Efficiently loading datasets in opencv C++

opencvnoob · May 31, 2021, 5:13am

So i am looking into training a machine learning model for hand written text recognition using OpenCV and C++. I would like to clarify that i am not looking for a deep learning approach.
Now,obviously you need a dataset,and there should be a way to load this dataset so that you can then extract features from that dataset,and then later on train it.
One way to do this is using the following syntax

vector<string> fn
cv::glob(path,fn,true)
for(size_t k=0;k<fn.size();k++){
cv::Mat im=cv::imread(fn[i]);
//do other stuff

But will this method be optimal if the dataset contains a large number of images.
For example if I was to use this to read GTI vehicle dataset which has 6000 images,won’t this be inefficent.
Also,in python you can use sklearn’s train test and split method to divide your dataset into train,validation and testing.Assuming that I have extracted the features,how exactly can we split our dataset into train,test and validation in Opencv using C++.
So to summarize ;

What are the methods to efficiently load large datasets so that we can later we can extract features ,and appropriately train a machine learnign model on it.
2.How does train,test and validation split work in OpenCV C++.

berak · May 31, 2021, 6:19am

so, what are you trying to use instead ?

there’s a TrainData class, there are loaders/parsers for special datasets , but none of them might be feasible for you

opencvnoob · May 31, 2021, 6:44am

I was thinking of using conventional approaches like SVM/KNN.One such methodology is here

and svm+hog example is already mentioned in opencv samples
I also saw your post on Sequence files in OpenCV C++ - Stack Overflow
the module dataset in opencv-contrib tends to answer my questions on loading datasets and annotations.
But I would like to still know how do we go about doing the train-test-val split in C++.Is there a specific class for it.

opencvnoob · May 31, 2021, 6:47am

you said that none of the parsers will be feasible,since I am working on IAM handwriting dataset what could be the problem.If these parsers won’t work should I stick to simply using the cv::glob method as mentioned in post.

berak · May 31, 2021, 7:13am

so, it’s this ?

just curious, how do you plan to extract single chars from it (do you, even) ?
(it seems to have “whole word” images)

you’d also have to parse xml meta data to get to the ground truth responses, i guess.

opencvnoob · May 31, 2021, 8:04am

yep that’s the one.
For starters most HWR would have a word segmentation step.once individual words are extracted,then you can go about training your ML model on it.
here is a link but its in python

as for parsing the xml meta data to get the ground truth,any tips on how to go about that.
Also if you can give pointers on how to use the trainData class for reading an image dataset that would be nice.I am not coming up with anything

berak · May 31, 2021, 8:17am

it does not do that. you’re expected to create it with a single data Mat (each feature on a row) and a labels Mat (one for each feature/row)
where and how you get that data, is still up to you
however, there are some samples you could look at

opencvnoob · May 31, 2021, 9:07am

hmmm,well in this case the features are already extracted.So it’s easy to use fstream based options to read it.
I mean for some time let’s not talk about handwritten recognition.
Let’s say i am talking about a general image classification problem like vehicle detection problem.If the dataset is say LFW or GTI,how do I proceed reading those images.

opencvnoob · May 31, 2021, 9:10am

I think this example suits me much better.Could you explain the reasoning behind convert_to_ml function here.

github.com

opencv/opencv/blob/master/samples/cpp/train_HOG.cpp

#include "opencv2/imgproc.hpp"
#include "opencv2/highgui.hpp"
#include "opencv2/ml.hpp"
#include "opencv2/objdetect.hpp"
#include "opencv2/videoio.hpp"
#include <iostream>
#include <time.h>

using namespace cv;
using namespace cv::ml;
using namespace std;

vector< float > get_svm_detector( const Ptr< SVM >& svm );
void convert_to_ml( const std::vector< Mat > & train_samples, Mat& trainData );
void load_images( const String & dirname, vector< Mat > & img_lst, bool showImages );
void sample_neg( const vector< Mat > & full_neg_lst, vector< Mat > & neg_lst, const Size & size );
void computeHOGs( const Size wsize, const vector< Mat > & img_lst, vector< Mat > & gradient_lst, bool use_flip );
void test_trained_detector( String obj_det_filename, String test_dir, String videofilename );

vector< float > get_svm_detector( const Ptr< SVM >& svm )

This file has been truncated. show original

berak · May 31, 2021, 12:04pm

as said before, opencv’s ml classes expect all train data in a single Mat,
this function copies HOG descriptors to rows in that Mat.
(later, an SVM is trained on that)

crackwitz · May 31, 2021, 12:05pm

if you’re asking about how to most efficiently load data… don’t worry about that until you have measured the time it takes, and it’s a significant portion of the whole workflow.

Topic		Replies	Views
How do I train an LBPHFaceRecognizer Model with a very large number of images? C++ facerecognition , face	2	220	March 12, 2024
Loading Tensorflow Models Using Opencv and C++ C++ dnn , tensorflow	2	1512	April 20, 2023
Does svm->train() has a size limit? (segmentation fault) C++ svm	2	299	March 17, 2022
Opencv 4.5.4 LBPHFaceRecognizer train error Python	5	2891	January 5, 2022
OpenCV and deep learning neural networks C++ dnn	9	2376	August 29, 2021

Efficiently loading datasets in opencv C++

Related topics