Hi…I am trying to find faces of human using cfg(yolov3-face.cfg) and weight files(yolov3-wider_16000.weights) in realtime. I can get approximately 5 FPS in CPU. But it is not sufficient for my application. I noticed that single frame execution takes 200 ms. Out of it nearly 170ms spent for net.forward() line to execute. How to reduce the time taken for this line. Accuracy is good. I have attached part of the code below. Anyone please help me to come out from this issue.
Hi .
You should build your OpenCV with CUDA support . Then you will gain a huge prformance from your application. You can refer to this link : Build OpenCV with CUDA
An other option is using OpenVINO as your inference backend. You will gain a good performance with OpenVINO , but not as much as CUDA.
You can refer to this link for OpenCV and OpenVINO configuration :OpenCV with OpenVINO
@Maheswari.R Yes , sure . You can do it easily. I haven’t try it on windows, but there are much topics on the internet that you can use them.
For example : link1 and link2 and link3
Do you compile your code as debug profile, or release profile?
Both the OpenCV and other libraries ( like Dlib ) have awful performance under debug.
If this is the case, try to build in release profile, and check performance.