1FPS using Cuda (Nvidia Jetson Orin Nano)

I am not sure why, but after building OpenCV with CUDA support, I am only able to achieve 1 fps, regardless of the input video quality.

Does anyone know what might be causing this? I am unable to figure this out. As far as I understand, this should not be happening on the Nvidia Jetson Orin Nano.

Demonstration: Watch OPENCV with CUDA LOW FPS JETSON ORIN NANO | Streamable

What are you doing with CUDA, DNN inference? If so do you have FPS figures for the model you are using on different hardware?

I’m using CUDA with OpenCV for webcam processing in an emotion detection model. When the webcam preview opens, the FPS is very low. However, on different hardware, specifically with my RTX 4090, I get 30 FPS.

according to this, it should perform well enough for common models:

you have provided no debuggable details. I would guess that you aren’t actually using the GPU. something went wrong.

As @crackwitz said we need an MRE. That said if you can only get 30FPS on an RTX 4090 the back of the envelope calculation of fps on the jetson based purely on flops (not tensor cores/fp16 etc. as we don’t have an MRE) would be something like

30 * 1280 GLOPS (Jetson Orin Nano 8GB) / 82000 GFLOPS (RTX 4090) ~= 0.4 fps

1 Like