How are you timing that, are you including the upload/download to the GPU in the timing? Which GPU/CPU are you comparing? Are you timing a single call, the time for the first run on the GPU is always orders of magnitude greater than subsequent ones? Are you using C++ or python?
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Is there a way to use Cuda version of fastNlMeansDenoising in python? | 2 | 614 | October 10, 2022 | |
The use of OpenCV's cuda-VideoReader's grab does not contribute to efficiency improvement? | 4 | 403 | February 19, 2024 | |
OpenCV CUDA extremely slow | 3 | 6705 | April 30, 2021 | |
Some opencv cudafilter functions is slower than CPU code on Jetson Xavier NX | 1 | 312 | November 8, 2023 | |
CUDA Fast detector much slower than normal FAST
|
9 | 2445 | May 28, 2021 |