What cuda stream notes mean?

nao.akm · October 20, 2021, 3:15pm

I would like to know the meanings of cuda stream notes.

https://docs.opencv.org/4.5.3/d9/df3/classcv_1_1cuda_1_1Stream.html#details

in the above page, the following is written as note:

Currently, you may face problems if an operation is enqueued twice with different data.

I want to call knnMatchAsync() for so many train descriptors, thousands of descriptors.
In this case, can I call knnMatchAsync() many times with a single cuda stream, or not?
If not, I have to create so many cuda stream objects.
How many cuda streams can be created simultaneously? Or, can I reuse cuda stream repeatedly?

https://docs.opencv.org/4.5.3/dd/dc5/classcv_1_1cuda_1_1DescriptorMatcher.html#a5911f6cbdbd03c7782cc8dd925ba4a3e

My environment is as follows:

os: linux/windows
opencv: 4.4.0
platform: java w/ org. bytedeco opencv-platform-gpu
cuda: 11.2
gpu: nvidia turing architecture

cudawarped · October 20, 2021, 3:44pm

I am not sure which routines that refers to, if I were you I would inspect the source code for knnMatchAsync() to see if there are any global variables which are being set. That said I am pretty sure it is not possible for the following to happen

next call may update the memory before the previous one has been finished

as kernels and async memory operations launched in the same stream should be executed synchronously with respect to each other. Now if you issued the same operation to multiple streams that could be a problem if a global variable was used so it may be refering to the way npp (lots of the CUDA routines in OpenCV are built on top of npp libs) used to work pre CUDA 10.1 where all operations had to be performed in the same stream.

Alternatively this could be the way CUDA used to operate when streams were first implemented, although I don’t remember that.

Topic		Replies	Views
OpenCV-cuda : run the same function in parallel on diferent data using streams? C++ cuda	6	639	August 5, 2022
CUDA flag to create a cv::cuda::Stream that supports asynchronous calls C++ gpu , cuda	1	1120	July 20, 2022
How to use cuda::SparsePyrLKOpticalFlow in multi thread environment C++ multithreading , cuda , optflow	11	988	July 30, 2022
Opencv cuda stream optimisation C++ cuda	1	1075	August 18, 2022
What cuda stream do OpenCV::Cuda functions use? cuda	8	394	March 28, 2024

What cuda stream notes mean?

Related topics