Yes I have been reading these threads. It seems the best way forward is to work in __cuda_array_interface__
and use CuPy to get the data into a PyTorch tensor. Not yet sure if I will have to wait for the OpenCV stream to complete before starting the PyTorch processing. Ideally I would push the PyTorch inference onto the existing OpenCV stream.