Reading and Writing Videos: Python on GPU with CUDA - VideoCapture and VideoWriter

Wilan · December 9, 2020, 5:39pm

I’m using PyTorch on GPU, and trying to read each frame, and let’s say, see if there’s a cat in the frame. I want to extract all frames of cats, and create a new video with only those frames.
My videos are in 1080p (1920x1080).

To explain my thought process so you can see what the bottleneck is, (hopefully you’re familiar with PyTorch):
OpenCV CUDA loads frames into PyTorch DataLoader (which I’ll set num_workers=4 and pinned_memory=True), then DataLoader sends frames to model.

I sort of see what you mean in your pinned memory and streams explanation. Hopefully PyTorch’s DataLoader can take care of that.

I’m completely lost on the OpenCV CUDA code and how to patch OpenCV CUDA using the AV1 codec to work with the Video Codec SDK 11.0

Note: I’d like to do everything with GPU to keep things future-proof, since I’m unsure if I’ll always use an Intel CPU.

Overall, do you think my pipeline would work out?
I managed to do some testing, the reading part takes ~3h (without the DataLoader), and the writing ~1h.

Topic		Replies	Views
Status and usage of cudacodec::VideoWriter Python ffmpeg , cuda , videoio , cudacodec	39	903	May 13, 2024
VideoWriter.write() very slow Python performance , videoio , nvidia	4	4378	November 3, 2022
Reading Video Signal with CPU vs GPU gpu , cuda , videoio , cudacodec	1	3541	July 22, 2022
How to read specific frames from a video using OpenCV's Cuda Video Reader in Python? Python dnn , cuda , videoio	2	2903	September 29, 2023
cv2.cudacodec.createVideoWriter does not work? Python cuda , videoio	8	2936	May 10, 2022

Reading and Writing Videos: Python on GPU with CUDA - VideoCapture and VideoWriter

Related topics