Cudacodec:: VideoReader has a high parallel frame loss rate

If you are splicing into a large picture that imples there could be some synchronization, which is causing each read thread to wait?

Is it always 1-2 seconds or just for the first say 30 seconds while they all catch up with each other?