Reusing Mat memory as much as possible when decoding a sequence of images

if you decode into a new Mat, and then subregion-copy that into the “page-locked” one, how does that perform?