GpuMat simple operations: Error: Gpu API call (invalid configuration argument) in cv::cudev::grid_transform_detail::TransformDispatcher

yes, like i would know about gpu programming, cuda, npp or what blocks are. all i want is to use the OpenCV library that I know a bit together with my new cuda-able gpu and to see how much faster it can get. I don’t even know how to implement a cuda::repeat() myself in a way that’s not incredibly slow with using src.row(a).copyTo(dst.row(b)) or copy.to(M(dst(ROI)) and gets even slower when I try to use openMP

my issue with transpose is more like the multidimensionality:

Mat c = Mat(5, 7, CV_32FC(8));
GpuMat g = GpuMat(c);
cv::transpose(c, c); //works
cuda::transpose(g, g); //crash

but using CV_32FC(9) will also cause cv::transpose to crash