yes, like i would know about gpu programming, cuda, npp or what blocks are. all i want is to use the OpenCV library that I know a bit together with my new cuda-able gpu and to see how much faster it can get. I don’t even know how to implement a cuda::repeat() myself in a way that’s not incredibly slow with using src.row(a).copyTo(dst.row(b)) or copy.to(M(dst(ROI)) and gets even slower when I try to use openMP
my issue with transpose
is more like the multidimensionality:
Mat c = Mat(5, 7, CV_32FC(8));
GpuMat g = GpuMat(c);
cv::transpose(c, c); //works
cuda::transpose(g, g); //crash
but using CV_32FC(9) will also cause cv::transpose to crash