I have found that sometimes, depending on the picture when I apply CUDA Fast, it detects less keypoints than the CPU version.
I am using png pics of 1448*648 pixels. (288.6KB) and when I apply my algorithm CUDA Fast finds (around- it is not consistent) 1591 points while the CPU finds consistently 1826 keypoints.
And the curios things is that all the keypoints found by CUDA are with Y<524 (that means that all keypoints with Y>524 are not found)
what could be happening here?