Why are SIFT features using floats instead of unsigned chars?

Ben · December 21, 2020, 2:47pm

I ran into a problem when I assumed that the SIFT features are normalized float vectors.
So I printed them out and was surprised to see integer values stored in a float array!
I did some research and according to this question on stackoverflow, the normalized vector is multiplied by 512, then cast to an unsigned char, which is then cast back to a float.

This seems to be a very inefficient way regarding computation time and storage space. So what’s the idea behind this?

crackwitz · December 21, 2020, 7:54pm

it makes sense to compress/quantize the float values into uint8 values. 4x less storage definitely has an effect on CPU caches. comparing/subtracting 8 bit integers is very cheap and vectorizable, vs. having to handle 32 bit floats, where you actually don’t need the precision.

look at this (highly optimized and undocumented) code. there is a case for when the output is explicitly float, and there is a case for when it’s uint8.

indeed they apply the uint8 scaling (256/512) to the float result case as well. that’s silly in my opinion.

github.com

opencv/opencv/blob/a029f03edcf7e5f91aec172fed2eca082f2cbdcc/modules/features2d/src/sift.simd.hpp#L816


        __dst = vx_load_aligned(rawDst + k);
        __dst = v_min(v_max(v_cvt_f32(v_round(__dst * __nrm2)), __min), __max);
        v_store(dst + k, __dst);
    }
#endif
    for( ; k < len; k++ )
    {
        dst[k] = saturate_cast<uchar>(rawDst[k]*nrm2);
    }
}
else // CV_8U
{
    uint8_t* dst = dstMat.ptr<uint8_t>(row);
#if CV_SIMD
    v_float32 __dst0, __dst1;
    v_uint16 __pack01;
    v_float32 __nrm2 = vx_setall_f32(nrm2);
    for( k = 0; k <= len - v_float32::nlanes * 2; k += v_float32::nlanes * 2 )
    {
        __dst0 = vx_load_aligned(rawDst + k);
        __dst1 = vx_load_aligned(rawDst + k + v_float32::nlanes);

Topic		Replies	Views
Preferred numeric type for image processing algorithms	3	314	December 14, 2020
Imshow() and imwrite() outputs are different and normalization is not preserving original image Python highgui , imgcodecs	2	2193	April 22, 2021
Normalizing CF_32 images with negative pixel values gives output with some black regions of the original image turning white Python highgui , cuda , imgcodecs	5	2403	April 23, 2021
Mat::convertTo (); question on scaling C++ imgproc	4	397	April 7, 2024
How does the “compressed form” of `cv::convertMaps` work? C++ remap , imgproc	1	1155	June 16, 2021

Why are SIFT features using floats instead of unsigned chars?

Related topics