How to improve the performance of image matching

Nigel_Lee · November 6, 2022, 5:21am

I am working on a project about giving a test image, I have to retrieve similar images from a folder with 1000 images inside.

For example, my test image:

The images expected to be retrieved are some ancient Rome architectures (cannot post due to new user)

However, there are some images that are to confuse my program which are some mountains, which the colors are quite similar to my test image.

These are my functions :

import cv2 as cv
import numpy as np
from glob import glob

#Comparing images
def compareImgs(img1, img2):
# resize img2 to img1
img2 = cv.resize(img2, (img1.shape[1], img1.shape[0]))
diff = cv.absdiff(img1, img2)
return diff.sum()

#Comparing Histogram
def compareImgs_hist(img1, img2):
width, height = img1.shape[1], img1.shape[0]
img2 = cv.resize(img2, (width, height))
num_bins = 10
hist1 = [0] * num_bins
hist2 = [0] * num_bins
bin_width = 255.0 / num_bins + 1e-4
hist1 = cv.calcHist([img1], [0], None, [num_bins], [0, 255])
hist2 = cv.calcHist([img2], [0], None, [num_bins], [0, 255])
sum = 0
for i in range(num_bins):
sum += abs(hist1[i] - hist2[i])
return sum / float(width * height)

def retrieval():
src_input = cv.imread(“building.jpg”)
src_gray = cv.cvtColor(src_input, cv.COLOR_BGR2GRAY)
# read image database (1000 images)
database = sorted(glob(database_dir + “/*.jpg”))
for img in database:
# read image
img_rgb = cv.imread(img)
# convert to gray scale
img_gray = cv.cvtColor(img_rgb, cv.COLOR_BGR2GRAY)
# compare the two images
diff = compareImgs(src_gray, img_gray)
# compare the two images by histogram, uncomment the following line to use histogram
diff = compareImgs_hist(src_gray, img_gray)
# find the minimum difference
if diff <= min_diff:
# update the minimum difference
min_diff = diff
# update the most similar image
closest_img = img_rgb
result = img

print("the most similar image is %s, the pixel-by-pixel difference is %f " % (result, min_diff))
print("\n")

cv.imshow("Result", closest_img)
cv.waitKey(0)
cv.destroyAllWindows()

The result is always the mountain but not buildings, I know there are some differences like the test image is a skyscraper while the expected images are some ancient architecture. But is that a possible way that I can improve the performance?

This is my first time posting here so if I have something stated not clearly or wrong please let me know. Thanks a lot!

Nigel_Lee · November 6, 2022, 5:22am

Expected image:
266

Nigel_Lee · November 6, 2022, 5:23am

Image to confuse the program:
850

matti.vuori · November 6, 2022, 1:37pm

Similar histogram doesn’t tell much… I would find contours in the images. In the building there would be many rectangular ones, whereas in the mountains, very different ones.

crackwitz · November 6, 2022, 5:35pm

this is a huge field of research. you should not imagine that this will be trivial.

Nigel_Lee · November 9, 2022, 2:21pm

So it is hard to use histogram in this condition?

Topic		Replies	Views
Using OpenCV for Image Similarity Python	12	13656	November 30, 2021
Want to speed up template matching in OpenCV python Python	1	3664	December 27, 2020
Image difference after Image registration and alignment Python	5	2349	October 27, 2022
Matching and Code improvement questions Python matchtemplate	8	1606	August 17, 2022
Measuring image similarity with opencv Python calib3d , homography	1	791	August 10, 2023

How to improve the performance of image matching

Related topics