Memory leakage when using imread()

Sharon_Yaroshetsky · March 7, 2022, 10:56pm

I noticed that for some reason my script exceeds the maximum memory usage (16GB) pretty fast, and after using memory-profiler I saw that cv2.imread(filename) takes up 6.7 GB after just 20 iterations when there is ~1400 overall.
Here is a snippet from the profiler (the rest of the code is commented anyway):

    90     59.8 MiB      0.0 MiB           1       img_array = []
    91     59.8 MiB      0.0 MiB           1       size = 0
    92   6827.2 MiB     -2.1 MiB          21       for idx, path in enumerate(path_list):
    93   6827.2 MiB      0.0 MiB          21           if idx == 20:
    94   6827.2 MiB      0.0 MiB           1               return
    95   5728.9 MiB      0.0 MiB          20           print('------------------------------------------------------------------------')
    96   5728.9 MiB      0.0 MiB          20           print(f'Iteration Number: {idx}, In Directory: {path}')
    97   5728.9 MiB      0.0 MiB          20           print('------------------------------------------------------------------------')
    98   6827.6 MiB      5.8 MiB       61620           for filename in sorted(glob.glob(path + '/*.png'), key=len):
    99   6827.7 MiB   6757.6 MiB       61600               img = cv2.imread(filename)
   100   6827.7 MiB     -0.1 MiB       61600               height, width, _ = img.shape
   101   6827.7 MiB     -0.1 MiB       61600               size = (width, height)
   102   6827.7 MiB      3.7 MiB       61600               img_array.append(img)

Each image size is 224 x 171 and in total 62k images were read during this test.

kbarni · March 8, 2022, 11:16am

62000 images at 224*171 (RGB, 8bit) takes ~7GB. There’s no memory leak, just a lot of images.

Sharon_Yaroshetsky · March 9, 2022, 10:39am

But I have more than a million of them, so the process becomes very, very slow once it reaches 99% memory usage.
Why does it keep stacking up the memory? is there any way to free it up?

berak · March 9, 2022, 10:48am

because you append all those images to a list in memory.

you obviously cannot do this with millions of images,
so maybe that images_list is a bad idea.

what are you trying to do with it ?
what about processing those images “on the fly” ?

Sharon_Yaroshetsky · March 9, 2022, 11:00am

Oh, I forgot to empty that list after finishing each iteration
Thank you!

Topic		Replies	Views
Memory Leak with VideoCapture Python programming	2	1285	May 20, 2022
Using `imread` to read a image then multiply a number in one line, the value is wrong! BUG? Python numpy	4	210	January 8, 2024
cv2.VideoCapture.release does not free RAM memory Python videoio , valgrind , leaks	17	6675	June 8, 2021
HoughCircles method: memory problem with large images Python imgproc	6	772	September 21, 2021
Memory leak detected by Valgrind in cv::resize C++ imgproc , leaks	2	3168	May 24, 2021

Memory leakage when using imread()

Related topics