EfficientSAM input size

florent.falipou · November 7, 2024, 10:51am

Hi, I am using OpenCV 4.10, and wanted to try some of the neural networks available at GitHub - opencv/opencv_zoo: Model Zoo For OpenCV DNN and Benchmarks.

I am interested in EfficientSAM. I managed to make it work in C++, but i couldn’t find how to change the image resolution. It seems to always output 640x640 images, and do the segmentation on this kind of input. With images in 4K or higher, it lacks of precision.
I am really new with the DNN module, so that might be obvious…
Can you guide me ?

Thank you!

crackwitz · November 7, 2024, 10:57am

did you run this file?

the _preprocess function resizes all inputs to be 640 by 640. if you give it something larger, it’ll take it. the output might be that size, but you can resize that to fit your input. yes, that means it won’t be “pixel-accurate”. it probably isn’t even if the input is sized exactly as the model requires.

try changing the occurences of 640. try doubling that, or halving. not all arbitrary numbers might work, only some specific sizes. if nothing works, the model is probably not fully convolutional.

github.com

opencv/opencv_zoo/blob/main/models/image_segmentation_efficientsam/efficientSAM.py

import numpy as np
import cv2 as cv

class EfficientSAM:
    def __init__(self, modelPath, backendId=0, targetId=0):
        self._modelPath = modelPath
        self._backendId = backendId
        self._targetId = targetId

        self._model = cv.dnn.readNet(self._modelPath)
        self._model.setPreferableBackend(self._backendId)
        self._model.setPreferableTarget(self._targetId)
        # 3 inputs
        self._inputNames = ["batched_images", "batched_point_coords", "batched_point_labels"]  
        
        self._outputNames = ['output_masks']  # actual output layer name
        self._currentInputSize = None
        self._inputSize = [640, 640]  # input size for the model

    @property

This file has been truncated. show original

florent.falipou · November 7, 2024, 11:07am

Thank you for the reactivity !
I tried to resize my input image at 2*640 to see if the output would change, but it remained at 640x640
I think this model cannot handle other dimensions.

Again, thanks for your quick response

crackwitz · November 7, 2024, 3:28pm

I didn’t suggest that.

I suggested editing the code of that script that runs the network.

some places on the internet talk about input being 1024 by 1024, so who knows what’s possible.

florent.falipou · November 7, 2024, 3:50pm

The input size, from what I understand from the file you pointed out is doing this with the inputSize.

        image = cv.resize(image, self._inputSize)

which is basically resizing the image as input.
Is there something I missed ?

Topic		Replies	Views
Model inference resulting in unknown rows and cols C++ dnn	7	472	August 14, 2023
OpenCV DNN changing my input dimension C++ dnn	10	1001	June 12, 2023
Inferencing ONNX model on a RGB image in Android-Java Android/Java dnn , java	7	1009	August 10, 2023
3D Image segmentation with 3D U-NET C++ dnn	0	1148	June 16, 2021
Open CV dnn module python and c++ outputs are different C++ dnn	12	1499	March 31, 2021

EfficientSAM input size

Related topics