Getting metric scale around object location?

M_Abbas · November 15, 2024, 12:03pm

Hi, im using OpenCV to try and retrieve depth to an object Ive already detected, Im doing this to get pixel per cm at the object location.
so far the constraints I have are:

I cant use a depth camera or a depth sensor
I cant place a reference object at the object location

My Approach currently is using my phone camera to take two pictures a certain distance (baseline) apart to simulate sterescopy.
I have calibrate my camera using the standard chessboard pattern, Im wondering does using other patterns help?

I also tried to turns this pattern into metric by adding a ‘square_size’ param, like:

    square_size = 0.024
    termCriteria = (cv2.TERM_CRITERIA_EPS + cv2.TERM_CRITERIA_MAX_ITER, 30, 0.001)
    worldPtsCur = np.zeros((nRows*nCols, 3), np.float32)
    worldPtsCur[:, :2] = np.mgrid[0:nCols, 0:nRows].T.reshape(-1, 2) * square_size

then I do feature matching and use those points to get Essential Matrix E, and R, t like:

# Calculate Essential Matrix
E, mask = cv2.findEssentialMat(pts1, pts2, cameraMatrix, method=cv2.RANSAC, prob=0.999, threshold=1.0)
# Decompose the essential matrix
_, R, t, mask = cv2.recoverPose(E, pts1, pts2, cameraMatrix)

Ive noticed here that my x element in t is negative, it should be positive as my images are left to right, am I correct in that assumption?
lastly I tried both to just use x-axis disparity with this equation:
depth = f * baseline / disparity
and triangulating using cv2 like:

# Triangulate the 3D point
points_4D = cv2.triangulatePoints(P1, P2, normalized_point1[:2].reshape(2, 1), 
                                  normalized_point2[:2].reshape(2, 1))

# Convert from homogeneous coordinates to 3D
points_3D = points_4D[:3] / points_4D[3]  # Normalize by the fourth coordinate

# Extract the Z-coordinate as the metric depth
depth = points_3D[2][0]  # Metric depth

print("Metric Depth (Z-coordinate):", depth)

but the depth result from both does not match my ground truth on my test set.
is my approach generally, correct?
what are the points of inaccuracies that could be harming my depth calculation?
what can I do better as a whole?

M_Abbas · November 16, 2024, 8:28am

Update:

Calibration images, the furthest 2 are removed because of inability to find edges Calibration
Sample image: finding matches
Selecting a point manually in the left image and searching the right image along the epipolar line for the best match best match along epipolar line
then using cv2.triangulate to triangulate the undistorted points to get depth,
but my depth results are off from my ground truth.

things to note also:

My calibration results I think are quite decent because for fx got 3043, and the manual calculation which is f = focal_mm / pixel size_µm,
4.25mm / 0.0014 which is ≈ 3036
is that a very big difference?
for some reason the x of translation vector t from cv2.recoverpose is negative, the Images are passed left to right so if Im understanding correctly they should be positive correct? I took the images Right to Left but then passed them in Left to Right. I dont understand where I went wrong here
Any advice is appreciated

M_Abbas · December 10, 2024, 11:24am

not a single answer

Topic		Replies	Views
Measuring object dimensions from stereo maps Python calib3d	0	755	February 14, 2022
Calibration , disparity map Python calib3d	5	1091	February 20, 2023
How should I measure a object accurately using camera from any distance Python calib3d	1	1112	September 14, 2023
Stereo Depth Perception gone wrong C++ calib3d	1	554	March 29, 2021
Get 3D coordinates from 2D pixel Python calib3d	4	3184	February 16, 2022

Getting metric scale around object location?

Related topics