Howdy folks.
I have a strong strategic interest in building something like this;
I know Python and basic cpp and js, and I understand the basics of opencv. Any hints?
Howdy folks.
I have a strong strategic interest in building something like this;
I know Python and basic cpp and js, and I understand the basics of opencv. Any hints?
human pose estimation.
thatāll give you points for hands and feet. match that to previously known (manually noted down) positions of these holds. if hand/foot is close enough to some hold, āactivateā the hold.
add some perspective transforms to map between the walls and the camera view.
Have you seen any solution with āmultipleā perspective transforms?
Meaning, assume a wall of polygons with handholds and the distance from the camera is the z axis. Some handholds are further from the camera, others are closer to the camera. Can I find x y z coordinates with any degree of accuracy? How about coords for vertices of the polygons? Hands and feet?
If itās unmanageable without additional hardware (beyond a webcam) Iāll look into it.
you can overengineer this, but thatās a waste of time.
the āmost expensiveā part of this is modeling the wall and holds.
make that cheap to do, so it takes a mere minute to model a wall. make that manual, because automating it is a waste of time.
make sure the camera and projector stay in their places for the duration of a game. I think thatās not too much to ask for.
take a camera picture.
show a quad on the projector, such that you also see it on your screen. adjust its corners interactively, to coincide with the corners you picked previously.
now you have enough data.
holds need no specific detection. they donāt move. they are fixed regions.
the only thing that does need detection is the people.
in the video you linked, the registration/pose estimation is glitchy. people who ādoā this, rather than āoverengineerā this, usually get slightly glitchy results, but nobody cares, because it works well enough. so, donāt overthink it. their glitching is due to them running wall-camera pose estimation needlessly at runtime. they should do that once at the beginning, and keep that pose info.