why does everyone suddenly seem to want to do this?
here’s one of many existing discussions of this:
I can tell you right now that your approach is doomed. feature matching on an ellipse does not work. it has no features.
real Video Assistant Referee systems are a LOT more complicated. they also work with more reliable information than just the video feed.