What do I need to detect and track multiple objects in HD Video?

Good morning all,

I have a need to track about 20 objects in real time (30 frames per second) from a video stream. The video resolution needs to be at minimum, 1920 x 1080. Each object that will be tracked with have a unique Black letter/number on a white background. The size of that letter/number in pixels will be 45 pixels X 45 pixels which is very readable by the human eye.

I would like to stick with Python if at all possible.

I suspect this is outside the world of Raspberry Pi, or even a mini-PC. Perhaps and Nvidia Orina Nanon?

Thoughts, opinions, suggestions, anything to help would be greatly appreciated.