Break image down to bottles only

Yes, Yolo should be a good solution to detect and identify the bottles. You should only need a good training database.

As this is more a machine learning problem, you could start with choosing a DNN framework (like TensorFlow or PyTorch) and follow the tutorials for training a Yolo network and apply it to your images. Using OpenCV is optional for this project.