My colleague had a quick lock at the ngraph source code, and checking the OpenVINO docs suggests, that loading a model automatically checks to load the model onto the chip with the lowest number of already loaded models. That would mean that indeed, the models are properly spread across the chips.
A quick FPS counter for the inference confirmed this.