Yes, the interocular distance between the eyes helps the brain to judge scale (750 mm being the most common). But the arctan value (distance behind the lens of your eye and it's retina) is also important for judgement of scale in the brain. If you lose an eye you are still going to be able to judge scale by that distance. A camera doesn't have periferal vision. We do. That's where the comparison between eye and camera has to end. We can only extend what a normal lens sees by adding a bigger sensor/bigger negative.
It's really a very simple principle. An eye that is only 8 mm deep belongs to a small person. An eye that is 300 mm deep belongs to a very large person. If you put two 300 mm lenses at a normal interocular distance of 750 mm you will still have the feeling that the world in front of you is a miniature.
The German way of making films was to view the camera as a person (the audience) or a character (pushing dramatic events either through straight POV or the "perspective"). That's why normal lenses was extremely important. You had to have a dramatic reason for turning the audience eyes small or large. You'll find this line of visual thinking in art dating back to medieval times and older. Look at how a classical painter measures reality in front of him with different tools.