They deal with pretty noisy mediums though. How much of the input is actually transformed into what we perceive as relevant knowledge and stored long-term?
I agree that they are high bandwidth inputs but not all of it sticks. There is a saying that goes: "in one ear and out the other". Recently, I learned about "reticular activating system". As far as I can understand, RAS determines what is important to you and provides your focus there. This is pure speculation but it might be that, eyes/ears are indeed very high-bandwidth inputs that if all their data was persisted, it would overwhelm our brain.
Can you only imagine what advances we'd make in "information upload" if we invested as much in Teaching Pedagogy as we invest in Neuroscience?