There's an incredibly huge difference between text and video and how the information is processed. Your statement literally rejects the validity and significance of the director of photography's job
If I'm not mistaken, they meant that text == text, just as video == video - not that text and video are directly comparable.
There's no difference between a printed and bound book full of (for the sake of argument) Tumblr posts and the same posts online - or alternatively, a betamax video, and the same video captured and encoded in an MKV container.
The parent comment meant that there is not a big difference to them on which medium (paper book, iPad, desktop monitor) they read a text, "text" is its own category instead of paper vs electronic. Similar "video" is its own category. Watching a Football game on a gigantic 60 inch screen vs on your small phone the big screen may be more comfortable, but it doesn't fundamentally change how you process the information (compared to reading what happened on Wikipedia or hearing radio play-by-play commentary.)
The other replies have already clarified it better than I did, but from the horses mouth: Yes, of course video is different from text. I meant as the other comments suggest that text is no different from text, and video is no different from video. I called both of them their own (separate) medias.
There's no difference between a printed and bound book full of (for the sake of argument) Tumblr posts and the same posts online - or alternatively, a betamax video, and the same video captured and encoded in an MKV container.