Hacker News new | ask | show | jobs
by devenvdev 1203 days ago
Deaf people would disagree :) if you talk in sign language on zoom missing video parts would ruin the conversation.

I don't think it's about precision, in the case of audio vs video - if you remove all the even columns from a video it would be similar to reducing quality, the same can be done with audio - removing half of the frequencies uniformly will just lower the quality.

1 comments

That's a pretty specific case. You can get really good performance for a ton of tasks in video (video question answering, object identification and tracking, action recognition, etc) by just sampling a frame per second or even less frequently. Definitely can't do that with audio.