| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by devenvdev 1203 days ago
	Deaf people would disagree :) if you talk in sign language on zoom missing video parts would ruin the conversation. I don't think it's about precision, in the case of audio vs video - if you remove all the even columns from a video it would be similar to reducing quality, the same can be done with audio - removing half of the frequencies uniformly will just lower the quality.

1 comments

kelipso 1203 days ago

That's a pretty specific case. You can get really good performance for a ton of tasks in video (video question answering, object identification and tracking, action recognition, etc) by just sampling a frame per second or even less frequently. Definitely can't do that with audio.

link