Hacker News new | ask | show | jobs
by bigmealbigmeal 1465 days ago
I'll add to this that having subtitles next to the person one is speaking to is completely transformative for hearing-impaired people. The only way you could replicate this with a 2D screen is by having them either (a) avert eye contact to look down at a phone, which prevents them from being engaged with the person, or (b) hold up a phone camera to someone's face, which is obviously significantly more cumbersome and socially awkward than wearing some glasses (and please try to imagine the future of AR headsets that are becoming increasingly compact like sunglasses, not a bulky existing Hololens headset).

So, take that idea. It's not a novelty experience. It's not fluff. It significantly improves the lives of hearing-impaired people.

Did you even come up with this idea? If so, why were you not able to create it? Have you considered that perhaps it was due to the fact that something like this is extremely difficult to develop and can't be done by a regular team over a period of 'months'? Have you considered that AR/VR isn't just going to be made transformative within a <1 year time period of you getting your hands on it?

On the other hand, if you didn't even come up with such a practically beneficial idea as this (or were unable to see how life-changingly useful it'd be for the hearing-impaired), then the issue with all of your ideas being "fluff" was not due to the technology at hand.

This even sparks my imagination further. Right now, if someone yells at a hearing-impaired person from behind, they have no immediate way of knowing (any phone-based solution is not going to give quick information about the direction of the yell when it's in-pocket). On the other hand, an AR headset will be able to immediately inform that person that a loud voice has come from exactly the direction it is pointing to, because it can literally show an arrow in their visual sight. That is so goddamn exciting and useful. And I simply can't comprehend how you cannot see it.

1 comments

Believe it or not, overlayed closed captions was one the first things I came up with. It's also not that hard to do with commodity voice and face recognition. We did a POC just on a 2d phone screen in like a week. Trying to capture multiple people speaking at once is way beyond the capability of any retail headset and would require an elaborate 3d microphone array and noise filtering to pinpoint where a voice is coming from. Ours worked pretty well sitting across a table, but would struggle mightily trying to hear something across any distance in a noisy room.