Hacker News new | ask | show | jobs
by felixarba 838 days ago
I second this! I was recently looking into a way to build something like this for my grandfather, but wasn't even sure where to start from the hardware side.

I wanted to have hardware plug into TV receiver, generate subtitles for live TV program and then play it back on TV. Delay would likely be less than a minute but even a few minutes is not a problem really.

Many people with a hearing problem would benefit from this and with AI getting so good at Speech-to-text, this can be done for quite a large population.

If anyone has a recommendation on where to start with this, I'd appreciate it! Was thinking of using Whisper for subtitle generation, but not sure about hardware that can take in, and output HDMI and run this software

1 comments

I keep thinking about something similar. Also hardware. Also for my grandparents.

My grandma is 95. Her vision is bad. Even using the phone (I'm talking old school landline) is getting hit and miss, because she can't see the buttons.

Years ago, I set her up with an Echo Show. That works well enough for her to say "Call Leo". But Alexa is dumb. Sometimes, she'll mishear something and start playing music. Or start a monthly subscription... :)

So what I'd like:

- box - screen - far-field mic array - AI backend

You could do a number of things with it:

- manage a grocery shopping list (AI will notice duplicates and other oddities and ask) - communicate with the outside world (initiate calls, send emails and faxes, including to local businesses) - optional human oversight and/or permission settings (preventing the AI, say, from ordering groceries for more than $50 a week without a family member approving the order)

Something like your "subtitle mode" could also work:

"Listen to what is currently being spoken in the room (including the TV), and display it on the screen".

My grandma has her TV running all day. So maybe one could ditch the screen and make it a "set top box". Add an IR port to it, so it can control also the TV itself. Something like that might work.