Hacker News new | ask | show | jobs
by jabradoodle 1041 days ago
Wouldn't cost that much if the transcribing is done on device
2 comments

This would be immediately obvious in a cursory analysis of performance. On-device transcription is not only computationally infeasible, it would also require model capabilities far beyond what is currently SOTA.

Google had (and has afaik) significant challenges implementing multiple wake-word detection for precisely this reason.

Transcribing a couple of words accurately on-device without a major performance penalty (so that it can be running in the background always) is just _barely_ coming out now.

I would have to take your word for it but my phone is able to transcribe speech with no problem and no internet connection.

Of course running it 24/7 in the background would ruin my battery, you would have to be smarter than that.

Which phone/app? I would be very surprised if a manufacturer has an entirely on-device real-time ASR model, maybe I'm behind.
rewind.ai has entered chat.
There's this weird narrative I see that "computers just aren't powerful enough" to do things I remember them already doing on Pentium 1 class machines in the 90s.