Hacker News new | ask | show | jobs
by modeless 920 days ago
I implemented this! All local models. And I packaged it up so people can install it with one click: https://apps.microsoft.com/detail/9NC624PBFGB7

The speech recognition part needs work for sure, but when it works you can see the potential. It's very different from the way it feels to talk to Siri or even ChatGPT's voice mode. It won't be long before we are having real conversations with our computers.

2 comments

Could you record a demo of this?
I really should! I'm not the type to publish videos of myself usually, but it really does need a video demo.
But how realtime is it?
The end-to-end response latency is around 1 second typically. It listens continuously, there are no buttons to press, and you can interrupt it while it's talking.