Hacker News new | ask | show | jobs
by Izkata 2136 days ago
> ~13 years ago

Beyond that, closer to 20 years ago, I remember on Win98 experimenting with an offline speech-to-text program I'd downloaded from somewhere. It required training, but I remember it being pretty accurate - I just didn't find a use for it because we had one shared desktop and I'd be annoying everyone else in the room. I think it was called Vox, or something like that...

2 comments

And 25 years ago there was that IBM card that allowed realtime voice recognition on a 486, no connection required. At the presentation I saw at IBM, the operator loaded a word processor, wrote a letter, saved it as an image, sent it as a fax, received it on a second machine and printed it without moving a finger. I also seem to remember one machine ran OS/2 Warp and the other one Windows. It wasn't that fast for sure, and she had to correct some errors, but the point is that if done on dedicated hardware (FPGAs?) the performance can be a lot higher than on software. A lot of powerful hardware can be fitted into those assistants, and unless they fully open source them, there's no way to know what they do and what they could do if instructed to.
I’m always slightly surprised by anecdotes like this from so long ago. When I tried using MacOS Classic speech recognition 20 years ago, it interpreted every command as “Tell me a joke” including the line the user was supposed to say to make the joke script continue.