Y
Hacker News
new
|
ask
|
show
|
jobs
by
nshm
2312 days ago
Why would you work with google when there are much more accurate open source speech recognizers based on Kaldi? With that specific usecase it is very easy to beat Google on accuracy.
2 comments
lunixbochs
2312 days ago
I think google as the primary engine results in stuff like this:
https://github.com/osprey-voice/osprey-starter-pack/blob/mas...
On the other hand it’s probably better at general (non command) English.
link
cjbassi
2312 days ago
Yeah it uses a lot of machine learning and context based inference, which is great for dictating phrases but less so for commands.
link
cjbassi
2312 days ago
Actually I don't think I ended up testing kaldi because it seemed difficult to set up but I'll give it a try now that you mention it.
link
nshm
2312 days ago
Ok, if you want to start with Kaldi it is probably easier to check kaldi-active-grammar mention above or
https://github.com/alphacep/vosk-api
link
cjbassi
2312 days ago
Awesome, thanks, I'll check this out and the other one too.
link
On the other hand it’s probably better at general (non command) English.