|
|
|
|
|
by lovelearning
3773 days ago
|
|
Pocketsphinx/Sphinx with a small, use-case specific dictionary showed much better accuracy for my accent and speech defects, than any of these cloud based recognition systems. I used a standard acoustic model, but it probably would have been even more accurate had I trained a custom acoustic model. For simple use cases like home automation or desktop automation, I think it's a more practical approach than depending on a cloud API. |
|
[1] https://github.com/kastnerkyle/ez-phones
[2] https://www.reddit.com/r/MachineLearning/comments/3pr4v4/are...