Hacker News new | ask | show | jobs
by mnkmnk 1535 days ago
This is not really a demo of the voice transcription quality but of the platform. It just shows how to take an open source machine learning library (deepspeech) and make it into a webapp quickly.
1 comments

Ah ok. I have done that as well. Deepspeech and Speechbrain and the other open source models for transcription are unfortunately not good. Probably because they don't have enough training data relative to the big guys. You should show CLIP - probably the best open source model I have seen as it was trained on a huge corpus.