|
|
|
|
|
by jononor
1775 days ago
|
|
Open-source speech recognition is doing pretty good with projects such as VOSK, Athena, ESPNet and SpeechBrain.
These days models are the easy part of ML, and data is the hard one. So for Mozilla to focus on Common Voice over DeepSpeech seems reasonable. |
|
Especially for the videos with Close Caption....
As simple as extracting the Audio and CC text?