|
|
|
|
|
by trowngon
1716 days ago
|
|
I believe most people already moved to offline engines. No need to send the data to some random guys like this Assembly. Nemo Conformer from Nvidia, Robust Wav2Vec from Facebook, Vosk. There are dozen options. And the cost is $0.01 per hour, not $0.89 per hour like here. Another advantage is that you can do more custom things - add words to vocabulary, detect speakers with biometric features, detect emotions. |
|