Y
Hacker News
new
|
ask
|
show
|
jobs
by
oidar
263 days ago
What's the best solution right now for TTS that supports speaker diarisation?
2 comments
makaimc
263 days ago
AssemblyAI (YC S17) is currently the one that stands out in the WER and accuracy benchmarks (
https://www.assemblyai.com/benchmarks
). Though its models are accessed through a web API rather than locally hosted, and speaker diarization is enabled through a parameter in the API call (
https://www.assemblyai.com/docs/speech-to-text/pre-recorded-...
).
link
xnx
263 days ago
I like this version of Whisper which has diarization built in:
https://github.com/Purfview/whisper-standalone-win
link