Hacker News new | ask | show | jobs
by jononor 1774 days ago
Have used VOSK a bit recently. The out-of-the-box experience was great compared to earlier projects (looking at you Kaldi and Sphinx...). Word-level audio segmentation was one usecase, https://stackoverflow.com/a/65370463/1967571
2 comments

Vosk is built on Kaldi.
Kdenlive supports automatic subtitles created with VOSK now btw. This makes it a lot more accessible for non-tech folks.