Hacker News new | ask | show | jobs
by drewbuschhorn 263 days ago
You should throw in some diarization, there's some pretty effective libraries that don't need pertraining on the voice separation in python.
2 comments

I would suggest 2 speaker-diarization libraries:

- https://huggingface.co/pyannote/speaker-diarization-3.1 - https://github.com/narcotic-sh/senko

I personally love senko since it can run in seconds, whereas py-annote took hours, but there is a 10% WER (word error rate) that is tough to get around.

Nice suggestion, I'll look them up.