|
|
|
|
|
by artimaeis
2350 days ago
|
|
> ML multi-speaker speech-to-text every conversation Neat idea, do you know of any software that's capable of taking an audio file and producing multi-user text from it? Seems like it would be useful in a wide variety of situations. |
|
Some of that only comes across when you actually use it - when you clean up the transcription immediately after the meeting or the next day. Clicking a mistake word to edit it snaps the video and audio to that point, so its super intuitive to "scrub" through the video just by clicking around the text transcription. Very fast, very natural, very low effort.
I can only imagine how much it will be improved if it used google's newest multi-speaker transcription models. It always had some trouble whenever people started talking at the same time.
[0] https://trint.com