Hacker News new | ask | show | jobs
by bravura 623 days ago
How is transcription done?
1 comments

whisper + pyannote + custom LLM pipeline to reduce errors, extract and attach speaker labels
How do you interface with the Python ML stack from Rust?
Whisper and Pyannote via Replicate to spin up a container for transcription and diarization. It returns word-level timestamps and speaker turns.

The LLM pipeline is built into the Rust code and makes calls to the OpenAI API.

Are you running whisper on that same $7 Server?