| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by rhyme-boss 822 days ago
	I use Apple dictation heavily for transcribing interviews. I've tried all the voice-to-text services out there and none have been reliable enough *at transcribing an audio file. I've settled on playing audio in my headphones and pausing while I carefully dictate text into a document. If I could upload the audio file, get a first-pass transcription, and then go through and edit / make corrections with voice, that would be awesome. A difference in error rate from 20-something percent down to less than 5 percent sounds incredible.

4 comments

mathisd 822 days ago

Have you tried using Whisper from OpenAI ? Aiko [0] have Whisper-v2-large built-in and allow for transcription of audio file

[0] https://apps.apple.com/fr/app/aiko/id1672085276

link

LeoPanthera 822 days ago

Is there anything like this for watching foreign television (or radio)? I don't want to create a document, I just want real-time translated subtitles, but I can't do it in advance for live shows.

link

jonplackett 822 days ago

This is amazing. Just tried really mumbling a long for a while and it got every word.

link

codeptualize 822 days ago

Have you tried openai whisper? Last time I compared it was quite a bit better than all the other options.

link

hantusk 822 days ago

Check out Descript. It's been awesome when I used it in the past

link

c0brac0bra 821 days ago

Deepgram has been incredibly accurate for me.

link