Hacker News new | ask | show | jobs
by jasode 1854 days ago
Fyi if you weren't aware ... most podcasts don't have text of the audio because high-quality (accurate) transcription of podcasts costs money. Example rates: https://www.google.com/search?q=podcast+transcription+servic...

So this thread's podcast of 52 minutes of a complex technical topic with multiple speakers could cost ~$200. A programming-related podcast is already a niche topic with a tiny audience and an Array Languages podcast is an even tinier subset of that so the cost might not be justified.

I suppose podcasts could be uploaded to Youtube and let their speech-to-text algorithm do an auto-transcribe. However, the A.I. algorithm is not good at tech topics with industry jargon/acronyms and the resultant transcription will be inaccurate.

2 comments

I make transcripts of all my work using Descript. It uses Google's speech-to-text algo (same as the one in youtube presumably) and gives you a transcript you can then edit. It costs $15/month I believe, and you have to spend some time editing the transcript that realistically won't be read by many, but it works pretty well ime (no affiliation besides being a happy customer)
Right. A high-quality podcast is already lots of pre- and post-production work on just the audio. I use Rev which hires captioners on my behalf [0] but it's also expensive. I use it sparingly.

[0] https://www.rev.com/

Thanks for bringing Descript to my attention. Do you use any of the production aspects of it?
Yeah, it works really well, it's basically completely replaced what I used to use Audacity and Premiere for.
Presumably there was a script or at least a summary. Why not publish that as well as any slides used?
Why would there be? It's a recorded conversation between a group of people. Some of them may have some rough notes but maybe not even that.
There was no script or summary, nor any slides. It was a completely organic conversation.