Hacker News new | ask | show | jobs
by shlorn 4167 days ago
I don't know for sure but the transcripts are so good and for a show like TAL I imagine that they could probably have someone do them by hand.
1 comments

I don't know either, but it looks like the transcripts are initially generated from speech recognition (text dump + timing meta data?) and then hand-edited/annotated by a producer.

They'd add punctuation, sound cues, fix spelling, annotate the speakers (e.g. name + host, subject, or interviewer). Then that data's got to go somewhere...

It looks pretty labor intensive. I sure hope they have great tools!