Hacker News new | ask | show | jobs
by super256 1185 days ago
> Youtube text2speech routinely mistranscribes

Isn’t text2speech the opposite of transcribing?

Speech2text would be transcribing, and text2speech would be speech synthesis.

Anyway, assuming you meant speech2text: I found YouTube‘s transcribing quite good. It even understands stuff that is inaudible to me (especially in movie snippets). Of course it’s not perfect, but neither am I.

2 comments

Thanks, I mean speech2text, the youtube auto-caption feature specifically. Perhaps you have enjoyed it. I regularry use it due to bad hearing, and it is hilariously often mistranslating stuff that a human never would, simply because it does not understand context. It is a dumb system.
TTS is where I started when I was working on this sort of thing, I commonly just say text to speech to mean either direction.