Hacker News new | ask | show | jobs
by ace_of_spades 3130 days ago
Just had an idea... what about call center providers? They already collect speech data for training purposes and transcription could most likely help there!
1 comments

They do, but privacy is a major concern.

The bigger problem is, of course, that you need speech data with (fairly) accurate transcripts for training ASR systems. These typically don't exist for call center calls.

Transcriptions are not really the issue here, the cost of freelance transcribers is relatively low. It is privacy that makes it so hard, most of the call center calls need to have some kind of user authentication, which means they would need to be anonymized prior to being transcribed and used as a training material.