Hacker News new | ask | show | jobs
by tiew9Vii 2245 days ago
I had a similar idea wanting to play with the AWS / Google speech to text services.

I wanted to pipe in audio of various Youtube tech conference videos then apply some basic taxonomy/tagging and provide full text search so you can find a conference talk which contains some specific technology/subject you want to view.

I ran in to difficulty in technology / software conferences uses very specific acronyms and words that are not very general. Also being international there's many accents and levels of English. This means the AWS/Google API's struggled to translate videos which was also made difficult by using compressed audio streams you get from Youtube vs wavs.

1 comments

Google offers the functionality to add your own acronyms and products on the commercial speech to text. I think there is even a manual quality feedback loop in alpha.