|
|
|
|
|
by tiew9Vii
2245 days ago
|
|
I had a similar idea wanting to play with the AWS / Google speech to text services. I wanted to pipe in audio of various Youtube tech conference videos then apply some basic taxonomy/tagging and provide full text search so you can find a conference talk which contains some specific technology/subject you want to view. I ran in to difficulty in technology / software conferences uses very specific acronyms and words that are not very general. Also being international there's many accents and levels of English. This means the AWS/Google API's struggled to translate videos which was also made difficult by using compressed audio streams you get from Youtube vs wavs. |
|