|
|
|
|
|
by lunixbochs
2202 days ago
|
|
I have an open source web service for rapidly recording lots of text prompts to flac: https://speech.talonvoice.com (right now the live site prompts for single words because I’m trying to build single word training data, but the prompts can be any length) You can set it up yourself with a bit of Python knowledge from this branch: https://github.com/talonvoice/noise/tree/speech-dataset There are keyboard shortcuts - up/down/space to move through the list and record quickly. If you want to use it on arbitrary text prompts, you can modify this function to return each line from a text file: https://github.com/talonvoice/noise/blob/speech-dataset/serv... If you use this, before recording too much, do some test recordings and make sure they sound ok. Web audio can be unreliable in some browsers. The uploaded files are named after the short name, so make sure you can correspond the short name with the original text prompts, eg with string_to_shortname(). If you aren’t easily able to do this yourself, I’d be happy to spin up an instance of it for you with text prompts of your choosing. |
|
Also, I noted the VLC demo says it doesn't use DNS! That's awesome...