|
|
|
Ask HN: Need for a human-powered text-to-speech API?
|
|
3 points
by leahcim
2749 days ago
|
|
We are working with a network of US-based people good on the phone. Is anyone interested in an API that would accept text as input and return a MP3 of someone reading the text within a couple of hours? We have a couple of US-based people who could do the job really well in a couple of minutes. Command: POST /tts { text: "Hello John. Thanks for joining us today.", voice:"female", web hook: "../webhook/response" } Webhook response (a few minutes later): POST /webhook/response { file: "voice.mp3", cost: 0.07 } Cost would be something like $1 per 100 words. |
|
A quick googling suggests that voice acting rates (pay to the voice actor alone) tend to be in the range of $1/second for short, small-market bits (short bits with larger markets tend to have higher use fees on top), so it sounds like this service relies on getting people willing to work on-demand for about 1/100 of market rates with a much faster turnaround time than is typical to have any room for profit
Sure, if you’ve got quality voice talent there's a huge demand for that. OTOH, if you don't have quality voice talent, why would people pay for this instead of today's commercially available machine TTS, which is much lower latency and much cheaper (e.g., Google with their premium WaveNet voices at $16/million characters, or something on the order of $1/8000 words.)