Hacker News new | ask | show | jobs
by Kelteseth 1486 days ago
I really like the idea, but watching some demos at the bottom of the page, I did not like the generated voices. They are unpleasant to hear, and I constantly compare them to crappy YouTube tutorial that have similar TTS voices.
3 comments

I got it, I am sure it cannot be compared with human professional artist. However AI is improving and definitely it keeps getting better. And with WowTo you have many advantages: 1. If you do not like the tone of a speaker, there are several voices to choose from and also dialects 2. And you can instantly connect with a global audience with its multi-lingual capability. 3. Reduce your voice-over costs significantly for using with videos that does not focus on voice modulation but rather conveying the information.
I think he was comparing it with just a random native English speaker, rather than a professional voiceover artist.

I guess it might be a decent option if you have a very difficult to understand accent though.

I completely agree with this. My immediate advice because this tool is definitely more likely to be used by more people who aren't programmers, focus on quality. These types of users will have a higher bar for quality in this specific aspect. But also right now the quality of the TTS voices is too low in general imo.

Less technical users get more frustrated when their tooling can't produce things that meet their expectations. And moreso in my experience because they don't have the skillset in order to modify their tooling so it can meet their expectations.

> But also right now the quality of the TTS voices is too low in general imo.

Most youtube shorts (and I'd wager tiktoks, but I have no idea) have this incredibly irritating generic lady voice in the background. It's definitely artificial, every video has the exact same diction and tone. Clearly this is good enough for most people, so it can be done.

> Clearly this is good enough for most people

I really hope the quality improves before I'm forced to engage with such content. It's an absolute deal breaker for me. I can't reach for the back button quickly enough. I'm sure I'm far from alone in this.

You aren't. I hate the voices too. They cause me to disengage with the content instantly.
Came here to day the same thing. Those voices are an absolute blight and an immediate turn off.