Hacker News new | ask | show | jobs
by jameszhao00 1071 days ago
Have you tried Google Cloud Studio voices?

https://cloud.google.com/text-to-speech/docs/wavenet#studio_...

1 comments

Yes. I'm not saying Google's Top Cloud offerings are bad although i still think microsoft's stuff is better.

Just that

1. It's behind their current sota research

2. You can only use those voices extensively by paying for it. Microsoft offers their best stuff on edge for free. So for reading aloud a pdf or web page, microsoft is far better.

It's disappointing, but I wouldn't expect research algorithms to be available immediately unless they held it back until the product is ready. I guess Apple would do that?
By “SOTA” tts I think you mean LLM based TTS? With sound and language tokens trained GPT style?

Without going into too much details, imo they’re not really usable right now for TTS use cases.

Not necessarily LLM style. The above isn't for instance.

also Google Studio Voices is excellent. Definitely better than Microsoft's best, albeit very limited voices.