Hacker News new | ask | show | jobs
by smusamashah 313 days ago
The reddit video is awesome. I don't understand how people are calling it an OK model. Under 25MB and cpu only for this quality is amazing.
3 comments

Just made a TTS tool based on Kitten TTS, fully browser based, no Python server backend: https://quickeditvideo.com/tts/ A tts model of this size should be industry standard!
The people calling it "OK" probably tried it for themselves. Whatever model is being demoed in that video is not the same as the 25MB model they released.
Nope, looks like the default voice is the worst and it's not in the demo. A Reddit user generated these as well https://limewire.com/d/28CRw#UPuRLynIi7
Never thought I'd see the name LimeWire again, wow
Haha interesting pivot!
It did say this was a preview release, so I'll reserve judgement until that's out the door.
Local quality is very bad
https://vocaroo.com/1njz1UwwVHCF

It doesn't sound so good. Excellent technical achievement and it may just improve more and more! But for now I can't use it for consumer facing applications.

We are still training the model. We expect the quality to go up in the next release. This is just a preview release :)