|
|
|
|
|
by ricardobeat
313 days ago
|
|
Speech speed is always a tunable parameter and not something intrinsic to the model. The comparison to make is expressiveness and correct intonation for long sentences vs something like espeak. It actually sounds amazing for the size. The closest thing is probably KokoroTTS at 82M params and ~300MB. |
|