Y
Hacker News
new
|
ask
|
show
|
jobs
by
bbminner
514 days ago
I suppose it means per speaker. And it is based on a simplified style tts 2 which from my small dive into the subject seems one of the smaller models achieving great quality.