Hacker News new | ask | show | jobs
by Havoc 521 days ago
Yeah that surprised me as well - seems low vs what is used on text llms . To be fair 100 hours of speaking is a lot of speaking though
1 comments

But it covers five? Languages so if all equal it’s just 20 hours per language.
in the linked audio sample it says the training data is mostly english. also another comment claims that the japanese quality is not good, so i'd be suspicious about all the other languages.