|
|
|
|
|
by yorwba
2299 days ago
|
|
> Align text to voice (the hardest part) using some private api That's also the part that would be most interesting to have explained. Is it language-agnostic? After all, the title says "in any language", but I can't think of any text-audio alignment algorithms that don't require a language-specific model. (Unless you just count characters and assume they map linearly to time, which I'd expect to go very badly.) |
|