|
|
|
|
|
by nmca
2235 days ago
|
|
You misunderstand critically that this is not "singing along", it's generating the music and voice. Conditioning on lyrics is optional, and done "unaligned", eg by arbitrarily encoding the lyrics and passing them as additional input. |
|