|
|
|
|
|
by Ey7NFZ3P0nzAe
16 days ago
|
|
I don't think so. Cramming new senses into the latent space of the model is one thing, but having a model output tokens that can be detokenized into sound is completely different and requires a very different type of data. |
|