|
|
|
|
|
by p1esk
2159 days ago
|
|
Making large transformer based models like Jukebox [1] efficient enough so that they can be trained on a single 8x GPU machine to the same level of quality as the original model, in the same amount of time. [1] https://openai.com/blog/jukebox/ |
|