|
|
|
|
|
by ipsum2
834 days ago
|
|
Fun fact, I can also train a 24 trillion parameter model on my laptop! Just need to offload weights to the cloud every layer. ... It's meaningless to say something can train a model that has 24 trillion parameters without specifying the dataset size and time it takes to train. |
|