|
|
|
|
|
by danielmarkbruce
1103 days ago
|
|
I'd like bigger GPUs. A trillion parameter model at 16 bits needs 2000gb+ for inference, more for training. All kinds of things can be done to spread it across multiple GPUs, downsize to less bits etc, but it's a lot easier to just shove a model on one GPU. We'll likely see more efficiency from bigger GPUs and hopefully more availability as a result. |
|