Y
Hacker News
new
|
ask
|
show
|
jobs
by
kookamamie
1143 days ago
LLAMA is a LLM, very little to do with a model trying to learn MNIST. CNNs in particular gain from using a GPU (or ten), as they're optimizing weights for convolutional/spatial kernels.