Hacker News new | ask | show | jobs
by kookamamie 1143 days ago
LLAMA is a LLM, very little to do with a model trying to learn MNIST. CNNs in particular gain from using a GPU (or ten), as they're optimizing weights for convolutional/spatial kernels.