Hacker News new | ask | show | jobs
by zdragnar 7 days ago
Don't forget llama.cpp came about when meta released the weights to their LLaMa LLM. They've been in the game for awhile, just not anywhere near the top of the score board since.
1 comments

llama.cpp is great. However, Llama 4 was a misstep for them: it was too big, so was out of reach of the LocalLlama crowd and hard to train/customize into different variants like has happened with the smaller models on Hugging Face. 70B seems to be about the limit there, with smaller models being easier to run and customize.