Don't forget llama.cpp came about when meta released the weights to their LLaMa LLM. They've been in the game for awhile, just not anywhere near the top of the score board since.
llama.cpp is great. However, Llama 4 was a misstep for them: it was too big, so was out of reach of the LocalLlama crowd and hard to train/customize into different variants like has happened with the smaller models on Hugging Face. 70B seems to be about the limit there, with smaller models being easier to run and customize.