Hacker News new | ask | show | jobs
by antirez 434 days ago
Llama4 seems in many ways a cut and paste of DeepSeek. Including the shared expert and the high sparsity. It's a DeepSeek that does not work well.