Y
Hacker News
new
|
ask
|
show
|
jobs
by
antirez
434 days ago
Llama4 seems in many ways a cut and paste of DeepSeek. Including the shared expert and the high sparsity. It's a DeepSeek that does not work well.