Hacker News new | ask | show | jobs
by cjbprime 795 days ago
Fair enough, although it means we don't know whether a 1.8T MoE GPT-4 will have a "size advantage" over Llama 3 400B.