Hacker News new | ask | show | jobs
by qeternity 793 days ago
I'm pretty sure he suggested it was a 16 way 110 MoE
1 comments

The exact quote: "Sam Altman won’t tell you that GPT 4 has 220 billion parameters and is a 16 way mixture model with eight sets of weights."