Y
Hacker News
new
|
ask
|
show
|
jobs
by
qeternity
793 days ago
I'm pretty sure he suggested it was a 16 way 110 MoE
1 comments
brandall10
793 days ago
The exact quote: "Sam Altman won’t tell you that GPT 4 has 220 billion parameters and is a 16 way mixture model with eight sets of weights."
link