Y
Hacker News
new
|
ask
|
show
|
jobs
by
YetAnotherNick
1857 days ago
According to my understanding they are referring to parameter count. If we go by that logic, BERT has 340M parameters. GPT3 has 175B. So this will have 340B parameters?