Hacker News new | ask | show | jobs
by rubatuga 1857 days ago
Lol, they state that their model is a 1000 times more powerful than BERT? Under what metric?
3 comments

According to my understanding they are referring to parameter count. If we go by that logic, BERT has 340M parameters. GPT3 has 175B. So this will have 340B parameters?
That's what I was wondering! Such gibberish
Well so far the're mostly talking about what it would be able to do, so it's probably more wishful thinking than any exact metric.