Y
Hacker News
new
|
ask
|
show
|
jobs
by
dylanbyte
1913 days ago
Curious to see what parameter size of gpt3 this will end up being equivalent to. Obviously we won't know until they evaluate their models.
1 comments
sailingparrot
1913 days ago
It's trained using the same architecture, and with a very similar dataset, so it should be very close.
link
dylanbyte
1912 days ago
My experience is that replicating papers is actually nontrivial. For example someone announced they had replicated gpt2 some time back but when evals were run it turned about to be the equivalent of a much smaller model.
link