| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by sailingparrot 1914 days ago
	It's trained using the same architecture, and with a very similar dataset, so it should be very close.

1 comments

dylanbyte 1914 days ago

My experience is that replicating papers is actually nontrivial. For example someone announced they had replicated gpt2 some time back but when evals were run it turned about to be the equivalent of a much smaller model.

link