Hacker News new | ask | show | jobs
by dylanbyte 1912 days ago
My experience is that replicating papers is actually nontrivial. For example someone announced they had replicated gpt2 some time back but when evals were run it turned about to be the equivalent of a much smaller model.