| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jimmySixDOF 631 days ago
	tl;dr: > We are able to reproduce the model benchmark scores initially claimed and are sharing the eval code. That is a big deal considering all the accusations flying around at the time so I hope this forensic update checks out and everyone who jumped to conclusions a couple of weeks ago takes a step back to reflect.