| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by byefruit 1144 days ago
	They ran lm-evaluation-harness on both this model and the original llama weights, which is the correct way to do it. Many people have been struggling to reproduce the benchmark numbers included in the original llama paper.