| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by coder543 1022 days ago
	> scored 18.9 on HumanEval (coding) where Llama2 7B scored 12.2 The article claims 18.9 for the base model, but also claims 20.7 for the fine tuned model.