Y
Hacker News
new
|
ask
|
show
|
jobs
by
coder543
1022 days ago
> scored 18.9 on HumanEval (coding) where Llama2 7B scored 12.2
The article claims 18.9 for the base model, but also claims 20.7 for the fine tuned model.