Hacker News new | ask | show | jobs
by mnkv 1149 days ago
How did you run the benchmarking, zero-shot or few-shot? I think a fair comparison would be Llama-7B which got an average ~35% for 5-shot.
1 comments

5-shot prompting.