Y
Hacker News
new
|
ask
|
show
|
jobs
by
ainch
90 days ago
pass@k means that you run the model k times and give it a pass if any of the answers is correct. I guess Lean is one of the few use cases where pass@k actually makes sense, since you can automatically validate correctness.