|
|
|
|
|
by isoprophlex
581 days ago
|
|
If your benchmark covers all possible programming tasks then you dont need an llm, you need search over your benchmark. Hypothetically let's say the benchmark contains "test divisibility of this integer by n" for all n of the form 3x+1. An extremely overfit llm won't be able to code divisibility for all n not of the form 3x+1, and your benchmark will never tell. |
|