|
|
|
|
|
by mewpmewp2
585 days ago
|
|
But so if there's a benchmark that a model scores at 60%, does it mean that it's literally impossible to make anything that could be more than 67% better? E.g. if someone scores 60% at a high school exam, is it impossible for anyone to be more than 67% smarter than this person at that subject? Then what if you have another benchmark where GPT3.5 scores 0%, but GPT4 scores 2%. Does it make GPT4 infinitely better? E.g. supposedly there was one LLM that did 2% in FrontierMath. |
|