|
|
|
|
|
by slaterbug
499 days ago
|
|
As someone who doesn't understand anything beyond the word 'set' in that question, can anyone give an indication of how hard of a problem that actually is (within that domain)? Also I'm curious as to what percentage of the questions in this benchmark are of this type / difficulty, vs the seemingly much easier example of "In Greek mythology, who was Jason's maternal great-grandfather?". I'd imagine the latter is much easier for an LLM, and almost trivial for any LLM with access to external sources (such as deep research). |
|