|
|
|
|
|
by herval
541 days ago
|
|
Depends on the task, no? Do you have a sense of what kind of task this benchmark includes? Are they more “general” such that random people would fare well or more specialized (ie something a STEM grad studied and isn’t common knowledge)? |
|