|
|
|
|
|
by amelius
944 days ago
|
|
> honestly at this point I think GPT4 would to a better job than most MTurkers at these tasks... From the article: > Our experimental results support the conclusion that neither version of GPT-4 has developed robust abstraction abilities at humanlike levels. This makes the conclusion only worse for GPT-4 ... |
|
If they stuck to the average Mechanical Turk worker instead of filtering for "Master Workers," the parent's conclusions likely would've aligned with those of the study. Unfortunately, it seems the authors threw out the only data that didn't support their hypothesis as GPT-4 did, in fact, outperform the median Mechanical Turk worker, particularly in terms of instruction following.