|
|
|
|
|
by GMoromisato
232 days ago
|
|
I suspect that LLMs are better at classifying novel vs junk papers than they are at creating novel papers themselves. If so, I think the solution is obvious. (But I remind myself that all complex problems have a simple solution that is wrong.) |
|
That's without even being able to backprop through the annotator, and also with me actively trying to avoid reward hacking. If arxiv used an open model for review, it would be trivial for people to insert a few grammatical mistakes which cause them to receive max points.