Hacker News new | ask | show | jobs
by meowkit 396 days ago
> Once we have a universal automated judge that can judge any kind of human research output then sure your statement is true,

If you've noticed, most LLM interfaces have a "thumbs up" or "thumbs down" response. The prompt may provide novel data. The text generated is synthetic. You don't need an automated judge, the user is providing sufficient feedback.

Same thing goes for the other disciplines.

1 comments

I’m extremely skeptical that “thumbs up” and “thumbs down” plus replies to chatbots is sufficiently informative to train models to the same level of quality as models trained on user generated content.