Hacker News new | ask | show | jobs
by bachback 633 days ago
for coding tasks see

https://aider.chat/docs/leaderboards/

the question is how would you define "improve" and "solve". RLHF in a way delegates this to humans.