Hacker News new | ask | show | jobs
by seertaak 89 days ago
Is that really true? I would have expected by now that AI companies nowadays are doing RL on git histories, not just on the HEAD.
2 comments

I also expected this. Please run some experiments and maybe other models are different
Claude definitely does