Hacker News new | ask | show | jobs
The Art of Scaling Reinforcement Learning Compute for LLMs [Meta] (arxiv.org)
1 points by wavelander 251 days ago