Hacker News new | ask | show | jobs
Effective Reinforcement Learning for Reasoning in Language Models (arxiv.org)
4 points by obastani 381 days ago