Hacker News new | ask | show | jobs
Scaling Reinforcement Learning for Trillion-Scale Thinking Model (arxiv.org)
3 points by mountainview 239 days ago