Hacker News new | ask | show | jobs
Reinforcement Learning Teachers of Test Time Scaling (sakana.ai)
2 points by mottiden 364 days ago