Hacker News new | ask | show | jobs
by mountainriver 187 days ago
> "Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model"

I believe NVidia’s ProRL showed otherwise right?