Hacker News new | ask | show | jobs
A Not So Gentle Introduction to PPO and GRPO (cyrilzakka.github.io)
3 points by archiv 405 days ago