Hacker News new | ask | show | jobs
TournO: Tournament Optimization for Non-Verifiable RL (github.com)
3 points by leonardtang 82 days ago