Hacker News new | ask | show | jobs
Verifiers: Environments for LLM Reinforcement Learning (github.com)
2 points by dominik-space 265 days ago