Hacker News new | ask | show | jobs
by danoandco 62 days ago
Similar but reusing lab-native CLIs like Claude Code or Codex, which they perform RL on. And so in the long-run, we believe this approach wins over custom harnesses.