Y
Hacker News
new
|
ask
|
show
|
jobs
by
danoandco
62 days ago
Similar but reusing lab-native CLIs like Claude Code or Codex, which they perform RL on. And so in the long-run, we believe this approach wins over custom harnesses.