Hacker News new | ask | show | jobs
by kcorbitt 404 days ago
For "that last 10% of reliability" RL is actually working pretty well right now too! https://openpipe.ai/blog/art-e-mail-agent