Hacker News new | ask | show | jobs
by uplifter 128 days ago
Has someone turned this into an agent benchmark? Most tokens emitted until the system rm -rf /s