Hacker News new | ask | show | jobs
by galnagli 120 days ago
General-Purpose Cyber Benchmark for AI Agents and their Models