Hacker News new | ask | show | jobs
by dcre 42 days ago
A new Mythos checkpoint improves significantly on the previous one (and beats GPT-5.5-Cyber) on this benchmark.