There is little evidence that Mythos is any better at finding bugs than any other system. Mythos appears to be impactful because people are, for the first time, using lots of resources (for free from Anthropic) to try and find security issues. The actual bugs found are mostly inconsequential, any chart showing a giant leap in fixes that doesn’t consider whether they were even using any tooling before and whether these are serious issues is junk. If you read the partner’s summaries of Mythos so far, it is a damp squib. Maybe that’ll change but at least for now there is no evidence Mythos is anything but marketing hype.
https://www.aisi.gov.uk/blog/our-evaluation-of-claude-mythos...
https://daniel.haxx.se/blog/2026/05/11/mythos-finds-a-curl-v...