Hacker News new | ask | show | jobs
by rfgplk 16 days ago
It probably isn't, at least in terms of security or memory safety. The current models can already sniff out all memory vulnerabilities with relative ease, you can't really beat that.
1 comments

Have you read firefoxes findings? They found it to be qualitatively improved over Opus, and have published several of the resulting CVEs as well as more detailed numbers.
They also seem to point to it being more the harness than the model itself.
Really? They mention that Opus 4.7 in the same harness found like 1% of the bugs that Mythos found.