Hacker News new | ask | show | jobs
We Benchmarked Claude Code, Codex, Semgrep, CodeQL, Trent on 28 CWE-Bench CVEs (trent.ai)
6 points by geopsist 20 days ago
2 comments

Looks interesting. LLM base solutions fails when metric is strict. For security solution guess is not enough, we need reliable and robust solution to pin vulnerability and its evidence to fully judge and mitigate with appropriate fix.
I'm co-founder at trent.ai, happy to answer any questions around this.