Hacker News new | ask | show | jobs
by wslh 30 days ago
I think it's worth to look at the recent XBOW benchmark: https://xbow.com/blog/mythos-offensive-security-xbow-evaluat... they realized that ChatGPT 5.5 works better so the secret is in the architecture (including humans in the loop).
1 comments

'frontier tokens are not fungible'