Y
Hacker News
new
|
ask
|
show
|
jobs
by
wslh
30 days ago
I think it's worth to look at the recent XBOW benchmark:
https://xbow.com/blog/mythos-offensive-security-xbow-evaluat...
they realized that ChatGPT 5.5 works better so the secret is in the architecture (including humans in the loop).
1 comments
baq
30 days ago
'frontier tokens are not fungible'
link