Hacker News new | ask | show | jobs
by taneq 457 days ago
Maybe we should work on solving that problem, then? And maybe this is what working on that problem looks like?
1 comments

Eval sets are not an appropriate tool for evaluating progress on security problems since the bar here is 100% correctness in the face of sustained targeted adversarial effort.

This work largely resembles the Politician's syllogism; it's something, but it's not actually addressing the problem.