Hacker News new | ask | show | jobs
by nicklecompte 800 days ago
It's not "open enough" to do an honest evaluation of these systems by constructing adversarial benchmarks.