Hacker News new | ask | show | jobs
by SpicyLemonZest 63 days ago
Frontier model developers try to check for memorization. But until AI interpretability is a fully solved problem, how can you really know whether it actually didn't memorize or your memorization check wasn't right?