Hacker News new | ask | show | jobs
by techcam 89 days ago
I’ve been noticing the same — a lot of failures aren’t obvious “jailbreaks,” they’re just subtle prompt structure issues that only show up in production.