Hacker News new | ask | show | jobs
by err4nt 366 days ago
How does it know the difference?
3 comments

I'm still on the AI-skeptic side of the spectrum (though shifting more towards "it has some useful applications"), but, I think the easy answer is - if different models/prompts are used in generation than in quality-/correctness-checking.
This might not always work, but whenever possible, a working exploit could be demanded, working in a form that can be automatically verified to work.
I think Claude, given enough time to mull it over, could probably come up with some sort of bug severity score.