Hacker News new | ask | show | jobs
by knollimar 35 days ago
Don't you think "consider if anything is missing" leads them into adding something with sycophancy RL training and "if anything is not needed" making it remove something?

Or does "verify all claims in report" counteract that?

1 comments

It can indeed cause some models to try too hard to come up stuff, but the next verification prompt does counteract it.

E.g. some findings first classified as moderate priority often get reclassified as low priority even if the finding itself is correct.

The exact phrasing doesn't seem to matter as much as keeping the prompts short, simple and to the point.

However some models seem to do a bit better when adding ", if any" to prompts such as "List potential improvements".