Hacker News new | ask | show | jobs
by PrayagS 421 days ago
Claude also does that apparently. You give it a hint and it’ll lie about using that hint.

They talk about it here: https://www.anthropic.com/news/tracing-thoughts-language-mod...