Hacker News new | ask | show | jobs
Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult (simonwillison.net)
6 points by jonesn11 199 days ago
1 comments

Prompt injections + context window engineering are the combined Archilles heel of the "agentic revolution".