| Well here's some: Confabulation/Hallucination - https://github.com/lechmazur/confabulations Failure to read context - https://georggrab.net/content/opus46retrieval.html Deleting tests to make them pass - https://www.linkedin.com/posts/jasongorman_and-after-it-did-... Going rogue and deleting data - https://x.com/jasonlk/status/1946069562723897802 Agent security nightmares because they are not in fact intelligent assistants - https://x.com/theonejvo/status/2015401219746128322 Failure to read or generate structured data - https://support.google.com/gemini/thread/390981629/llm-ignor... There are many, many examples, mostly caused by people thinking LLMs are intelligent and reasoning and giving them too much power (e.g. treating them as agents, not text generators). I'm sure they're all fixed in whatever new version came out this week though. |