Hacker News new | ask | show | jobs
by kundan_s__r 161 days ago
Interesting reflection — but I’d push back on treating surface similarities between human conversational quirks and LLM failure modes as evidence they’re really the same thing. The article lists things like “not stopping generating,” “small context window,” and “repeating mistakes,” and frames them as if current LLM patterns are just human behaviors seen through a new lens.

The key difference is mechanism. For humans, wandering off topic or repeating a point usually stems from attention, memory, or social dynamics. For LLMs, similar-looking behavior often arises from architectural and statistical limitations — e.g., limited context windows causing context drift that makes the model lose track of earlier facts over long conversations. That’s been documented as a core technical barrier in conversational AI systems.

Likewise, what the article calls “persistent hallucination” isn’t just “being confidently wrong like a human might be” — it’s a systemic property of how LLMs predict the next token based on learned patterns rather than grounding in external truth. Hallucinations are literally outputs that are factually incorrect or fabricated even if they read fluently. That’s why engineering teams talk about hallucination detection, grounding, or external verification layers for production use.

Framing them as the same as human flaws risks obscuring the real challenge: deploying LLMs in systems where correctness, consistency, and adherence to explicit intent matter. Those aren’t just philosophical problems; they’re system-design constraints that require mechanism-aware validation and monitoring, not just metaphors to human behavior.