Hacker News new | ask | show | jobs
by nomel 567 days ago
This is actually why I don't use Gemini. I've notice that it gets nonsensical when it gets into what I assume is sparser latency space. Claude and ChatGPT will stay coherent/consistent within the context of what they're saying (even if wrong). Worse, when Gemini starts doing this, it seems mostly irrecoverable, like the "nonsense" poisons the context window.
1 comments

I suppose that nonsense in the training data is often accompanied by yet more nonsense, so that’s what it might be trained to emit.