Hacker News new | ask | show | jobs
by user_7832 7 days ago
On the topic of older (Claude) models being better... anyone knows anything close to 3.5 (or 3.6) era Sonnet? It was by far the best LLM I had ever asked my doubts too. It actually explained in a human way, not like some AI I need to re read thrice to understand.

(I've used modern Gemini 3.1 pro & claude too. Modern ChatGPT is just as useless, I've never heard a human speak in points. The human brain never encounters that irl.)

3 comments

This was obviously a conscious choice from the leadership at he frontier labs, and especially OpenAI, considering how 4o turned out.

I don't think they expected the ELIZA effect [0] to explode as much as it did when they started including feedback directly from users into posttraining the next generation, so to be safe they've likely added several regimens of synthetic data ensuring ChatGPT tries to steer away from ELIZA.

[0]: https://en.wikipedia.org/wiki/ELIZA_effect

I'd have to see representative examples but there are thousands of models available, obliterated, remixed, distilled, cloned, compressed, and so many more.

I really liked the way copilot was last year, but I switched to deepseek because I don't trust MS.

Grok cracks me up, but I refuse to give elon more money than I'm already forced to by circumstance outside my control and budget.

It is hard to say because there is "affection" memory that it was better than what we had before so it seems it was better.

In my humble opinion that serves nothing, it improved gradually, not exponentially up to 4.5

4.6 seems to be a minor step and the latest 2 are pure rubbish