Hacker News new | ask | show | jobs
by _bin_ 431 days ago
Cursor results are going to depend heavily on the model; Gemini 2.5 pro exp seems the overall strongest. You’re probably defaulting to 3.7 sonnet which is completely unusable; it was good at first but I am convinced anthropic “updated” (degraded) it behind the scenes to lower their inference costs. OpenAI did the same with GPT-4o for a bit a while back before making it better again.

3.7 also seems to have converged more on the hybrid reddit user/npr listener/HR lady tone and manner of speaking that makes me want to punch a wall. Genuinely people could probably increase LLM usage just by fixing this problem and banning r*fit from the training set.

1 comments

I've seen evidence that suggests this is false, and that it's more likely that cursor degraded the experience in their context window to save on costs.

The date stamped models haven't had any evidence of ever changing or degrading, to my knowledge. Aider did a test for this as well.