| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by alex_sf 63 days ago
	Everytime I've tried a local model, and I have tried lots for a couple years now, they just seem like they were overtrained on benchmarks. They consistently perform dramatically worse than even older models from Anthropic/OAI/Google.

1 comments

slopinthebag 63 days ago

You're just using them wrong.

link

eloisant 62 days ago

That might be true, but still: with Claude Opus I can give a task with 2 lines and it will just do it, with a local Qwen I have to use plan mode for everything even small tasks.

link