| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by solenoid0937 2 days ago
	Most of HN is stuck in this fantasyland where they insist their local LLM setup is comparable to Opus 4.8 or GPT 5.5. It's like a collective delusion, I've never seen anything like it.

2 comments

written-beyond 2 days ago

You can get really good results with Chinese models. You're putting Opus and GPT on too high of a pedestal.

link

solenoid0937 2 days ago

I use Chinese models (for simple personal projects), they just don't compare to GPT or Opus for any serious work.

I do not know why every Chinese model fan thinks that people that aren't impressed by them simply don't use them.

link

SXX 1 day ago

Wast majority of software engineers do very little except of moving JSONs around and building CRUDs.

It's quite obvious that when you dont try to do something particularly complex there will be literally no difference between GPT, Claude, Gemini and Deepseek.

Fot many things I'm doing in gamedev Gemini 2.5 Pro was already good enough even though it released more than year ago.

Once you pass certain threshold it's just enough.

link

Vetch 1 day ago

What constitutes serious work and how seriously have you tried to do serious work with them? While those trying to claim a 30B dense model can match Opus 4.6 are engaging in either beyond over-excessive over-exaggeration or performing rather routine tasks, it's disingenuous in the other direction to claim the latest open 1T models are not useful for serious work. I find those making such claims have rarely spent more than a few minutes on halfhearted attempts and often on recently obsoleted models.

Openweight models turned a corner around kimi 2.6, deepseek v4 pro/flash, hy3 and mimo 2.5 pro. Similar to how closed LLMs turned a corner around gpt 5.2 and opus 4.5.

While they remain a step behind closed frontier models, for real world tasks ranging across functional reactive programming, distributed systems, mathematical modeling, to-the-millisecond highly optimized spatial data-structures, complex compute shaders and shader effects and non-trivial systems involving parser combinators and algebraic effect systems, I can say that open models have very recently gone from useless to productive. For my work, mimo v2.5 pro is hands down better than sonnet 4.6.

link

bigbadfeline 2 days ago

Some of the new and open models are very capable now, The truth is, the value of the model is in the mind of the user - the big names are impressive to those who know little and are dazed by little, but they are bound to end up wrong regardless of how good the model is.

link

jatora 2 days ago

This is ridiculous. How about the rational users who use the best current model regardless of brand? The value of the model is in the quality of the output over time. I give every major model a chance. Coding and scripts in the chat are nothing compared to the power of agentic SWEEEEEEEEE. And nothing is remotely close to claude and gpt. If you're comfortable with being well behind SOTA intelligence, then good for you, but some of us prefer to be efficient with our time and resources. With your mindset, you will never truly SWEEEEEEEEEEEEEEEEEEEE

link

jpfromlondon 1 day ago

that isn't rational, rational is using the model that can best solve your current problem in the timeliest cost considered manner.

I'm not working on the frontier problems, I don't need god-in-a-box for $600 per month.

link

jatora 1 day ago

its not god in a box and its not $600 per month

and almost nobody is working on frontier problems. they just want frontier intelligence to solve their given problems in a superior manner.

you're minimizing and exaggerating all of the wrong things. cope more i guess - more compute for us!

link

jpfromlondon 1 day ago

Your comment makes it pretty clear that mine went over your head and that's fine, these tools are for people like you, godspeed.

link