| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by usernamed7 2 hours ago
	Let us hope this only accelerates the proliferation of local models

3 comments

baq 2 hours ago

Serving barely useful GLM 5.2 costs what? $15k? Actually useful is like $50k? You’ll never recoup the cost unless you ‘locally’ means ‘inference provider is not the model provider’?

link

dgellow 31 minutes ago

Yes they mean open weight models offered by various providers

link

fractorial 1 hour ago

Not "local" in the literal sense, but I set it up to serve at half quant for $23/hr and full quant for $35/hr.

You don't need to have it always on? This is a far cry from "$200/month," but I do not think it's $50k for "useful." Do you see it differently?

link

dakolli 27 minutes ago

This is probably the dumbest possible way to do it. Just buy tokens through open router and you could run it all month 24/7 at 100tps for practically nothing. There are tons of ways to pay for things without giving your personal information.

link

verdverm 1 hour ago

$15k or $50k is pretty cheap all things considered (a year ago it would have been more expensive, one person can spend that in a month or two)

I bought my spark and the models have already improved in that time (qwen3.6, speculative decoding 2x tgen, diffusion gemma 4x tgen) and I expect this to improve. Look out another 2-3 years, local is going to be very competitive.

link

polski-g 2 hours ago

You can recoup the costs quicker if you resell access to your local LLM on a reselling service.

link

baq 25 minutes ago

Cheaper to just buy T-bills when I saw the numbers last time

link

nairboon 2 hours ago

It will. Moves like this will only lead to a drift of brains and talents to tweak & tune open harnesses and open models.

link

forgetfreeman 2 hours ago

There is the undocumented 3rd option of simply shrugging and moving on without LLMs, you know, business as usual.

link

baq 2 hours ago

That ship has sailed. Even if you never even tab complete in cursor, if you don’t let LLMs review your code you’re very, very behind unless you’re in a deeply specialized domain which doesn’t have any public training data available. Anything remotely public and you’re just outpaced.

link

inigyou 2 hours ago

Mythos found one low-severity vulnerability in curl.

link

forgetfreeman 1 hour ago

Is this your first tech industry hype cycle or something?

link

baq 26 minutes ago

No, it’s my experience from the past 6 months

link

jckahn 2 hours ago

That's not the option most are going to take.

link

forgetfreeman 1 hour ago

shrug Not really a me problem, but I'd counsel taking an afternoon to reflect on what part of any of this is actually inevitable. You know, maybe come up for air for a minute and examine the industry hype from 30,000 ft.

link

usernamed7 1 hour ago

That's a choice you are free to make, just like you're free to shrug and not use the internet or computers.

link

forgetfreeman 47 minutes ago

eyeroll If you truly had the courage of your convictions you would have gone all in here and told me to stop using electricity.

link

i2km 2 hours ago

Ridiculous. Haven't you heard? All critical thinking skills have long since been sacrificed on the altars of the AI gods and it's inconceivable that we write any code the old way. If you actually understand your code it means you're a luddite and are going to be left behind. /s

link