| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by chis 16 days ago
	It's also shockingly twitter-nerd-coded. "The cure for token anxiety", it advertises. To be honest it's hard to see why anyone would buy this product so maybe they decided to take a wild swing with the marketing. The only use is people who really, really, want to run models locally vs getting a much cheaper and higher performance result from a cloud host.

1 comments

illiac786 16 days ago

Higher performance, I agree, but cheaper?

I didn’t check how much this costs, but if you use AI locally a lot, it’s going to be amortised pretty quickly. Burning 100$ a month on tokens has become insanely easy. I remember when it was unimaginable for me…

link

chis 14 days ago

This won’t be able to run any of the cutting edge models. And the models it can run can be served from cloud providers for very cheap - like <$1 per million tokens for the latest deepseek.

It’d take many years to break even on your $6000 investment, meanwhile better and better models will come out that the DGX can’t run.

link

girvo 15 days ago

You top out at 20 tokens per second on hardware with memory bandwidth this low for any local model actually worth using. Doing the maths, it’s not financially worth it. Only worth it for privacy and control reasons.

I do love my GB10 Asus Spark-like though still!

link

illiac786 15 days ago

I don’t understand your calculation, can you elaborate? At 25USD/Mtk output, assuming your 20tk/s, I generated/saved (minus power costs) ~15k$ in a year.

Granted, it won’t run 24/7, but over a couple of years, this is definitely cheaper.

link

chis 14 days ago

This can’t run any models that cost $25/mtok lol. I think the fastest model it’ll reasonably run will be GPT-OSS 120B which costs $.05/mtok.

This is a laptop for CUDA devs and AI larpers.

link

chis 15 days ago

Speculation is it’ll cost at least $6k for 128GB

link