| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by fny 2 hours ago
	The RAM requirements are still pretty painful.

1 comments

yieldcrv 2 hours ago

equilibrium in one or two more years on the consumer/prosumer side

think Apple M6 or M7 with a currently unforeseen denser memory style, 256gb RAM

a couple inference or cache improvements on the algorithmic side, using less ram for context windows and doubling token speed again

denser open source models, packing more experts for smaller active layers

it'll still be expensive but like $8,000 - $13,000 instead of $450,000 worth of B200s

link

stingraycharles 1 hour ago

Fairly certain that model sizes and computational requirements will grow as the price for LLM compute drops.

link

3stacks 29 minutes ago

Maybe there's a conversation to be had about how much is enough... Unless something beyond my imagination happened, I would be happy enough with Opus 4.5 levels of productivity

link

yieldcrv 27 minutes ago

have you seen the open source LLM space? people fulfill all niches and there are active communities at every range of RAM and all are looking for the most capable in their respective range

a lot of innovation occurring

link