| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by embedding-shape 109 days ago
	> but I've been buying studios with as much ram as possible Why though? Are you managing a fleet of headless Mac Studios? You mention LLMs, but today's Mac hardware really isn't great as soon as you use bit larger contexts, so I'm guessing it's not that either?

1 comments

taylorhou 108 days ago

We combo frontier coding models with last frontier for more admin stuff.

link

embedding-shape 107 days ago

But with macOS hardware? What kind of calculations did you do that shows that this makes sense? 6-7 months ago I looked into the same, getting a bunch of Apple hardware to do local LLM inference (for a client), but the bad performance + high pricing just makes it completely infeasible compared to nvidia hardware.

Now I'm really curious how you got that to make sense in reality.

link

taylorhou 107 days ago

250+ employees on the lowest paid subscriptions of claude or openai is $5k/month. their usage of AI is chatbot most of the time which local models/inference can easily handle. so just being able to cancel those subscriptions with local hardware makes my break even on a single $10k mac studio 2 months and honestly the mac studio for their use is overkill.

link