Hacker News new | ask | show | jobs
by weitendorf 5 days ago
It’s not cheaper to run Claude in your own GPUs rather than the $200/mo for certain workloads. For a large portion of what I work on, the bottleneck is my time, not tokens. You certainly could throw more tokens at it but if you need it to work a certain way for certain reasons, and your plan/goals are beyond the scope of what the top-capability models can do, then throwing them at the problem just bogs you down in extra cruft or reviews/iteration that you could more effectively do being the primary driver of the work.
1 comments

Sure, you can keep paying $200/mo to Anthropic forever, and accept heavy censorship on the types of tasks you can do (e.g. malware research), accept no privacy, and accept rate limiting and the requirement of internet access at all times.

Or buy $2400 of GPU today to get you something close to get you within 10% of Opus 4.6 on coding benchmarks, that pays for itself in 1 year, AND you can work with private code and data offline as you like with no censorship or restrictions.

The value proposition of Anthropic is comically bad to anyone that understands how to insert PCI-E cards into a motherboard and install linux.