Hacker News new | ask | show | jobs
by jonahbenton 26 days ago
There are many markets. Qwen 3.6 27b at a high enough quant is good enough for many use cases. But enterprise-consumed tokens come with legal/data protection agreements. They have just gotten comfortable with BYOD- there is no BYOD equivalent set of practices and protections for local LLMs (BYOLLM). So some enterprises are getting back into prem GPU capacity.
1 comments

On prem GPU capacity - or decent enough devices for core engineering team - lends itself pretty nicely to local LLMs too. And you own the whole stack this way. Why pay premiums to Anthropic and fuel its trillion dollar valuation?
Yeah...pay opex to Anthropic or your capex to NVidia- whose Blackwell gen prices are now up 25% from launch, with more increases to come.