| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by funfunfunction 284 days ago

Creator of inference.net / schematron here.

There is growing emphasis on efficiency as more companies adopt and scale with LLMs in their products.

Developers might be fine paying GPT-5-Super-AGI-Thinking-Max prices to use the very best models in Cursors, but (despite what some may think about Silicon Valley), businesses do care about efficiency.

And if you can fine-tune an 8b-parameter Llama model on GPT-5 data in < 48 hours and save $100k/mo, you're going to take that opportunity.