Hacker News new | ask | show | jobs
by csoham 4 days ago
Cool stuff! I suspect we will now see a bunch of startups coming up focused on helping companies reduce their AI usage, similar to how we have companies focused on optimizing cloud costs.
1 comments

Definitely - when you consider how varied inference workloads will be, and the different ways to minimize costs - better prompting, SLMs, different chips, batching, etc, there will be tons of opportunity