Hacker News new | ask | show | jobs
by halJordan 47 days ago
No one is losing money on inference these days. Google's vertical integration means that they have some of the lowest inference costs in the industry in any event.
1 comments

Microsoft recently announced changes to copilot because, apparently, it was losing money on inference.
They were charging a flat rate per query no matter how many tokens it consumed. People naturally got very good at writing prompts that used as many tokens as possible.
They were loosing money giving absurdly generous agentic usage on expensive models to people with $10 to $40 flat rate subscriptions.

They weren't selling inference.