Hacker News new | ask | show | jobs
by moralestapia 615 days ago
No one here gets it, even though @sama has said it countless times.

I will write it explicitly for you once again:

The plan is to make inference so cheap it's negligible.

3 comments

If inference cost is so cheap and negligible, then we'll be able to run the models on an average computer. Which means they have no business model (assuming generosity from Meta to keep publishing llma for free).
There is no way that running a data center full of any current or prospective offering from nvidia will be anything close to resembling negligible.
so... ad funded?
I think they mean running inference. Either more efficient/powerful hardware, or more efficient software.

No one thinks about the cost of a db query any more, but I'm sure people did back in the day (well, I suppose with cloud stuff, now people do need to think about it again haha)

Anybody with billions of database queries thinks about them.
Yeah, but GP said one.
nobody is paying for the training so you either pay for the inference or the ads do