Hacker News new | ask | show | jobs
by satvikpendem 6 days ago
By that logic, any software you run that isn't fully built by yourself is "third party" therefore you shouldn't run anything at all on your machine, thus obviating the need for it entirely.
1 comments

But practically AI inference requires substantial local computing resources. It's not some web app, it's a order of magnitude more compute needed
Hopefully now you understand why people want smaller models.
Not really, I run a production service on a basic server using these Gemma models, the server is weaker than my MacBook. Most people's laptops and even phones actually can run local models, most simply don't know how. Run Unsloth Studio and you'll see how easy it is.

As the sibling says this is why people want smaller but still performant models.