Hacker News new | ask | show | jobs
by wlesieutre 35 days ago
But if you're running it on your own hardware you might only generate tokens when you have something useful to do with them, instead of every time you load a Google search results page because Google decided the future is stuffing Gemini-generated answers down your eyeballs instead of letting you read it yourself from the primary source for 0.1 watts.
2 comments

Whether I'm using Google or not is completely unrelated to whether I use OpenAI (for example) API or run LLM locally
Don't worry, capitalism takes care of that.