Hacker News new | ask | show | jobs
by yincrash 59 days ago
Still needs a server. You could run a server locally if you had a model that your device could handle then point aide to the localhost URL.
1 comments

New phones can run Gemma 4 quants pretty nicely. It's a surprisingly good model. Google's Edge Gallery also offers some choice to try.
Missed the window for edit: I agree that ideally I'd have a tiny local MOE-kind of model, able to establish the complexity of the request, route simple local requests to the instantly available local agent, and route all the rest outside (to one of several models).