Hacker News new | ask | show | jobs
by subscribed 59 days ago
New phones can run Gemma 4 quants pretty nicely. It's a surprisingly good model. Google's Edge Gallery also offers some choice to try.
1 comments

Missed the window for edit: I agree that ideally I'd have a tiny local MOE-kind of model, able to establish the complexity of the request, route simple local requests to the instantly available local agent, and route all the rest outside (to one of several models).