Hacker News new | ask | show | jobs
by adam_arthur 31 days ago
I recently found Gemma 4 e4b surprisingly effective for small "classification" style tasks for something I'm doing at work.

In this case, picking out "semantic" css classes on single dom nodes.

Was able to run it on my 4(?) year old M2 mbp with 16GB of ram and it runs in only 100ms or so per query. Probably it can run much faster, but haven't experimented with batching etc

With tight and targeted context control, you can use extremely small models for useful things. Ideally with problems where the harness can be mostly deterministic and you have known bounds on what you're trying to do