Gemma 4 31b is better for coding than Gemini in my limited testing on a small C project single source file project, less than 1000 lines. Setting temperature to 0 gives better results for me. It seems like Gemini ignores the system prompt more and the default reasoning output seems more incoherent.
Their open weight on device models are really impressive. Partly because I think they are the only ones out of all the frontier labs even working on local models.
Yaah I feel the same way. Gemini is great at and Django and AI backends, OpenAI better at making something visually pleasing in React and Claude for everything else or across frontend and backend.
At least, that's my heuristic that tends to work for my workflow. I use a combination of Gemini-CLI, Claude Code, and Github Copilot, but across those, the underlying model choice works best according to which part of the applicaiton I am messing with