That looks great! I also thought about calling the Gemini nano model embedded into Chrome (only extensions can do that). But after some testing on smaller models I found that anything smaller than 9b can’t really handle the complex tool call schema I use.
Qwen3.5 4b is quite good but still gives messy json quite often. But it’s very promising!
Maybe after one more model iteration or some fine-toning we can go fully embedded?
Qwen3.5 4b is quite good but still gives messy json quite often. But it’s very promising!
Maybe after one more model iteration or some fine-toning we can go fully embedded?