|
|
|
|
|
by c0brac0bra
406 days ago
|
|
What tasks have you found the 0.6B model useful for? The hallucination that's apparent during its thinking process put up a big red flag for me. Conversely, the 4B model actually seemed to work really well and gave results comparable to Gemini 2.0 Flash (at least in my simple tests). |
|