Y
Hacker News
new
|
ask
|
show
|
jobs
by
iamnotagenius
284 days ago
They are different. Gemma 3 12b excels at natural languages but terrible at long context. Pixtral 12b is better at long context (not stellar), but worse at natural language.