Hacker News new | ask | show | jobs
by samtheprogram 374 days ago
Not only the quantization, but what’s available via ollama is magistral-small (for local inference), not the -medium variant.