Hacker News new | ask | show | jobs
by pzo 461 days ago
Maybe model is sensitive to quantization, by default ollama quantize it significantly.
1 comments

I tried ollama fp16 and it had the same issues.