|
|
|
|
|
by redman25
375 days ago
|
|
Not the parent but I would say bad defaults or naming. There are countless posts from newbies wondering why a model doesn’t work as well as it should. It’s usually either because the context size is set very low by default or they didn’t realize that they weren’t running the full model (ollama uses the distilled version in place of the full version but names it after the full version). There’s also been some controversy over not giving proper credit to llama.cpp which ollama is/was a wrapper around. |
|
I've never used ollama, but perhaps you mean quantized and not distilled? Or do they actually use distilled versions?