I usually build llama.cpp from source and download quantized (GGUF) models from Huggingface, haven’t used Ollama this far.