Hacker News new | ask | show | jobs
by krzyk 4 days ago
ollama is a wrapper on top of llama.cpp, and it makes llama.cpp slower, why use it?

Also Ollama has other issues (like forgetting what it really is - a wrapper).