Hacker News new | ask | show | jobs
by yvbbrjdr 245 days ago
I see! Do you know what's causing the slowdown for ollama? They should be using the same backend..
1 comments

Dude, ggerganov is the creator of llama.cpp. Kind of a legend. And of course he is right, you should've used llama.cpp.

Or you can just ask the ollama people about the ollama problems. Ollama is (or was) just a Go wrapper around llama.cpp.

Was. They've been diverging.