Y
Hacker News
new
|
ask
|
show
|
jobs
by
mehdibl
116 days ago
Ollama is quite a bad example here. Despite popular, it's a simple wrapper and more and more pushed by the app it wraps llama.cpp.
Don't understand here the parallel.
3 comments
kossisoroyce
116 days ago
TBVH I didn't think about naming it too much. I defaulted to Ollama because of the perceive simplicity and I wanted that same perceived simplicity to help adoption.
link
eleventyseven
116 days ago
This is the vLLM of classic ML, not Ollama.
link
ekianjo
116 days ago
I guess the parallel is "Ollama serve" which provides you with a direct REST API to interact with a LLM.
link
sieve
116 days ago
llama-cpp provides an API server as well via llama-server (and a competent webgui too).
link