| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by mehdibl 116 days ago
	Ollama is quite a bad example here. Despite popular, it's a simple wrapper and more and more pushed by the app it wraps llama.cpp. Don't understand here the parallel.

3 comments

kossisoroyce 116 days ago

TBVH I didn't think about naming it too much. I defaulted to Ollama because of the perceive simplicity and I wanted that same perceived simplicity to help adoption.

link

eleventyseven 116 days ago

This is the vLLM of classic ML, not Ollama.

link

ekianjo 116 days ago

I guess the parallel is "Ollama serve" which provides you with a direct REST API to interact with a LLM.

link

sieve 116 days ago

llama-cpp provides an API server as well via llama-server (and a competent webgui too).

link