Y
Hacker News
new
|
ask
|
show
|
jobs
by
rpdillon
317 days ago
llama.cpp is what you want. It offers both a web UI and an API on the same port. I use llama.cpp's webui with gpt-oss-20b, and I also leverage it as an OpenAI-compatible server with gptel for Emacs. Very good product.