Hacker News new | ask | show | jobs
by fortyseven 66 days ago
Strangely, I haven't had a lot of luck with vLLM; I finally ended up ditching Ollama and going straight to the tap with llama-serve in llamacpp. No regrets.
1 comments

Good job. llama.cpp is already much better.