Y
Hacker News
new
|
ask
|
show
|
jobs
by
fortyseven
66 days ago
Strangely, I haven't had a lot of luck with vLLM; I finally ended up ditching Ollama and going straight to the tap with llama-serve in llamacpp. No regrets.
1 comments
magic_hamster
62 days ago
Good job. llama.cpp is already much better.
link