|
|
|
|
|
by spmurrayzzz
770 days ago
|
|
> having to install Ollama + Cuda to get locally working LLM didn't felt right to me when there's all what's needed in the browser Was there something specifically about the install that didn't feel right? I ask because ollama is just a thin go wrapper around llama.cpp (its actually starting a modified version of the llama.cpp server in the background, not even going through the go ffi, likely for perf reasons). In that that sense, you could just install the CUDA toolkit via your package manager and calling `make LLAMA_CUDA=1; ./server` from the llama.cpp repo root to get effectively the same thing in two simple steps with no extra overhead. |
|
Most people value convenience at the expense of almost everything else when it comes to technology.