Y
Hacker News
new
|
ask
|
show
|
jobs
by
bugglebeetle
914 days ago
Unified memory and optimizations in llama.cpp (which Ollama wraps).
1 comments
ithkuil
914 days ago
Is that using the GPU?
link
bugglebeetle
914 days ago
It can be variably configured. There are details in the repo, but llama.cpp makes use of Metal.
link