Hacker News new | ask | show | jobs
by kgeist 743 days ago
Llama.cpp can run on CPU, on GPU, or in mixed mode (some layers run on CPU and some on GPU if you don't have enough VRAM).