Y
Hacker News
new
|
ask
|
show
|
jobs
by
mayankchhabra
1040 days ago
Ah yes, running on GPU isn't supported at the moment. But CUDA (for Nvidia GPUs) and Metal support is on the roadmap!
1 comments
samspenc
1040 days ago
Ah fascinating, just curious, what's the technical blocker? I thought most of the Llama models were optimized to run on GPUs?
link
mayankchhabra
1040 days ago
It's fairly straightforward to add GPU support when running on the host, but LlamaGPT runs inside a Docker container, and that's where it gets a bit challenging.
link
stavros
1040 days ago
It shouldn't, nVidia provides a CUDA Docker plugin that lets you expose your GPU to the container, and it works quite well.
link
dicriseg
1039 days ago
See above if you're interested in that. It does work quite well, even with nested virtualization (WSL2).
link
stavros
1039 days ago
I am, thanks!
link