| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by mayankchhabra 1040 days ago
	Ah yes, running on GPU isn't supported at the moment. But CUDA (for Nvidia GPUs) and Metal support is on the roadmap!

1 comments

samspenc 1040 days ago

Ah fascinating, just curious, what's the technical blocker? I thought most of the Llama models were optimized to run on GPUs?

link

mayankchhabra 1040 days ago

It's fairly straightforward to add GPU support when running on the host, but LlamaGPT runs inside a Docker container, and that's where it gets a bit challenging.

link

stavros 1040 days ago

It shouldn't, nVidia provides a CUDA Docker plugin that lets you expose your GPU to the container, and it works quite well.

link

dicriseg 1039 days ago

See above if you're interested in that. It does work quite well, even with nested virtualization (WSL2).

link

stavros 1039 days ago

I am, thanks!

link