Hacker News new | ask | show | jobs
by mythz 281 days ago
Also less power efficient, takes up more PCI slots and a lot of software doesn't support GPU clustering. Already have 4x 16GB GPUs which is unable to run large models exceeding 16GB.

Currently running them different VMs to be able to make full use of them, used to have them running in different docker containers however OOM Exceptions would frequently bring down the whole server, which running in VMs helped resolve.

1 comments

What’s your application for high-VRAM that doesn’t leverage multiple gpus?