Hacker News new | ask | show | jobs
by nathanasmith 637 days ago
The system has 512 GB of RAM so while it'll be slower at inference, he really has about 704 GB at his disposal to run the model assuming he distributes the weights across the VRAM and system RAM.