Hacker News new | ask | show | jobs
by johndough 137 days ago
I was wondering whether multiple GPUs make it go appreciably faster when limited by VRAM. Do you have some tokens/sec numbers for text generation?