Hacker News new | ask | show | jobs
Multi-GPU Inference with Accelerate (bengubler.com)
14 points by nebrelbug 1099 days ago
1 comments

Quick tutorial on how to use Accelerate to run inference on LLMs in parallel