Hacker News new | ask | show | jobs
by nebrelbug 1100 days ago
Quick tutorial on how to use Accelerate to run inference on LLMs in parallel