Hacker News new | ask | show | jobs
by zaptrem 471 days ago
Thanks! We used SkyPilot (an open source cloud GPU worker management tool) to help out with both our small (single node) and large (many node) training runs.