Hacker News new | ask | show | jobs
Pipeline Parallelism: Distributed Training via Model Partitioning (siboehm.com)
2 points by ml_basics 887 days ago