Hacker News new | ask | show | jobs
by wenyuanyu 482 days ago
No, besides accessing training data, there is also logging and checkpointing... When you run k8s over it, and there are multiple training jobs... isolated local storage is a nightmare...