Hacker News new | ask | show | jobs
by nravic 723 days ago
How were you handling GPU state w/ pytorch? We added some custom code around CRIU to enable GPU checkpointing fwiw: https://docs.cedana.ai/setup/gpu-checkpointing/
1 comments

Not at all. I forked before I used anything with CUDA. I didn't need it but I guessed this could cause all kind of weird problems.