|
|
|
|
|
by highd
3450 days ago
|
|
Yes! I'm thinking of building something like this for my neural net training (1-2 days on AWS, 16 GPUs/processes on the job). In this case the "state" that I'd like to access is all the parameters of the model and training history, so I'm thinking I'll probably store an mmapped file so I can use other processes to poke at it while it's running. That way I can decouple the write-test-debug loops for the training code and the viz code. |
|
That seems to run substantial risk of seeing it in an inconsistent state, yeah?