Hacker News new | ask | show | jobs
Show HN: Slack bot to monitor/stop/restart ML model training remotely (github.com)
2 points by rahuldan 1303 days ago
I created this personal project for myself to be able to monitor, stop and restart my ML model training using Slack. A lot of my GPU hours have been wasted running training with wrong hyperparameters, or the training being stuck due to random reason only to realize after a long time. So I thought a tool like this might come handy to monitor it using my phone.

I showed it to a few people and they liked the idea, hence open-sourced it and posting it here.