Hacker News new | ask | show | jobs
by gcommer 2664 days ago
A few slightly more advanced GNU Parallel features that I've used:

- --joblog writes out a detailed logfile of the jobs, which can be used to resume from interrupted runs with --resume{,-failed}

- `--slf filename` can be used to provide a list of ssh logins to remote worker nodes to run jobs. Importantly, parallel will automatically reread this list when it changes. This lets you very easily distribute batch jobs across preemptible gcloud vms (or ec2 spot instances) and gracefully handle worker nodes appearing/disappearing with just a few lines of bash https://gist.github.com/gpittarelli/5e14fb772ce0230a3c40ffad...

- When used with bash, parallel can run bash functions if you export them with `export -f functionName` .

2 comments

Yeah, --joblog is a very handy feature. I once hacked a small Python script to produce an ASCII time plot from its output:

https://github.com/ziotom78/plot_joblog

Those are all really good tips, thank you for sharing them.