|
|
|
|
|
by stabbles
1971 days ago
|
|
The problem with slurm is how it's typically used: ssh into a shared login node with a shared file system, authorization is tightly coupled to linux users on that node, submit jobs with sbatch. Kubernetes deployment feels much more modern and safe. I have worked with containers + slurm, where the vendor libmpi is injected in the container runtime [1] by a hook, which gives you close to bare metal performance with some container goodness in terms of isolation and deployment. [1] https://github.com/eth-cscs/sarus |
|
P.S. I also think the K8s scheduler isn't great.