SlurmTUI: A Terminal UI for Managing Slurm Jobs
During my PhD, I spent a lot of time running experiments on large HPC clusters, we’re talking thousands of GPU and CPU nodes, massive hyperparameter sweeps, and job arrays that would balloon into hundreds of entries in the queue. If you’ve ever had to babysit Slurm jobs by repeatedly typing squeue -u $USER and squinting at the output, you know the pain.
The existing options were either too barebones (raw Slurm commands) or too heavy (web-based dashboards that the cluster admins may or may not have set up). I wanted something I could just SSH into any login node and run immediately, so I built SlurmTUI.