SchedMD is the core company behind the Slurm workload manager software, a free open-source workload manager designed specifically to satisfy the demanding needs of high performance computing.
The purpose of this presentation is to raise awareness about some directions in the field of HPC workload management. The areas of focus are scalability, data management and new architectures. Specifically, in the area of scalability we will talk about issues and features such as large node and core count, power management, failure management and federated clusters. In the area of data management, we’ll focus on Burst Buffer, which is a high-speed data store. Finally, in the area of new architectures we’ll talk about the KNL (Intel Knights Landing) support in the Slurm workload manager.