Talks > 20-21/04/2016

Introduction of a near real-time monitoring plugin in the Slurm open source software

With the increasing number of cpu cores in compute nodes of high performance clusters, proper monitoring tools become essential to understand the usage and the behavior of the applications running in the cluster.

In this work a new approach to near real-time monitoring is presented, using the Slurm profiling plugin to display resource usage information for each of the processes running in the cluster. This data improves the understanding of the applications running and can help in highlighting to the user any application-related issue.


Related Talks

Fernando Galindo & Gabriel Verdejo

Asynchronous Job Operator (AJO)

13-14/01/2014

Alejandro Sanchez

Directions in Workload Management

20-21/04/2016

Carlos Blanco

DRM4G: an open source framework for distributed computing

03-04/02/2015

Visit our forum

One of the main goals of this project is to motivate new initiatives and collaborations in the HPC field. Visit our forum to share your knowledge and discuss with other HPC experts!

About us

HPCKP (High-Performance Computing Knowledge Portal) is an Open Knowledge project focused on technology transfer and knowledge sharing in the HPC Science field.

HPC Knowledge Portal is supported and maintained by HPCNow!