With the advent of modern package managers for HPC (EasyBuild, Spack, etc.) automated building of large amounts of software is becoming easier, quickly giving rise to issues related to life cycle management of applications. This makes tracking the applications and libraries that actually get used considerably more important. Existing solutions (module load hooks, launch wrappers) do not account for user-built software, are hard to deploy or produce inconclusive results.
OGRT introduces a way to track the execution of programs and the shared objects they load in a lightweight manner and without launch wrappers. It supports watermarking of binaries, capturing the environment of tracked processes and is transparent to the user. Data is aggregated and persisted into configurable backends (currently Elasticsearch/Splunk).
OGRT is a versatile tool, which can be used to:
- provide a census of used software (including user-built)
- troubleshoot problems with user’s programs picking up unexpected shared libraries
- retroactively inform users about buggy libraries
- contribute to reproducibility of application runs
This presentation will show how easy it is to deploy OGRT and give a demo of the capabilities of OGRT when plugged into an Elasticsearch backend. Also the production deployment within a bioinformatics focused environment and the insights gained from analyzing data obtained through OGRT will be discussed.