@scadsfct

FROOM: A Framework of Operators for OTF2 Modification

, , , , , and . Proceedings of 2023 SC Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis, SC Workshops 2023, page 1403–1411. United States of America, Association for Computing Machinery (ACM), New York, (Nov 12, 2023)Publisher Copyright: © 2023 Owner/Author..
DOI: 10.1145/3624062.3624209

Abstract

In recent years, High Performance Computing (HPC) has become increasingly important for many industries and research areas besides ‘classic’ applications. As new domains emerge, applications, implementations and frameworks become more diverse. Generic performance analysis tools often cannot keep up with the development speed of new approaches for workload distribution, offloading, and communication. Some of the new approaches employ their own performance monitoring, which is difficult to integrate into generic tools designed for traditional HPC. Performance measurements often result in a collection of separate performance logs that logically form a unit but cannot intuitively be investigated together with established performance tools. In this paper, we present a tool library that can be used to combine separate performance logs and separately recorded metrics into one single performance log, enabling investigation of such performance data as a unit. Use cases from Big Data processing and AI show the broad applicability of our approach.

Links and resources

Tags