Publications

Detailed Information

SnuHPL: High Performance LINPACK for Heterogeneous GPUs

Cited 0 time in Web of Science Cited 5 time in Scopus
Authors

Kim, Jinpyo; Kwon, Hyungdal; Kang, Jintaek; Park, Jihwan; Lee, Seungwook; Lee, Jaejin

Issue Date
2022-06
Publisher
Association for Computing Machinery
Citation
Proceedings of the International Conference on Supercomputing, p. 18
Abstract
© 2022 ACM.These days, it is typical for a large-scale cluster system to have different kinds of GPUs. However, HPL (High-Performance LINPACK), the de-facto standard LINPACK implementation for evaluating the performance of a cluster system, is originally designed to work only for homogeneous CPU-only systems. In this paper, we develop SnuHPL, an optimized HPL for clusters of modern heterogeneous GPUs. To optimize SnuHPL for the heterogeneous GPUs, we design a performance model, a SnuHPL simulator based on the model, and a greedy heuristic algorithm based on the simulator. The algorithm generates the best data distribution for a given cluster configuration by considering computing power, memory capacity, and network performance altogether. We also present a simple technique to increase the energy efficiency of HPL by adjusting the core clock frequency of the GPUs. The evaluation of the data distribution algorithm on small clusters of different GPU combinations shows that it outperforms well-known other data distribution strategies. We show the effectiveness of SnuHPL on a cluster of 1,760 NVIDIA A100-80GB GPUs and 440 A100-40GB GPUs. We also show the effectiveness of the proposed energy optimization technique on a cluster of 144 A100-80GB GPUs.
URI
https://hdl.handle.net/10371/200912
DOI
https://doi.org/10.1145/3524059.3532370
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share