Publications

Detailed Information

SnuHPL: High Performance LINPACK for Heterogeneous GPUs

DC Field Value Language
dc.contributor.authorKim, Jinpyo-
dc.contributor.authorKwon, Hyungdal-
dc.contributor.authorKang, Jintaek-
dc.contributor.authorPark, Jihwan-
dc.contributor.authorLee, Seungwook-
dc.contributor.authorLee, Jaejin-
dc.date.accessioned2024-05-03T07:36:47Z-
dc.date.available2024-05-03T07:36:47Z-
dc.date.created2022-07-18-
dc.date.issued2022-06-
dc.identifier.citationProceedings of the International Conference on Supercomputing, p. 18-
dc.identifier.urihttps://hdl.handle.net/10371/200912-
dc.description.abstract© 2022 ACM.These days, it is typical for a large-scale cluster system to have different kinds of GPUs. However, HPL (High-Performance LINPACK), the de-facto standard LINPACK implementation for evaluating the performance of a cluster system, is originally designed to work only for homogeneous CPU-only systems. In this paper, we develop SnuHPL, an optimized HPL for clusters of modern heterogeneous GPUs. To optimize SnuHPL for the heterogeneous GPUs, we design a performance model, a SnuHPL simulator based on the model, and a greedy heuristic algorithm based on the simulator. The algorithm generates the best data distribution for a given cluster configuration by considering computing power, memory capacity, and network performance altogether. We also present a simple technique to increase the energy efficiency of HPL by adjusting the core clock frequency of the GPUs. The evaluation of the data distribution algorithm on small clusters of different GPU combinations shows that it outperforms well-known other data distribution strategies. We show the effectiveness of SnuHPL on a cluster of 1,760 NVIDIA A100-80GB GPUs and 440 A100-40GB GPUs. We also show the effectiveness of the proposed energy optimization technique on a cluster of 144 A100-80GB GPUs.-
dc.language영어-
dc.publisherAssociation for Computing Machinery-
dc.titleSnuHPL: High Performance LINPACK for Heterogeneous GPUs-
dc.typeArticle-
dc.identifier.doi10.1145/3524059.3532370-
dc.citation.journaltitleProceedings of the International Conference on Supercomputing-
dc.identifier.wosid001086201800012-
dc.identifier.scopusid2-s2.0-85132819494-
dc.citation.startpage18-
dc.description.isOpenAccessN-
dc.contributor.affiliatedAuthorLee, Jaejin-
dc.type.docTypeProceedings Paper-
dc.description.journalClass1-
dc.subject.keywordAuthorCluster-
dc.subject.keywordAuthorGPU-
dc.subject.keywordAuthorHeterogeneous computing-
dc.subject.keywordAuthorHigh performance LINPACK-
Appears in Collections:
Files in This Item:
There are no files associated with this item.

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share