Publications

Detailed Information

BBOS: Efficient HPC Storage Management via Burst Buffer Over-Subscription

Cited 7 time in Web of Science Cited 6 time in Scopus
Authors

Sung, Hanul; Bang, Jiwoo; Kim, Chungyong; Kim, Hyung-Sin; Sim, Alexander; Lockwood, Glenn K.; Eom, Hyeonsang

Issue Date
2020-05
Publisher
Institute of Electrical and Electronics Engineers Inc.
Citation
Proceedings - 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGRID 2020, pp.142-151
Abstract
© 2020 IEEE.To avoid access to PFS, dedicated BB allocation is preferred despite of severe BB underutilization. Recently, new all-flash HPC storage systems with integrated BB and PFS are proposed, which speed up access to PFS. For this reason, we adopt BB over-subscription allocation method by allowing HPC applications to use BB only for I/O phase for improving BB utilization. Unfortunately, BB over-subscription aggravates I/O interference and demotion overhead from BB to PFS, resulting in degraded performance. To minimize the performance degradation, we develop an I/O scheduler to prevent I/O congestion and a new transparent data management system based on checkpoint/restart characteristics of HPC applications. With the proposed approach, not only the BB utilization can be improved, but also high performance of applications is achieved. In our experiments, we find that BB utilization is improved at least 2.2x, and more stable and higher checkpoint performance is guaranteed compared to other approaches. Besides, we achieve up to 96.4% hit ratio of restart requests on BB and up to 3.1x higher restart performance than others.
URI
https://hdl.handle.net/10371/186537
DOI
https://doi.org/10.1109/CCGrid49817.2020.00-79
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share