Publications

Detailed Information

Implementing CUDA Unified Memory in the PyTorch Framework

Cited 0 time in Web of Science Cited 4 time in Scopus
Authors

Choi, Jake; Yeom, Heon Young; Kim, Yoonhee

Issue Date
2021-01
Publisher
Institute of Electrical and Electronics Engineers Inc.
Citation
Proceedings - 2021 IEEE International Conference on Autonomic Computing and Self-Organizing Systems Companion, ACSOS-C 2021, pp.20-25
Abstract
© 2021 IEEE.Popular deep learning frameworks like PyTorch utilize GPUs heavily for training, and suffer from out-of-memory (OOM) problems if memory is not managed properly. In this paper, we propose a modification that utilizes CUDA Unified Memory (UM) to expand GPU memory to the available host memory space so that practicality for the programmer can increase, and OOM memory errors will not result for any workload. We also pinpoint performance issues that result from our modifications to the framework, and outline future plans like reducing redundant memory copies, prefetching, and memory advising techniques to improve upon our design. Our implementation shows that PyTorch UM performance overheads are minimal when the data footprint is below GPU memory capacity.
URI
https://hdl.handle.net/10371/183790
DOI
https://doi.org/10.1109/ACSOS-C52956.2021.00029
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share