Publications

Detailed Information

GradPIM: A Practical Processing-in-DRAM Architecture for Gradient Descent

DC Field Value Language
dc.contributor.authorKim, Heesu-
dc.contributor.authorPark, Hanmin-
dc.contributor.authorKim, Taehyun-
dc.contributor.authorCho, Kwanheum-
dc.contributor.authorLee, Eojin-
dc.contributor.authorRyu, Soojung-
dc.contributor.authorLee, Hyuk-Jae-
dc.contributor.authorChoi, Kiyoung-
dc.contributor.authorLee, Jinho-
dc.date.accessioned2024-05-02T05:42:47Z-
dc.date.available2024-05-02T05:42:47Z-
dc.date.created2024-01-18-
dc.date.created2024-01-18-
dc.date.issued2021-
dc.identifier.citation2021 27TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2021), Vol.2021-February, pp.249-262-
dc.identifier.issn1530-0897-
dc.identifier.urihttps://hdl.handle.net/10371/200477-
dc.description.abstractIn this paper, we present GradPIM, a processing-in-memory architecture which accelerates parameter updates of deep neural networks training. As one of processing-in-memory techniques that could be realized in the near future, we propose an incremental, simple architectural design that does not invade the existing memory protocol. Extending DDR4 SDRAM to utilize bank-group parallelism makes our operation designs in processing-in-memory (PIM) module efficient in terms of hardware cost and performance. Our experimental results show that the proposed architecture can improve the performance of DNN training and greatly reduce memory bandwidth requirement while posing only a minimal amount of overhead to the protocol and DRAM area.-
dc.language영어-
dc.publisherIEEE COMPUTER SOC-
dc.titleGradPIM: A Practical Processing-in-DRAM Architecture for Gradient Descent-
dc.typeArticle-
dc.identifier.doi10.1109/HPCA51647.2021.00030-
dc.citation.journaltitle2021 27TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2021)-
dc.identifier.wosid000671076000019-
dc.identifier.scopusid2-s2.0-85104983900-
dc.citation.endpage262-
dc.citation.startpage249-
dc.citation.volume2021-February-
dc.description.isOpenAccessY-
dc.contributor.affiliatedAuthorChoi, Kiyoung-
dc.contributor.affiliatedAuthorLee, Jinho-
dc.type.docTypeProceedings Paper-
dc.description.journalClass1-
Appears in Collections:
Files in This Item:
There are no files associated with this item.

Related Researcher

  • College of Engineering
  • Department of Electrical and Computer Engineering
Research Area AI Accelerators, Distributed Deep Learning, Neural Architecture Search

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share