Publications

Detailed Information

McDRAM: Low Latency and Energy-Efficient Matrix Computation in DRAM

DC Field Value Language
dc.contributor.advisor유승주-
dc.contributor.author신현승-
dc.date.accessioned2018-05-29T03:32:44Z-
dc.date.available2018-05-29T03:32:44Z-
dc.date.issued2018-02-
dc.identifier.other000000149545-
dc.identifier.urihttps://hdl.handle.net/10371/141557-
dc.description학위논문 (석사)-- 서울대학교 대학원 : 공과대학 컴퓨터공학부, 2018. 2. 유승주.-
dc.description.abstractNeural networks are characterized by massively parallel computation and high memory bandwidth. In particular, memory bandwidth severely limits performance and increases power consumption. In order to overcome memory bottleneck of neural network applications, we propose a novel memory architecture called McDRAM where DRAM dies are equipped with a large number of multiplier-accumulator (MAC) units to perform neural networks internally. Each bank of DRAM memory has multiple MACs as much as the size of memory pre-fetch data, thereby fully utilizing internal bandwidth of DRAM which far larger than external memory bandwidth. McDRAM broadcast data efficiently to all bank without any modifications of DRAM data bus, and it performs MAC operations in the all banks with a single DRAM command. McDRAM is implemented based on the state-of-the-art commercial memory architecture, HBM2, and it equips thousands of MACs (up to 6,144 in HBM2) in a single DRAM package. According to our experiments with in-house memory models based on commercial JEDEC HBM2 simulator, McDRAM achieves 18.68x TOPS/W performance compared to the state-of-the-art hardware accelerator (Google TPU) in LSTM.-
dc.description.tableofcontents1. Introduction 1
2. Backgound and Motivation 8
3. McDRAM Architecture 16
4. McDRAM Scheduling 28
5. Evaluation Methodology 36
6. Evaluation Results 41
7. Related Work 50
8. Conclusion 55
-
dc.formatapplication/pdf-
dc.format.extent1063568 bytes-
dc.format.mediumapplication/pdf-
dc.language.isoen-
dc.publisher서울대학교 대학원-
dc.subjectNeural Network-
dc.subjectDRAM-
dc.subjectRNN-
dc.subjectLSTM-
dc.subjectMLP-
dc.subjectMAC-
dc.subjectHBM2-
dc.subject.ddc621.39-
dc.titleMcDRAM: Low Latency and Energy-Efficient Matrix Computation in DRAM-
dc.typeThesis-
dc.description.degreeMaster-
dc.contributor.affiliation공과대학 컴퓨터공학부-
dc.date.awarded2018-02-
Appears in Collections:
Files in This Item:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share