Publications
Detailed Information
McDRAM: Low Latency and Energy-Efficient Matrix Computation in DRAM
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | 유승주 | - |
dc.contributor.author | 신현승 | - |
dc.date.accessioned | 2018-05-29T03:32:44Z | - |
dc.date.available | 2018-05-29T03:32:44Z | - |
dc.date.issued | 2018-02 | - |
dc.identifier.other | 000000149545 | - |
dc.identifier.uri | https://hdl.handle.net/10371/141557 | - |
dc.description | 학위논문 (석사)-- 서울대학교 대학원 : 공과대학 컴퓨터공학부, 2018. 2. 유승주. | - |
dc.description.abstract | Neural networks are characterized by massively parallel computation and high memory bandwidth. In particular, memory bandwidth severely limits performance and increases power consumption. In order to overcome memory bottleneck of neural network applications, we propose a novel memory architecture called McDRAM where DRAM dies are equipped with a large number of multiplier-accumulator (MAC) units to perform neural networks internally. Each bank of DRAM memory has multiple MACs as much as the size of memory pre-fetch data, thereby fully utilizing internal bandwidth of DRAM which far larger than external memory bandwidth. McDRAM broadcast data efficiently to all bank without any modifications of DRAM data bus, and it performs MAC operations in the all banks with a single DRAM command. McDRAM is implemented based on the state-of-the-art commercial memory architecture, HBM2, and it equips thousands of MACs (up to 6,144 in HBM2) in a single DRAM package. According to our experiments with in-house memory models based on commercial JEDEC HBM2 simulator, McDRAM achieves 18.68x TOPS/W performance compared to the state-of-the-art hardware accelerator (Google TPU) in LSTM. | - |
dc.description.tableofcontents | 1. Introduction 1
2. Backgound and Motivation 8 3. McDRAM Architecture 16 4. McDRAM Scheduling 28 5. Evaluation Methodology 36 6. Evaluation Results 41 7. Related Work 50 8. Conclusion 55 | - |
dc.format | application/pdf | - |
dc.format.extent | 1063568 bytes | - |
dc.format.medium | application/pdf | - |
dc.language.iso | en | - |
dc.publisher | 서울대학교 대학원 | - |
dc.subject | Neural Network | - |
dc.subject | DRAM | - |
dc.subject | RNN | - |
dc.subject | LSTM | - |
dc.subject | MLP | - |
dc.subject | MAC | - |
dc.subject | HBM2 | - |
dc.subject.ddc | 621.39 | - |
dc.title | McDRAM: Low Latency and Energy-Efficient Matrix Computation in DRAM | - |
dc.type | Thesis | - |
dc.description.degree | Master | - |
dc.contributor.affiliation | 공과대학 컴퓨터공학부 | - |
dc.date.awarded | 2018-02 | - |
- Appears in Collections:
- Files in This Item:
Item View & Download Count
Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.