Publications

Detailed Information

Circuit and Architecture Optimization Techniques for Emerging Technologies of High-Speed Computing : 고성능 컴퓨팅 이머징 기술을 위한 회로 및 아키텍쳐 최적화 기법

DC Field Value Language
dc.contributor.advisor김태환-
dc.contributor.author박희천-
dc.date.accessioned2018-05-28T16:20:52Z-
dc.date.available2018-05-28T16:20:52Z-
dc.date.issued2018-02-
dc.identifier.other000000149859-
dc.identifier.urihttps://hdl.handle.net/10371/140669-
dc.description학위논문 (박사)-- 서울대학교 대학원 : 공과대학 전기·컴퓨터공학부, 2018. 2. 김태환.-
dc.description.abstractFor system-on-chip (SoC) design improvement in more-than-Moore scale, entirely new high-speed computing technologies beyond conventional optimizations are often
proposed. They can solve many limitations of existing designs from different angle of the view. However, as engineers are not mature with such new technologies, these emerging technologies are facing many problems in realization. Even larger implementation overhead than benefits prevents these technologies from being used in real world. This dissertation presents some of the emerging high-speed computing technologies and their limitations in realization, and proposes a solution to each of the technologies to be used in industry with lower technology entry barrier.
Firstly, TSV(Through-Silicon Via)-based 3-D IC is introduced for denser chip design by stacking dies vertically. In this dissertation, we solved TSV reliability problem in the clock tree of 3-D IC with a full solution of designing and synthesizing a TSV fault-tolerant 3-D clock tree.
Secondly, as clock tree synthesis becomes more complicated under recent design environment of low supply voltage and large variations, asynchronous circuit design have been considered as an alternative to the clock-based synchronous design. In the dissertation, we proposed a new structure of single-rail/dual-rail hybrid asynchronous design that achieves both robustness against variations and low implementation overhead.
This design is also devised to be compatible with conventional standard cell libraries and computer-aided design (CAD) tools for productivity and practicality.
Lastly, with the aid of clock-less circuit design, biologically inspired neuromorphic computing architecture is emerged to overcome the memory-computation gap in the traditional von Neumann architecture. We improved the performance of the architecture with a new approach of cross optimization of multiple synapse networks in implementation of a deep neural network (DNN).
-
dc.description.tableofcontents1 INTRODUCTION 1
1.1 TSV based 3-D IC 1
1.2 Asynchronous Circuit Design 2
1.3 Neuromorphic Computing Architecture 3
1.4 Contributions of this Dissertation 4
2 FAULT-TOLERANT CLOCK TREE SYNTHESIS FOR TSV-BASED 3-D IC 6
2.1 Preliminaries 6
2.2 Motivations 11
2.2.1 Limitations of Previous TFU Designs 11
2.2.2 Limitations for Application of Previous TFUs 13
2.3 3-D TSV Fault-Tolerant Clock Tree Synthesis 16
2.3.1 Slew-Controlled TSV Fault-tolerant Unit (SC-TFU) 16
2.3.2 Fault-Tolerant 3-D Clock Tree Synthesis 23
2.4 Experimental Results 33
2.5 Summary 42
3 PRACTICAL ASYNCHRONOUS CIRCUIT DESIGN WITH HYBRID STRUCTURE 45
3.1 Preliminaries 45
3.1.1 Quasi-Delay Insensitive Model 45
3.1.2 Single-rail vs. Dual-rail Asynchronous Circuits 46
3.1.3 Dynamic vs. Static Logic based Asynchronous Circuits 50
3.2 Synthesis of Hybrid Asynchronous Circuits 51
3.2.1 Overview of Hybrid Asynchronous Circuit Structure 51
3.2.2 Design Details for Hybrid Asynchronous Circuits 54
3.2.3 Transformation to the Hybrid Asynchronous Circuit 58
3.2.4 Overall Design Flow 63
3.3 Experimental Results 64
3.3.1 Experiment Setup and Design Models 64
3.3.2 Assessment of Area Overhead 66
3.3.3 Assessment of Power Consumption 68
3.3.4 Assessment of Circuit Performance 72
3.4 Summary 74
4 NEUROMORPHIC COMPUTING ARCHITECTURE OPTIMIZATION SCHEME 76
4.1 Motivation 76
4.2 Preliminaries 77
4.3 Structure Optimization of Neuromorphic Inter-core Architecture 80
4.3.1 Zero Wait Inter-core Architecture 80
4.3.2 Resource Configuration of Inter-core Architecture 84
4.3.3 Using Denaxo-driven Inter-core in Large DNNs 86
4.4 Experimental Results 87
4.4.1 Experiment Setups and Design Models 87
4.4.2 Evaluation of Computation Speed 89
4.4.3 Evaluation of Cell Area 93
4.4.4 Experiments with MNIST Dataset 95
4.5 Summary 96
5 CONCLUSION 97
5.1 Chapter 2 97
5.2 Chapter 3 97
5.3 Chapter 4 98
Abstract (In Korean) 108
-
dc.formatapplication/pdf-
dc.format.extent5762536 bytes-
dc.format.mediumapplication/pdf-
dc.language.isoen-
dc.publisher서울대학교 대학원-
dc.subjectEmerging technologies-
dc.subject3-D IC-
dc.subjectThrough-silicon via-
dc.subjectFault-tolerance-
dc.subjectClock tree-
dc.subjectAsynchronous design-
dc.subjectSingle-rail-
dc.subjectDual-rail-
dc.subjectDynamic logic-
dc.subjectStatic logic-
dc.subjectNeuromorphic computing-
dc.subjectDeep neural network-
dc.subjectSynaptic network-
dc.subjectDendritic-based-
dc.subjectAxonal-based-
dc.subject.ddc621.3-
dc.titleCircuit and Architecture Optimization Techniques for Emerging Technologies of High-Speed Computing-
dc.title.alternative고성능 컴퓨팅 이머징 기술을 위한 회로 및 아키텍쳐 최적화 기법-
dc.typeThesis-
dc.contributor.AlternativeAuthorHeechun Park-
dc.description.degreeDoctor-
dc.contributor.affiliation공과대학 전기·컴퓨터공학부-
dc.date.awarded2018-02-
Appears in Collections:
Files in This Item:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share