Optimizing Memory Management Systems for High Performance and Scalability

박성재

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

Optimizing Memory Management Systems for High Performance and Scalability : 높은 성능과 확장성을 위한 메모리 관리 시스템 최적화

DC Field	Value	Language
dc.contributor.advisor	염헌영	-
dc.contributor.author	박성재	-
dc.date.accessioned	2019-10-21T02:27:33Z	-
dc.date.available	2019-10-21T02:27:33Z	-
dc.date.issued	2019-08	-
dc.identifier.other	000000156341	-
dc.identifier.uri	https://hdl.handle.net/10371/162025	-
dc.identifier.uri	http://dcollection.snu.ac.kr/common/orgView/000000156341	ko_KR
dc.description	학위논문(박사)--서울대학교 대학원 :공과대학 컴퓨터공학부,2019. 8. 염헌영.	-
dc.description.abstract	One common characteristic of modern workloads which appeared with recent computing paradigms including cloud, big data and machine learning is memory intensiveness. Such workloads usually have huge working sets that cannot be fully accommodated in DRAM in many case. Those also tend to show only low locality so that the small CPU cache cannot hide DRAM or lower level memory access overhead. Meanwhile, computing hardware has also evolved to keep pace with this change. (1) Computing systems are increasing the size of their main memory so that those could accommodate more of the huge working sets. As a result, data center servers utilizing few hundreds of gigabytes of DRAM have been common and even terabytes of DRAM equipped systems exist. (2) Massive parallelism is becomming common and essential. CPU vendors have started to increase the number of CPU cores instead of the CPU frequency due to the heat dissipation and power consumption problem since the early 2000s. Prevalent datacenter systems provide few hundreds of CPU cores; Few thousands of CPU cores are not rare. Such many-core systems are normally constructed in non-uniform memory access (NUMA) architecture. Therefore, efficient, effective and NUMA-awared use of this parallelism is especially important for the memory intensive workloads. Compared to these rapid changes of workload characteristics and hardware, memory management system software has not sufficiently optimized. Consequently, the memory management system software has been a bottleneck. In other words, the memory intensive modern workloads cannot fully utilize the evolved modern hardware unless the underlying memory management system is completely optimized. This paper provides an overview of a few limitations in existing memory management systems and introduces two optimization approaches for high performance and scalability of the memory management systems. The first approach improves the performance of the memory systems by guaranteeing huge page utilization under memory fragmentation situation. For the guarantee, we introduce a contiguous memory allocator that guarantees success and low latency of its allocations. The second approach intends to optimize the NUMA-aware system scalability. For that, we optimize virtual memory address space management system by substituting virtual memory area (VMA) managing red-black tree protection from global reader-writer locking to an RCU extension. Because no RCU extension including state-of-the-arts are NUMA oblivious, we also designed new RCU extension that provides NUMA-aware scalable update-side synchronization.	-
dc.description.tableofcontents	Abstract 1 Chapter 1 Introduction 6 1.1 Motivation 6 1.2 Approaches 7 1.2.1 An Optimization for High Performance 7 1.2.2 An Optimization for High Scalability 9 1.3 Dissertation Structure 10 Chapter 2 Guaranteed Transparent Huge Pages Allocations 12 2.1 Introduction 12 2.2 Background 16 2.2.1 Devices using DMA 16 2.2.2 Huge Pages 17 2.2.3 Buddy Allocator 20 2.2.4 Memory Reservation 21 2.2.5 Contiguous Memory Allocator 21 2.3 Guaranteed CMA 22 2.3.1 Secondary Class Clients of GCMA 23 2.3.2 Limitations and Optimizations 26 2.4 Implementation 27 2.4.1 Contiguous Memory Allocation 29 2.4.2 DMEM: Discardable Memory 30 2.5 Guaranteed THP 30 2.6 Evaluation 32 2.6.1 Evaluation on a Mobile System 32 2.6.2 Evaluation on a Server System 38 2.7 Related Work 45 2.8 Conclusion 47 Chapter 3 A Scalable Virtual Address Space Protected by an HTM-based NUMA-aware RCU Extension 48 3.1 Introduction 48 3.2 Background and Related Work 50 3.2.1 Read-Copy Update 50 3.2.2 Hardware Transactional Memory 53 3.2.3 Related Work 54 3.3. An RCU Extension for NUMA Systems 57 3.3.1 Root Cause of HTM Performance Degradation on NUMA systems 57 3.3.2 Design of RCX 62 3.3.3 Implementation 70 3.4 Evaluation 71 3.4.1 Evaluation Setup 71 3.4.2 Micro-benchmarks 72 3.4.3 Macro-benchmark 76 3.5 Conclusion. 80 Chapter 4 Conculsion 81	-
dc.language.iso	eng	-
dc.publisher	서울대학교 대학원	-
dc.subject	Multicore	-
dc.subject	Parallelism	-
dc.subject	RCU	-
dc.subject	Fragmentation	-
dc.subject	Memory	-
dc.subject	Operating System	-
dc.subject.ddc	621.39	-
dc.title	Optimizing Memory Management Systems for High Performance and Scalability	-
dc.title.alternative	높은 성능과 확장성을 위한 메모리 관리 시스템 최적화	-
dc.type	Thesis	-
dc.type	Dissertation	-
dc.contributor.AlternativeAuthor	SeongJae Park	-
dc.contributor.department	공과대학 컴퓨터공학부	-
dc.description.degree	Doctor	-
dc.date.awarded	2019-08	-
dc.identifier.uci	I804:11032-000000156341	-
dc.identifier.holdings	000000000040▲000000000041▲000000156341▲	-

Appears in Collections:

College of Engineering/Engineering Practice School (공과대학/대학원)
- Dept. of Computer Science and Engineering (컴퓨터공학부)
  - Theses (Ph.D. / Sc.D._컴퓨터공학부)

Files in This Item:

000000156341.pdf 2.29 MB

Altmetrics

Item View & Download Count

Show Simple Item Record

Find it @ SNU

트윗하기

SNS Share