S-Space College of Natural Sciences (자연과학대학) Dept. of Mathematical Sciences (수리과학부) Theses (Master's Degree_수리과학부)
Implementing general matrix-matrix multiplication algorithm on the Intel Xeon Phi Knights Landing Processor
Intel Xeon Phi Knights Landing 프로세서에서의 일반 행렬 곱셈 알고리즘 구현
- 자연과학대학 수리과학부
- Issue Date
- 서울대학교 대학원
- 학위논문 (석사)-- 서울대학교 대학원 : 자연과학대학 수리과학부, 2018. 2. 신동우.
- This paper presents the design and implementation of general matrix-matrix multiplication (GEMM) algorithm for the second generation Intel Xeon Phi processor codenamed Knights Landing (KNL). We illustrate several developing guidelines to achieve optimal performance with C programming language and the Advanced Vector Extensions (AVX-512) instruction set. Further, we present several environment variable issues associated with parallelization on the KNL. On a single core of the KNL, our double-precision GEMM (DGEMM) implementation achieves up to 99 percent of DGEMM performance using the Intel MKL, which is the current state-of-the-art library. Our parallel implementation for 68 cores of the KNL also achieves good scaling results, up to 93 percent of DGEMM performance using the Intel MKL.