Cheetah: Optimizing and Accelerating Homomorphic Encryption for Private Inference

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

Cheetah: Optimizing and Accelerating Homomorphic Encryption for Private Inference

Cited 47 time in Web of Science Cited 56 time in Scopus

Authors: Reagen, Brandon; Choi, Woo-Seok; Ko, Yeongil; Lee, Vincent T.; Lee, Hsien-Hsin S.; Wei, Gu-Yeon; Brooks, David

Citation: IEEE High-Performance Computer Architecture Symposium Proceedings, Vol.2021-February, pp.26-39

Abstract: As the application of deep learning continues to grow, so does the amount of data used to make predictions. While traditionally big-data deep learning was constrained by computing performance and off-chip memory bandwidth, a new constraint has emerged: privacy. One solution is homomorphic encryption (HE). Applying HE to the client-cloud model allows cloud services to perform inferences directly on clients' encrypted data. While HE can meet privacy constraints it introduces enormous computational challenges and remains impractically slow on current systems. This paper introduces Cheetah, a set of algorithmic and hardware optimizations for server-side HE DNN inference. Cheetah proposes HE-parameter tuning and operator scheduling optimizations, which together deliver up to 79 x speedup over the state-of-the-art. However, HE inference still falls short of real-time inference speeds by nearly four orders of magnitude. Cheetah further proposes an accelerator architecture to understand the degree of speedup hardware can provide and whether it can bridge HE's real-time performance gap. We evaluate several DNNs and find that privacy-preserving HE inference for ResNet50 can approach real-time speeds with a 587mm(2) accelerator dissipating 30W in 5nm.