FARNN: FPGA-GPU Hybrid Acceleration Platform for Recurrent Neural Networks

Cho, Hyungmin; Lee, Jeesoo; Lee, Jaejin

doi:10.1109/TPDS.2021.3124125

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

FARNN: FPGA-GPU Hybrid Acceleration Platform for Recurrent Neural Networks

DC Field	Value	Language
dc.contributor.author	Cho, Hyungmin	-
dc.contributor.author	Lee, Jeesoo	-
dc.contributor.author	Lee, Jaejin	-
dc.date.accessioned	2024-05-03T07:36:31Z	-
dc.date.available	2024-05-03T07:36:31Z	-
dc.date.created	2021-12-14	-
dc.date.issued	2022-07-01	-
dc.identifier.citation	IEEE Transactions on Parallel and Distributed Systems, Vol.33 No.7, pp.1725-1738	-
dc.identifier.issn	1045-9219	-
dc.identifier.uri	https://hdl.handle.net/10371/200911	-
dc.description.abstract	GPU-based platforms provide high computation throughput for large mini-batch deep neural network computations. However, a large batch size may not be ideal for some situations, such as aiming at low latency, training on edge/mobile devices, partial retraining for personalization, and having irregular input sequence lengths. GPU performance suffers from low utilization especially for small-batch recurrent neural network (RNN) applications where sequential computations are required. In this article, we propose a hybrid architecture, called FARNN, which combines a GPU and an FPGA to accelerate RNN computation for small batch sizes. After separating RNN computations into GPU-efficient and GPU-inefficient tasks, we design special FPGA computation units that accelerate the GPU-inefficient RNN tasks. FARNN off-loads the GPU-inefficient tasks to the FPGA. We evaluate FARNN with synthetic RNN layers of various configurations on the Xilinx UltraScale+ FPGA and the NVIDIA P100 GPU in addition to evaluating it with real RNN applications. The evaluation result indicates that FARNN outperforms the P100 GPU platform for RNN training by up to 4.2x with small batch sizes, long input sequences, and many RNN cells per layer.	-
dc.language	영어	-
dc.publisher	Institute of Electrical and Electronics Engineers	-
dc.title	FARNN: FPGA-GPU Hybrid Acceleration Platform for Recurrent Neural Networks	-
dc.type	Article	-
dc.identifier.doi	10.1109/TPDS.2021.3124125	-
dc.citation.journaltitle	IEEE Transactions on Parallel and Distributed Systems	-
dc.identifier.wosid	000719558900002	-
dc.identifier.scopusid	2-s2.0-85118639502	-
dc.citation.endpage	1738	-
dc.citation.number	7	-
dc.citation.startpage	1725	-
dc.citation.volume	33	-
dc.description.isOpenAccess	N	-
dc.contributor.affiliatedAuthor	Lee, Jaejin	-
dc.type.docType	Article	-
dc.description.journalClass	1	-
dc.subject.keywordAuthor	FPGA	-
dc.subject.keywordAuthor	GPU	-
dc.subject.keywordAuthor	hybrid platform	-
dc.subject.keywordAuthor	RNN	-

Appears in Collections:

Graduate School of Data Science (데이터사이언스 대학원)
- Journal Papers (저널논문_데이터사이언스학과)

Files in This Item:: There are no files associated with this item.

Altmetrics

Item View & Download Count

Show Simple Item Record

Find it @ SNU

트윗하기

SNS Share