Publications

Detailed Information

Genesis: A Hardware Acceleration Framework for Genomic Data Analysis

Cited 14 time in Web of Science Cited 14 time in Scopus
Authors

Ham, Tae Jun; Bruns-Smith, David; Sweeney, Brendan; Lee, Yejin; Seo, Seong Hoon; Song, U Gyeong; Oh, Young H.; Asanovic, Krste; Lee, Jae Wook; Wills, Lisa Wu

Issue Date
2020-05
Publisher
Institute of Electrical and Electronics Engineers Inc.
Citation
Proceedings - International Symposium on Computer Architecture, Vol.2020-May, pp.254-267
Abstract
© 2020 IEEE.In this paper, we describe our vision to accelerate algorithms in the domain of genomic data analysis by proposing a framework called Genesis (genome analysis) that contains an interface and an implementation of a system that processes genomic data efficiently. This framework can be deployed in the cloud and exploit the FPGAs-as-a-service paradigm to provide cost-efficient secondary DNA analysis. We propose conceptualizing genomic reads and associated read attributes as a very large relational database and using extended SQL as a domain-specific language to construct queries that form various data manipulation operations. To accelerate such queries, we design a Genesis hardware library which consists of primitive hardware modules that can be composed to construct a dataflow architecture specialized for those queries. As a proof of concept for the Genesis framework, we present the architecture and the hardware implementation of several genomic analysis stages in the secondary analysis pipeline corresponding to the best known software analysis toolkit, GATK4 workflow proposed by the Broad Institute. We walk through the construction of genomic data analysis operations using a sequence of SQL-style queries and show how Genesis hardware library modules can be utilized to construct the hardware pipelines designed to accelerate such queries. We exploit parallelism and data reuse by utilizing a dataflow architecture along with the use of on-chip scratchpads as well as non-blocking APIs to manage the accelerators, allowing concurrent execution of the accelerator and the host. Our accelerated system deployed on the cloud FPGA performs up to $ 19.3× better than GATK4 running on a commodity multi-core Xeon server and obtains up to $ 15× better cost savings. We believe that if a software algorithm can be mapped onto a hardware library to utilize the underlying accelerator(s) using an already-standardized software interface such as SQL, while allowing the efficient mapping of such interface to primitive hardware modules as we have demonstrated here, it will expedite the acceleration of domainspecific algorithms and allow the easy adaptation of algorithm changes.
ISSN
1063-6897
URI
https://hdl.handle.net/10371/186539
DOI
https://doi.org/10.1109/ISCA45697.2020.00031
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share