Publications

Detailed Information

Integrated genome sizing (IGS) approach for the parallelization of whole genome analysis

Cited 0 time in Web of Science Cited 0 time in Scopus
Authors

Sona, Peter; Hong, Jong Hui; Lee, Sunho; Kim, Byong Joon; Hong, Woon-Young; Jung, Jongcheol; Kim, Han-Na; Kim, Hyung-Lae; Christopher, David; Herviou, Laurent; Im, Young Hwan; Lee, Kwee-Yum; Kim, Tae Soon; Jung, Jongsun

Issue Date
2018-12-03
Publisher
BioMed Central
Citation
BMC Bioinformatics, 19(1):462
Keywords
Genome sizingSequencingGenome analysisStatisticsInfrastructureStorageWhole genome
Abstract
Background
The use of whole genome sequence has increased recently with rapid progression of next-generation sequencing (NGS) technologies. However, storing raw sequence reads to perform large-scale genome analysis pose hardware challenges. Despite advancement in genome analytic platforms, efficient approaches remain relevant especially as applied to the human genome. In this study, an Integrated Genome Sizing (IGS) approach is adopted to speed up multiple whole genome analysis in high-performance computing (HPC) environment. The approach splits a genome (GRCh37) into 630 chunks (fragments) wherein multiple chunks can simultaneously be parallelized for sequence analyses across cohorts.

Results
IGS was integrated on Maha-Fs (HPC) system, to provide the parallelization required to analyze 2504 whole genomes. Using a single reference pilot genome, NA12878, we compared the NGS process time between Maha-Fs (NFS SATA hard disk drive) and SGI-UV300 (solid state drive memory). It was observed that SGI-UV300 was faster, having 32.5 mins of process time, while that of the Maha-Fs was 55.2 mins.

Conclusions
The implementation of IGS can leverage the ability of HPC systems to analyze multiple genomes simultaneously. We believe this approach will accelerate research advancement in personalized genomic medicine. Our method is comparable to the fastest methods for sequence alignment.
ISSN
1471-2105
Language
English
URI
https://hdl.handle.net/10371/146963
DOI
https://doi.org/10.1186/s12859-018-2499-1
Files in This Item:
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share