Publications

Detailed Information

Integrated genome sizing (IGS) approach for the parallelization of whole genome analysis

DC Field Value Language
dc.contributor.authorSona, Peter-
dc.contributor.authorHong, Jong Hui-
dc.contributor.authorLee, Sunho-
dc.contributor.authorKim, Byong Joon-
dc.contributor.authorHong, Woon-Young-
dc.contributor.authorJung, Jongcheol-
dc.contributor.authorKim, Han-Na-
dc.contributor.authorKim, Hyung-Lae-
dc.contributor.authorChristopher, David-
dc.contributor.authorHerviou, Laurent-
dc.contributor.authorIm, Young Hwan-
dc.contributor.authorLee, Kwee-Yum-
dc.contributor.authorKim, Tae Soon-
dc.contributor.authorJung, Jongsun-
dc.date.accessioned2019-03-12T00:41:22Z-
dc.date.available2019-03-12T09:57:49Z-
dc.date.issued2018-12-03-
dc.identifier.citationBMC Bioinformatics, 19(1):462ko_KR
dc.identifier.issn1471-2105-
dc.identifier.urihttps://hdl.handle.net/10371/146963-
dc.description.abstractBackground
The use of whole genome sequence has increased recently with rapid progression of next-generation sequencing (NGS) technologies. However, storing raw sequence reads to perform large-scale genome analysis pose hardware challenges. Despite advancement in genome analytic platforms, efficient approaches remain relevant especially as applied to the human genome. In this study, an Integrated Genome Sizing (IGS) approach is adopted to speed up multiple whole genome analysis in high-performance computing (HPC) environment. The approach splits a genome (GRCh37) into 630 chunks (fragments) wherein multiple chunks can simultaneously be parallelized for sequence analyses across cohorts.

Results
IGS was integrated on Maha-Fs (HPC) system, to provide the parallelization required to analyze 2504 whole genomes. Using a single reference pilot genome, NA12878, we compared the NGS process time between Maha-Fs (NFS SATA hard disk drive) and SGI-UV300 (solid state drive memory). It was observed that SGI-UV300 was faster, having 32.5 mins of process time, while that of the Maha-Fs was 55.2 mins.

Conclusions
The implementation of IGS can leverage the ability of HPC systems to analyze multiple genomes simultaneously. We believe this approach will accelerate research advancement in personalized genomic medicine. Our method is comparable to the fastest methods for sequence alignment.
ko_KR
dc.description.sponsorshipThis work was supported by the INNOPOLIS Foundation, a grant-in-aid from the Korean government through Syntekabio, Inc. [grant number A2014DD101]; the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI); and the Ministry of Health & Welfare, Republic of Korea [grant number HI14C0072] The funding bodies had no role in the design, collection, analysis, or interpretation of data in this study.ko_KR
dc.language.isoenko_KR
dc.publisherBioMed Centralko_KR
dc.subjectGenome sizingko_KR
dc.subjectSequencingko_KR
dc.subjectGenome analysisko_KR
dc.subjectStatisticsko_KR
dc.subjectInfrastructureko_KR
dc.subjectStorageko_KR
dc.subjectWhole genomeko_KR
dc.titleIntegrated genome sizing (IGS) approach for the parallelization of whole genome analysisko_KR
dc.typeArticleko_KR
dc.contributor.AlternativeAuthor홍정휘-
dc.contributor.AlternativeAuthor이순호-
dc.contributor.AlternativeAuthor김병준-
dc.contributor.AlternativeAuthor홍운영-
dc.contributor.AlternativeAuthor정종철-
dc.contributor.AlternativeAuthor김한나-
dc.contributor.AlternativeAuthor김형래-
dc.contributor.AlternativeAuthor임영환-
dc.contributor.AlternativeAuthor이귀염-
dc.contributor.AlternativeAuthor김태순-
dc.contributor.AlternativeAuthor정종선-
dc.identifier.doi10.1186/s12859-018-2499-1-
dc.language.rfc3066en-
dc.rights.holderThe Author(s).-
dc.date.updated2018-12-09T04:25:46Z-
Appears in Collections:
Files in This Item:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share