Browse

Risk prediction using common and rare genetic variants: application to Type 2 diabetes

Cited 0 time in Web of Science Cited 0 time in Scopus
Authors
배성환
Advisor
박태성
Major
자연과학대학 협동과정 생물정보학전공
Issue Date
2018-02
Publisher
서울대학교 대학원
Keywords
Whole exome sequencing (WES)Risk prediction modelType 2 diabetes (T2D)Penalized regression methodsStepwise selectionSupport vector machine (SVM)
Description
학위논문 (석사)-- 서울대학교 대학원 : 자연과학대학 협동과정 생물정보학전공, 2018. 2. 박태성.
Abstract
Genome-wide association studies (GWAS) have identified many disease-related common variants.Common genetic variants are being diagnosed and treated.Furthermore,using common genetic variants, there have been several prediction models suggested based on penalized regression or statistical learning methods. However, the common variant is not sufficient to explain the phenotype. One way to solve this problem is to consider rare variations. This is because rare variants has a large impact on disease. A recent development of next generation sequencing technology (NGS) has identified several disease-related rare genetic variants. However only a few studies have compared predictive models using both common and rare variants. The aim of our study is to compare the performance of prediction models systematically by using common and rare variants from the Whole Exome Sequencing (WES) data of Type 2 Diabetes Genetic Exploration by Next-generation sequencing in Ethnic Samples (T2D-GENES) Consortium. We first constructed risk prediction models, such as stepwise logistic regression (SLR), least absolute shrinkage and selection operator (LASSO), Elastic-Net (EN) and support vector machine (SVM). We then compared prediction accuracy by calculating the area under the curve (AUC). Our results show that the performance using both common and rare variants was better than using either the common variants only or the rare variants only. Although the AUC values were different depending on the variant sets, the AUC values of SVM prediction models were always larger than those of other prediction models.Among the four rare variant sets, AUC value was larger at ptv_ns set.
Language
English
URI
https://hdl.handle.net/10371/142486
Files in This Item:
Appears in Collections:
College of Natural Sciences (자연과학대학)Program in Bioinformatics (협동과정-생물정보학전공)Theses (Master's Degree_협동과정-생물정보학전공)
  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse