Publications
Detailed Information
Practical issues for screening and variable selection method in a Genome-Wide Association Analysis : 전장유전체 연관분석에서의 변수 선별과 변수 선택 방법의 현실적 사안들
DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | 박태성 | - |
dc.contributor.author | 홍성연 | - |
dc.date.accessioned | 2017-07-19T08:43:28Z | - |
dc.date.available | 2017-07-19T08:43:28Z | - |
dc.date.issued | 2013-02 | - |
dc.identifier.other | 000000009611 | - |
dc.identifier.uri | https://hdl.handle.net/10371/131267 | - |
dc.description | 학위논문 (석사)-- 서울대학교 대학원 : 통계학과, 2013. 2. 박태성. | - |
dc.description.abstract | Variable selection plays an important role in high dimensional statistical modeling analysis. Computational cost and estimation accuracy are two main concerns for statistical inference of high dimensional data. Recently, many high dimensional data have been generated in biomedical science such as microarray data and single nucleotide polymorphism (SNP) data. Especially, the genome-wide association studies (GWAS) which focus on identifying SNPs associated with a disease of interest, have produced ultra-high dimensional data. Numerous methods have been proposed to handle GWAS data. Most statistical methods have adopted a two-stage approach: (1) pre-screening for dimensional reduction, (2) variable selection for identification of causal SNPs. The pre-screening step selects SNPs in terms of their p-values or absolute value of regression coefficients in single SNP analysis. Penalized regression such as Ridge, Lasso, adaptive Lasso and Elastic-net are commonly used for the variable selection step. In this paper, we investigate which combination of prescreening method and penalized regression performs best on continuous type response variable via real GWA data containing 327,872 SNPs from 8842 individuals. | - |
dc.description.tableofcontents | Contents
Abstract i Chapter 1. Introduction 1 1.1 Background 1 1.2 Overview 3 Chapter 2. Methods 5 2.1 Standardization 5 2.2 Pre-screening 6 2.3 Variable Selection 6 2.4 Ordering 10 Chapter 3. Analysis 11 3.1 KARE data 11 3.2 Pre-screening 12 3.3 Variable Selection 15 3.4 Comparison Study 19 Chapter 4. Discussion 21 Bibliography 23 초 록 26 List of Figures [Figure 1] 14 [Figure 2] 17 [Figure 3] 18 | - |
dc.format | application/pdf | - |
dc.format.extent | 508937 bytes | - |
dc.format.medium | application/pdf | - |
dc.language.iso | en | - |
dc.publisher | 서울대학교 대학원 | - |
dc.subject.ddc | 519 | - |
dc.title | Practical issues for screening and variable selection method in a Genome-Wide Association Analysis | - |
dc.title.alternative | 전장유전체 연관분석에서의 변수 선별과 변수 선택 방법의 현실적 사안들 | - |
dc.type | Thesis | - |
dc.contributor.AlternativeAuthor | Sung-Yeon Hong | - |
dc.description.degree | Master | - |
dc.citation.pages | 27 | - |
dc.contributor.affiliation | 자연과학대학 통계학과 | - |
dc.date.awarded | 2013-02 | - |
- Appears in Collections:
- Files in This Item:
Item View & Download Count
Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.