Browse

A Modified Least Angle Regression Algorithm for Hierarchical Interaction
계층적 교호작용을 고려한 수정된 HLARS 알고리즘 개발

Cited 0 time in Web of Science Cited 0 time in Scopus
Authors
김우성
Advisor
김용대
Major
자연과학대학 통계학과
Issue Date
2016-02
Publisher
서울대학교 대학원
Keywords
High dimensional regressionTwo way interaction modelLASSOLARSMassage passing interface
Description
학위논문 (박사)-- 서울대학교 대학원 : 통계학과, 2016. 2. 김용대.
Abstract
Variable selection is important in high dimensional regression. Traditional variable selection methods such as stepwise selection are unstable which means that the set of the selected variables is sensitive according to the change of data sets. As an alternative to those methods, a series of sparse penalized methods are used for estimation and variable selection simultaneously. The full set of LASSO solutions can be calculated by a minor modification of the LARS algorithm.
In many important practical problems, the main effect alone may not be enough to capture the relationship between the response and predictors, and high-order interactions are often of interest to scientific researchers. In considering two-way interaction models with a large number of covariates, we often would like to determine a smaller subset that exhibits strong effects on the response variable have been suggested.
Considering all possible interactions, however, is almost impossible due to computational burden when the number of covariate is large. To resolve this problem, the heredity structure between the main and interaction effect can be considered, algorithms for LASSO with heredity structure. However these algorithms cannot be executed if the number of main effects is large since still computational burden is large. To resolve this issue, we suggest a hierarchical LARS algorithm which can be parallelized easily with MPI.
The proposed hierarchical LARS is a modified version of LARS, but it is more faster than LARS and has comparable prediction accuracy. It can be scaled up since it is possible to be executed in parallel process. MPI is a well known parallel model and we suggest a MPI-version of hierarchical LARS.
Language
English
URI
http://hdl.handle.net/10371/121159
Files in This Item:
Appears in Collections:
College of Natural Sciences (자연과학대학)Dept. of Statistics (통계학과)Theses (Ph.D. / Sc.D._통계학과)
  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse