Evolutionary Machine Learning of Higher Order Relationships in Genome-wide Sequence Analysis

이제근

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

Evolutionary Machine Learning of Higher Order Relationships in Genome-wide Sequence Analysis : 유전체 서열 분석에서 고차 관계의 진화적 기계학습

Cited 0 time in Web of Science Cited 0 time in Scopus

Export

Authors: 이제근

Advisor: 장병탁

Major: 자연과학대학 협동과정 생물정보학전공

Issue Date: 2014-02

Publisher: 서울대학교 대학원

Keywords: Higher-order interaction ; Evolutionary computation ; Genome-wide sequence analysis ; Machine learning ; Genomics ; Epigenomics

Description: 학위논문 (박사)-- 서울대학교 대학원 : 협동과정 생물정보학전공, 2014. 2. 장병탁.

Abstract: One of the basic research goals in life science is to understand the complex relationships between biological factors and phenotypes, and to identify the various factors affecting the phenotype. In particular, genomic sequences play a significant role in determining the phenotype, such as gene expression and a susceptibility to disease, so the studies for the fundamental information stored in genome is essential to understanding biological processes. Previous genomic sequence analyses mainly focused on identification of a single associated factor or pairwise relationships with significant effects. Recent development of high-throughput technologies has made it possible to identify the causal factors by carrying out genome-wide analysis. However, it still remains as a challenge to discover higher-order interactions of multiple factors because this involves huge search spaces and computational costs.

In this dissertation, we develop effective methods for identifying the higher-order relationships of sequence elements affecting the phenotype, by combining statistical learning with evolutionary computation. The methods are applied to finding the associated combinatorial factors and dysfunctional modules in various genome-wide sequence analysis problems. Firstly, we show statistical learning-based methods to detect co-regulatory sequence motifs and to investigate combinatorial effects of DNA methylation, affecting on
downstream gene expression. Next, to examine the sequence datasets with a huge number of attributes on human genome, we apply evolutionary computation approaches. Our methods search the problem feature space based on machine learning techniques using training datasets in evolutionary computation processes and are able to find candidate solution well in computationally expensive optimization problems. The experimental results show that the approaches are useful to find the higher-order relationships associated to disease using genomic and epigenomic datasets. In conclusion, our studies would provide practical methods to analyze complex interactions among sequence elements in genomic/epigenomic studies.

Language: English

URI: https://hdl.handle.net/10371/125374

Files in This Item:

000000017660.pdf 11.08 MB

Appears in Collections:

College of Natural Sciences (자연과학대학)
- Program in Bioinformatics (협동과정-생물정보학전공)
  - Theses (Ph.D. / Sc.D._협동과정-생물정보학전공)

Altmetrics

Item View & Download Count

Show Full Item Record

Find it @ SNU

트윗하기

SNS Share