Browse

Predicting disease predisposition patterns of the personal genome based on disease hierarchy : 질병 계층 기반 개인 유전체의 질병 위험도 예측

Cited 0 time in Web of Science Cited 0 time in Scopus
Authors
나영지
Advisor
김주한
Major
자연과학대학 협동과정 생물정보학전공
Issue Date
2013-02
Publisher
서울대학교 대학원
Keywords
next-generation sequencingMeSH tree structuredisease riskpersonal genome
Description
학위논문 (박사)-- 서울대학교 대학원 : 협동과정 생물정보학전공, 2013. 2. 김주한.
Abstract
The advent of next-generation sequencing (NGS) technologies has had a huge impact upon functional genomics. The NGS technologies generate millions of short sequence reads per run, making it possible to sequence entire human genomes in a matter of weeks. These NGS technologies have already been employed to sequence the constitutional genomes of several individuals. Ambitious efforts like the 1000 Genomes Project and the Personal Genomes Project hope to add thousands more. The first five cancer genomes revealed thousands of novel somatic mutations and implicated new genes in tumor development and progression. Current knowledge of the genetic variants that underlie disease susceptibility, treatment response and other phenotypes will continually improve as these studies expand the catalog of DNA sequence variation in humans.
As the cost of sequencing continues to freefall, the challenge of solving the data analysis and storage problems becomes more pressing. But those issues are nothing compared to the challenge facing the clinical community who are seeking to mine the genome for clinically actionable information. However, present analytical methods are insufficient to make genetic data accessible in a clinical context, and the clinical usefulness of these data for individual patients has not been formally assessed. Here, I focus on evaluating individual predispositions to specific phenotypic traits given their genetic backgrounds.
In this dissertation, I present a computational method for associating variants in the personal genome sequencing data with predispositions to disease. The method works by ranking all variants in the personal genome as potential disease risks, and reporting MeSH terms that are significantly associated with highly ranked genes. To identify genetic variants associated diseases, I obtained high-throughput sequencing data in several cancer types (acute myeloid leukemia, bladder cancer, breast cancer, colon cancer, glioblastoma multiforme, kidney cancer, lung adenocarcinoma, lung squamous cell carcinoma, malignant melanoma, ovarian serous cystadenocarcinoma and prostate cancer) and non-cancer types (Crohns disease, focal segmental glomerulosclerosis, and retinitis pigmentosa). From disease-gene association in the OMIM, I reconstructed relations of diseases and genes in the MeSH tree structures in order to consider the human disease hierarchical structure of human disease ontology.
The results showed the distribution of mutual information in the MeSH disease category differs according to the population in the healthy people. It suggests that in order to interpret personal genome properly, we may consider population information together. In addition, MeSH disease terms are more highly ranked in the patients than healthy people. Disease-enrichment analysis showed Cancer, Neurological, Endocrine, and Immunological categories were over-represented in the patients as well as healthy people. Namely, it is possible to speculate systemic response patterns to diseases: Neuro-Endocrine-Immune Circuitry. In conclusion, although this study could not answer accurately the disease risk assessment, this study can provide data analysis scheme for the personal genome sequencing data. The scheme of this method has extendibility in genomic-based knowledge: drug-gene, environmental factor-gene and so on.
Language
English
URI
https://hdl.handle.net/10371/125372
Files in This Item:
Appears in Collections:
College of Natural Sciences (자연과학대학)Program in Bioinformatics (협동과정-생물정보학전공)Theses (Ph.D. / Sc.D._협동과정-생물정보학전공)
  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Browse