Publications

Detailed Information

Spatial rank-based multifactor dimensionality reduction to detect gene–gene interactions for multivariate phenotypes

DC Field Value Language
dc.contributor.authorPark, Mira-
dc.contributor.authorJeong, Hoe-Bin-
dc.contributor.authorLee, Jong-Hyun-
dc.contributor.authorPark, Taesung-
dc.date.accessioned2022-02-07T05:04:17Z-
dc.date.available2022-02-07T14:23:30Z-
dc.date.issued2021-10-04-
dc.identifier.citationBMC Bioinformatics. 2021 Oct 04;22(1):480ko_KR
dc.identifier.issn1471-2105-
dc.identifier.urihttps://hdl.handle.net/10371/176929-
dc.description.abstractBackground
Identifying interaction effects between genes is one of the main tasks of genome-wide association studies aiming to shed light on the biological mechanisms underlying complex diseases. Multifactor dimensionality reduction (MDR) is a popular approach for detecting gene–gene interactions that has been extended in various forms to handle binary and continuous phenotypes. However, only few multivariate MDR methods are available for multiple related phenotypes. Current approaches use Hotellings T2 statistic to evaluate interaction models, but it is well known that Hotellings T2 statistic is highly sensitive to heavily skewed distributions and outliers.

Results
We propose a robust approach based on nonparametric statistics such as spatial signs and ranks. The new multivariate rank-based MDR (MR-MDR) is mainly suitable for analyzing multiple continuous phenotypes and is less sensitive to skewed distributions and outliers. MR-MDR utilizes fuzzy k-means clustering and classifies multi-locus genotypes into two groups. Then, MR-MDR calculates a spatial rank-sum statistic as an evaluation measure and selects the best interaction model with the largest statistic. Our novel idea lies in adopting nonparametric statistics as an evaluation measure for robust inference. We adopt tenfold cross-validation to avoid overfitting. Intensive simulation studies were conducted to compare the performance of MR-MDR with current methods. Application of MR-MDR to a real dataset from a Korean genome-wide association study demonstrated that it successfully identified genetic interactions associated with four phenotypes related to kidney function. The R code for conducting MR-MDR is available at
https://github.com/statpark/MR-MDR

Conclusions
Intensive simulation studies comparing MR-MDR with several current methods showed that the performance of MR-MDR was outstanding for skewed distributions. Additionally, for symmetric distributions, MR-MDR showed comparable power. Therefore, we conclude that MR-MDR is a useful multivariate non-parametric approach that can be used regardless of the phenotype distribution, the correlations between phenotypes, and sample size.
ko_KR
dc.description.sponsorshipThis work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government(MSIT) (2013M3A9C4078158, NRF-2021R1A2C1007788).ko_KR
dc.language.isoenko_KR
dc.publisherBMCko_KR
dc.subjectFuzzy clustering-
dc.subjectGene–gene interaction-
dc.subjectMultifactor dimensionality reduction-
dc.subjectSpatial rank statistic-
dc.titleSpatial rank-based multifactor dimensionality reduction to detect gene–gene interactions for multivariate phenotypesko_KR
dc.typeArticleko_KR
dc.contributor.AlternativeAuthor박미라-
dc.contributor.AlternativeAuthor정회빈-
dc.contributor.AlternativeAuthor이종현-
dc.contributor.AlternativeAuthor박태성-
dc.identifier.doihttps://doi.org/10.1186/s12859-021-04395-y-
dc.citation.journaltitleBMC Bioinformaticsko_KR
dc.language.rfc3066en-
dc.rights.holderThe Author(s)-
dc.date.updated2021-10-10T03:11:43Z-
dc.citation.number1ko_KR
dc.citation.startpage480ko_KR
dc.citation.volume22ko_KR
Appears in Collections:
Files in This Item:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share