Gene set analysis of genome-wide association studies: methodological issues and perspectives |
| |
Authors: | Wang Lily Jia Peilin Wolfinger Russell D Chen Xi Zhao Zhongming |
| |
Affiliation: | a Department of Biostatistics, Vanderbilt University, Nashville, TN 37232, USAb Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN 37232, USAc Department of Psychiatry, Vanderbilt University School of Medicine, Nashville, TN 37232, USAd SAS Institute Inc., Cary NC 27513, USAe Division of Cancer Biostatistics, Department of Biostatistics, Vanderbilt University School of Medicine, Nashville, TN 37232, USAf Department of Cancer Biology, Vanderbilt University School of Medicine, Nashville, TN 37232, USA |
| |
Abstract: | Recent studies have demonstrated that gene set analysis, which tests disease association with genetic variants in a group of functionally related genes, is a promising approach for analyzing and interpreting genome-wide association studies (GWAS) data. These approaches aim to increase power by combining association signals from multiple genes in the same gene set. In addition, gene set analysis can also shed more light on the biological processes underlying complex diseases. However, current approaches for gene set analysis are still in an early stage of development in that analysis results are often prone to sources of bias, including gene set size and gene length, linkage disequilibrium patterns and the presence of overlapping genes. In this paper, we provide an in-depth review of the gene set analysis procedures, along with parameter choices and the particular methodology challenges at each stage. In addition to providing a survey of recently developed tools, we also classify the analysis methods into larger categories and discuss their strengths and limitations. In the last section, we outline several important areas for improving the analytical strategies in gene set analysis. |
| |
Keywords: | Genome-wide association study Gene set Pathway Gene-set enrichment analysis Statistical significance Complex disease |
本文献已被 ScienceDirect PubMed 等数据库收录! |
|