首页 | 本学科首页   官方微博 | 高级检索  
   检索      


GPS: Identification of disease genes by rank aggregation of multi-genomic scoring schemes
Authors:Alireza Meshkin  Azadeh Shakery  Ali Masoudi-Nejad
Institution:1. Laboratory of Systems Biology and Bioinformatics (LBB), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran;2. School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran
Abstract:In solving the gene prioritization problem, ranking candidate genes from most to least promising is attempted before further experimental validation. Integrating the results of various data sources and methods tends to result in a better performance when solving the gene prioritization problem. Therefore, a wide range of datasets and algorithms was investigated; these included topological features of protein networks, physicochemical characteristics and blast similarity scores of protein sequences, gene ontology, biological pathways, and tissue-based data sources. The novelty of this study lies in how the best-performing methods and reliable multi-genomic data sources were applied in an efficient two-step approach. In the first step, various multi-genomic data sources and algorithms were evaluated and seven best-performing rankers were then applied to prioritize candidate genes in different ways. In the second step, global prioritization was obtained by aggregating several scoring schemes.The results showed that protein networks, functional linkage networks, gene ontology, and biological pathway data sources have a significant impact on the quality of the gene prioritization approach. The findings also demonstrated a direct relationship between the degree of genes and the ranking quality of the evaluated tools. This approach outperformed previously published algorithms (e.g., DIR, GPEC, GeneDistiller, and Endeavour) in all evaluation metrices and led to the development of GPS software. Its user-friendly interface and accuracy makes GPS a powerful tool for the identification of human disease genes. GPS is available at http://gpsranker.com and http://LBB.ut.ac.ir.
Keywords:Corresponding authors    Candidate genes  Integrative genomics  Data fusion  Rank aggregation  Gene prioritization
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号