首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Towards better accuracy for missing value estimation of epistatic miniarray profiling data by a novel ensemble approach
Authors:Pan Xiao-Yong  Tian Ye  Huang Yan  Shen Hong-Bin
Institution:a Department of Automation, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai, 200240, China;b School of Aerospace and Astronautic Engineering, Shanghai Jiaotong University, 800 Dongchuan Road, Shanghai, 200240, China;c National Laboratory for Infrared Physics, Shanghai Institute of Technical Physics, Chinese Academy of Science, 200083 Shanghai, China
Abstract:Epistatic miniarray profiling (E-MAP) is a powerful tool for analyzing gene functions and their biological relevance. However, E-MAP data suffers from large proportion of missing values, which often results in misleading and biased analysis results. It is urgent to develop effective missing value estimation methods for E-MAP. Although several independent algorithms can be applied to achieve this goal, their performance varies significantly on different datasets, indicating different algorithms having their own advantages and disadvantages. In this paper, we propose a novel ensemble approach EMDI based on the high-level diversity to impute missing values that consists of two global and four local base estimators. Experimental results on five E-MAP datasets show that EMDI outperforms all single base algorithms, demonstrating an appropriate combination providing complementarity among different methods. Comparison results between several fusion strategies also demonstrate that the proposed high-level diversity scheme is superior to others. EMDI is freely available at www.csbio.sjtu.edu.cn/bioinf/EMDI/.
Keywords:Epistatic miniarray profiling  Missing value estimation  Matrix completion  Ensemble estimator  EMDI
本文献已被 ScienceDirect PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号