首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Discovering General Multidimensional Associations
Authors:Ben Murrell  Daniel Murrell  Hugh Murrell
Institution:1. Department of Medicine, University of California San Diego, San Diego, United States of America.; 2. Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, United Kingdom.; 3. Computer Science, University of KwaZulu-Natal, Pietermaritzburg, South Africa.; Memorial Sloan Kettering Cancer Center, UNITED STATES,
Abstract:When two variables are related by a known function, the coefficient of determination (denoted R2) measures the proportion of the total variance in the observations explained by that function. For linear relationships, this is equal to the square of the correlation coefficient, ρ. When the parametric form of the relationship is unknown, however, it is unclear how to estimate the proportion of explained variance equitably—assigning similar values to equally noisy relationships. Here we demonstrate how to directly estimate a generalised R2 when the form of the relationship is unknown, and we consider the performance of the Maximal Information Coefficient (MIC)—a recently proposed information theoretic measure of dependence. We show that our approach behaves equitably, has more power than MIC to detect association between variables, and converges faster with increasing sample size. Most importantly, our approach generalises to higher dimensions, estimating the strength of multivariate relationships (Y against A, B, …) as well as measuring association while controlling for covariates (Y against X controlling for C). An R package named matie (“Measuring Association and Testing Independence Efficiently”) is available (http://cran.r-project.org/web/packages/matie/).
Keywords:
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号