首页 | 本学科首页   官方微博 | 高级检索  
     


Assessing among‐lineage variability in phylogenetic imputation of functional trait datasets
Authors:Rafael Molina‐Venegas  Juan Carlos Moreno‐Saiz  Isabel Castro Parga  T. Jonathan Davies  Pedro R. Peres‐Neto  Miguel Á. Rodríguez
Affiliation:1. http://orcid.org/0000‐0001‐5801‐0736;2. Depto de Ciencias de la Vida, Univ. de Alcalá, Madrid, Spain;3. Inst. of Plant Sciences, Univ. of Bern, Bern, Switzerland;4. Depto de Biología (Botánica), Univ. Autónoma de Madrid, Madrid, Spain;5. Depto de Ecología, Univ. Autónoma de Madrid, Madrid, Spain;6. Dept of Biology, McGill Univ., Montreal, QC, Canada;7. Dept of Biology, Concordia Univ., Montreal, QC, Canada
Abstract:Phylogenetic imputation has recently emerged as a potentially powerful tool for predicting missing data in functional traits datasets. As such, understanding the limitations of phylogenetic modelling in predicting trait values is critical if we are to use them in subsequent analyses. Previous studies have focused on the relationship between phylogenetic signal and clade‐level prediction accuracy, yet variability in prediction accuracy among individual tips of phylogenies remains largely unexplored. Here, we used simulations of trait evolution along the branches of phylogenetic trees to show how the accuracy of phylogenetic imputations is influenced by the combined effects of 1) the amount of phylogenetic signal in the traits and 2) the branch length of the tips to be imputed. Specifically, we conducted cross‐validation trials to estimate the variability in prediction accuracy among individual tips on the phylogenies (hereafter ‘tip‐level accuracy’). We found that under a Brownian motion model of evolution (BM, Pagel't λ = 1), tip‐level accuracy rapidly decreased with increasing tip branch‐lengths, and only tips of approximately 10% or less of the total height of the trees showed consistently accurate predictions (i.e. cross‐validation R‐squared >0.75). When phylogenetic signal was weak, the effect of tip branch‐length was reduced, becoming negligible for traits simulated with λ < 0.7, where accuracy was in any case low. Our study shows that variability in prediction accuracy among individual tips of the phylogeny should be considered when evaluating the reliability of phylogenetically imputed trait values. To address this challenge, we describe a Monte Carlo‐based method that allows one to estimate the expected tip‐level accuracy of phylogenetic predictions for continuous traits. Our approach identifies gaps in functional trait datasets for which phylogenetic imputation performs poorly, and will help ecologists to design more efficient trait collection campaigns by focusing resources on lineages whose trait values are more uncertain.
Keywords:branch-lengths  missing data  phylogenetic signal
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号