Assessing among‐lineage variability in phylogenetic imputation of functional trait datasets期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Assessing among‐lineage variability in phylogenetic imputation of functional trait datasets

Authors:	Rafael Molina‐Venegas Juan Carlos Moreno‐Saiz Isabel Castro Parga T. Jonathan Davies Pedro R. Peres‐Neto Miguel Á. Rodríguez

Affiliation:	1. http://orcid.org/0000‐0001‐5801‐0736;2. Depto de Ciencias de la Vida, Univ. de Alcalá, Madrid, Spain;3. Inst. of Plant Sciences, Univ. of Bern, Bern, Switzerland;4. Depto de Biología (Botánica), Univ. Autónoma de Madrid, Madrid, Spain;5. Depto de Ecología, Univ. Autónoma de Madrid, Madrid, Spain;6. Dept of Biology, McGill Univ., Montreal, QC, Canada;7. Dept of Biology, Concordia Univ., Montreal, QC, Canada

Abstract:	Phylogenetic imputation has recently emerged as a potentially powerful tool for predicting missing data in functional traits datasets. As such, understanding the limitations of phylogenetic modelling in predicting trait values is critical if we are to use them in subsequent analyses. Previous studies have focused on the relationship between phylogenetic signal and clade‐level prediction accuracy, yet variability in prediction accuracy among individual tips of phylogenies remains largely unexplored. Here, we used simulations of trait evolution along the branches of phylogenetic trees to show how the accuracy of phylogenetic imputations is influenced by the combined effects of 1) the amount of phylogenetic signal in the traits and 2) the branch length of the tips to be imputed. Specifically, we conducted cross‐validation trials to estimate the variability in prediction accuracy among individual tips on the phylogenies (hereafter ‘tip‐level accuracy’). We found that under a Brownian motion model of evolution (BM, Pagel't λ = 1), tip‐level accuracy rapidly decreased with increasing tip branch‐lengths, and only tips of approximately 10% or less of the total height of the trees showed consistently accurate predictions (i.e. cross‐validation R‐squared >0.75). When phylogenetic signal was weak, the effect of tip branch‐length was reduced, becoming negligible for traits simulated with λ < 0.7, where accuracy was in any case low. Our study shows that variability in prediction accuracy among individual tips of the phylogeny should be considered when evaluating the reliability of phylogenetically imputed trait values. To address this challenge, we describe a Monte Carlo‐based method that allows one to estimate the expected tip‐level accuracy of phylogenetic predictions for continuous traits. Our approach identifies gaps in functional trait datasets for which phylogenetic imputation performs poorly, and will help ecologists to design more efficient trait collection campaigns by focusing resources on lineages whose trait values are more uncertain.

Keywords:	branch-lengths missing data phylogenetic signal

设为首页 | 免责声明 | 关于勤云 | 加入收藏