Considering external information to improve the phylogenetic comparison of microbial communities: a new approach based on constrained Double Principal Coordinates Analysis (cDPCoA) |
| |
Authors: | S Dray S Pavoine D Aguirre de Cárcer |
| |
Institution: | 1. Université de Lyon, Lyon, France;2. CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, Villeurbanne, France;3. Muséum national d'Histoire naturelle, Département Ecologie et Gestion de la Biodiversité, UMR 7204‐CNRS‐UPMC, Paris, France;4. Mathematical Ecology Research Group, Department of Zoology, University of Oxford, Oxford, UK;5. Department of Virology and Microbiology, Centro de Biología Molecular Severo Ochoa CSIC/UAM, Madrid, Spain |
| |
Abstract: | The use of next‐generation sequencing technologies is revolutionizing microbial ecology by allowing a deep phylogenetic coverage of tens to thousands of samples simultaneously. Double Principal Coordinates Analysis (DPCoA) is a multivariate method, developed in community ecology, able to integrate a distance matrix describing differences among species (e.g. phylogenetic distances) in the analysis of a species abundance matrix. This ordination technique has been used recently to describe microbial communities taking into account phylogenetic relatedness. In this work, we extend DPCoA to integrate the information of external variables measured on communities. The constrained Double Principal Coordinates Analysis (cDPCoA) is able to enforce a priori classifications to retrieve subtle differences and (or) remove the effect of confounding factors. We describe the main principles of this new approach and demonstrate its usefulness by providing application examples based on published 16S rRNA gene data sets. |
| |
Keywords: | 16S 18S DPCoA microbial community analysis QIIME Unifrac |
|
|