Parallel tagged amplicon sequencing reveals major lineages and phylogenetic structure in the North American tiger salamander (Ambystoma tigrinum) species complex |
| |
Authors: | Eric M. O'Neill Rachel Schwartz C. Thomas Bullock Joshua S. Williams H. Bradley Shaffer X. Aguilar‐Miguel Gabriela Parra‐Olea David W. Weisrock |
| |
Affiliation: | 1. Department of Biology, University of Kentucky, , Lexington, KY, 40506 USA;2. Department of Biology, Colorado State University, , Fort Collins, CO, 80523 USA;3. The Biodesign Institute, Arizona State University, , Tempe, AZ, 85287 USA;4. Department of Computer Science, University of Kentucky, , Lexington, KY, 40506 USA;5. Department of Ecology and Evolutionary Biology, University of California, , Los Angeles, CA, 90095 USA;6. La Kretz Center for California Conservation Science, Institute of the Environment and Sustainability, University of California, , Los Angeles, CA, 90095‐14966 USA;7. CIRB, Facultad de Ciencias, Universidad Autónoma del Estado de México, , Toluca, Edo. de México, México;8. Instituto de Biología, Universidad Nacional Autónoma de México, , Distrito Federal, México |
| |
Abstract: | Modern analytical methods for population genetics and phylogenetics are expected to provide more accurate results when data from multiple genome‐wide loci are analysed. We present the results of an initial application of parallel tagged sequencing (PTS) on a next‐generation platform to sequence thousands of barcoded PCR amplicons generated from 95 nuclear loci and 93 individuals sampled across the range of the tiger salamander (Ambystoma tigrinum) species complex. To manage the bioinformatic processing of this large data set (344 330 reads), we developed a pipeline that sorts PTS data by barcode and locus, identifies high‐quality variable nucleotides and yields phased haplotype sequences for each individual at each locus. Our sequencing and bioinformatic strategy resulted in a genome‐wide data set with relatively low levels of missing data and a wide range of nucleotide variation. structure analyses of these data in a genotypic format resulted in strongly supported assignments for the majority of individuals into nine geographically defined genetic clusters. Species tree analyses of the most variable loci using a multi‐species coalescent model resulted in strong support for most branches in the species tree; however, analyses including more than 50 loci produced parameter sampling trends that indicated a lack of convergence on the posterior distribution. Overall, these results demonstrate the potential for amplicon‐based PTS to rapidly generate large‐scale data for population genetic and phylogenetic‐based research. |
| |
Keywords: | barcode bioinformatic gene tree next‐generation sequencing nuclear DNA |
|
|