首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 46 毫秒
1.
Angiosperm systematics has progressed to the point where it is now expected that multiple, independent markers be used in phylogenetic studies. Universal primers for amplifying informative regions of the chloroplast genome are readily available, but in the faster-evolving nuclear genome it is challenging to discover priming sites that are conserved across distantly related taxa. With goals including the identification of informative markers in rosids, and perhaps other angiosperms, we screened 141 nuclear primer combinations for phylogenetic utility in two distinct groups of rosids at different taxonomic levels-Psiguria (Cucurbitaceae) and Geraniaceae. We discovered three phylogenetically informative regions in Psiguria and two in Geraniaceae, but none that were useful in both groups. Extending beyond rosids, we combined our findings with those of another recent effort testing these primer pairs in Asteraceae, Brassicaceae, and Orchidaceae. From this comparison, we identified 32 primer combinations that amplified regions in representative species of at least two of the five distantly related angiosperm families, giving some prior indication about phylogenetic usefulness of these markers in other flowering plants. This reduced set of primer pairs for amplifying low-copy nuclear markers along with a recommended experimental strategy provide a framework for identifying phylogenetically informative regions in angiosperms.  相似文献   

2.
3.
Aquatic larvae of many Rhithrogena mayflies (Ephemeroptera) inhabit sensitive Alpine environments. A number of species are on the IUCN Red List and many recognized species have restricted distributions and are of conservation interest. Despite their ecological and conservation importance, ambiguous morphological differences among closely related species suggest that the current taxonomy may not accurately reflect the evolutionary diversity of the group. Here we examined the species status of nearly 50% of European Rhithrogena diversity using a widespread sampling scheme of Alpine species that included 22 type localities, general mixed Yule-coalescent (GMYC) model analysis of one standard mtDNA marker and one newly developed nDNA marker, and morphological identification where possible. Using sequences from 533 individuals from 144 sampling localities, we observed significant clustering of the mitochondrial (cox1) marker into 31 GMYC species. Twenty-one of these could be identified based on the presence of topotypes (expertly identified specimens from the species' type locality) or unambiguous morphology. These results strongly suggest the presence of both cryptic diversity and taxonomic oversplitting in Rhithrogena. Significant clustering was not detected with protein-coding nuclear PEPCK, although nine GMYC species were congruent with well supported terminal clusters of nDNA. Lack of greater congruence in the two data sets may be the result of incomplete sorting of ancestral polymorphism. Bayesian phylogenetic analyses of both gene regions recovered four of the six recognized Rhithrogena species groups in our samples as monophyletic. Future development of more nuclear markers would facilitate multi-locus analysis of unresolved, closely related species pairs. The DNA taxonomy developed here lays the groundwork for a future revision of the important but cryptic Rhithrogena genus in Europe.  相似文献   

4.
Microsatellite loci are ideal for testing hypotheses relating to genetic segregation at fine spatio‐temporal scales. They are also conserved among closely related species, making them potentially useful for clarifying interspecific relationships between recently diverged taxa. However, mutations at primer binding sites may lead to increased nonamplification, or disruptions that may result in decreased polymorphism in nontarget species. Furthermore, high mutation rates and constraints on allele size may also with evolutionary time, promote an increase in convergently evolved allele size classes, biasing measures of interspecific genetic differentiation. Here, we used next‐generation sequencing to develop microsatellite markers from a shotgun genome sequence of the sub‐Antarctic seabird, the thin‐billed prion (Pachyptila belcheri), that we tested for cross‐species amplification in other Pachyptila and related sub‐Antarctic species. We found that heterozygosity decreased and the proportion of nonamplifying loci increased with phylogenetic distance from the target species. Surprisingly, we found that species trees estimated from interspecific FST provided better approximations of mtDNA relationships among the studied species than those estimated using DC, even though FST was more affected by null alleles. We observed a significantly nonlinear second order polynomial relationship between microsatellite and mtDNA distances. We propose that the loss of linearity with increasing mtDNA distance stems from an increasing proportion of homoplastic allele size classes that are identical in state, but not identical by descent. Therefore, despite high cross‐species amplification success and high polymorphism among the closely related Pachyptila species, we caution against the use of microsatellites in phylogenetic inference among distantly related taxa.  相似文献   

5.
We present a strategy for phylogenetic marker development in non-model systems. Rather than using the traditional approach of comparing distantly related taxa to develop conserved primers for unknown species, we explore an alternative strategy that builds primers directly from a single, relatively well characterized species and applies those primers to increasingly distantly related taxa. We develop and test our protocol with turtles. Using a single BAC end-sequence library consisting of 3461 sequences totaling 2.43 million base pairs of data, we outline a procedure to flag repeat elements, followed by a BLAST approach to categorize sequences into high, low, and no similarity compartments compared to GenBank sequences. We developed and tested a panel of 96 primer pairs with a set of turtle tissues that forms a series of increasingly distantly related taxa with respect to the BAC reference species. Finally, we sequenced 11 of these newly discovered markers across a diverse set of 18 turtle species that spans the 210 million years of chelonian crown-group history and that includes representatives of most of the major clades of extant turtles. Our results indicate that large numbers of new, phylogenetically informative markers can be developed quickly and inexpensively from a single BAC, EST, or similar genomic resource, and that those markers provide reliable phylogenetic information across both shallow and deep levels of phylogenetic history. Our results also highlight the importance of screening for and managing repetitive elements found in randomly sequenced DNA fragments. We presume that our strategy should work well across any similarly divergent clade, suggesting that many-marker datasets can be developed quickly and efficiently for phylogenetic analysis.  相似文献   

6.
Inferring phylogenetic relationships between closely related taxa can be hindered by three factors: (1) the lack of informative molecular variation at short evolutionary timescale; (2) the lack of established markers in poorly studied taxa; and (3) the potential phylogenetic conflicts among different genomic regions due to incomplete lineage sorting or introgression. In this context, Restriction site Associated DNA sequencing (RAD‐seq) seems promising as this technique can generate sequence data from numerous DNA fragments scattered throughout the genome, from a large number of samples, and without preliminary knowledge on the taxa under study. However, divergence beyond the within‐species level will necessarily reduce the number of conserved and non‐duplicated restriction sites, and therefore the number of loci usable for phylogenetic inference. Here, we assess the suitability of RAD‐seq for phylogeny using a simulated experiment on the 12 Drosophila genomes, with divergence times ranging from 5 to 63 million years. These simulations show that RAD‐seq allows the recovery of the known Drosophila phylogeny with strong statistical support, even for relatively ancient nodes. Notably, this conclusion is robust to the potentially confounding effects of sequencing errors, heterozygosity, and low coverage. We further show that clustering RAD‐seq data using the BLASTN and SiLiX programs significantly improves the recovery of orthologous RAD loci compared with previously proposed approaches, especially for distantly related species. This study therefore validates the view that RAD sequencing is a powerful tool for phylogenetic inference.  相似文献   

7.
Microsatellites or simple sequence repeats (SSRs) are highly useful molecular markers for plant improvement. Expressed sequence tag (EST)-SSR markers have a higher rate of transferability across species than genomic SSR markers and are thus well suited for application in cross-species phylogenetic studies. Our objectives were to examine the amplification of tall fescue EST-SSR markers in 12 grass species representing 8 genera of 4 tribes from 2 subfamilies of Poaceae and the applicability of these markers for phylogenetic analysis of grass species. About 43% of the 145 EST-SSR primer pairs produced PCR bands in all 12 grass species and had high levels of polymorphism in all forage grasses studied. Thus, these markers will be useful in a variety of forage grass species, including the ones tested in this study. SSR marker data were useful in grouping genotypes within each species. Lolium temulentum, a potential model species for cool-season forage grasses, showed a close relation with the major Festuca-Lolium species in the study. Tall wheat grass was found to be closely related to hexaploid wheat, thereby confirming the known taxonomic relations between these species. While clustering of closely related species was found, the effectiveness of such data in evaluating distantly related species needs further investigations. The phylogenetic trees based on DNA sequences of selected SSR bands were in agreement with the phylogenetic relations based on length polymorphism of SSRs markers. Tall fescue EST-SSR markers depicted phylogenetic relations among a wide range of cool-season forage grass species and thus are an important resource for researchers working with such grass species.  相似文献   

8.
Recently, we reported the chloroplast genome‐wide association of oligonucleotide repeats, indels and nucleotide substitutions in aroid chloroplast genomes. We hypothesized that the distribution of oligonucleotide repeat sequences in a single representative genome can be used to identify mutational hotspots and loci suitable for population genetic, phylogenetic and phylogeographic studies. Using information on the location of oligonucleotide repeats in the chloroplast genome of taro (Colocasia esculenta), we designed 30 primer pairs to amplify and sequence polymorphic loci. The primers have been tested in a range of intra‐specific to intergeneric comparisons, including ten taro samples (Colocasia esculenta) from diverse geographical locations, four other Colocasia species (C. affinis, C. fallax, C. formosana, C. gigantea) and three other aroid genera (represented by Remusatia vivipara, Alocasia brisbanensis and Amorphophallus konjac). Multiple sequence alignments for the intra‐specific comparison revealed nucleotide substitutions (point mutations) at all 30 loci and microsatellite polymorphisms at 14 loci. The primer pairs reported here reveal levels of genetic variation suitable for high‐resolution phylogeographic and evolutionary studies of taro and other closely related aroids. Our results confirm that information on repeat distribution can be used to identify loci suitable for such studies, and we expect that this approach can be used in other plant groups.  相似文献   

9.
Chloroplast genomes supply indispensable information that helps improve the phylogenetic resolution and even as organelle‐scale barcodes. Next‐generation sequencing technologies have helped promote sequencing of complete chloroplast genomes, but compared with the number of angiosperms, relatively few chloroplast genomes have been sequenced. There are two major reasons for the paucity of completely sequenced chloroplast genomes: (i) massive amounts of fresh leaves are needed for chloroplast sequencing and (ii) there are considerable gaps in the sequenced chloroplast genomes of many plants because of the difficulty of isolating high‐quality chloroplast DNA, preventing complete chloroplast genomes from being assembled. To overcome these obstacles, all known angiosperm chloroplast genomes available to date were analysed, and then we designed nine universal primer pairs corresponding to the highly conserved regions. Using these primers, angiosperm whole chloroplast genomes can be amplified using long‐range PCR and sequenced using next‐generation sequencing methods. The primers showed high universality, which was tested using 24 species representing major clades of angiosperms. To validate the functionality of the primers, eight species representing major groups of angiosperms, that is, early‐diverging angiosperms, magnoliids, monocots, Saxifragales, fabids, malvids and asterids, were sequenced and assembled their complete chloroplast genomes. In our trials, only 100 mg of fresh leaves was used. The results show that the universal primer set provided an easy, effective and feasible approach for sequencing whole chloroplast genomes in angiosperms. The designed universal primer pairs provide a possibility to accelerate genome‐scale data acquisition and will therefore magnify the phylogenetic resolution and species identification in angiosperms.  相似文献   

10.
Chloroplast genome information helps improve the phylogenetic resolution and can act as organelle-scale barcodes in recently radiated plant groups. Previously we reported that nine universal primer pairs could amplify angiosperm whole chloroplast genomes by long-range polymerase chain reaction and using next-generation sequencing. Although these primers show high universality and efficiency for sequencing whole chloroplast genomes in angiosperms, they did not fully resolve the following two issues surrounding sequencing angiosperm chloroplast genomes: (i) approximately 30% of angiosperms cannot be amplified successfully; and (ii) only fresh leaves can be applied. In this study, we designed another set of 15 universal primer pairs for amplifying angiosperm whole chloroplast genomes to complement the original nine primer pairs. Furthermore, we designed a primer pair for nuclear ribosomal DNAs (nrDNAs). To validate the functionality of the primers, we tested 44 species with silica gel-dried leaves and 15 species with fresh leaves that have been shown to not be amplified with the original nine primer pairs. The result showed that, in 65.9% and 88.6% of the 44 species with silica gel-dried leaves, the whole chloroplast genome and nrDNAs could be amplified, respectively. In addition, all 15 fresh leaf samples could have the whole chloroplast genome successfully amplified. The nrDNAs comprise partial sequences of 18S and 26S, along with the complete sequence of 5.8S and the internal transcribed spacers ITS1 and ITS2. The mean size of nrDNA was 5800 bp. This study shows that the 15 universal primer set is an indispensable tool for amplifying whole chloroplast genomes in angiosperms, and these are an important supplement to the nine reported primer pairs.  相似文献   

11.
Although massively parallel sequencing has facilitated large-scale DNA sequencing, comparisons among distantly related species rely upon small portions of the genome that are easily aligned. Methods are needed to efficiently obtain comparable DNA fragments prior to massively parallel sequencing, particularly for biologists working with non-model organisms. We introduce a new class of molecular marker, anchored by ultraconserved genomic elements (UCEs), that universally enable target enrichment and sequencing of thousands of orthologous loci across species separated by hundreds of millions of years of evolution. Our analyses here focus on use of UCE markers in Amniota because UCEs and phylogenetic relationships are well-known in some amniotes. We perform an in silico experiment to demonstrate that sequence flanking 2030 UCEs contains information sufficient to enable unambiguous recovery of the established primate phylogeny. We extend this experiment by performing an in vitro enrichment of 2386 UCE-anchored loci from nine, non-model avian species. We then use alignments of 854 of these loci to unambiguously recover the established evolutionary relationships within and among three ancient bird lineages. Because many organismal lineages have UCEs, this type of genetic marker and the analytical framework we outline can be applied across the tree of life, potentially reshaping our understanding of phylogeny at many taxonomic levels.  相似文献   

12.
MAVID is a multiple alignment program suitable for many large genomic regions. The MAVID web server allows biomedical researchers to quickly obtain multiple alignments for genomic sequences and to subsequently analyse the alignments for conserved regions. MAVID has been successfully used for the alignment of closely related species such as primates and also for the alignment of more distant organisms such as human and fugu. The server is fast, capable of aligning hundreds of kilobases in less than a minute. The multiple alignment is used to build a phylogenetic tree for the sequences, which is subsequently used as a basis for identifying conserved regions in the alignment. The server can be accessed at http://baboon.math.berkeley.edu/mavid/.  相似文献   

13.
MOTIVATION: The quality of a model structure derived from a comparative modeling procedure is dictated by the accuracy of the predicted sequence-template alignment. As the sequence-template pairs are increasingly remote in sequence relationship, the prediction of the sequence-template alignments becomes increasingly problematic with sequence alignment methods. Structural information of the template, used in connection with the sequence relationship of the sequence-template pair, could significantly improve the accuracy of the sequence-template alignment. In this paper, we describe a sequence-template alignment method that integrates sequence and structural information to enhance the accuracy of sequence-template alignments for distantly related protein pairs. RESULTS: The structure-dependent sequence alignment (SDSA) procedure was optimized for coverage and accuracy on a training set of 412 protein pairs; the structures for each of the training pairs are similar (RMSD< approximately 4A) but the sequence relationship is undetectable (average pair-wise sequence identity = 8%). The optimized SDSA procedure was then applied to extend PSI-BLAST local alignments by calculating the global alignments under the constraint of the residue pairs in the local alignments. This composite alignment procedure was assessed with a testing set of 1421 protein pairs, of which the pair-wise structures are similar (RMSD< approximately 4A) but the sequences are marginally related at best in each pair (average pair-wise sequence identity = 13%). The assessment showed that the composite alignment procedure predicted more aligned residues pairs with an average of 27% increase in correctly aligned residues over the standard PSI-BLAST alignments for the protein pairs in the testing set.  相似文献   

14.
Position-specific substitution matrices, known as profiles,derived from multiple sequence alignments are currently usedto search sequence databases for distantly related members ofprotein families. The performance of the database searches isenhanced by using (i) a sequence weighting scheme which assignshigher weights to more distantly related sequences based onbranch lengths derived from phylogenetic trees, (ii) exclusionof positions with mainly padding characters at sites of insertionsor deletions and (iii) the BLOSUM62 residue comparison matrix.A natural consequence of these modifications is an improvementin the alignment of new sequences to the profiles. However,the accuracy of the alignments can be further increased by employinga similarity residue comparison matrix. These developments areimplemented in a program called PROFILEWEIGHT which runs onUnix and Vax computers. The only input required by the programis the multiple sequence alignment. The output from PROFILEWEIGHTis a profile designed to be used by existing searching and alignmentprograms. Test results from database searches with four differentfamilies of proteins show the improved sensitivity of the weightedprofiles.  相似文献   

15.
Three approaches — microsatellite library screening, consensus primer PCR (polymerase chain reaction) and sequencing with arbitrary primer pairs (SWAPP) — were used to develop single‐copy nuclear DNA (scnDNA) markers for log‐dwelling beetles Apasis puncticeps and Adelium calosomoides. We are unaware of other nuclear markers for Adeliini. We tested > 70 primer pairs per species, but despite exhaustive optimization, we obtained only five polymorphic markers. Nonetheless, the markers are valuable in detection of effects of habitat fragmentation.  相似文献   

16.
Single nucleotide polymorphisms (SNP) are the ideal marker for characterizing genomic variation but can be difficult to find in nonmodel species. We explored the usefulness of the dog genome for finding SNPs in distantly related nonmodel canids and evaluated so-ascertained SNPs. Using 40 primer pairs designed from randomly selected bacterial artificial chromosome clones from the dog genome, we successfully sequenced 80-88% of loci in a coyote (Canis latrans), grey fox (Urocyon cinereoargenteus), and red fox (Vulpes vulpes), which compared favourably to a 60% success rate for each species using 10 primer pairs conserved across mammals. Loci were minimally heterogeneous with respect to SNP density, which was similar, overall, in a discovery panel of nine red foxes to that previously reported for a panel of eight wolves (Canis lupus). Additionally, individual heterozygosity was similar across the three canids in this study. However, the proportion of SNP sites shared with the dog decreased with phylogenetic divergence, with no SNPs shared between red foxes and dogs. Density of interspecific SNPs increased approximately linearly with divergence time between species. Using red foxes from three populations, we estimated F(ST) based on each of 42 SNPs and 14 microsatellites and simulated null distributions conditioned on each marker type. Relative to SNPs, microsatellites systematically underestimated F(ST) and produced biased null distributions, indicating that SNPs are superior markers for these functions. By reconstituting the frequency spectrum of SNPs discovered in nine red foxes, we discovered an estimated 77-89% of all SNPs (within the region screened) present in North American red foxes. In sum, these findings indicate that information from the dog genome enables easy ascertainment of random and gene-linked SNPs throughout the Canidae and illustrate the value of SNPs in ecological and evolutionary genetics.  相似文献   

17.
Data from a large‐scale restriction site‐associated DNA sequencing (RAD‐Seq) study of nine butterflyfish species in the Red Sea and Arabian Sea provided a means to test the utility of a recently published draft genome (Chaetodon austriacus) and assess apparent bias in this method of isolating nuclear loci. We here processed double‐digest restriction site‐associated DNA (ddRAD) sequencing data to identify single nucleotide polymorphism (SNP) markers and their associated function with and without our reference genome to see whether it improves the quality of RAD‐Seq. Our analyses indicate (i) a modest gap between the number of nonannotated versus annotated SNPs across all species, (ii) an advantage of using genomic resources for closely related but not distantly related butterflyfish species based on the ability to assign putative gene function to SNPs and (iii) an enrichment of genes among sister butterflyfish taxa related to calcium transmembrane transport and binding. The latter result highlights the potential for this approach to reveal insights into adaptive mechanisms in populations inhabiting challenging coral reef environments such as the Red Sea, Arabian Sea and Arabian Gulf with further study.  相似文献   

18.
A major bottleneck in comparative modeling is the alignment quality; this is especially true for proteins whose distant relationships could be reliably recognized only by recent advances in fold recognition. The best algorithms excel in recognizing distant homologs but often produce incorrect alignments for over 50% of protein pairs in large fold-prediction benchmarks. The alignments obtained by sequence-sequence or sequence-structure matching algorithms differ significantly from the structural alignments. To study this problem, we developed a simplified method to explicitly enumerate all possible alignments for a pair of proteins. This allowed us to estimate the number of significantly different alignments for a given scoring method that score better than the structural alignment. Using several examples of distantly related proteins, we show that for standard sequence-sequence alignment methods, the number of significantly different alignments is usually large, often about 10(10) alternatives. This distance decreases when the alignment method is improved, but the number is still too large for the brute force enumeration approach. More effective strategies were needed, so we evaluated and compared two well-known approaches for searching the space of suboptimal alignments. We combined their best features and produced a hybrid method, which yielded alignments that surpassed the original alignments for about 50% of protein pairs with minimal computational effort.  相似文献   

19.
Fast and reliable sexing of prosimian and human DNA   总被引:1,自引:0,他引:1  
Molecular sexing of mammals is normally done by PCR amplification of Y chromosomal fragments, or coamplification of homologous fragments from both sex chromosomes. Existing primers are often unreliable for distantly related species due to mutations in primer regions. Currently there are no published primers for the sexing of prosimian DNA. We show that an existing method (using the zinc finger protein) based on a size difference between the X and Y homologs does not work in prosimians. Multiple alignments of distantly related mammalian species from Genbank and genome databases enabled us to identify conserved regions in the amelogenin gene. Using these conserved regions, we can target species that have no sequence information. We designed a single, conserved primer pair that is useful for fast and reliable molecular sexing of prosimian primates. A single PCR yields two fragments in males and only one in females, which are easily separated with the use of agarose gels. Amplification of separable fragments was successful in seven species of lemurs, as well as humans.  相似文献   

20.
MUSTANG: a multiple structural alignment algorithm   总被引:1,自引:0,他引:1  
Multiple structural alignment is a fundamental problem in structural genomics. In this article, we define a reliable and robust algorithm, MUSTANG (MUltiple STructural AligNment AlGorithm), for the alignment of multiple protein structures. Given a set of protein structures, the program constructs a multiple alignment using the spatial information of the C(alpha) atoms in the set. Broadly based on the progressive pairwise heuristic, this algorithm gains accuracy through novel and effective refinement phases. MUSTANG reports the multiple sequence alignment and the corresponding superposition of structures. Alignments generated by MUSTANG are compared with several handcurated alignments in the literature as well as with the benchmark alignments of 1033 alignment families from the HOMSTRAD database. The performance of MUSTANG was compared with DALI at a pairwise level, and with other multiple structural alignment tools such as POSA, CE-MC, MALECON, and MultiProt. MUSTANG performs comparably to popular pairwise and multiple structural alignment tools for closely related proteins, and performs more reliably than other multiple structural alignment methods on hard data sets containing distantly related proteins or proteins that show conformational changes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号