期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

The adaptive evolution database (TAED) 总被引：2，自引：0，他引：2

Liberles DA Schreiber DR Govindarajan S Chamberlin SG Benner SA 《Genome biology》2001,2(8):research0028.1-research00286

Background

The Master Catalog is a collection of evolutionary families, including multiple sequence alignments, phylogenetic trees and reconstructed ancestral sequences, for all protein-sequence modules encoded by genes in GenBank. It can therefore support large-scale genomic surveys, of which we present here The Adaptive Evolution Database (TAED). In TAED, potential examples of positive adaptation are identified by high values for the normalized ratio of nonsynonymous to synonymous nucleotide substitution rates (K_A/K_S values) on branches of an evolutionary tree between nodes representing reconstructed ancestral sequences.

Results

Evolutionary trees and reconstructed ancestral sequences were extracted from the Master Catalog for every subtree containing proteins from the Chordata only or the Embryophyta only. Branches with high K_A/K_S values were identified. These represent candidate episodes in the history of the protein family when the protein may have undergone positive selection, where the mutant form conferred more fitness than the ancestral form. Such episodes are frequently associated with change in function. An unexpectedly large number of families (between 10% and 20% of those families examined) were found to have at least one branch with high K_A/K_S values above arbitrarily chosen cut-offs (1 and 0.6). Most of these survived a robustness test and were collected into TAED.

Conclusions

TAED is a raw resource for bioinformaticists interested in data mining and for experimental evolutionists seeking candidate examples of adaptive evolution for further experimental study. It can be expanded to include other evolutionary information (for example changes in gene regulation or splicing) placed in a phylogenetic perspective. 相似文献

2.

pico‐PLAZA,a genome database of microbial photosynthetic eukaryotes

Guilhem Richard Sofie Van Landeghem Bram Verhelst Hervé Moreau Yves Van de Peer Nigel Grimsley Gwenael Piganeau 《Environmental microbiology》2013,15(8):2147-2153

相似文献

3.

A phylogenomic gene cluster resource: the Phylogenetically Inferred Groups (PhIGs) database

Paramvir S Dehal Jeffrey L Boore 《BMC bioinformatics》2006,7(1):201-9

Background

We present here the PhIGs database, a phylogenomic resource for sequenced genomes. Although many methods exist for clustering gene families, very few attempt to create truly orthologous clusters sharing descent from a single ancestral gene across a range of evolutionary depths. Although these non-phylogenetic gene family clusters have been used broadly for gene annotation, errors are known to be introduced by the artifactual association of slowly evolving paralogs and lack of annotation for those more rapidly evolving. A full phylogenetic framework is necessary for accurate inference of function and for many studies that address pattern and mechanism of the evolution of the genome. The automated generation of evolutionary gene clusters, creation of gene trees, determination of orthology and paralogy relationships, and the correlation of this information with gene annotations, expression information, and genomic context is an important resource to the scientific community. 相似文献

4.

iGTP: A software package for large-scale gene tree parsimony analysis

Ruchi Chaudhary Mukul S Bansal André Wehe David Fernández-Baca Oliver Eulenstein 《BMC bioinformatics》2010,11(1):574

Background

The ever-increasing wealth of genomic sequence information provides an unprecedented opportunity for large-scale phylogenetic analysis. However, species phylogeny inference is obfuscated by incongruence among gene trees due to evolutionary events such as gene duplication and loss, incomplete lineage sorting (deep coalescence), and horizontal gene transfer. Gene tree parsimony (GTP) addresses this issue by seeking a species tree that requires the minimum number of evolutionary events to reconcile a given set of incongruent gene trees. Despite its promise, the use of gene tree parsimony has been limited by the fact that existing software is either not fast enough to tackle large data sets or is restricted in the range of evolutionary events it can handle. 相似文献

5.

Exploiting gene families for phylogenomic analysis of myzostomid transcriptome data

Hartmann S Helm C Nickel B Meyer M Struck TH Tiedemann R Selbig J Bleidorn C 《PloS one》2012,7(1):e29843

相似文献

6.

Understanding phylogenetic incongruence: lessons from phyllostomid bats

Liliana M. Dávalos Andrea L. Cirranello Jonathan H. Geisler Nancy B. Simmons 《Biological reviews of the Cambridge Philosophical Society》2012,87(4):991-1024

All characters and trait systems in an organism share a common evolutionary history that can be estimated using phylogenetic methods. However, differential rates of change and the evolutionary mechanisms driving those rates result in pervasive phylogenetic conflict. These drivers need to be uncovered because mismatches between evolutionary processes and phylogenetic models can lead to high confidence in incorrect hypotheses. Incongruence between phylogenies derived from morphological versus molecular analyses, and between trees based on different subsets of molecular sequences has become pervasive as datasets have expanded rapidly in both characters and species. For more than a decade, evolutionary relationships among members of the New World bat family Phyllostomidae inferred from morphological and molecular data have been in conflict. Here, we develop and apply methods to minimize systematic biases, uncover the biological mechanisms underlying phylogenetic conflict, and outline data requirements for future phylogenomic and morphological data collection. We introduce new morphological data for phyllostomids and outgroups and expand previous molecular analyses to eliminate methodological sources of phylogenetic conflict such as taxonomic sampling, sparse character sampling, or use of different algorithms to estimate the phylogeny. We also evaluate the impact of biological sources of conflict: saturation in morphological changes and molecular substitutions, and other processes that result in incongruent trees, including convergent morphological and molecular evolution. Methodological sources of incongruence play some role in generating phylogenetic conflict, and are relatively easy to eliminate by matching taxa, collecting more characters, and applying the same algorithms to optimize phylogeny. The evolutionary patterns uncovered are consistent with multiple biological sources of conflict, including saturation in morphological and molecular changes, adaptive morphological convergence among nectar‐feeding lineages, and incongruent gene trees. Applying methods to account for nucleotide sequence saturation reduces, but does not completely eliminate, phylogenetic conflict. We ruled out paralogy, lateral gene transfer, and poor taxon sampling and outgroup choices among the processes leading to incongruent gene trees in phyllostomid bats. Uncovering and countering the possible effects of introgression and lineage sorting of ancestral polymorphism on gene trees will require great leaps in genomic and allelic sequencing in this species‐rich mammalian family. We also found evidence for adaptive molecular evolution leading to convergence in mitochondrial proteins among nectar‐feeding lineages. In conclusion, the biological processes that generate phylogenetic conflict are ubiquitous, and overcoming incongruence requires better models and more data than have been collected even in well‐studied organisms such as phyllostomid bats. 相似文献

7.

The Use (and Misuse) of Phylogenetic Trees in Comparative Behavioral Analyses

Luca Pozzi Christina M. Bergey Andrew S. Burrell 《International journal of primatology》2014,35(1):32-54

Phylogenetic comparative methods play a critical role in our understanding of the adaptive origin of primate behaviors. To incorporate evolutionary history directly into comparative behavioral research, behavioral ecologists rely on strong, well-resolved phylogenetic trees. Phylogenies provide the framework on which behaviors can be compared and homologies can be distinguished from similarities due to convergent or parallel evolution. Phylogenetic reconstructions are also of critical importance when inferring the ancestral state of behavioral patterns and when suggesting the evolutionary changes that behavior has undergone. Improvements in genome sequencing technologies have increased the amount of data available to researchers. Recently, several primate phylogenetic studies have used multiple loci to produce robust phylogenetic trees that include hundreds of primate species. These trees are now commonly used in comparative analyses and there is a perception that we have a complete picture of the primate tree. But how confident can we be in those phylogenies? And how reliable are comparative analyses based on such trees? Herein, we argue that even recent molecular phylogenies should be treated cautiously because they rely on many assumptions and have many shortcomings. Most phylogenetic studies do not model gene tree diversity and can produce misleading results, such as strong support for an incorrect species tree, especially in the case of rapid and recent radiations. We discuss implications that incorrect phylogenies can have for reconstructing the evolution of primate behaviors and we urge primatologists to be aware of the current limitations of phylogenetic reconstructions when applying phylogenetic comparative methods. 相似文献

8.

The Adaptive Evolution Database (TAED)

Liberles DA Schreiber DR Govindarajan S Chamberlin SG Benner SA 《Genome biology》2001,2(4):preprint00-18

Background

Developing an understanding of the molecular basis for the divergence of species lies at the heart of biology. The Adaptive Evolution Database (TAED) serves as a starting point to link events that occur at the same time in the evolutionary history (tree of life) of species, based upon coding sequence evolution analyzed with the Master Catalog. The Master Catalog is a collection of evolutionary models, including multiple sequence alignments, phylogenetic trees, and reconstructed ancestral sequences, for all independently evolving protein sequence modules encoded by genes in GenBank [1]. 相似文献

9.

Insights from human/mouse genome comparisons 总被引：4，自引：0，他引：4

Len?A.?Pennacchio Email author 《Mammalian genome》2003,14(7):429-436

Large-scale public genomic sequencing efforts have provided a wealth of vertebrate sequence data poised to provide insights into mammalian biology. These include deep genomic sequence coverage of human, mouse, rat, zebrafish, and two pufferfish (Fugu rubripes and Tetraodon nigroviridis) (Aparicio et al. 2002; Lander et al. 2001; Venter et al. 2001; Waterston et al. 2002). In addition, a high-priority has been placed on determining the genomic sequence of chimpanzee, dog, cow, frog, and chicken (Boguski 2002). While only recently available, whole genome sequence data have provided the unique opportunity to globally compare complete genome contents. Furthermore, the shared evolutionary ancestry of vertebrate species has allowed the development of comparative genomic approaches to identify ancient conserved sequences with functionality. Accordingly, this review focuses on the initial comparison of available mammalian genomes and describes various insights derived from such analysis. 相似文献

10.

Comparative genomic analysis of eutherian interferon genes

《Genomics》2020,112(6):4749-4759

相似文献

11.

Phylogeny and evolution of Cervidae based on complete mitochondrial genomes

Zhang WQ Zhang MH 《Genetics and molecular research : GMR》2012,11(1):628-635

Mitochondrial DNA sequences can be used to estimate phylogenetic relationships among animal taxa and for molecular phylogenetic evolution analysis. With the development of sequencing technology, more and more mitochondrial sequences have been made available in public databases, including whole mitochondrial DNA sequences. These data have been used for phylogenetic analysis of animal species, and for studies of evolutionary processes. We made phylogenetic analyses of 19 species of Cervidae, with Bos taurus as the outgroup. We used neighbor joining, maximum likelihood, maximum parsimony, and Bayesian inference methods on whole mitochondrial genome sequences. The consensus phylogenetic trees supported monophyly of the family Cervidae; it was divided into two subfamilies, Plesiometacarpalia and Telemetacarpalia, and four tribes, Cervinae, Muntiacinae, Hydropotinae, and Odocoileinae. The divergence times in these families were estimated by phylogenetic analysis using the Bayesian method with a relaxed molecular clock method; the results were consistent with those of previous studies. We concluded that the evolutionary structure of the family Cervidae can be reconstructed by phylogenetic analysis based on whole mitochondrial genomes; this method could be used broadly in phylogenetic evolutionary analysis of animal taxa. 相似文献

12.

Why Are Some Evolutionary Trees in Natural History Museums Prone to Being Misinterpreted?

Erica Torrens Ana Barahona 《Evolution》2012,5(1):76-100

Today, the picture of an evolutionary tree is a very well-known visual image. It is almost impossible to think of the ancestry and relationships of living beings without it. As natural history museums play a major role in the public understanding of evolution, they often present a wide variety of evolutionary trees. However, many studies have shown (Baum and Offner 2008; Baum et al. 2005; Catley and Novick 2008; Evans 2009; Gregory 2008; Matuk 2007; Meir et al. 2007b; Padian 2008) that even though evolutionary trees have the potential to engage visitors of natural history museums with the phenomena of evolution, many of them unwittingly might lead to misunderstandings about the process. As valuable research and educational institutions, one of the museum’s important missions should be the careful design of their exhibits on evolution considering, for example, common preconceptions visitors often bring, such as the notion that evolution is oriented from simple toward complex organisms (incarnating the idea of a single ladder of life amidst the extraordinary diversity of organisms) and that humans are at the pinnacle of the evolutionary story, as well as na?ve interpretations of phylogenies. Our aim in this article is to show from history where many of these misunderstandings come from and to determine whether five important Western natural history museums inadvertently present “problematic” evolutionary trees (which might lead to non-scientific notions). 相似文献

13.

HaMStR: Profile hidden markov model based search for orthologs in ESTs

Ingo Ebersberger Sascha Strauss Arndt von Haeseler 《BMC evolutionary biology》2009,9(1):157-9

Background

EST sequencing is a versatile approach for rapidly gathering protein coding sequences. They provide direct access to an organism's gene repertoire bypassing the still error-prone procedure of gene prediction from genomic data. Therefore, ESTs are often the only source for biological sequence data from taxa outside mainstream interest. The widespread use of ESTs in evolutionary studies and particularly in molecular systematics studies is still hindered by the lack of efficient and reliable approaches for automated ortholog predictions in ESTs. Existing methods either depend on a known species tree or cannot cope with redundancy in EST data. 相似文献

14.

Protein co-evolution, co-adaptation and interactions 总被引：2，自引：0，他引：2

Pazos F Valencia A 《The EMBO journal》2008,27(20):2648-2655

Co-evolution has an important function in the evolution of species and it is clearly manifested in certain scenarios such as host–parasite and predator–prey interactions, symbiosis and mutualism. The extrapolation of the concepts and methodologies developed for the study of species co-evolution at the molecular level has prompted the development of a variety of computational methods able to predict protein interactions through the characteristics of co-evolution. Particularly successful have been those methods that predict interactions at the genomic level based on the detection of pairs of protein families with similar evolutionary histories (similarity of phylogenetic trees: mirrortree). Future advances in this field will require a better understanding of the molecular basis of the co-evolution of protein families. Thus, it will be important to decipher the molecular mechanisms underlying the similarity observed in phylogenetic trees of interacting proteins, distinguishing direct specific molecular interactions from other general functional constraints. In particular, it will be important to separate the effects of physical interactions within protein complexes (‘co-adaptation') from other forces that, in a less specific way, can also create general patterns of co-evolution. 相似文献

15.

Complete Mitochondrial Genome Sequences of the South American and the Australian Lungfish: Testing of the Phylogenetic Performance of Mitochondrial Data Sets for Phylogenetic Problems in Tetrapod Relationships

Brinkmann H Denk A Zitzler J Joss JJ Meyer A 《Journal of molecular evolution》2004,59(6):834-848

We determined the complete nucleotide sequences (16403 and 16572 base pairs, respectively) of the mitochondrial genomes of the South American lungfish, Lepidosiren paradoxa, and the Australian lungfish, Neoceratodus forsteri (Sarcopterygii, Dipnoi). The mitochondrial DNA sequences were established in an effort to resolve the debated evolutionary positions of the lungfish and the coelacanth relative to land vertebrates. Previous molecular phylogenetic studies based on complete mtDNA sequences, including only the African lungfish, Protopterus dolloi, sequence were able to strongly reject the traditional textbook hypothesis that coelacanths are the closest relatives of land vertebrates. However, these studies were unable to statistically significantly distinguish between the two remaining scenarios: lungfish as the closest relatives to land vertebrates and lungfish and coelacanths jointly as their sister group (Cao et al. 1998; Zardoya et al. 1998; Zardoya and Meyer 1997a). Lungfish, coelacanths, and the fish ancestors of the tetrapod lineage all originated within a short time window of about 20 million years, back in the early Devonian (about 380 to 400 million years ago). This short divergence time makes the determination of the phylogenetic relationships among these three lineages difficult. In this study, we attempted to break the long evolutionary branch of lungfish, in an effort to better resolve the phylogenetic relationships among the three extant sarcopterygian lineages. The gene order of the mitochondrial genomes of the South American and Australian lungfish conforms to the consensus gene order among gnathostome vertebrates. The phylogenetic analyses of the complete set of mitochondrial proteins (without ND6) suggest that the lungfish are the closest relatives of the tetrapods, although the support in favor of this scenario is not statistically significant. The two other smaller data sets (tRNA and rRNA genes) give inconsistent results depending on the different reconstruction methods applied and cannot significantly rule out any of the three alternative hypotheses. Nuclear protein-coding genes, which might be better phylogenetic markers for this question, support the lungfish–tetrapod sister-group relationship (Brinkmann et al. 2004).This article contains online supplementary material.Reviewing Editor: Dr. Rafael Zardoya 相似文献

16.

Genomic duplication problems for unrooted gene trees

Paszek Jaros&#;aw G&#;recki Pawe&#; 《BMC genomics》2016,17(1):165-175

Background

Discovering the location of gene duplications and multiple gene duplication episodes is a fundamental issue in evolutionary molecular biology. The problem introduced by Guigó et al. in 1996 is to map gene duplication events from a collection of rooted, binary gene family trees onto theirs corresponding rooted binary species tree in such a way that the total number of multiple gene duplication episodes is minimized. There are several models in the literature that specify how gene duplications from gene families can be interpreted as one duplication episode. However, in all duplication episode problems gene trees are rooted. This restriction limits the applicability, since unrooted gene family trees are frequently inferred by phylogenetic methods.

Results

In this article we show the first solution to the open problem of episode clustering where the input gene family trees are unrooted. In particular, by using theoretical properties of unrooted reconciliation, we show an efficient algorithm that reduces this problem into the episode clustering problems defined for rooted trees. We show theoretical properties of the reduction algorithm and evaluation of empirical datasets.

Conclusions

We provided algorithms and tools that were successfully applied to several empirical datasets. In particular, our comparative study shows that we can improve known results on genomic duplication inference from real datasets.

相似文献

17.

Choosing among alternative trees of multigene families 总被引：4，自引：0，他引：4

Martin AP 《Molecular phylogenetics and evolution》2000,16(3):430-439

Estimation of gene trees is the first step in testing alternative hypotheses about the evolution of multigene families. The standard practice for inferring gene family history is to construct trees that meet some objective criteria based on the fit of the character state changes (nucleotide or amino acid changes) to the gene tree. Unfortunately, analysis of character state data can be misleading. In addition, this approach ignores information about the relationships of the species from which the genes have been sampled. In this paper I explore using statistics of fit between the character data and gene trees and the reconciliation of the gene and species trees for choosing among alternative evolutionary hypotheses of gene families. In particular, I advocate a two-pronged strategy for choosing among alternative gene trees. First, the character data are used to define a set of acceptable gene trees (i.e., trees that are not significantly different from the minimum length tree). Next, the set of acceptable gene trees is reconciled with a known species tree, and the gene tree requiring the fewest number of gene duplications and losses is adopted as the best estimate of evolutionary history. The approach is illustrated using three gene families: BMP, EGR, and LDH. 相似文献

18.

Evolution of the RNA polymerase B' subunit gene (rpoB') in Halobacteriales: a complementary molecular marker to the SSU rRNA gene

Walsh DA Bapteste E Kamekura M Doolittle WF 《Molecular biology and evolution》2004,21(12):2340-2351

Many prokaryotes have multiple ribosomal RNA operons. Generally, sequence differences between small subunit (SSU) rRNA genes are minor (<1%) and cause little concern for phylogenetic inference or environmental diversity studies. For Halobacteriales, an order of extremely halophilic, aerobic Archaea, within-genome SSU rRNA sequence divergence can exceed 5%, rendering phylogenetic assignment problematic. The RNA polymerase B' subunit gene (rpoB') is a single-copy conserved gene that may be an appropriate alternative phylogenetic marker for Halobacteriales. We sequenced a fragment of the rpoB' gene from 21 species, encompassing 15 genera of Halobacteriales. To examine the utility of rpoB' as a phylogenetic marker in Halobacteriales, we investigated three properties of rpoB' trees: the variation in resolution between trees inferred from the rpoB' DNA and RpoB' protein alignment, the degree of mutational saturation between taxa, and congruence with the SSU rRNA tree. The rpoB' DNA and protein trees were for the most part congruent and consistently recovered two well-supported monophyletic groups, the clade I and clade II haloarchaea, within a collection of less well resolved Halobacteriales lineages. A comparison of observed versus inferred numbers of substitution revealed mutational saturation in the rpoB' DNA data set, particularly between more distant species. Thus, the RpoB' protein sequence may be more reliable than the rpoB' DNA sequence for inferring Halobacteriales phylogeny. AU tests of tree selection indicated the trees inferred from rpoB' DNA and protein alignments were significantly incongruent with the SSU rRNA tree. We discuss possible explanations for this incongruence, including tree reconstruction artifact, differential paralog sampling, and lateral gene transfer. This is the first study of Halobacteriales evolution based on a marker other than the SSU rRNA gene. In addition, we present a valuable phylogenetic framework encompassing a broad diversity of Halobacteriales, in which novel sequences can be inserted for evolutionary, ecological, or taxonomic investigations. 相似文献

19.

Random sampling of constrained phylogenies: conducting phylogenetic analyses when the phylogeny is partially known

Housworth EA Martins EP 《Systematic biology》2001,50(5):628-639

Statistical randomization tests in evolutionary biology often require a set of random, computer-generated trees. For example, earlier studies have shown how large numbers of computer-generated trees can be used to conduct phylogenetic comparative analyses even when the phylogeny is uncertain or unknown. These methods were limited, however, in that (in the absence of molecular sequence or other data) they allowed users to assume that no phylogenetic information was available or that all possible trees were known. Intermediate situations where only a taxonomy or other limited phylogenetic information (e.g., polytomies) are available are technically more difficult. The current study describes a procedure for generating random samples of phylogenies while incorporating limited phylogenetic information (e.g., four taxa belong together in a subclade). The procedure can be used to conduct comparative analyses when the phylogeny is only partially resolved or can be used in other randomization tests in which large numbers of possible phylogenies are needed. 相似文献

20.

Dissecting plant genomes with the PLAZA comparative genomics platform

Van Bel M Proost S Wischnitzki E Movahedi S Scheerlinck C Van de Peer Y Vandepoele K 《Plant physiology》2012,158(2):590-600

With the arrival of low-cost, next-generation sequencing, a multitude of new plant genomes are being publicly released, providing unseen opportunities and challenges for comparative genomics studies. Here, we present PLAZA 2.5, a user-friendly online research environment to explore genomic information from different plants. This new release features updates to previous genome annotations and a substantial number of newly available plant genomes as well as various new interactive tools and visualizations. Currently, PLAZA hosts 25 organisms covering a broad taxonomic range, including 13 eudicots, five monocots, one lycopod, one moss, and five algae. The available data consist of structural and functional gene annotations, homologous gene families, multiple sequence alignments, phylogenetic trees, and colinear regions within and between species. A new Integrative Orthology Viewer, combining information from different orthology prediction methodologies, was developed to efficiently investigate complex orthology relationships. Cross-species expression analysis revealed that the integration of complementary data types extended the scope of complex orthology relationships, especially between more distantly related species. Finally, based on phylogenetic profiling, we propose a set of core gene families within the green plant lineage that will be instrumental to assess the gene space of draft or newly sequenced plant genomes during the assembly or annotation phase. 相似文献