首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Wnt genes encode a conserved family of secreted signaling proteins that play many roles in arthropod and vertebrate development. We have investigated both the phylogenetic history and molecular evolution of this gene family. We have identified a novel Wnt gene in a diversity of arthropods that it is likely an orthologue of the vertebrate Wnt-10 group. Wnt-10 is one of only two cases in which orthology between protostome and deuterostome genes could be consistently assigned based on our analyses. Despite difficulties in assessing orthologies, all of our trees suggest that the most recent common ancestor of protostomes and deuterostomes possessed more than the five Wnt genes known from either arthropods or nematodes. This suggests that Wnt gene loss has occurred during protostome evolution. In addition, we examined the rate of amino acid evolution in the two arthropod/deuterostome orthology groups we identified. We found little rate variation across taxa, with the exception that Drosophila Wnt-1 is evolving more rapidly than all vertebrate and most arthropod orthologues.  相似文献   

2.
Accurate genome-wide identification of orthologs is a central problem in comparative genomics, a fact reflected by the numerous orthology identification projects developed in recent years. However, only a few reports have compared their accuracy, and indeed, several recent efforts have not yet been systematically evaluated. Furthermore, orthology is typically only assessed in terms of function conservation, despite the phylogeny-based original definition of Fitch. We collected and mapped the results of nine leading orthology projects and methods (COG, KOG, Inparanoid, OrthoMCL, Ensembl Compara, Homologene, RoundUp, EggNOG, and OMA) and two standard methods (bidirectional best-hit and reciprocal smallest distance). We systematically compared their predictions with respect to both phylogeny and function, using six different tests. This required the mapping of millions of sequences, the handling of hundreds of millions of predicted pairs of orthologs, and the computation of tens of thousands of trees. In phylogenetic analysis or in functional analysis where high specificity is required, we find that OMA and Homologene perform best. At lower functional specificity but higher coverage level, OrthoMCL outperforms Ensembl Compara, and to a lesser extent Inparanoid. Lastly, the large coverage of the recent EggNOG can be of interest to build broad functional grouping, but the method is not specific enough for phylogenetic or detailed function analyses. In terms of general methodology, we observe that the more sophisticated tree reconstruction/reconciliation approach of Ensembl Compara was at times outperformed by pairwise comparison approaches, even in phylogenetic tests. Furthermore, we show that standard bidirectional best-hit often outperforms projects with more complex algorithms. First, the present study provides guidance for the broad community of orthology data users as to which database best suits their needs. Second, it introduces new methodology to verify orthology. And third, it sets performance standards for current and future approaches.  相似文献   

3.
Orthology is a powerful refinement of homology that allows us to describe more precisely the evolution of genomes and understand the function of the genes they contain. However, because orthology is not concerned with genomic position, it is limited in its ability to describe genes that are likely to have equivalent roles in different genomes. Because of this limitation, the concept of 'positional orthology' has emerged, which describes the relation between orthologous genes that retain their ancestral genomic positions. In this review, we formally define this concept, for which we introduce the shorter term 'toporthology', with respect to the evolutionary events experienced by a gene's ancestors. Through a discussion of recent studies on the role of genomic context in gene evolution, we show that the distinction between orthology and toporthology is biologically significant. We then review a number of orthology prediction methods that take genomic context into account and thus that may be used to infer the important relation of toporthology.  相似文献   

4.
Evolution and function of the extended miR-2 microRNA family   总被引:1,自引:0,他引:1  
  相似文献   

5.
6.
That natural selection affects molecular evolution at synonymous sites in protein-coding sequences is well established and is thought to predominantly reflect selection for translational efficiency/accuracy mediated through codon bias. However, a recently developed maximum likelihood framework, when applied to 18 coding sequences in 3 species of Drosophila, confirmed an earlier report that the Notch gene in Drosophila melanogaster was evolving under selection in favor of those codons defined as unpreferred in this species. This finding opened the possibility that synonymous sites may be subject to a variety of selective pressures beyond weak selection for increased frequencies of the codons currently defined as "preferred" in D. melanogaster. To further explore patterns of synonymous site evolution in Drosophila in a lineage-specific manner, we expanded the application of the maximum likelihood framework to 8,452 protein coding sequences with well-defined orthology in D. melanogaster, Drosophila sechellia, and Drosophila yakuba. Our analyses reveal intragenomic and interspecific variation in mutational patterns as well as in patterns and intensity of selection on synonymous sites. In D. melanogaster, our results provide little statistical evidence for recent selection on synonymous sites, and Notch remains an outlier. In contrast, in D. sechellia our findings provide evidence in support of selection predominantly in favor of preferred codons. However, there is a small subset of genes in this species that appear to be evolving under selection in favor of unpreferred codons, which indicates that selection on synonymous sites is not limited to the preferential fixation of mutations that enhance the speed or accuracy of translation in this species.  相似文献   

7.
Fridman E  Zamir D 《Plant physiology》2003,131(2):603-609
Comparative analysis of complex developmental pathways depends on our ability to resolve the function of members of gene families across taxonomic groups. LIN5, which belongs to a small gene family of apoplastic invertases in tomato (Lycopersicon esculentum), is a quantitative trait locus that modifies fruit sugar composition. We have compared the genomic organization and expression of this gene family in the two distantly related species: tomato and Arabidopsis. Invertase family members reside on segmental duplications in the near-colinear genomes of tomato and potato (Solanum tuberosum). These chromosomal segments are syntenically duplicated in the model plant Arabidopsis. On the basis of phylogenetic analysis of genes in the microsyntenic region, we conclude that these segmental duplications arose independently after the separation of the tomato/potato clade from Arabidopsis. Rapid regulatory divergence is characteristic of the invertase family. Interestingly, although the processes of gene duplication and specialization of expression occurred separately in the two species, synteny-based orthologs from both clades acquired similar organ-specific expression. This similar expression pattern of the genes is evidence of comparable evolutionary constraints (parallel evolution) rather than of functional orthology. The observation that functional orthology cannot be identified through analysis of expression similarity highlights the caution that needs to be exercised in extrapolating developmental networks from a model organism.  相似文献   

8.
Baines JF  Parsch J  Stephan W 《Genetics》2004,166(1):237-242
Recent advances in experimental analyses of the evolution of RNA secondary structures suggest a more complex scenario than that typically considered by Kimura's classical model of compensatory evolution. In this study, we examine one such case in more detail. Previous experimental analysis of long-range compensatory interactions between the two ends of Drosophila Adh mRNA failed to fit the classical model of compensatory evolution. To further investigate and verify long-range pairing in Drosophila Adh with respect to models of compensatory evolution and its potential functional role, we introduced site-directed mutations in the Drosophila melanogaster Adh gene. We explore two alternative hypotheses for why previous analysis of long-range compensatory interactions failed to fit the classical model. Specifically, we investigate whether the disruption of a conserved short-range pairing within Adh exon 2 has an effect on Adh expression or if there is a dual functional role of a conserved sequence in the 3'-UTR in both long-range pairing and the negative regulation of Adh expression. We find that a classical result was not observed due to the pleiotropic effect of changing a nucleotide involved in both long-range base pairing and the negative regulation of gene expression.  相似文献   

9.
Owing to its special mode of evolution and central role in the adaptive immune system, the major histocompatibility complex (MHC) has become the focus of diverse disciplines such as immunology, evolutionary ecology, and molecular evolution. MHC evolution has been studied extensively in diverse vertebrate lineages over the last few decades, and it has been suggested that birds differ from the established mammalian norm. Mammalian MHC genes evolve independently, and duplication history (i.e., orthology) can usually be traced back within lineages. In birds, this has been observed in only 3 pairs of closely related species. Here we report strong evidence for the persistence of orthology of MHC genes throughout an entire avian order. Phylogenetic reconstructions of MHC class II B genes in 14 species of owls trace back orthology over tens of thousands of years in exon 3. Moreover, exon 2 sequences from several species show closer relationships than sequences within species, resembling transspecies evolution typically observed in mammals. Thus, although previous studies suggested that long-term evolutionary dynamics of the avian MHC was characterized by high rates of concerted evolution, resulting in rapid masking of orthology, our results question the generality of this conclusion. The owl MHC thus opens new perspectives for a more comprehensive understanding of avian MHC evolution.  相似文献   

10.
The major histocompatibility complex (MHC) plays a central role in the adaptive immune system and provides a good model with which to understand the evolutionary processes underlying functional genes. Trans-species polymorphism and orthology are both commonly found in MHC genes; however, mammalian MHC class I genes tend to cluster by species. Concerted evolution has the potential to homogenize different loci, whereas birth-and-death evolution can lead to the loss of orthologs; both processes result in monophyletic groups within species. Studies investigating the evolution of MHC class I genes have been biased toward a few particular taxa and model species. We present the first study of MHC class I genes in a species from the superfamily Musteloidea. The European badger (Meles meles) exhibits moderate variation in MHC class I sequences when compared to other carnivores. We identified seven putatively functional sequences and nine pseudogenes from genomic (gDNA) and complementary (cDNA) DNA, signifying at least two functional class I loci. We found evidence for separate evolutionary histories of the α1 and α2/α3 domains. In the α1 domain, several sequences from different species were more closely related to each other than to sequences from the same species, resembling orthology or trans-species polymorphism. Balancing selection and probable recombination maintain genetic diversity in the α1 domain, evidenced by the detection of positive selection and a recombination event. By comparison, two recombination breakpoints indicate that the α2/α3 domains have most likely undergone concerted evolution, where recombination has homogenized the α2/α3 domains between genes, leading to species-specific clusters of sequences. Our findings highlight the importance of analyzing MHC domains separately.  相似文献   

11.
The main reaction pathways in the living cell are carried out by functional modules--namely, macromolecular machines with compact structure or ensembles that change their composition and/or organization during function. Modules define themselves by spatial sequestration, chemical specificity and a characteristic time domain within which their function proceeds. On receiving a specific input, modules go through functional cycles, with phases of increasing and decreasing complexity of molecular interactions. Here, we discuss how such modules are formed and the experimental and theoretical approaches that can be used to investigate them, using examples from polynucleotide-protein interactions, vesicle transport and signal transduction to illustrate the underlying principles. Further progress in this field, where systems biology and biochemistry meet, will depend on iterative validation of the experimental and theoretical approaches.  相似文献   

12.
Two recent studies demonstrated a positive correlation between divergence in gene expression and protein sequence in Drosophila. This correlation could be driven by positive selection or variation in functional constraint. To distinguish between these alternatives, we compared patterns of molecular evolution for 1,862 genes with two previously reported estimates of expression divergence in Drosophila. We found a slight negative trend (nonsignificant) between positive selection on protein sequence and divergence in expression levels between Drosophila melanogaster and Drosophila simulans. Conversely, shifts in expression patterns during Drosophila development showed a positive association with adaptive protein evolution, though as before the relationship was weak and not significant. Overall, we found no strong evidence for an increase in the incidence of positive selection on protein-coding regions in genes with divergent expression in Drosophila, suggesting that the previously reported positive association between protein and regulatory divergence primarily reflects variation in functional constraint.  相似文献   

13.
MOTIVATION: Orthologous proteins in different species are likely to have similar biochemical function and biological role. When annotating a newly sequenced genome by sequence homology, the most precise and reliable functional information can thus be derived from orthologs in other species. A standard method of finding orthologs is to compare the sequence tree with the species tree. However, since the topology of phylogenetic tree is not always reliable one might get incorrect assignments. RESULTS: Here we present a novel method that resolves this problem by analyzing a set of bootstrap trees instead of the optimal tree. The frequency of orthology assignments in the bootstrap trees can be interpreted as a support value for the possible orthology of the sequences. Our method is efficient enough to analyze data in the scale of whole genomes. It is implemented in Java and calculates orthology support levels for all pairwise combinations of homologous sequences of two species. The method was tested on simulated datasets and on real data of homologous proteins.  相似文献   

14.
Heterochromatin is the gene-poor, satellite-rich eukaryotic genome compartment that supports many essential cellular processes. The functional diversity of proteins that bind and often epigenetically define heterochromatic DNA sequence reflects the diverse functions supported by this enigmatic genome compartment. Moreover, heterogeneous signatures of selection at chromosomal proteins often mirror the heterogeneity of evolutionary forces that act on heterochromatic DNA. To identify new such surrogates for dissecting heterochromatin function and evolution, we conducted a comprehensive phylogenomic analysis of the Heterochromatin Protein 1 gene family across 40 million years of Drosophila evolution. Our study expands this gene family from 5 genes to at least 26 genes, including several uncharacterized genes in Drosophila melanogaster. The 21 newly defined HP1s introduce unprecedented structural diversity, lineage-restriction, and germline-biased expression patterns into the HP1 family. We find little evidence of positive selection at these HP1 genes in both population genetic and molecular evolution analyses. Instead, we find that dynamic evolution occurs via prolific gene gains and losses. Despite this dynamic gene turnover, the number of HP1 genes is relatively constant across species. We propose that karyotype evolution drives at least some HP1 gene turnover. For example, the loss of the male germline-restricted HP1E in the obscura group coincides with one episode of dramatic karyotypic evolution, including the gain of a neo-Y in this lineage. This expanded compendium of ovary- and testis-restricted HP1 genes revealed by our study, together with correlated gain/loss dynamics and chromosome fission/fusion events, will guide functional analyses of novel roles supported by germline chromatin.  相似文献   

15.
By using in silico methods in a previous study, we identified 100 oocyte-specific genes and 150 genes, enriched in the mouse oocyte. Interestingly, approximately half of the oocyte-specific genes tend to cluster on mouse chromosomes as if they have recently duplicated during evolution. In this study, we focused our attention on mouse BRDT, which belongs to a family of four structurally related proteins characterized by two N-terminal bromodomains and one C-terminal extraterminal domain (ET domain), defining the BET family. In mammals, BRD2, -3, and -4 are ubiquitously expressed, whereas BRDT expression was shown to be restricted to the testis. We were interested to know whether there was a correlation between the evolutionary rate and the specificity of expression of these four paralogous genes. First, we show by RT-PCR and in situ hybridization that BRDT is also expressed in mouse oocyte. Moreover, phylogenetic analyses show that the BRDT germ cell-specific orthology group clearly evolves faster than its ubiquitously expressed paralogs BRD2, BRD3, and BRD4. This suggests that there is a relationship between the evolution of these four groups of orthology and their tissue specificity of expression.  相似文献   

16.
MOTIVATION: Comparative genomics in general and orthology analysis in particular are becoming increasingly important parts of gene function prediction. Previously, orthology analysis and reconciliation has been performed only with respect to the parsimony model. This discards many plausible solutions and sometimes precludes finding the correct one. In many other areas in bioinformatics probabilistic models have proven to be both more realistic and powerful than parsimony models. For instance, they allow for assessing solution reliability and consideration of alternative solutions in a uniform way. There is also an added benefit in making model assumptions explicit and therefore making model comparisons possible. For orthology analysis, uncertainty has recently been addressed using parsimonious reconciliation combined with bootstrap techniques. However, until now no probabilistic methods have been available. RESULTS: We introduce a probabilistic gene evolution model based on a birth-death process in which a gene tree evolves 'inside' a species tree. Based on this model, we develop a tool with the capacity to perform practical orthology analysis, based on Fitch's original definition, and more generally for reconciling pairs of gene and species trees. Our gene evolution model is biologically sound (Nei et al., 1997) and intuitively attractive. We develop a Bayesian analysis based on MCMC which facilitates approximation of an a posteriori distribution for reconciliations. That is, we can find the most probable reconciliations and estimate the probability of any reconciliation, given the observed gene tree. This also gives a way to estimate the probability that a pair of genes are orthologs. The main algorithmic contribution presented here consists of an algorithm for computing the likelihood of a given reconciliation. To the best of our knowledge, this is the first successful introduction of this type of probabilistic methods, which flourish in phylogeny analysis, into reconciliation and orthology analysis. The MCMC algorithm has been implemented and, although not yet being in its final form, tests show that it performs very well on synthetic as well as biological data. Using standard correspondences, our results carry over to allele trees as well as biogeography.  相似文献   

17.
18.
The function of unknown genes is often inferred from comparisons to well-characterized homologs. In this paper, we show that, even if all of the homologs of a gene are unannotated, its function may be deduced through phylogenetic profiling. We have designed a series of algorithms that make functional predictions of genes based on orthology and set theory, but our approach to predicting gene function requires no previous knowledge of homolog function. With this technique, we successfully identified 94% of the clusters of orthologous groups that are known to be involved in flagella development or function. As a test, we removed the function of three putative flagellar genes that had been previously uncharacterized in Bacillus subtilis. We observed a motility phenotype for two of these three genes. Thus, these algorithms allow for high-throughput functional prediction of genes beyond that provided by simple orthology-based annotation endeavors.  相似文献   

19.
Ortholog identification is used in gene functional annotation, species phylogeny estimation, phylogenetic profile construction and many other analyses. Bioinformatics methods for ortholog identification are commonly based on pairwise protein sequence comparisons between whole genomes. Phylogenetic methods of ortholog identification have also been developed; these methods can be applied to protein data sets sharing a common domain architecture or which share a single functional domain but differ outside this region of homology. While promiscuous domains represent a challenge to all orthology prediction methods, overall structural similarity is highly correlated with proximity in a phylogenetic tree, conferring a degree of robustness to phylogenetic methods. In this article, we review the issues involved in orthology prediction when data sets include sequences with structurally heterogeneous domain architectures, with particular attention to automated methods designed for high-throughput application, and present a case study to illustrate the challenges in this area.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号