首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Are intron positions correlated with regions of high amino acid conservation? For a set of ancient conserved proteins, with intronless prokaryotic but intron-containing eukaryotic homologs, multiple sequence alignments identified residues invariant throughout evolution. Intron positions between codons show no preferences. However, introns lying after the first base of a codon prefer conserved regions, markedly in glycines. Because glycines are in excess in conserved regions, this behavior could reflect phase-one introns entering glycine residues randomly in the ancestral sequences. Examination of intron positions within codons of evolutionarily invariable amino acids showed that roughly 50% of these introns are bordered by guanines at both 5'- and 3'-ends, 25% have a G only before the intron, and 5% have a G only after the intron, whereas about 20% are bordered by nonguanine bases.  相似文献   

2.
Analysis of evolution of exon-intron structure of eukaryotic genes   总被引:10,自引:0,他引:10  
The availability of multiple, complete eukaryotic genome sequences allows one to address many fundamental evolutionary questions on genome scale. One such important, long-standing problem is evolution of exon-intron structure of eukaryotic genes. Analysis of orthologous genes from completely sequenced genomes revealed numerous shared intron positions in orthologous genes from animals and plants and even between animals, plants and protists. The data on shared and lineage-specific intron positions were used as the starting point for evolutionary reconstruction with parsimony and maximum-likelihood approaches. Parsimony methods produce reconstructions with intron-rich ancestors but also infer lineage-specific, in many cases, high levels of intron loss and gain. Different probabilistic models gave opposite results, apparently depending on model parameters and assumptions, from domination of intron loss, with extremely intron-rich ancestors, to dramatic excess of gains, to the point of denying any true conservation of intron positions among deep eukaryotic lineages. Development of models with adequate, realistic parameters and assumptions seems to be crucial for obtaining more definitive estimates of intron gain and loss in different eukaryotic lineages. Many shared intron positions were detected in ancestral eukaryotic paralogues which evolved by duplication prior to the divergence of extant eukaryotic lineages. These findings indicate that numerous introns were present in eukaryotic genes already at the earliest stages of evolution of eukaryotes and are compatible with the hypothesis that the original, catastrophic intron invasion accompanied the emergence of the eukaryotic cells. Comparison of various features of old and younger introns starts shedding light on probable mechanisms of intron insertion, indicating that propagation of old introns is unlikely to be a major mechanism for origin of new ones. The existence and structure of ancestral protosplice sites were addressed by examining the context of introns inserted within codons that encode amino acids conserved in all eukaryotes and, accordingly, are not subject to selection for splicing efficiency. It was shown that introns indeed predominantly insert into or are fixed in specific protosplice sites which have the consensus sequence (A/C)AG|Gt.  相似文献   

3.
Many spliceosomal introns exist in the eukaryotic nuclear genome. Despite much research, the evolution of spliceosomal introns remains poorly understood. In this paper, we tried to gain insights into intron evolution from a novel perspective by comparing the gene structures of cytoplasmic ribosomal proteins (CRPs) and mitochondrial ribosomal proteins (MRPs), which are held to be of archaeal and bacterial origin, respectively. We analyzed 25 homologous pairs of CRP and MRP genes that together had a total of 527 intron positions. We found that all 12 of the intron positions shared by CRP and MRP genes resulted from parallel intron gains and none could be considered to be “conserved,” i.e., descendants of the same ancestor. This was supported further by the high frequency of proto-splice sites at these shared positions; proto-splice sites are proposed to be sites for intron insertion. Although we could not definitively disprove that spliceosomal introns were already present in the last universal common ancestor, our results lend more support to the idea that introns were gained late. At least, our results show that MRP genes were intronless at the time of endosymbiosis. The parallel intron gains between CRP and MRP genes accounted for 2.3% of total intron positions, which should provide a reliable estimate for future inferences of intron evolution.  相似文献   

4.
Eukaryotes and archaea both possess multiple genes coding for family B DNA polymerases. In animals and fungi, three family B DNA polymerases, alpha, delta, and epsilon, are responsible for replication of nuclear DNA. We used a PCR-based approach to amplify and sequence phylogenetically conserved regions of these three DNA polymerases from Giardia intestinalis and Trichomonas vaginalis, representatives of early-diverging eukaryotic lineages. Phylogenetic analysis of eukaryotic and archaeal paralogs suggests that the gene duplications that gave rise to the three replicative paralogs occurred before the divergence of the earliest eukaryotic lineages, and that all eukaryotes are likely to possess these paralogs. One eukaryotic paralog, epsilon, consistently branches within archaeal sequences to the exclusion of other eukaryotic paralogs, suggesting that an epsilon-like family B DNA polymerase was ancestral to both archaea and eukaryotes. Because crenarchaeote and euryarchaeote paralogs do not form monophyletic groups in phylogenetic analysis, it is possible that archaeal family B paralogs themselves evolved by a series of gene duplications independent of the gene duplications that gave rise to eukaryotic paralogs.   相似文献   

5.
Many intron positions are conserved in varying subsets of eukaryotic genomes and, consequently, comprise a potentially informative class of phylogenetic characters. Roy and Gilbert developed a method of phylogenetic reconstruction using the patterns of intron presence-absence in eukaryotic genes and, applying this method to the analysis of animal phylogeny, obtained support for an Ecdysozoa clade (Roy SW, Gilbert W. 2005. Resolution of a deep animal divergence by the pattern of intron conservation. Proc Natl Acad Sci USA. 102:4403-4408). The critical assumption in the method was the independence of intron loss in different branches of the phylogenetic tree. Here, this assumption is refuted by showing that the branch-specific intron loss rates are strongly correlated. We show that different tree topologies are obtained, in each case with a significant statistical support, when different subsets of intron positions are analyzed. The analysis of the conserved intron positions supports the Coelomata topology, that is, a clade comprised of arthropods and chordates, whereas the analysis of more variable intron positions favors the Ecdysozoa topology, that is, a clade of arthropods and nematodes. We show, however, that the support for Ecdysozoa is fully explained by parallel loss of introns in nematodes and arthropods, a factor that does not contribute to the analysis of the conserved introns. The developed procedure for the identification and analysis of conserved introns and other characters with minimal or no homoplasy is expected to be useful for resolving many hard phylogenetic problems.  相似文献   

6.
Conservation versus parallel gains in intron evolution   总被引:10,自引:1,他引:9  
Orthologous genes from distant eukaryotic species, e.g. animals and plants, share up to 25–30% intron positions. However, the relative contributions of evolutionary conservation and parallel gain of new introns into this pattern remain unknown. Here, the extent of independent insertion of introns in the same sites (parallel gain) in orthologous genes from phylogenetically distant eukaryotes is assessed within the framework of the protosplice site model. It is shown that protosplice sites are no more conserved during evolution of eukaryotic gene sequences than random sites. Simulation of intron insertion into protosplice sites with the observed protosplice site frequencies and intron densities shows that parallel gain can account but for a small fraction (5–10%) of shared intron positions in distantly related species. Thus, the presence of numerous introns in the same positions in orthologous genes from distant eukaryotes, such as animals, fungi and plants, appears to reflect mostly bona fide evolutionary conservation.  相似文献   

7.
A new twist in trypanosome RNA metabolism: cis-splicing of pre-mRNA   总被引:6,自引:1,他引:5       下载免费PDF全文
It has been known for almost a decade and a half that in trypanosomes all mRNAs are trans-spliced by addition to the 5' end of the spliced leader (SL) sequence. During the same time period the conviction developed that classical cis-splicing introns are not present in the trypanosome genome and that the trypanosome gene arrangement is highly compact with small intergenic regions separating one gene from the next. We have now discovered that these tenets are no longer true. Poly(A) polymerase (PAP) genes in Trypanosoma brucei and Trypanosoma cruzi are split by intervening sequences of 653 and 302 nt, respectively. The intervening sequences occur at identical positions in both organisms and obey the GT/AG rule of cis-splicing introns. PAP mRNAs are trans-spliced at the very 5' end as well as internally at the 3' splice site of the intervening sequence. Interestingly, 11 nucleotide positions past the actual 5' splice site are conserved between the T. bruceiand T. cruzi introns. Point mutations in these conserved positions, as well as in the AG dinucleotide of the 3' splice site, abolish intron removal in vivo. Our results, together with the recent discovery of cis-splicing introns in Euglena gracilis, suggest that both trans- and cis-splicing are ancient acquisitions of the eukaryotic cell.  相似文献   

8.
Theories regarding the evolution of spliceosomal introns differ in the extent to which the distribution of introns reflects either a formative role in the evolution of protein-coding genes or the adventitious gain of genetic elements. Here, systematic methods are used to assess the causes of the present-day distribution of introns in 10 families of eukaryotic protein-coding genes comprising 1,868 introns in 488 distinct alignment positions. The history of intron evolution inferred using a probabilistic model that allows ancestral inheritance of introns, gain of introns, and loss of introns reveals that the vast majority of introns in these eukaryotic gene families were not inherited from the most recent common ancestral genes, but were gained subsequently. Furthermore, among inferred events of intron gain that meet strict criteria of reliability, the distribution of sites of gain with respect to reading-frame phase shows a 5:3:2 ratio of phases 0, 1 and 2, respectively, and exhibits a nucleotide preference for MAG GT (positions -3 to +2 relative to the site of gain). The nucleotide preferences of intron gain may prove to be the ultimate cause for the phase bias. The phase bias of intron gain is sufficient to account quantitatively for the well-known 5:3:2 bias in phase frequencies among extant introns, a conclusion that holds even when taxonomic heterogeneity in phase patterns is considered. Thus, intron gain accounts for the vast majority of extant introns and for the bias toward phase 0 introns that previously was interpreted as evidence for ancient formative introns.  相似文献   

9.
While it is widely accepted that most animals (Metazoa) do not have endogenous cellulases, relying instead on intestinal symbionts for cellulose digestion, the glycosyl hydrolase family 9 (GHF9) cellulases found in the genomes of termites, abalone, and sea squirts could be an exception. Using information from expressed sequence tags, we show that GHF9 genes (subgroup E2) are widespread in Metazoa because at least 11 classes in five phyla have expressed GHF9 cellulases. We also demonstrate that eukaryotic GHF9 gene families are ancient, forming distinct monophyletic groups in plants and animals. As several intron positions are also conserved between four metazoan phyla then, contrary to the still widespread belief that cellulases were horizontally transferred to animals relatively recently, GHF9 genes must derive from an ancient ancestor. We also found that sequences isolated from the same animal phylum tend to group together, and in some deuterostomes, GHF9 genes are characterized by substitutions in catalytically important sites. Several paralogous subfamilies of GHF9 can be identified in plants, and genes from primitive species tend to arise basally to angiosperm representatives. In contrast, GHF9 subgroup E2 genes are relatively rare in bacteria.  相似文献   

10.
Spliceosomal introns play a key role in eukaryotic genome evolution and protein diversity. A large Rab GTPase family has been identified in a unicellular eukaryote Trichomonas vaginalis. However, the characteristics of introns in Rab genes of T. vaginalis have not been investigated previously. In this study, we identified a 25-bp spliceosomal intron in the T. vaginalis Rab1a (TvRab1a) gene, the smallest intron in T. vaginalis to be characterized to date. This intron contains a canonical splice site at both 5' (GT) and 3' (AG) ends, and a putative branch-point sequence (TCTAAC) that matches the Trichomonad consensus sequence of ACTAAC except for the first nucleotide. The position and phase of the TvRab1a intron are evolutionarily conserved in Rab1 homologous genes across at least five eukaryotic supergroups, including Opisthokonta, Amoebozoa, Excavata, Chromalveolata, and Plantae. These results strongly suggest that the TvRab1a intron is likely to be an ancient spliceosomal intron, and it can therefore be used as a phylogenetic marker to evaluate particular eukaryotic groupings. Identification and characterization of the TvRabla intron may provide an insight into the evolution of the large Rab repertoire in T. vaginalis.  相似文献   

11.
The primary structures of two leghemoglobin genes from soybean   总被引:18,自引:8,他引:18       下载免费PDF全文
We present the complete nucleotide sequences of two leghemoglobin genes isolated from soybean DNA. Both genes contain three intervening sequences which interrupt the two coding sequences in identical positions. The 5' and 3' flanking sequences in both genes contain conserved sequences similar to those found in corresponding positions in other eukaryotic genes. Thus, the general DNA sequence organization of these plant genes is similar to that of other eukaryotic genes.  相似文献   

12.
Many genes for calmodulin-like domain protein kinases (CDPKs) have been identified in plants and Alveolate protists. To study the molecular evolution of the CDPK gene family, we performed a phylogenetic analysis of CDPK genomic sequences. Analysis of introns supports the phylogenetic analysis; CDPK genes with similar intron/exon structure are grouped together on the phylogenetic tree. Conserved introns support a monophyletic origin for plant CDPKs, CDPK-related kinases, and phosphoenolpyruvate carboxylase kinases. Plant CDPKs divide into two major branches. Plant CDPK genes on one branch share common intron positions with protist CDPK genes. The introns shared between protist and plant CDPKs presumably originated before the divergence of plants from Alveolates. Additionally, the calmodulin-like domains of protist CDPKs have intron positions in common with animal and fungal calmodulin genes. These results, together with the presence of a highly conserved phase zero intron located precisely at the beginning of the calmodulin-like domain, suggest that the ancestral CDPK gene could have originated from the fusion of protein kinase and calmodulin genes facilitated by recombination of ancient introns. Received: 11 July 2000 / Accepted: 18 April 2001  相似文献   

13.
Mammalian G protein-coupled receptor (GPCR) genes are characterised by a large proportion of intronless genes or a lower density of introns when compared with GPCRs of invertebrates. It is unclear which mechanisms have influenced intron density in this protein family, which is one of the largest in the mammalian genomes. We used a combination of Hidden Markov Models (HMM) and BLAST searches to establish the comprehensive repertoire of Rhodopsin GPCRs from seven species and performed overall alignments and phylogenetic analysis using the maximum parsimony method for over 1400 receptors in 12 subgroups. We identified 14 different Ancestral Receptor Groups (ARGs) that have members in both vertebrate and invertebrate species. We found that there exists a remarkable difference in the intron density among ancestral and new Rhodopsin GPCRs. The intron density among ARGs members was more than 3.5-fold higher than that within non-ARG members and more than 2-fold higher when considering only the 7TM region. This suggests that the new GPCR genes have been predominantly formed intronless while the ancestral receptors likely accumulated introns during their evolution. Many of the intron positions found in mammalian ARG receptor sequences were found to be present in orthologue invertebrate receptors suggesting that these intron positions are ancient. This analysis also revealed that one intron position is much more frequent than any other position and it is common for a number of phylogenetically different Rhodopsin GPCR groups. This intron position lies within a functionally important, conserved, DRY motif which may form a proto-splice site that could contribute to positional intron insertion. Moreover, we have found that other receptor motifs, similar to DRY, also contain introns between the second and third nucleotide of the arginine codon which also forms a proto-splice site. Our analysis presents compelling evidence that there was not a major loss of introns in mammalian GPCRs and formation of new GPCRs among mammals explains why these have fewer introns compared to invertebrate GPCRs. We also discuss and speculate about the possible role of different RNA- and DNA-based mechanisms of intron insertion and loss.  相似文献   

14.
Spo11 is a meiotic protein of fundamental importance as it is a conserved meiosis-specific transesterase required for meiotic recombination initiation in fungi, animals, and plants. Spo11 is homologous to the archaebacterial topoisomerase VIA (Top6A) gene, and its homologs are broadly distributed among eukaryotes, with some eukaryotes having more than one homolog. However, the evolutionary relationships among these genes are unclear, with some debate as to whether eukaryotic homologs originated by lateral gene transfer. We have identified and characterized protist Spo11 homologs by degenerate polymerase chain reaction (PCR) and sequencing and by analyses of sequences from public databases. Our phylogenetic analyses show that Spo11 homologs evolved by two ancient eukaryotic gene duplication events prior to the last common ancestor of extant eukaryotes, resulting in three eukaryotic paralogs: Spo11-1, Spo11-2, and Spo11-3. Spo11-1 orthologs encode meiosis-specific proteins and are distributed broadly among eukaryotic lineages, though Spo11-1 is absent from some protists. This absence coincides with the presence of Spo11-2 orthologs, which are meiosis-specific in Arabidopsis and are found in plants, red algae, and some protists but absent in animals and fungi. Spo11-3 encodes a Top6A subunit that interacts with topoisomerase VIB (Top6B) subunits, which together play a role in vegetative growth in Arabidopsis. We identified Spo11-3 (Top6A) and Top6B homologs in plants, red algae, and a few protists, establishing a broader distribution of these genes among eukaryotes, indicating their likely vertical descent followed by lineage-specific loss.  相似文献   

15.
16.
17.
The spliceosome, a sophisticated molecular machine involved in the removal of intervening sequences from the coding sections of eukaryotic genes, appeared and subsequently evolved rapidly during the early stages of eukaryotic evolution. The last eukaryotic common ancestor (LECA) had both complex spliceosomal machinery and some spliceosomal introns, yet little is known about the early stages of evolution of the spliceosomal apparatus. The Sm/Lsm family of proteins has been suggested as one of the earliest components of the emerging spliceosome and hence provides a first in-depth glimpse into the evolving spliceosomal apparatus. An analysis of 335 Sm and Sm-like genes from 80 species across all three kingdoms of life reveals two significant observations. First, the eukaryotic Sm/Lsm family underwent two rapid waves of duplication with subsequent divergence resulting in 14 distinct genes. Each wave resulted in a more sophisticated spliceosome, reflecting a possible jump in the complexity of the evolving eukaryotic cell. Second, an unusually high degree of conservation in intron positions is observed within individual orthologous Sm/Lsm genes and between some of the Sm/Lsm paralogs. This suggests that functional spliceosomal introns existed before the emergence of the complete Sm/Lsm family of proteins; hence, spliceosomal machinery with considerably fewer components than today's spliceosome was already functional.  相似文献   

18.
Previous evolutionary reconstructions have concluded that early eukaryotic ancestors including both the last common ancestor of eukaryotes and of all fungi had intron-rich genomes. By contrast, some extant eukaryotes have few introns, underscoring the complex histories of intron–exon structures, and raising the question as to why these few introns are retained. Here, we have used recently available fungal genomes to address a variety of questions related to intron evolution. Evolutionary reconstruction of intron presence and absence using 263 diverse fungal species supports the idea that massive intron reduction through intron loss has occurred in multiple clades. The intron densities estimated in various fungal ancestors differ from zero to 7.6 introns per 1 kb of protein-coding sequence. Massive intron loss has occurred not only in microsporidian parasites and saccharomycetous yeasts, but also in diverse smuts and allies. To investigate the roles of the remaining introns in highly-reduced species, we have searched for their special characteristics in eight intron-poor fungi. Notably, the introns of ribosome-associated genes RPL7 and NOG2 have conserved positions; both intron-containing genes encoding snoRNAs. Furthermore, both the proteins and snoRNAs are involved in ribosome biogenesis, suggesting that the expression of the protein-coding genes and noncoding snoRNAs may be functionally coordinated. Indeed, these introns are also conserved in three-quarters of fungi species. Our study shows that fungal introns have a complex evolutionary history and underappreciated roles in gene expression.  相似文献   

19.
20.
Summary Both the mouse cytosolic malate dehydrogenase gene and its mitochondrial counterpart contain eight introns, of which two are present at identical positions between the isozyme genes. The probability that the two intron positions coincide by chance between the two genes has been shown to be significantly small (=1.3×10–3), suggesting that the conservation of the intron positions has a biological significance. On the basis of a rooted phylogenetic tree inferred from a comparison of these isozymes and lactate dehydrogenases, we have shown that the origins of the conserved introns are very old, possibly going back to a date before the divergence of eubacteria, archaebacteria, and eukaryotes. In the aspartate aminotransferase isozyme genes, five of the introns are at identical places. The origins of the five conserved introns, however, are not obvious at present. It remains possible that some or all of the conserved introns have evolved after the divergence of eubacteria and eukaryotes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号