首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Exon-intron structure and evolution of the Lipocalin gene family   总被引:6,自引:0,他引:6  
The Lipocalins are an ancient protein family whose expression is currently confirmed in bacteria, protoctists, plants, arthropods, and chordates. The evolution of this protein family has been assessed previously using amino acid sequence phylogenies. In this report we use an independent set of characters derived from the gene structure (exon-intron arrangement) to infer a new lipocalin phylogeny. We also present the novel gene structure of three insect lipocalins. The position and phase of introns are well preserved among lipocalin clades when mapped onto a protein sequence alignment, suggesting the homologous nature of these introns. Because of this homology, we use the intron position and phase of 23 lipocalin genes to reconstruct a phylogeny by maximum parsimony and distance methods. These phylogenies are very similar to the phylogenies derived from protein sequence. This result is confirmed by congruence analysis, and a consensus tree shows the commonalities between the two source trees. Interestingly, the intron arrangement phylogeny shows that metazoan lipocalins have more introns than other eukaryotic lipocalins, and that intron gains have occurred in the C-termini of chordate lipocalins. We also analyze the relationship of intron arrangement and protein tertiary structure, as well as the relationship of lipocalins with members of the proposed structural superfamily of calycins. Our congruence analysis validates the gene structure data as a source of phylogenetic information and helps to further refine our hypothesis on the evolutionary history of lipocalins.  相似文献   

2.
Analysis of evolution of exon-intron structure of eukaryotic genes   总被引:10,自引:0,他引:10  
The availability of multiple, complete eukaryotic genome sequences allows one to address many fundamental evolutionary questions on genome scale. One such important, long-standing problem is evolution of exon-intron structure of eukaryotic genes. Analysis of orthologous genes from completely sequenced genomes revealed numerous shared intron positions in orthologous genes from animals and plants and even between animals, plants and protists. The data on shared and lineage-specific intron positions were used as the starting point for evolutionary reconstruction with parsimony and maximum-likelihood approaches. Parsimony methods produce reconstructions with intron-rich ancestors but also infer lineage-specific, in many cases, high levels of intron loss and gain. Different probabilistic models gave opposite results, apparently depending on model parameters and assumptions, from domination of intron loss, with extremely intron-rich ancestors, to dramatic excess of gains, to the point of denying any true conservation of intron positions among deep eukaryotic lineages. Development of models with adequate, realistic parameters and assumptions seems to be crucial for obtaining more definitive estimates of intron gain and loss in different eukaryotic lineages. Many shared intron positions were detected in ancestral eukaryotic paralogues which evolved by duplication prior to the divergence of extant eukaryotic lineages. These findings indicate that numerous introns were present in eukaryotic genes already at the earliest stages of evolution of eukaryotes and are compatible with the hypothesis that the original, catastrophic intron invasion accompanied the emergence of the eukaryotic cells. Comparison of various features of old and younger introns starts shedding light on probable mechanisms of intron insertion, indicating that propagation of old introns is unlikely to be a major mechanism for origin of new ones. The existence and structure of ancestral protosplice sites were addressed by examining the context of introns inserted within codons that encode amino acids conserved in all eukaryotes and, accordingly, are not subject to selection for splicing efficiency. It was shown that introns indeed predominantly insert into or are fixed in specific protosplice sites which have the consensus sequence (A/C)AG|Gt.  相似文献   

3.
4.
Several protein-coding genes from land plant chloroplasts have been shown to contain introns. The majority of these introns resemble the fungal mitochondrial group II introns due to considerable nucleotide sequence homology at their 5 and 3 ends and they can readily be folded to form six hairpins characteristic of the predicted secondary structure of the mitochondrial group II introns. Recently it has been demonstrated that some mitochondrial group II introns are capable of self-splicing in vitro in the absence of protein co-factors. However evidence presented in this overview suggests that this is probably not the case for chloroplast introns and that trans-acting factors are almost certainly involved in their processing reactions.Abbreviations kop kilobase pairs - ORF Open Reading Frame - pre-RNA precursor ribonucleic acid  相似文献   

5.
The complete nucleotide sequences of the mitochondrial (mt) genomes of three cephalopods, Octopus vulgaris (Octopodiformes, Octopoda, Incirrata), Todarodes pacificus (Decapodiformes, Oegopsida, Ommastrephidae), and Watasenia scintillans (Decapodiformes, Oegopsida, Enoploteuthidae), were determined. These three mt genomes encode the standard set of metazoan mt genes. However, W. scintillans and T. pacificus mt genomes share duplications of the longest noncoding region, three cytochrome oxidase subunit genes and two ATP synthase subunit genes, and the tRNA(Asp) gene. Southern hybridization analysis of the W. scintillans mt genome shows that this single genome carries both duplicated regions. The near-identical sequence of the duplicates suggests that there are certain concerted evolutionary mechanisms, at least in cephalopod mitochondria. Molecular phylogenetic analyses of mt protein genes are suggestive, although not statistically significantly so, of a monophyletic relationship between W. scintillans and T. pacificus.  相似文献   

6.
Long M  Deutsch M  Wang W  Betrán E  Brunet FG  Zhang J 《Genetica》2003,118(2-3):171-182
Exon shuffling is an essential molecular mechanism for the formation of new genes. Many cases of exon shuffling have been reported in vertebrate genes. These discoveries revealed the importance of exon shuffling in the origin of new genes. However, only a few cases of exon shuffling were reported from plants and invertebrates, which gave rise to the assertion that the intron-mediated recombination mechanism originated very recently. We focused on the origin of new genes by exon shuffling and retroposition. We will first summarize our experimental work, which revealed four new genes in Drosophila, plants, and humans. These genes are 106 to 108 million years old. The recency of these genes allows us to directly examine the origin and evolution of genes in detail. These observations show firstly the importance of exon shuffling and retroposition in the rapid creation of new gene structures. They also show that the resultant chimerical structures appearing as mosaic proteins or as retroposed coding structures with novel regulatory systems, often confer novel functions. Furthermore, these newly created genes appear to have been governed by positive Darwinian selection throughout their history, with rapid changes of amino acid sequence and gene structure in very short periods of evolution. We further analyzed the distribution of intron phases in three non-vertebrate species, Drosophila melanogaster, Caenorhabditis elegans, and Arabidosis thaliana, as inferred from their genome sequences. As in the case of vertebrate genes, we found that intron phases in these species are unevenly distributed with an excess of phase zero introns and a significant excess of symmetric exons. Both findings are consistent with the requirements for the molecular process of exon shuffling. Thus, these non-vertebrate genomes may have also been strongly impacted by exon shuffling in general.  相似文献   

7.
The organization of ribosomal proteins in 16 prokaryotic genomes was studied as an example of comparative genome analyses of gene systems. Hypothetical ribosomal protein-containing operons were constructed. These operons also contained putative genes and other non-ribosomal genes. The correspondences among these genes across different organisms were clarified by sequence homology computations. In this way a cross tabulation of 70 ribosomal proteins genes was constructed. On average, these were organized into 9-14 operons in each genome. There were also 25 non-ribosomal or putative genes in these mainly ribosomal protein operons. Hence the table contains 95 genes in total. It was found that: (i) the conservation of the block of about 20 r-proteins in the L3 and L4 operons across almost the entire eubacteria and archaebacteria is remarkable; (ii) some operons only belong to eubacteria or archaebacteria; (iii) although the ribosomal protein operons are highly conserved within domain, there are fine variations in some operons across different organisms within each domain, and these variations are informative on the evolutionary relations among the organisms. This method provides a new potential for studying the origin and evolution of old species.  相似文献   

8.
The organization of ribosomal proteins in 16 prokaryotic genomes was studied as an example of comparative genome analyses of gene systems. Hypothetical ribosomal protein-containing operons were constructed. These operons also contained putative genes and other non-ribosomal genes. The correspondences among these genes across different organisms were clarified by sequence homology computations. In this way a cross tabulation of 70 ribosomal proteins genes was constructed. On average, these were organized into 9-14 operons in each genome. There were also 25 non-ribosomal or putative genes in these mainly ribosomal protein operons. Hence the table contains 95 genes in total. It was found that: (i) the conservation of the block of about 20 r-proteins in the L3 and L4 operons across almost the entire eubacteria and ar-chaebacteria is remarkable; (ii) some operons only belong to eubacteria or archaebacte-ria; (iii) although the ribosomal protein operons are highly conserved within domain, there are fine variat  相似文献   

9.
蜱螨线粒体基因组研究进展   总被引:2,自引:0,他引:2  
袁明龙  王进军 《昆虫学报》2012,55(4):472-481
蜱螨亚纲包括蜱类和螨类, 是节肢动物中物种多样性最高的类群之一。本文综述了当前已测序的28种蜱螨线粒体基因组的研究成果。概括起来, 蜱螨线粒体基因组具有以下特点: (1)大小变异显著, 其中柑橘全爪螨Panonychus citri线粒体基因组在目前已测节肢动物中最小(13 077 bp); (2)一般碱基组成偏向A和T, 但6种蜱螨具有相反的GC-偏斜(正值); (3)基因组的碱基组成及A+T富集区的位置、 长度和拷贝数等变异显著, 其中4种叶螨的A+T含量最高, 其A+T富集区在目前已测节肢动物中最短(44~57 bp); (4)基因高度重排, 特别是真螨总目的种类, 但重排与高分类阶元无相关性; (5)真螨总目部分螨类的tRNA基因极度缩短, 不能形成经典的三叶草二级结构。作者建议要进一步测定更多蜱螨的线粒体基因组, 验证蜱螨非典型tRNA基因的生物学功能性, 分析蜱螨线粒体基因组的分子进化机制, 开展蜱螨线粒体转录组研究等。  相似文献   

10.
Yu JF  Xiao K  Jiang DK  Guo J  Wang JH  Sun X 《DNA research》2011,18(6):435-449
The falsely annotated protein-coding genes have been deemed one of the major causes accounting for the annotating errors in public databases. Although many filtering approaches have been designed for the over-annotated protein-coding genes, some are questionable due to the resultant increase in false negative. Furthermore, there is no webserver or software specifically devised for the problem of over-annotation. In this study, we propose an integrative algorithm for detecting the over-annotated protein-coding genes in microorganisms. Overall, an average accuracy of 99.94% is achieved over 61 microbial genomes. The extremely high accuracy indicates that the presented algorithm is efficient to differentiate the protein-coding genes from the non-coding open reading frames. Abundant analyses show that the predicting results are reliable and the integrative algorithm is robust and convenient. Our analysis also indicates that the over-annotated protein-coding genes can cause the false positive of horizontal gene transfers detection. The webserver of the proposed algorithm can be freely accessible from www.cbi.seu.edu.cn/RPGM.  相似文献   

11.
Horizontal gene transfer (HGT), a process through which genomes acquire genetic materials from distantly related organisms, is believed to be one of the major forces in prokaryotic genome evolution.However, systematic investigation is still scarce to clarify two basic issues about HGT: (1) what types of genes are transferred; and (2) what influence HGT events over the organization and evolution of biological pathways. Genome-scale investigations of these two issues will advance the systematical understanding of HGT in the context of prokaryotic genome evolution. Having investigated 82 genomes, we constructed an HGT database across broad evolutionary timescales. We identified four function categories containing a high proportion of horizontally transferred genes: cell envelope, energy metabolism, regulatory functions, and transport/binding proteins. Such biased function distribution indicates that HGT is not completely random;instead, it is under high selective pressure, required by function restraints in organisms. Furthermore, we mapped the transferred genes onto the connectivity structure map of organism-specific pathways listed in Kyoto Encyclopedia of Genes and Genomes (KEGG). Our results suggest that recruitment of transferred genes into pathways is also selectively constrained because of the tuned interaction between original pathway members. Pathway organization structures still conserve well through evolution even with the recruitment of horizontally transferred genes. Interestingly, in pathways whose organization were significantly affected by HGT events, the operon-like arrangement of transferred genes was found to be prevalent. Such results suggest that operon plays an essential and directional role in the integration of alien genes into pathways.  相似文献   

12.
13.
The structure and nucleotide sequence of the murine lactotransferrin-encoding gene (LTF) deduced partly by direct sequencing of genomic clones in the λ phage vector and partly by enzymatic amplification of genomic DNA segments primed with the oligodeoxyribonucleic primers homologous to the cDNA sequence. The λ phage clones contained the 5′ half of the gene corresponding to the first eight exons and an incomplete ninth exon interrupted by eight introns. Genomic clones corresponding to the 3′ half of the LTF gene could not be obtained on repeated attempts from two different mouse genomic libraries, suggesting the possible presence of unclonable sequences in this part of the gene. Hence, PCR was used to clone the rest of the gene. Four out of the presumed eight remaining introns were cloned along with the flanking exons using PCR. Comparison of the structure of the LTF gene with those of the two other known transferrin-encoding genes, human serum transferrin-encoding gene and chicken ovotransferrin-encoding gene reveals that all three genes have a very similar intron-exon distribution pattern. The hypothesis that the present-day transferrin-encoding genes have originated from duplication of a common ancestral gene is confirmed here at the gene level. An interesting finding is the identification of a region of shared nucleotides between the 5′ flanking regions of the murine LTF and myeloperoxidase-encoding genes, the two genes expressed specifically in neutrophilic granulocytes.  相似文献   

14.
The evolution of spliceosomal introns remains intensely debated. We studied 96 Entamoeba histolytica genes previously identified as having been laterally transferred from prokaryotes, which were presumably intronless at the time of transfer. Ninety out of the 96 are also present in the reptile parasite Entamoeba invadens, indicating lateral transfer before the species' divergence approximately 50 MYA. We find only 2 introns, both shared with E. invadens. Thus, no intron gains have occurred in approximately 50 Myr, implying a very low rate of intron gain of less than one gain per gene per approximately 4.5 billion years. Nine other predicted introns are due to annotation errors reflecting apparent mistakes in the E. histolytica genome assembly. These results underscore the massive differences in intron gain rates through evolution.  相似文献   

15.
16.
Previous evolutionary reconstructions have concluded that early eukaryotic ancestors including both the last common ancestor of eukaryotes and of all fungi had intron-rich genomes. By contrast, some extant eukaryotes have few introns, underscoring the complex histories of intron–exon structures, and raising the question as to why these few introns are retained. Here, we have used recently available fungal genomes to address a variety of questions related to intron evolution. Evolutionary reconstruction of intron presence and absence using 263 diverse fungal species supports the idea that massive intron reduction through intron loss has occurred in multiple clades. The intron densities estimated in various fungal ancestors differ from zero to 7.6 introns per 1 kb of protein-coding sequence. Massive intron loss has occurred not only in microsporidian parasites and saccharomycetous yeasts, but also in diverse smuts and allies. To investigate the roles of the remaining introns in highly-reduced species, we have searched for their special characteristics in eight intron-poor fungi. Notably, the introns of ribosome-associated genes RPL7 and NOG2 have conserved positions; both intron-containing genes encoding snoRNAs. Furthermore, both the proteins and snoRNAs are involved in ribosome biogenesis, suggesting that the expression of the protein-coding genes and noncoding snoRNAs may be functionally coordinated. Indeed, these introns are also conserved in three-quarters of fungi species. Our study shows that fungal introns have a complex evolutionary history and underappreciated roles in gene expression.  相似文献   

17.
The determination and analysis of complete genome sequences has led to the suggestion that horizontal gene transfer may be much more extensive than previously appreciated. Many of these studies, however, rely on evidence that could be generated by forces other than gene transfer including selection, variable evolutionary rates, and biased sampling.  相似文献   

18.
19.
20.
Essential genes, indispensable genes for an organism’s survival, encode functions that are considered a foundation of life. Based on those experimentally determined for 10 bacteria, we find that essential genes are more preferentially situated at the leading strand than at the lagging strand, for all the 10 genomes studied, confirming previous findings based on either smaller datasets or putatively assigned ones by homology search. Furthermore, we find that rather than all essential genes, only those with the COG functional category of information storage and process (J, K and L), and subcategories D (cell cycle control), M (cell wall biogenesis), O (posttranslational modification), C (energy production and conversion), G (carbohydrate transport and metabolism), E (amino acid transport and metabolism) and F (nucleotide transport and metabolism) are preferentially situated at the leading strand. In contrast, the strand-bias for essential genes in other COG functional subcategories is not statistically significant. These results suggest that the remarkable strand-bias of the distribution of essential genes is mainly relevant to the aforementioned functionalities, which, therefore, likely play a key role in shaping the gene strand-bias in bacterial genomes.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号