首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Exon shuffling has been characterized as one of the major evolutionary forces shaping both the genome and the proteome of eukaryotes. This mechanism was particularly important in the creation of multidomain proteins during animal evolution, bringing a number of functional genetic novelties. Here, genome information from a variety of eukaryotic species was used to address several issues related to the evolutionary history of exon shuffling. By comparing all protein sequences within each species, we were able to characterize exon shuffling signatures throughout metazoans. Intron phase (the position of the intron regarding the codon) and exon symmetry (the pattern of flanking introns for a given exon or block of adjacent exons) were features used to evaluate exon shuffling. We confirmed previous observations that exon shuffling mediated by phase 1 introns (1-1 exon shuffling) is the predominant kind in multicellular animals. Evidence is provided that such pattern was achieved since the early steps of animal evolution, supported by a detectable presence of 1-1 shuffling units in Trichoplax adhaerens and a considerable prevalence of them in Nematostella vectensis. In contrast, Monosiga brevicollis, one of the closest relatives of metazoans, and Arabidopsis thaliana, showed no evidence of 1-1 exon or domain shuffling above what it would be expected by chance. Instead, exon shuffling events are less abundant and predominantly mediated by phase 0 introns (0-0 exon shuffling) in those non-metazoan species. Moreover, an intermediate pattern of 1-1 and 0-0 exon shuffling was observed for the placozoan T. adhaerens, a primitive animal. Finally, characterization of flanking intron phases around domain borders allowed us to identify a common set of symmetric 1-1 domains that have been shuffled throughout the metazoan lineage.  相似文献   

2.
We conducted a multi-genome analysis correlating protein domain organization with the exon-intron structure of genes in nine eukaryotic genomes. We observed a significant correlation between the borders of exons and domains on a genomic scale for both invertebrates and vertebrates. In addition, we found that the more complex organisms displayed consistently stronger exon-domain correlation, with substantially more significant correlations detected in vertebrates compared with invertebrates. Our observations concur with the principles of exon shuffling theory, including the prediction of predominantly symmetric phase of introns flanking the borders of correlating exons. These results suggest that extensive exon shuffling events during evolution significantly contributed to the shaping of eukaryotic proteomes.  相似文献   

3.
Eukaryotes encode numerous proteins that either have no detectable homologs in prokaryotes or have only distant homologs. These molecular innovations of eukaryotes may be classified into three categories: proteins and domains inherited from prokaryotic precursors without drastic changes in biochemical function, but often recruited for novel roles in eukaryotes; new superfamilies or distinct biochemical functions emerging within pre-existing protein folds; and domains with genuinely new folds, apparently 'invented' at the outset of eukaryotic evolution. Most new folds emerging in eukaryotes are either alpha-helical or stabilized by metal chelation. Comparative genomics analyses point to an early phase of rapid evolution, and dramatic changes between the origin of the eukaryotic cell and the advent of the last common ancestor of extant eukaryotes. Extensive duplication of numerous genes, with subsequent functional diversification, is a distinctive feature of this turbulent era. Evolutionary analysis of ancient eukaryotic proteins is generally compatible with a two-symbiont scenario for eukaryotic origin, involving an alpha-proteobacterium (the ancestor of the mitochondria) and an archaeon, as well as key contributions from their selfish elements.  相似文献   

4.
Directed evolution of proteins by exon shuffling   总被引:18,自引:0,他引:18  
Evolution of eukaryotes is mediated by sexual recombination of parental genomes. Crossovers occur in random, but homologous, positions at a frequency that depends on DNA length. As exons occupy only 1% of the human genome and introns about 24%, by far most of the crossovers occur between exons, rather than inside. The natural process of creating new combinations of exons by intronic recombination is called exon shuffling. Our group is developing in vitro formats for exon shuffling and applying these to the directed evolution of proteins. Based on the splice frame junctions, nine classes of exons and three classes of introns can be distinguished. Splice frame diagrams of natural genes show how the splice frame rules govern exon shuffling. Here, we review various approaches to constructing libraries of exon-shuffled genes. For example, exon shuffling of human pharmaceutical proteins can generate libraries in which all of the sequences are fully human, without the point mutations that raise concerns about immunogenicity.  相似文献   

5.
It has been long debated whether spliceosomal introns originated in the common ancestor of eukaryotes and prokaryotes. In this study, we tested the possibility that extant introns were inherited from the common ancestor of eukaryotes and prokaryotes using in silico simulation. We first identified 21 intron positions that are shared among different families of the P-Type ATPase superfamily, some of which are known to have diverged before the separation of prokaryotes and eukaryotes. Theoretical estimates of the expected number of intron positions shared by different genes suggest that the introns at those 21 positions were inserted independently. There seems to be no intron that arose from before the diversification of the P-Type ATPase superfamily. Namely, the present introns were inserted after the separation of eukaryotes and prokaryotes.  相似文献   

6.
Complete structure of the chicken alpha 2(VI) collagen gene   总被引:4,自引:0,他引:4  
Type VI collagen is a hybrid molecule consisting of a short triple helix flanked by two large globular domains. These globular domains are composed of several homologous repeats which show a striking similarity to the collagen-binding motifs found in von Willebrand factor. The alpha 2(VI) subunit contains three of these homologous repeats termed D1, D2 and D3. We have isolated and characterized the entire gene for chicken alpha 2(VI) collagen. This gene, which is present as a single copy in the chicken genome, is 26 kbp long and comprises 28 exons. All exons can be classified in three groups. (a) The triple-helical domain is encoded by 19 short exons (27-90 bp) separated by introns of phase class 0. These exons are multiples of 9 bp and encode an integral number of collagenous Gly-Xaa-Yaa triplets. (b) The homologous repeats D1-D3 are encoded by one or two very long exons each (153-1578 bp). These exons are separated by introns of phase class 1. (c) The homologous repeats and the collagen sequence are linked to each other by three short adapter segments which are each encoded by a single exon (21-46 bp). The modular nature of the polypeptide is thus clearly reflected by the mosaic structure of its gene. The size of the exons and the phase class of the introns suggest that the alpha 2(VI) gene evolved by duplication and shuffling of two different primordial exons, one of 9 bp encoding a collagen Gly-Xaa-Yaa triplet and one of 600 bp encoding the precursor of the homologous repeats.  相似文献   

7.
Long M  Deutsch M  Wang W  Betrán E  Brunet FG  Zhang J 《Genetica》2003,118(2-3):171-182
Exon shuffling is an essential molecular mechanism for the formation of new genes. Many cases of exon shuffling have been reported in vertebrate genes. These discoveries revealed the importance of exon shuffling in the origin of new genes. However, only a few cases of exon shuffling were reported from plants and invertebrates, which gave rise to the assertion that the intron-mediated recombination mechanism originated very recently. We focused on the origin of new genes by exon shuffling and retroposition. We will first summarize our experimental work, which revealed four new genes in Drosophila, plants, and humans. These genes are 106 to 108 million years old. The recency of these genes allows us to directly examine the origin and evolution of genes in detail. These observations show firstly the importance of exon shuffling and retroposition in the rapid creation of new gene structures. They also show that the resultant chimerical structures appearing as mosaic proteins or as retroposed coding structures with novel regulatory systems, often confer novel functions. Furthermore, these newly created genes appear to have been governed by positive Darwinian selection throughout their history, with rapid changes of amino acid sequence and gene structure in very short periods of evolution. We further analyzed the distribution of intron phases in three non-vertebrate species, Drosophila melanogaster, Caenorhabditis elegans, and Arabidosis thaliana, as inferred from their genome sequences. As in the case of vertebrate genes, we found that intron phases in these species are unevenly distributed with an excess of phase zero introns and a significant excess of symmetric exons. Both findings are consistent with the requirements for the molecular process of exon shuffling. Thus, these non-vertebrate genomes may have also been strongly impacted by exon shuffling in general.  相似文献   

8.
Phyletic distributions of eukaryotic signalling domains were studied using recently developed sensitive methods for protein sequence analysis, with an emphasis on the detection and accurate enumeration of homologues in bacteria and archaea. A major difference was found between the distributions of enzyme families that are typically found in all three divisions of cellular life and non-enzymatic domain families that are usually eukaryote-specific. Previously undetected bacterial homologues were identified for# plant pathogenesis-related proteins, Pad1, von Willebrand factor type A, src homology 3 and YWTD repeat-containing domains. Comparisons of the domain distributions in eukaryotes and prokaryotes enabled distinctions to be made between the domains originating prior to the last common ancestor of all known life forms and those apparently originating as consequences of horizontal gene transfer events. A number of transfers of signalling domains from eukaryotes to bacteria were confidently identified, in contrast to only a single case of apparent transfer from eukaryotes to archaea.  相似文献   

9.
Gudlaugsdottir S  Boswell DR  Wood GR  Ma J 《Genetica》2007,131(3):299-306
Since it was first recognised that eukaryotic genes are fragmented into coding segments (exons) separated by non-coding segments (introns), the reason for this phenomenon has been debated. There are two dominant theories: that the piecewise arrangement of genes allows functional protein domains, represented by exons, to recombine by shuffling to form novel proteins with combinations of functions; or that introns represent parasitic DNA that can infest the eukaryotic genome because it does not interfere grossly with the fitness of its host. Differing distributions of exon lengths are predicted by these two theories. In this paper we examine distributions of exon lengths for six different organisms and find that they offer empirical evidence that both theories may in part be correct.  相似文献   

10.
Summary The evolutionary history of the intracellular calcium-binding protein superfamily is well documented. The members of this gene family are all believed to be derived from a common ancestor, which, itself, was the product of two successive gene duplications. In this study, we have compared and analyzed the structures of the recently described genes coding for these proteins. We propose a series of evolutionary events, which include exon shuffling and intron insertion, that could account for the evolutionary origin of all the members of this super-family. According to this hypothesis, the ancestral gene, a product of two successive duplications, consisted of at least four exons. Each exon coding for a peptide (a calcium-binding domain) was separated by an intron that had mediated the duplication. Each distinct lineage evolved from this ancestor by genomic rearrangement, with insertion of introns being a prominent feature.  相似文献   

11.
Many proteins, especially in eukaryotes, contain tandem repeats of several domains from the same family. These repeats have a variety of binding properties and are involved in protein–protein interactions as well as binding to other ligands such as DNA and RNA. The rapid expansion of protein domain repeats is assumed to have evolved through internal tandem duplications. However, the exact mechanisms behind these tandem duplications are not well-understood. Here, we have studied the evolution, function, protein structure, gene structure, and phylogenetic distribution of domain repeats. For this purpose we have assigned Pfam-A domain families to 24 proteomes with more sensitive domain assignments in the repeat regions. These assignments confirmed previous findings that eukaryotes, and in particular vertebrates, contain a much higher fraction of proteins with repeats compared with prokaryotes. The internal sequence similarity in each protein revealed that the domain repeats are often expanded through duplications of several domains at a time, while the duplication of one domain is less common. Many of the repeats appear to have been duplicated in the middle of the repeat region. This is in strong contrast to the evolution of other proteins that mainly works through additions of single domains at either terminus. Further, we found that some domain families show distinct duplication patterns, e.g., nebulin domains have mainly been expanded with a unit of seven domains at a time, while duplications of other domain families involve varying numbers of domains. Finally, no common mechanism for the expansion of all repeats could be detected. We found that the duplication patterns show no dependence on the size of the domains. Further, repeat expansion in some families can possibly be explained by shuffling of exons. However, exon shuffling could not have created all repeats.  相似文献   

12.
Protein S is a vitamin K dependent plasma protein and a cofactor to activated protein C, a serine protease that regulates blood coagulation. The haploid genome contains two protein S genes (alpha and beta) with the protein S alpha-gene corresponding to the cloned cDNA. We have now isolated and mapped overlapping genomic clones that cover an area of 50 kilobases of the protein S alpha-gene which code for the 3' part of the gene, i.e., the thrombin-sensitive region, the four domains that are homologous to the epidermal growth factor (EGF) precursor, the COOH-terminal part of protein S that is homologous to a plasma sex hormone binding globulin (SHBG), and, finally, the 3' untranslated region. The thrombin-sensitive region and the EGF-like domains are each coded on a separate exon. The sizes of the exons coding for the COOH-terminal half of protein S and the location of the introns are nearly identical with those in the homologous SHBG gene. Furthermore, the phase class of the splice junctions is the same in these two genes. We have also isolated and mapped genomic clones that cover 25 kilobases of the protein S beta-gene, which was found to contain stop codons and a 2 bp deletion which introduces a frame shift, suggesting that it is a pseudogene. The structure of the two protein S genes and a comparison with the vitamin K dependent clotting factors support a model for their origin by exon shuffling and recruitment of the 3' part of the gene from an ancestor shared with the sex hormone binding globulin.  相似文献   

13.
The structure of the gene encoding a chicken liver receptor, the chicken hepatic lectin, which mediates endocytosis of glycoproteins has been established. The coding sequence is divided into six exons separated by five introns. The first three exons correspond to separate functional domains of the receptor polypeptide (cytoplasmic tail, transmembrane sequence, and extracellular neck region), while the final three exons encode the Ca(2+)-dependent carbohydrate-recognition domain. These results, as well as computer-assisted multiple sequence comparisons, establish this receptor as the evolutionary homolog of the mammalian asialoglycoprotein receptors. It is interesting that the chicken receptor falls into a subfamily of proteins along with the mammalian asialoglycoprotein receptors, since the saccharide-binding specificity of the chicken receptor resembles more closely that of a different set of calcium-dependent animal lectins, which includes the mannose-binding proteins. The portions of the genes encoding the carbohydrate-recognition domains of these proteins lack introns. The results suggest that divergence of intron-containing and intron-lacking carbohydrate-recognition domains preceded shuffling events in which other functional domains were associated with the carbohydrate-recognition domains. This was followed by further divergence, generating a variety of saccharide-binding specificities.  相似文献   

14.

Background  

Ever since the discovery of 'genes in pieces' and mRNA splicing in eukaryotes, origin and evolution of spliceosomal introns have been considered within the conceptual framework of the 'introns early' versus 'introns late' debate. The 'introns early' hypothesis, which is closely linked to the so-called exon theory of gene evolution, posits that protein-coding genes were interrupted by numerous introns even at the earliest stages of life's evolution and that introns played a major role in the origin of proteins by facilitating recombination of sequences coding for small protein/peptide modules. Under this scenario, the absence of spliceosomal introns in prokaryotes is considered to be a result of "genome streamlining". The 'introns late' hypothesis counters that spliceosomal introns emerged only in eukaryotes, and moreover, have been inserted into protein-coding genes continuously throughout the evolution of eukaryotes. Beyond the formal dilemma, the more substantial side of this debate has to do with possible roles of introns in the evolution of eukaryotes.  相似文献   

15.
The human erythrocyte alpha-spectrin gene which spans 80 kbp has been cloned from human genomic DNA as overlapping lambda recombinants. The exon-intron junctions were identified and the exons mapped. The gene is encoded by 52 exons whose sizes range from 684 bp to the smallest of 18 bp. The donor and acceptor splice site sequences match the splice site consensus sequences, with the exception of one splice site where a donor sequence begins with -GC. The size and location of exons do not correlate with the 106-amino-acid repeat, except in three locations where the surrounding codons are conserved as well. The lack of correspondence between exons and 106-amino-acid repeat is interpreted to reflect the appearance of a spectrin-like gene from a minigene early in the evolution of eukaryotes. Since current evidence indicates that introns were present in genes before the divergence of prokaryotes and eukaryotes, it is possible that the original distribution of introns within the minigene has been lost by the random deletion of introns from the spectrin gene.  相似文献   

16.
《Gene》1997,192(2):215-225
The chicken neural glycoprotein F11 is a cell recognition molecule implicated in neurohistogenesis, in particular in the context of neurite outgrowth and fasciculation. F11 is a glycosyl-phosphatidylinositol-linked member of the immunoglobulin superfamily that is also termed contactin or F3 in humans and rodents, respectively. In this study, we report the complete structure of the F11 gene. It is composed of 23 exons distributed over more than 100 kb of genomic DNA and each of the ten domains of the F11 protein is encoded by two exons. The sizes of the introns vary by two orders of magnitude ranging from 150 bp to more than 15 kb. All interdomain introns are in phase one, i.e. are inserted after the first nucleotide of a codon, being consistent with assembly of a F11 progenitor gene via exon shuffling. The intradomain introns are localized at variable sites within the domains and have different intron phases. This study reveals a remarkable similarity of the F11 gene with the gene of axonin-1, a related neural immunoglobulin superfamily member which is also implicated in neurite outgrowth and fasciculation. The intron positions with respect to the protein domain organization are found to be identical, strongly suggesting that both genes are derived from a common ancestor that already had this exon-intron structure.  相似文献   

17.
All chlorophyll (Chl)-binding proteins constituting the photosynthetic apparatus of both prokaryotes and eukaryotes possess hydrophobic domains, corresponding to membrane-spanning alpha-helices (MSHs). Hydrophobic cluster analysis of representative members of the different Chl protein superfamilies revealed that all Chl proteins except the five-helix reaction center II proteins and the small subunits of photosystem I possess related domains. As a major conclusion, we found that the eukaryotic antennae likely share a common precursor with the prokaryotic Chl a/b antennae from Chl-b-containing oxyphotobacteria. From these data, we propose a global scheme for the evolution of these proteins from a one-MSH ancestor.  相似文献   

18.
19.
BACKGROUND: Lateral gene transfer (LGT) is an important evolutionary mechanism among prokaryotes. The situation in eukaryotes is less clear; the human genome sequence failed to give strong support for any recent transfers from prokaryotes to vertebrates, yet a number of LGTs from prokaryotes to protists (unicellular eukaryotes) have been documented. Here, we perform a systematic analysis to investigate the impact of LGT on the evolution of diplomonads, a group of anaerobic protists.RESULTS: Phylogenetic analyses of 15 genes present in the genome of the Atlantic Salmon parasite Spironucleus barkhanus and/or the intestinal parasite Giardia lamblia show that most of these genes originated via LGT. Half of the genes are putatively involved in processes related to an anaerobic lifestyle, and this finding suggests that a common ancestor, which most probably was aerobic, of Spironucleus and Giardia adapted to an anaerobic environment in part by acquiring genes via LGT from prokaryotes. The sources of the transferred diplomonad genes are found among all three domains of life, including other eukaryotes. Many of the phylogenetic reconstructions show eukaryotes emerging in several distinct regions of the tree, strongly suggesting that LGT not only involved diplomonads, but also involved other eukaryotic groups.CONCLUSIONS: Our study shows that LGT is a significant evolutionary mechanism among diplomonads in particular and protists in general. These findings provide insights into the evolution of biochemical pathways in early eukaryote evolution and have important implications for studies of eukaryotic genome evolution and organismal relationships. Furthermore, "fusion" hypotheses for the origin of eukaryotes need to be rigorously reexamined in the light of these results.  相似文献   

20.
Sato Y  Niimura Y  Yura K  Go M 《Gene》1999,238(1):93-101
Xylanases are classified into two families, numbered F/10 and G/11 according to the similarity of amino acid sequences of their catalytic domain (Henrissat, B., Bairoch, A., 1993. New families in the classification of glycosyl hydrolases based on amino acid sequence similarities. Biochem. J. 293, 781-788). Three-dimensional structure of the catalytic domain of the family F/10 xylanase was reported (White, A., Withers, S.G., Gilkes, N.R., Rose, D.R., 1994. Crystal structure of the catalytic domain of the beta-1,4-glycanase Cex from Cellulomonas fimi. Biochemistry 33, 12546-12552). The domain was decomposed into 22 modules by centripetal profiles (Go, M., Nosaka, M., 1987. Protein architecture and the origin of introns. Cold Spring Harbor Symp. Quant. Biol. 52, 915-924; Noguti, T., Sakakibara, H., Go, M., 1993. Localization of hydrogen-bonds within modules in barnase. Proteins 16, 357-363). A module is a contiguous polypeptide segment of amino acid residues having a compact conformation within a globular domain. Collected 31 intron sites of the family F/10 xylanase genes from fungus were found to be correlated to module boundaries with considerable statistical force (p values <0.001). The relationship between the intron locations and protein structures provides supporting evidence for the ancient origin of introns, because such a relationship cannot be expected by random insertion of introns into eukaryotic genes, but it rather suggests pre-existence of introns in the ancestral genes of prokaryotes and eukaryotes. A phylogenetic tree of the fungal and bacterial xylanase sequences made two clusters; one includes both the bacterial and fungal genes, but the other consists of only fungal genes. The mixed cluster of bacterial genes without introns and the fungal genes with introns further supports the ancient origin of introns. Comparison of the conserved base sequences of introns indicates that sliding of a splice site occurred in Aspergillus kawachii gene by one base from the ancestral position. Substrate-binding sites of xylanase are localized on eight modules, and introns are found at both termini of six out of these functional modules. This result suggests that introns might play a functional role in shuffling the exons encoding the substrate-binding modules.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号