首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Many proteins, especially in eukaryotes, contain tandem repeats of several domains from the same family. These repeats have a variety of binding properties and are involved in protein–protein interactions as well as binding to other ligands such as DNA and RNA. The rapid expansion of protein domain repeats is assumed to have evolved through internal tandem duplications. However, the exact mechanisms behind these tandem duplications are not well-understood. Here, we have studied the evolution, function, protein structure, gene structure, and phylogenetic distribution of domain repeats. For this purpose we have assigned Pfam-A domain families to 24 proteomes with more sensitive domain assignments in the repeat regions. These assignments confirmed previous findings that eukaryotes, and in particular vertebrates, contain a much higher fraction of proteins with repeats compared with prokaryotes. The internal sequence similarity in each protein revealed that the domain repeats are often expanded through duplications of several domains at a time, while the duplication of one domain is less common. Many of the repeats appear to have been duplicated in the middle of the repeat region. This is in strong contrast to the evolution of other proteins that mainly works through additions of single domains at either terminus. Further, we found that some domain families show distinct duplication patterns, e.g., nebulin domains have mainly been expanded with a unit of seven domains at a time, while duplications of other domain families involve varying numbers of domains. Finally, no common mechanism for the expansion of all repeats could be detected. We found that the duplication patterns show no dependence on the size of the domains. Further, repeat expansion in some families can possibly be explained by shuffling of exons. However, exon shuffling could not have created all repeats.  相似文献   

2.
Structure and evolution of the human involucrin gene   总被引:37,自引:0,他引:37  
R L Eckert  H Green 《Cell》1986,46(4):583-589
Involucrin is a keratinocyte protein that first appears in the cell cytosol, but ultimately becomes cross-linked to membrane proteins by transglutaminase. The gene for human involucrin has now been cloned and sequenced. The central segment of the coding region contains 39 repeats of a 30 nucleotide sequence whose ten encoded amino acids include three glutamines and two glutamic acids. This segment must have originated by successive duplications. Later duplications of modified sequences within the central segment can also be identified. Flanking the central segment lie shorter coding segments, a part of which must have given rise to the central segment. The flanking segments also show homology to a simpler 30 nucleotide sequence from which they likely originated. The evolution of involucrin as a substrate of transglutaminase and an envelope precursor was evidently made possible by this process of repeated mutation and duplication.  相似文献   

3.
Summary The evolutionary history of the intracellular calcium-binding protein superfamily is well documented. The members of this gene family are all believed to be derived from a common ancestor, which, itself, was the product of two successive gene duplications. In this study, we have compared and analyzed the structures of the recently described genes coding for these proteins. We propose a series of evolutionary events, which include exon shuffling and intron insertion, that could account for the evolutionary origin of all the members of this super-family. According to this hypothesis, the ancestral gene, a product of two successive duplications, consisted of at least four exons. Each exon coding for a peptide (a calcium-binding domain) was separated by an intron that had mediated the duplication. Each distinct lineage evolved from this ancestor by genomic rearrangement, with insertion of introns being a prominent feature.  相似文献   

4.
Protein domain repeats are common in proteins that are central to the organization of a cell, in particular in eukaryotes. They are known to evolve through internal tandem duplications. However, the understanding of the underlying mechanisms is incomplete. To shed light on repeat expansion mechanisms, we have studied the evolution of the muscle protein Nebulin, a protein that contains a large number of actin-binding nebulin domains.Nebulin proteins have evolved from an invertebrate precursor containing two nebulin domains. Repeat regions have expanded through duplications of single domains, as well as duplications of a super repeat (SR) consisting of seven nebulins. We show that the SR has evolved independently into large regions in at least three instances: twice in the invertebrate Branchiostoma floridae and once in vertebrates.In-depth analysis reveals several recent tandem duplications in the Nebulin gene. The events involve both single-domain and multidomain SR units or several SR units. There are single events, but frequently the same unit is duplicated multiple times. For instance, an ancestor of human and chimpanzee underwent two tandem duplications. The duplication junction coincides with an Alu transposon, thus suggesting duplication through Alu-mediated homologous recombination.Duplications in the SR region consistently involve multiples of seven domains. However, the exact unit that is duplicated varies both between species and within species. Thus, multiple tandem duplications of the same motif did not create the large Nebulin protein.Finally, analysis of segmental duplications in the human genome reveals that duplications are more common in genes containing domain repeats than in those coding for nonrepeated proteins. In fact, segmental duplications are found three to six times more often in long repeated genes than expected by chance.  相似文献   

5.
Molecular evolution before the origin of species   总被引:2,自引:0,他引:2  
  相似文献   

6.
Zhang L  Ma H 《The New phytologist》2012,195(1):248-263
? Plants and animals possess very different developmental processes, yet share conserved epigenetic regulatory mechanisms, such as histone modifications. One of the most important forms of histone modification is methylation on lysine residues of the tails, carried out by members of the SET protein family, which are widespread in eukaryotes. ? We analyzed molecular evolution by comparative genomics and phylogenetics of the SET genes from plant and animal genomes, grouping SET genes into several subfamilies and uncovering numerous gene duplications, particularly in the Suv, Ash, Trx and E(z) subfamilies. ? Domain organizations differ between different subfamilies and between plant and animal SET proteins in some subfamilies, and support the grouping of SET genes into seven main subfamilies, suggesting that SET proteins have acquired distinctive regulatory interactions during evolution. We detected evidence for independent evolution of domain organization in different lineages, including recruitment of new domains following some duplications. ? More recent duplications in both vertebrates and land plants are probably the result of whole-genome or segmental duplications. The evolution of the SET gene family shows that gene duplications caused by segmental duplications and other mechanisms have probably contributed to the complexity of epigenetic regulation, providing insights into the evolution of the regulation of chromatin structure.  相似文献   

7.
A new algorithm is introduced for analyzing gene-duplication-independent (orthologous) and gene-duplication-dependent amino acid sequence similarities between proteins of different species. It is based on the calculation of an autocorrelation function D(x) as a Fourier series analogous to that used in crystal analysis by x-ray diffraction. The primary structure of the protein is decomposed into "homopolypeptide-defective sequences" containing identical or similar amino acid residues and vacancies corresponding to the missing amino acid residues. The Fourier transforms F(h) simulating the diffraction patterns of defective linear gratings corresponding to the defective homopolypeptide sequences are calculated. The squared F(h) values are then used as coefficients of Fourier series corresponding to the autocorrelation functions D(x). A peak of D(x) corresponds to a vector of length x, which is the distance between two identical amino acid residues. It is pointed out that optical diffraction methods, instead of computer methods, would also be useful. It is shown through a number of examples that this method allows satisfactory pattern recognition of homologies and internal duplications of an initial segment of the polypeptide chain. In the latter case the value of the above method may be seen from the fact that it detects repeated duplications in proteins such as spinach ferredoxin and myoglobin, for which other methods had either failed or given inconclusive results. The above approach appears most promising for studies of molecular evolution and structure-sequence correlations.  相似文献   

8.
The complete sequence of 595 amino acids of the alpha-I domain of human erythrocyte spectrin has been determined. Peptides derived from three different protease cleavages were purified using high performance liquid chromatography and subjected to automated amino acid sequence analysis. These data along with sequences of the cyanogen bromide and large tryptic peptides (Speicher, D.W., Davis, G., Yurchenco, P.D., and Marchesi, V.T. (1983) J. Biol. Chem. 258, 14931-14937) represent most or all of the sequence of spectrin alpha-I. The single remaining ambiguity is the precise termination of the COOH terminus of the alpha-I domain. The sequence data suggest that the 595 residues presented here represent the complete sequence of the alpha-I domain, but the apparent size of the COOH-terminal CNBr fragment suggests the existence of an additional 38 residues at the end of the domain. The sequence of the alpha-I domain contains a single type of internal homology composed of multiple 106-amino acid repeats consistent with the occurrence of multiple gene duplications during the course of spectrin evolution. The only portion of the alpha-I sequence which does not appear to contain this sequence repeat is the segment containing the NH2-terminal 17 residues. This unique segment may be part of the oligomer binding site. No disulfide bonds appear to be involved in the structure of alpha-I and cysteine is not highly conserved. Calculations of secondary structure suggest the presence of short helices which fold into triple helical segments approximately 50 A in length. There is little beta sheet structure. A model of spectrin structure incorporating the repeat unit and proposed secondary structure is presented. A computer search of alpha-I sequence with the National Biomedical Research Foundation database of 2145 protein sequences did not detect any significant relationships. Spectrin is apparently the first member of a new class of proteins to be structurally characterized.  相似文献   

9.
Using several consensus sequences for the 106 amino acid residue alpha-spectrin repeat segment as probes we searched animal sequence databases using the BLAST program in order to find proteins revealing limited, but significant similarity to spectrin. Among many spectrins and proteins from the spectrin-alpha-actinin-dystrophin family as well as sequences showing a rather high degree of similarity in very short stretches, we found seven homologous animal sequences of low overall similarity to spectrin but showing the presence of one or more spectrin-repeat motifs. The homology relationship of these sequences to alpha-spectrin was further analysed using the SEMIHOM program. Depending on the probe, these segments showed the presence of 6 to 26 identical amino acid residues and a variable number of semihomologous residues. Moreover, we found six protein sequences, which contained a sequence fragment sharing the SH3 (sarc homology region 3) domain homology of 42-59% similarity. Our data indicate the occurrence of motifs of significant homology to alpha-spectrin repeat segments among animal proteins, which are not classical members of the spectrin-alpha-actinin-dystrophin family. This might indicate that these segments together with the SH3 domain motif are conserved in proteins which possibly at the early stage of evolution were close cognates of spectrin-alpha-actinin-dystrophin progenitors but then evolved separately.  相似文献   

10.
In higher plants, the superfamily of carboxyl-CoA ligases and related proteins, collectively called acyl activating enzymes (AAEs), has evolved to provide enzymes for many pathways of primary and secondary metabolism and for the conjugation of hormones to amino acids. Across the superfamily there is only limited sequence similarity, but a series of highly conserved motifs, including the AMP-binding domain, make it easy to identify members. These conserved motifs are best understood in terms of the unique domain-rotation architecture that allows AAE enzymes to catalyze the two distinct steps of the CoA ligase reaction. Arabidopsis AAE sequences were used to identify the AAE gene families in the sequenced genomes of green algae, mosses, and trees; the size of the respective families increased with increasing degree of organismal cellular complexity, size, and generation time. Large-scale genome duplications and small-scale tandem gene duplications have contributed to AAE gene family complexity to differing extents in each of the multicellular species analyzed. Gene duplication and evolution of novel functions in Arabidopsis appears to have occurred rapidly, because acquisition of new substrate specificity is relatively easy in this class of proteins. Convergent evolution has also occurred between members of distantly related clades. These features of the AAE superfamily make it difficult to use homology searches and other genomics tools to predict enzyme function.  相似文献   

11.
Zimek A  Weber K 《Gene》2002,288(1-2):187-193
We have isolated full length cDNAs encoding a cytoplasmic intermediate filament (IF) protein of a priapulid (Priapulus caudatus) and of a hemichordate (Saccoglossus kowalevskii) and determined the organisation of the hemichordate gene. As expected, the priapulid protein shows the long coil 1b subdomain and the lamin tail homology segment already known for cytoplasmic IF proteins from 11 other protostomic phyla. Surprisingly, the hemichordate protein follows in domain organisation much more closely the protostomic IF proteins than the chordate IF proteins. Thus, the lack of a lamin tail homology segment together with a deletion of 42 residues in the coil 1b domain is a molecular feature restricted to the chordates. We propose that the known IF subfamilies I to IV developed by gene duplications and sequence drift after the deletion in coil 1b arose at the base of the chordate branch.  相似文献   

12.
We now know that the evolution of multidomain proteins has frequently involved genetic duplication events. These, however, are sometimes difficult to trace because of low sequence similarity between duplicated segments. Spectrin, the major component of the membrane skeleton that provides elasticity to the cell, contains tandemly repeated sequences of 106 amino acid residues. The same repeats are also present in α-actinin, dystrophin and utrophin. Sequence alignments and phylogenetic trees of these domains allow us to interpret the evolutionary relationship between these proteins, concluding that spectrin evolved from α-actinin by an elongation process that included two duplications of a block of seven repeats. This analysis shows how a modular protein unit can be used in the evolution of large cytoskeletal structures.  相似文献   

13.
We have investigated the mechanism and the evolutionary pathway of protein dimerization through analysis of experimental structures of dimers. We propose that the evolution of dimers may have multiple pathways, including (1) formation of a functional dimer directly without going through an ancestor monomer, (2) formation of a stable monomer as an intermediate followed by mutations of its surface residues, and (3), a domain swapping mechanism, replacing one segment in a monomer by an equivalent segment from an identical chain in the dimer. Some of the dimers which are governed by a domain swapping mechanism may have evolved at an earlier stage of evolution via the second mechanism. Here, we follow the theory that the kinetic pathway reflects the evolutionary pathway. We analyze the structure-kinetics-evolution relationship for a collection of symmetric homodimers classified into three groups: (1) 14 dimers, which were referred to as domain swapping dimers in the literature; (2) nine 2-state dimers, which have no measurable intermediates in equilibrium denaturation; and (3), eight 3-state dimers, which have stable intermediates in equilibrium denaturation. The analysis consists of the following stages: (i) The dimer is divided into two structural units, which have twofold symmetry. Each unit contains a contiguous segment from one polypeptide chain of the dimer, and its complementary contiguous segment from the other chain. (ii) The division is repeated progressively, with different combinations of the two segments in each unit. (iii) The coefficient of compactness is calculated for the units in all divisions. The coefficients obtained for different cuttings of a dimer form a compactness profile. The profile probes the structural organization of the two chains in a dimer and the stability of the monomeric state. We describe the features of the compactness profiles in each of the three dimer groups. The profiles identify the swapping segments in domain swapping dimers, and can usually predict whether a dimer has domain swapping. The kinetics of dimerization indicates that some dimers which have been assigned in the literature as domain swapping cases, dimerize through the 2-state kinetics, rather than through swapping segments of performed monomers. The compactness profiles indicate a wide spectrum in the kinetics of dimerization: dimers having no intermediate stable monomers; dimers having an intermediate with a stable monomer structure; and dimers having an intermediate with a stable structure in part of the monomer. These correspond to the multiple evolutionary pathways for dimer formation. The evolutionary mechanisms proposed here for dimers are applicable to other oligomers as well.  相似文献   

14.
Animals evolved a variety of gene families involved in cell–cell communication and developmental control by gene duplication and domain shuffling. Each family is made up of several subtypes or subfamilies with distinct structures and functions, which diverged by gene duplications and domain shufflings before the divergence of parazoans and eumetazoans. Since the separation from protostomes, vertebrates expanded the multiplicity of members (isoforms) in the same subfamily by further gene duplications in their early evolution before the fish–tetrapod split. To know the dates of isoform duplications more closely, we have conducted isolation and sequencing cDNAs encoding the fibroblast growth factor receptor, Eph, src, and platelet-derived growth factor receptor subtypes belonging to the protein tyrosine kinase family from Branchiostoma belcheri, an amphioxus, Eptatretus burgeri, a hagfish, and Lampetra reissneri, a lamprey. From a phylogenetic tree of each subfamily inferred from a maximum likelihood (ML) method, together with a bootstrap analysis based on the ML method, we have shown that the isoform duplications frequently occurred in the early evolution of vertebrates around or just before the divergence of cyclostomes and gnathostomes by gene duplications and possibly chromosomal duplications. Received: 28 April 1998 / Accepted: 30 June 1999  相似文献   

15.
Most eukaryotic proteins consist of multiple domains created through gene fusions or internal duplications. The most frequent change of a domain architecture (DA) is insertion or deletion of a domain at the N or C terminus. Still, the mechanisms underlying the evolution of multidomain proteins are not very well studied.Here, we have studied the evolution of multidomain architectures (MDA), guided by evolutionary information in the form of a phylogenetic tree. Our results show that Pfam domain families and MDAs have been created with comparable rates (0.1-1 per million years (My)). The major changes in DA evolution have occurred in the process of multicellularization and within the metazoan lineage. In contrast, creation of domains seems to have been frequent already in the early evolution. Furthermore, most of the architectures have been created from older domains or architectures, whereas novel domains are mainly found in single-domain proteins. However, a particular group of exon-bordering domains may have contributed to the rapid evolution of novel multidomain proteins in metazoan organisms. Finally, MDAs have evolved predominantly through insertions of domains, whereas domain deletions are less common.In conclusion, the rate of creation of multidomain proteins has accelerated in the metazoan lineage, which may partly be explained by the frequent insertion of exon-bordering domains into new architectures. However, our results indicate that other factors have contributed as well.  相似文献   

16.
Chromosomal stability is safeguarded by a mitotic checkpoint, of which BUB1 and Mad3/BUBR1 are core components. These paralogs have similar, but not identical, domain organization. We show that Mad3/BUBR1 and BUB1 paralogous pairs arose by nine independent gene duplications throughout evolution, followed by parallel subfunctionalization in which preservation of the ancestral, amino-terminal KEN box or kinase domain was mutually exclusive. In one exception, vertebrate BUBR1-defined by the KEN box-preserved the kinase domain but allowed nonconserved degeneration of catalytic motifs. Although BUBR1 evolved to a typical pseudokinase in some vertebrates, it retained the catalytic triad in humans. However, we show that putative catalysis by human BUBR1 is dispensable for error-free chromosome segregation. Instead, residues that interact with ATP in conventional kinases are essential for conformational stability in BUBR1. We propose that parallel evolution of BUBR1 orthologs rendered its kinase function dispensable in vertebrates, producing an unusual, triad-containing pseudokinase.  相似文献   

17.
Evolution of Chitin-Binding Proteins in Invertebrates   总被引:11,自引:0,他引:11  
Analysis of a group of invertebrate proteins, including chitinases and peritrophic matrix proteins, reveals the presence of chitin-binding domains that share significant amino acid sequence similarity. The data suggest that these domains evolved from a common ancestor which may be a protein containing a single chitin-binding domain. The duplication and transposition of this chitin-binding domain may have contributed to the functional diversification of chitin-binding proteins. Sequence comparisons indicated that invertebrate and plant chitin binding domains do not share significant amino acid sequence similarity, suggesting that they are not coancestral. However, both the invertebrate and the plant chitin-binding domains are cysteine-rich and have several highly conserved aromatic residues. In plants, cysteines have been elucidated in maintaining protein folding and aromatic amino acids in interacting with saccharides [Wright HT, Sanddrasegaram G, Wright CS (1991) J Mol Evol 33:283–294]. It is likely that these residues perform similar functions in invertebrates. We propose that the invertebrate and the plant chitin-binding domains share similar mechanisms for folding and saccharide binding and that they evolved by convergent evolution. Furthermore, we propose that the disulfide bonds and aromatic residues are hallmarks for saccharide-binding proteins. Received: 2 March 1998 / Accepted: 17 July 1998  相似文献   

18.
Duplications are a major driving force behind evolution. Most duplicates are believed to fix through genetic drift, but it is not clear whether this process affects all duplications equally or whether there are certain gene families that are expected to show neutral expansions under certain circumstances. Here, we analyse the neutrality of duplications in different functional classes of signalling proteins based on their effects on response dynamics. We find that duplications involving intermediary proteins in a signalling network are neutral more often than those involving receptors. Although the fraction of neutral duplications in all functional classes increase with decreasing population size and selective pressure on dynamics, this effect is most pronounced for receptors, indicating a possible expansion of receptors in species with small population size. In line with such an expectation, we found a statistically significant increase in the number of receptors as a fraction of genome size in eukaryotes compared with prokaryotes. Although not confirmative, these results indicate that neutral processes can be a significant factor in shaping signalling networks and affect proteins from different functional classes differently.  相似文献   

19.
The complete amino acid sequence of the 61-kDa calmodulin-dependent, cyclic nucleotide phosphodiesterase (CaM-PDE) from bovine brain has been determined. The native protein is a homodimer of N alpha-acetylated, 529-residue polypeptide chains, each of which has a calculated molecular weight of 60,755. The structural organization of this CaM-PDE has been investigated with use of limited proteolysis and synthetic peptide analogues. A site capable of interacting with CaM has been identified, and the position of the catalytic domain has been mapped. A fully active, CaM-independent fragment (Mr = 36,000), produced by limited tryptic cleavage in the absence of CaM, represents a functional catalytic domain. N-Terminal sequence and size indicate that this 36-kDa fragment is comprised of residues 136 to approximately 450 of the CaM-PDE. This catalytic domain encompasses a approximately 250 residue sequence that is conserved among PDE isozymes of diverse size, phylogeny, and function. CaM-PDE and its PDE homologues comprise a unique family of proteins, each having a catalytic domain that evolved from a common progenitor. A search of the sequence for potential CaM-binding sites revealed only one 15-residue segment with both a net positive charge and the ability to form an amphiphilic alpha-helix. Peptide analogues that include this amphiphilic segment were synthesized. Each was found to inhibit the CaM-dependent activation of the enzyme and to bind directly to CaM with high affinity in a calcium-dependent manner. This site is among the sequences cleaved from a 45-kDa chymotryptic fragment that has the complete catalytic domain but no longer binds CaM. These results indicate that residues located between position 23 and 41 of the native enzyme contribute significantly to the binding of CaM although the involvement of residues from additional sites is not excluded.  相似文献   

20.
An examination of fragments of the human dystrophin rod domain, corresponding to a single structural repeating unit, showed that a critical chain length, defined with a precision of one residue at the C-terminal end, is required for formation of the native tertiary fold. We report here that extending the chain by six residues beyond this minimum results in a large increase in conformational stability. This is not related to a change in association state of the polypeptide. The results support the conjecture that successive repeating units in the rod domain of the spectrinlike proteins form a nested structure, in which the N-terminal part of the three-helix bundle of one repeat packs into the overlapping structure of the preceding repeat. This would be expected to affect functional characteristics related to flexibility of the dystrophin rod domain.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号