首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The subunit interfaces of 122 homodimers of known three-dimensional structure are analyzed and dissected into sets of surface patches by clustering atoms at the interface; 70 interfaces are single-patch, the others have up to six patches, often contributed by different structural domains. The average interface buries 1,940 A2 of the surface of each monomer, contains one or two patches burying 600-1,600 A2, is 65% nonpolar and includes 18 hydrogen bonds. However, the range of size and of hydrophobicity is wide among the 122 interfaces. Each interface has a core made of residues with atoms buried in the dimer, surrounded by a rim of residues with atoms that remain accessible to solvent. The core, which constitutes 77% of the interface on average, has an amino acid composition that resembles the protein interior except for the presence of arginine residues, whereas the rim is more like the protein surface. These properties of the interfaces in homodimers, which are permanent assemblies, are compared to those of protein-protein complexes where the components associate after they have independently folded. On average, subunit interfaces in homodimers are twice larger than in complexes, and much less polar due to the large fraction belonging to the core, although the amino acid compositions of the cores are similar in the two types of interfaces.  相似文献   

2.
Chakrabarti P  Janin J 《Proteins》2002,47(3):334-343
The recognition sites in 70 pairwise protein-protein complexes of known three-dimensional structure are dissected in a set of surface patches by clustering atoms at the interface. When the interface buries <2000 A2 of protein surface, the recognition sites usually form a single patch on the surface of each component protein. In contrast, larger interfaces are generally multipatch, with at least one pair of patches that are equivalent in size to a single-patch interface. Each recognition site, or patch within a site, contains a core made of buried interface atoms, surrounded by a rim of atoms that remain accessible to solvent in the complex. A simple geometric model reproduces the number and distribution of atoms within a patch. The rim is similar in composition to the rest of the protein surface, but the core has a distinctive amino acid composition, which may help in identifying potential protein recognition sites on single proteins of known structures.  相似文献   

3.
The strongly conserved amino acid sequences of the P8 outer capsid proteins of Rice dwarf virus (RDV) and Rice gall dwarf virus (RGDV) and the distribution of electrostatic potential on the proteins at the interfaces between structural proteins suggested the possibility that P8-trimers of RGDV might bind to the 3-fold symmetrical axes of RDV core particles, with vertical interaction between heterologous P3 and P8 proteins and lateral binding of homologous P8 proteins, thereby allowing formation of the double-layered capsids that are characteristic of viruses that belong to the family Reoviridae. We proved this hypothesis using chimeric virus-like particles composed of the P3 core capsid protein of RDV and the P8 outer capsid protein of RGDV, which were co-expressed in a baculovirus expression system. This is the first report on the molecular biological proof of the mechanism of the assembly of the double-layered capsids with disparate icosahedral lattices.  相似文献   

4.
Bahadur RP  Janin J 《Proteins》2008,71(1):407-414
To evaluate the evolutionary constraints placed on viral proteins by the structure and assembly of the capsid, we calculate Shannon entropies in the aligned sequences of 45 polypeptide chains in 32 icosahedral viruses, and relate these entropies to the residue location in the three-dimensional structure of the capsids. Three categories of residues have entropies lower than the chain average implying that they are better conserved than average: residues that are buried within a subunit (the protein core), residues that contain atoms buried at an interface between subunits (the interface core), and residues that contribute to several such interfaces. The interface core is also conserved in homomeric proteins and in transient protein-protein complexes, which have only one interface whereas capsids have many. In capsids, the subunit interfaces implicate most of the polypeptide chain: on average, 66% of the capsid residues are at an interface, 34% at more than one, and 47% at the interface core. Nevertheless, we observe that the degree of residue conservation can vary widely between interfaces within a capsid and between regions within an interface. The interfaces and regions of interfaces that show a low sequence variability are likely to play major roles in the self-assembly of the capsid, with implications on its mechanism that we discuss taking adeno-associated virus as an example.  相似文献   

5.
We selected 49 icosahedral virus capsids whose crystal structures are reported in the Protein Data Bank. They belong to the T=1, T=3, pseudo T=3 and other lattice types. We identified in them 779 unique interfaces between pairs of subunits, all repeated by icosahedral symmetry. We analyzed the geometric and physical chemical properties of these interfaces and compared with interfaces in protein-protein complexes and homodimeric proteins, and with crystal packing contacts. The capsids contain one to 16 subunits implicated in three to 66 unique interfaces. Each subunit loses 40-60% of its accessible surface in contacts with an average of 8.5 neighbors. Many of the interfaces are very large with a buried surface area (BSA) that can exceed 10,000 A(2), yet 39% are small with a BSA<800 A(2) comparable to crystal packing contacts. Pairwise capsid interfaces overlap, so that one-third of the residues are part of more than one interface. Those with a BSA>800 A(2) resemble homodimer interfaces in their chemical composition. Relative to the protein surface, they are non-polar, enriched in aliphatic residues and depleted of charged residues, but not of neutral polar residues. They contain one H-bond per about 200 A(2) BSA. Small capsid interfaces (BSA<800 A(2)) are only slightly more polar. They have a similar amino acid composition, but they bury fewer atoms and contain fewer H-bonds for their size. Geometric parameters that estimate the quality of the atomic packing suggest that the small capsid interfaces are loosely packed like crystal packing contacts, whereas the larger interfaces are close-packed as in protein-protein complexes and homodimers. We discuss implications of these findings on the mechanism of capsid assembly, assuming that the larger interfaces form first to yield stable oligomeric species (capsomeres), and that medium-size interfaces allow the stepwise addition of capsomeres to build larger intermediates.  相似文献   

6.
7.
Recent advances in DNA sequencing techniques have identified rare single‐nucleotide variants with less than 1% minor allele frequency. Despite the growing interest and physiological importance of rare variants in genome sciences, less attention has been paid to the allele frequency of variants in protein sciences. To elucidate the characteristics of genetic variants on protein interaction sites, from the viewpoints of the allele frequency and the structural position of variants, we mapped about 20,000 human SNVs onto protein complexes. We found that variants are less abundant in protein interfaces, and specifically the core regions of interfaces. The tendency to “avoid” the interfacial core is stronger among common variants than rare variants. As amino acid substitutions, the trend of mutating amino acids among rare variants is consistent in different interfacial regions, reflecting the fact that rare variants result from random mutations in DNA sequences, whereas amino acid changes of common variants vary between the interfacial core and rim regions, possibly due to functional constraints on proteins. This study illustrated how the allele frequency of variants relates to the protein structural regions and the functional sites in general and will lead to deeper understanding of the potential deleteriousness of rare variants at the structural level. Exceptional cases of the observed trends will shed light on the limitations of structural approaches to evaluate the functional impacts of variants.  相似文献   

8.
Protein crystals contain two different types of interfaces: biologically relevant ones, observed in protein–protein complexes and oligomeric proteins, and nonspecific ones, corresponding to crystal lattice contacts. Because of the increasing complexity of the objects being tackled in structural biology, distinguishing biological contacts from crystal contacts is not always a trivial task and can lead to wrong interpretation of macromolecular structures. We devised an approach (CRK, core‐rim Ka/Ks ratio) for distinguishing biologically relevant interfaces from nonspecific ones. Given a protein–protein interface, CRK finds a set of homologs to the sequences of the proteins involved in the interface, retrieves and aligns the corresponding coding sequences, on which it carries out a residue‐by‐residue Ka/Ks ratio (ω) calculation. It divides interface residues into a “rim” and a “core” set and analyzes the selection pressure on the residues belonging to the two sets. We developed and tested CRK on different datasets and test cases, consisting of biologically relevant contacts, nonspecific ones or of both types. The method proves very effective in distinguishing the two categories of interfaces, with an overall accuracy rate of 84%. As it relies on different principles when compared with existing tools, CRK is optimally suited to be used in combination with them. In addition, CRK has potential applications in the validation of structures of oligomeric proteins and protein complexes. Proteins 2010. © 2010 Wiley‐Liss, Inc.  相似文献   

9.
Molecular principles of the interactions of disordered proteins   总被引:6,自引:0,他引:6  
Thorough knowledge of the molecular principles of protein-protein recognition is essential to our understanding of protein function at the cellular level. Whereas interactions of ordered proteins have been analyzed in great detail, complexes of intrinsically unstructured/disordered proteins (IUPs) have hardly been addressed so far. Here, we have collected a database of 39 complexes of experimentally verified IUPs, and compared their interfaces with those of 72 complexes of ordered, globular proteins. The characteristic differences found between the two types of complexes suggest that IUPs represent a distinct molecular implementation of the principles of protein-protein recognition. The interfaces do not differ in size, but those of IUPs cover a much larger part of the surface of the protein than for their ordered counterparts. Moreover, IUP interfaces are significantly more hydrophobic relative to their overall amino acid composition, but also in absolute terms. They rely more on hydrophobic-hydrophobic than on polar-polar interactions. Their amino acids in the interface realize more intermolecular contacts, which suggests a better fit with the partner due to induced folding upon binding that results in a better adaptation to the partner. The two modes of interaction also differ in that IUPs usually use only a single continuous segment for partner binding, whereas the binding sites of ordered proteins are more segmented. Probably, all these features contribute to the increased evolutionary conservation of IUP interface residues. These noted molecular differences are also manifested in the interaction energies of IUPs. Our approximation of these by low-resolution force-fields shows that IUPs gain much more stabilization energy from intermolecular contacts, than from folding, i.e. they use their binding energy for folding. Overall, our findings provide a structural rationale to the prior suggestions that many IUPs are specialized for functions realized by protein-protein interactions.  相似文献   

10.
The repressor proteins of the LacI/GalR family exhibit significant similarity in their secondary and tertiary structures despite less than 35% identity in their primary sequences. Furthermore, the core domains of these oligomeric repressors, which mediate dimerization, are homologous with the monomeric periplasmic binding proteins, extending the issue of plasticity to quaternary structure. To elucidate the determinants of assembly, a structure-based alignment has been created for three repressors and four periplasmic binding proteins. Contact maps have also been constructed for the three repressor interfaces to distinguish any conserved interactions. These analyses show few strict requirements for assembly of the core N-subdomain interface. The interfaces of repressor core C-subdomains are well conserved at the structural level, and their primary sequences differ significantly from the monomeric periplasmic binding proteins at positions equivalent to LacI 281 and 282. However, previous biochemical and phenotypic analyses indicate that LacI tolerates many mutations at 281. Mutations at LacI 282 were shown to abrogate assembly, but for Y282D this could be compensated by a second-site mutation in the core N-subdomain at K84 to L or A. Using the link between LacI assembly and function, we have further identified 22 second-site mutations that compensate the Y282D dimerization defect in vivo. The sites of these mutations fall into several structural regions, each of which may influence assembly by a different mechanism. Thus, the 360-amino acid scaffold of LacI allows plasticity of its quaternary structure. The periplasmic binding proteins may require only minimal changes to facilitate oligomerization similar to the repressor proteins.  相似文献   

11.
We analyzed subunit interfaces in 315 homodimers with an X-ray structure in the Protein Data Bank, validated by checking the literature for data that indicate that the proteins are dimeric in solution and that, in the case of the “weak” dimers, the homodimer is in equilibrium with the monomer. The interfaces of the 42 weak dimers, which are smaller by a factor of 2.4 on average than in the remainder of the set, are comparable in size with antibody-antigen or protease-inhibitor interfaces. Nevertheless, they are more hydrophobic than in the average transient protein-protein complex and similar in amino acid composition to the other homodimer interfaces. The mean numbers of interface hydrogen bonds and hydration water molecules per unit area are also similar in homodimers and transient complexes. Parameters related to the atomic packing suggest that many of the weak dimer interfaces are loosely packed, and we suggest that this contributes to their low stability. To evaluate the evolutionary selection pressure on interface residues, we calculated the Shannon entropy of homologous amino acid sequences at 60% sequence identity. In 93% of the homodimers, the interface residues are better conserved than the residues on the protein surface. The weak dimers display the same high degree of interface conservation as other homodimers, but their homologs may be heterodimers as well as homodimers. Their interfaces may be good models in terms of their size, composition, and evolutionary conservation for the labile subunit contacts that allow protein assemblies to share and exchange components, allosteric proteins to undergo quaternary structure transitions, and molecular machines to operate in the cell.  相似文献   

12.
Hafumi Nishi  Motonori Ota 《Proteins》2010,78(6):1563-1574
Despite similarities in their sequence and structure, there are a number of homologous proteins that adopt various oligomeric states. Comparisons of these homologous protein pairs, in terms of residue substitutions at the protein–protein interfaces, have provided fundamental characteristics that describe how proteins interact with each other. We have prepared a dataset composed of pairs of related proteins with different homo‐oligomeric states. Using the protein complexes, the interface residues were identified, and using structural alignments, the shadow‐interface residues have been defined as the surface residues that align with the interface residues. Subsequently, we investigated residue substitutions between the interfaces and the shadow interfaces. Based on the degree of the contributions to the interactions, the aligned sites of the interfaces and shadow interfaces were divided into primary and secondary sites; the primary sites are the focus of this work. The primary sites were further classified into two groups (i.e. exposed and buried) based on the degree to which the residue is buried within the shadow interfaces. Using these classifications, two simple mechanisms that mediate the oligomeric states were identified. In the primary‐exposed sites, the residues on the shadow interfaces are replaced by more hydrophobic or aromatic residues, which are physicochemically favored at protein–protein interfaces. In the primary‐buried sites, the residues on the shadow interfaces are replaced by larger residues that protrude into other proteins. These simple rules are satisfied in 23 out of 25 Structural Classification of Proteins (SCOP) families with a different‐oligomeric‐state pair, and thus represent a basic strategy for modulating protein associations and dissociations. Proteins 2010. © 2009 Wiley‐Liss, Inc.  相似文献   

13.
Recently, renewed interest in the evolution of oligomeric proteins has seen new approaches explored and new hypotheses proposed. The model systems chosen are generally made up of pairs of homologous proteins, each composed of a monomer and a dimeric counterpart, but the question has been also approached by comparing statistically significant structural patterns in sets of monomeric and oligomeric proteins. Here the tools of genetics and chemistry potentially available to the evolution of oligomeric proteins are discussed, as well as the possible effects of environments on the early attempts to oligomerization. Traces of an ancestral monomeric status of oligomers may be detected in the significant presence of polar and charged residues at intersubunit interfaces, and by the recognition that, besides the hydrophobic effect, a 'hydrophilic' effect has also had a role in the construction of these interfaces. The traditional 'mutation' model is described and found to be based on a hierarchy of mutations, crowned by a 'primary' mutation, one that could prime oligomerization by irreversibly altering the structure of an ancestral monomer. The mechanism of oligomerization based on the exchange or 'swap' of structural elements between monomers is discussed. The possibility is also discussed that the main steps in the folding pathway of an oligomeric protein reiterate the main steps in its evolution.  相似文献   

14.
Despite the discovery of Epstein-Barr virus more than 35 years ago, a thorough understanding of gammaherpesvirus capsid composition and structure has remained elusive. We approached this problem by purifying capsids from Kaposi's sarcoma-associated herpesvirus (KSHV), the only other known human gammaherpesvirus. The results from our biochemical and imaging analyses demonstrate that KSHV capsids possess a typical herpesvirus icosahedral capsid shell composed of four structural proteins. The hexameric and pentameric capsomers are composed of the major capsid protein (MCP) encoded by open reading frame 25. The heterotrimeric complexes, forming the capsid floor between the hexons and pentons, are each composed of one molecule of ORF62 and two molecules of ORF26. Each of these proteins has significant amino acid sequence homology to capsid proteins in alpha- and betaherpesviruses. In contrast, the fourth protein, ORF65, lacks significant sequence homology to its structural counterparts from the other subfamilies. Nevertheless, this small, basic, and highly antigenic protein decorates the surface of the capsids, as does, for example, the even smaller basic capsid protein VP26 of herpes simplex virus type 1. We have also found that, as with the alpha- and betaherpesviruses, lytic replication of KSHV leads to the formation of at least three capsid species, A, B, and C, with masses of approximately 200, 230, and 300 MDa, respectively. A capsids are empty, B capsids contain an inner array of a fifth structural protein, ORF17.5, and C capsids contain the viral genome.  相似文献   

15.
Many soluble and membrane proteins form homooligomeric complexes in a cell which are responsible for the diversity and specificity of many pathways, may mediate and regulate gene expression, activity of enzymes, ion channels, receptors, and cell adhesion processes. The evolutionary and physical mechanisms of oligomerization are very diverse and its general principles have not yet been formulated. Homooligomeric states may be conserved within certain protein subfamilies and might be important in providing specificity to certain substrates while minimizing interactions with other unwanted partners. Moreover, recent studies have led to a greater awareness that transitions between different oligomeric states may regulate protein activity and provide the switch between different pathways. In this paper we summarize the biological importance of homooligomeric assemblies, physico-chemical properties of their interfaces, experimental and computational methods for their identification and prediction. We particularly focus on homooligomer evolution and describe the mechanisms to develop new specificities through the formation of different homooligomeric complexes. Finally, we discuss the possible role of oligomeric transitions in the regulation of protein activity and compile a set of experimental examples with such regulatory mechanisms.  相似文献   

16.
Protein–protein interactions are essential to all aspects of life. Specific interactions result from evolutionary pressure at the interacting interfaces of partner proteins. However, evolutionary pressure is not homogeneous within the interface: for instance, each residue does not contribute equally to the binding energy of the complex. To understand functional differences between residues within the interface, we analyzed their properties in the core and rim regions. Here, we characterized protein interfaces with two evolutionary measures, conservation and coevolution, using a comprehensive dataset of 896 protein complexes. These scores can detect different selection pressures at a given position in a multiple sequence alignment. We also analyzed how the number of interactions in which a residue is involved influences those evolutionary signals. We found that the coevolutionary signal is higher in the interface core than in the interface rim region. Additionally, the difference in coevolution between core and rim regions is comparable to the known difference in conservation between those regions. Considering proteins with multiple interactions, we found that conservation and coevolution increase with the number of different interfaces in which a residue is involved, suggesting that more constraints (i.e., a residue that must satisfy a greater number of interactions) allow fewer sequence changes at those positions, resulting in higher conservation and coevolution values. These findings shed light on the evolution of protein interfaces and provide information useful for identifying protein interfaces and predicting protein–protein interactions.  相似文献   

17.
The crystal structure of a novel Cre-Lox synapse was solved using phases from multiple isomorphous replacement and anomalous scattering, and refined to 2.05 A resolution. In this complex, a symmetric protein trimer is bound to a Y-shaped three-way DNA junction, a marked departure from the pseudo-4-fold symmetrical tetramer associated with Cre-mediated LoxP recombination. The three-way DNA junction was accommodated by a simple kink without significant distortion of the adjoining DNA duplexes. Although the mean angle between DNA arms in the Y and X structures was similar, adjacent Cre trimer subunits rotated 29 degrees relative to those in the tetramers. This rotation was accommodated at the protein-protein and DNA-DNA interfaces by interactions that are "quasi-equivalent" to those in the tetramer, analogous to packing differences of chemically identical viral subunits at non-equivalent positions in icosahedral capsids. This structural quasi-equivalence extends to function as Cre can bind to, cleave and perform strand transfer with a three-way Lox substrate. The structure explains the dual recognition of three and four-way junctions by site-specific recombinases as being due to shared structural features between the differently branched substrates and plasticity of the protein-protein interfaces. To our knowledge, this is the first direct demonstration of quasi-equivalence in both the assembly and function of an oligomeric enzyme.  相似文献   

18.
Structure-based prediction of DNA target sites by regulatory proteins   总被引:15,自引:0,他引:15  
Kono H  Sarai A 《Proteins》1999,35(1):114-131
Regulatory proteins play a critical role in controlling complex spatial and temporal patterns of gene expression in higher organism, by recognizing multiple DNA sequences and regulating multiple target genes. Increasing amounts of structural data on the protein-DNA complex provides clues for the mechanism of target recognition by regulatory proteins. The analyses of the propensities of base-amino acid interactions observed in those structural data show that there is no one-to-one correspondence in the interaction, but clear preferences exist. On the other hand, the analysis of spatial distribution of amino acids around bases shows that even those amino acids with strong base preference such as Arg with G are distributed in a wide space around bases. Thus, amino acids with many different geometries can form a similar type of interaction with bases. The redundancy and structural flexibility in the interaction suggest that there are no simple rules in the sequence recognition, and its prediction is not straightforward. However, the spatial distributions of amino acids around bases indicate a possibility that the structural data can be used to derive empirical interaction potentials between amino acids and bases. Such information extracted from structural databases has been successfully used to predict amino acid sequences that fold into particular protein structures. We surmised that the structures of protein-DNA complexes could be used to predict DNA target sites for regulatory proteins, because determining DNA sequences that bind to a particular protein structure should be similar to finding amino acid sequences that fold into a particular structure. Here we demonstrate that the structural data can be used to predict DNA target sequences for regulatory proteins. Pairwise potentials that determine the interaction between bases and amino acids were empirically derived from the structural data. These potentials were then used to examine the compatibility between DNA sequences and the protein-DNA complex structure in a combinatorial "threading" procedure. We applied this strategy to the structures of protein-DNA complexes to predict DNA binding sites recognized by regulatory proteins. To test the applicability of this method in target-site prediction, we examined the effects of cognate and noncognate binding, cooperative binding, and DNA deformation on the binding specificity, and predicted binding sites in real promoters and compared with experimental data. These results show that target binding sites for several regulatory proteins are successfully predicted, and our data suggest that this method can serve as a powerful tool for predicting multiple target sites and target genes for regulatory proteins.  相似文献   

19.
Macromolecular oligomeric assemblies are involved in many biochemical processes of living organisms. The benefits of such assemblies in crowded cellular environments include increased reaction rates, efficient feedback regulation, cooperativity and protective functions. However, an atom‐level structural determination of large assemblies is challenging due to the size of the complex and the difference in binding affinities of the involved proteins. In this study, we propose a novel combinatorial greedy algorithm for assembling large oligomeric complexes from information on the approximate position of interaction interfaces of pairs of monomers in the complex. Prior information on complex symmetry is not required but rather the symmetry is inferred during assembly. We implement an efficient geometric score, the transformation match score, that bypasses the model ranking problems of state‐of‐the‐art scoring functions by scoring the similarity between the inferred dimers of the same monomer simultaneously with different binding partners in a (sub)complex with a set of pregenerated docking poses. We compiled a diverse benchmark set of 308 homo and heteromeric complexes containing 6 to 60 monomers. To explore the applicability of the method, we considered 48 sets of parameters and selected those three sets of parameters, for which the algorithm can correctly reconstruct the maximum number, namely 252 complexes (81.8%) in, at least one of the respective three runs. The crossvalidation coverage, that is, the mean fraction of correctly reconstructed benchmark complexes during crossvalidation, was 78.1%, which demonstrates the ability of the presented method to correctly reconstruct topology of a large variety of biological complexes. Proteins 2015; 83:1887–1899. © 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc.  相似文献   

20.
Protein-DNA interactions are crucial for many biological processes. Attempts to model these interactions have generally taken the form of amino acid-base recognition codes or purely sequence-based profile methods, which depend on the availability of extensive sequence and structural information for specific structural families, neglect side-chain conformational variability, and lack generality beyond the structural family used to train the model. Here, we take advantage of recent advances in rotamer-based protein design and the large number of structurally characterized protein-DNA complexes to develop and parameterize a simple physical model for protein-DNA interactions. The model shows considerable promise for redesigning amino acids at protein-DNA interfaces, as design calculations recover the amino acid residue identities and conformations at these interfaces with accuracies comparable to sequence recovery in globular proteins. The model shows promise also for predicting DNA-binding specificity for fixed protein sequences: native DNA sequences are selected correctly from pools of competing DNA substrates; however, incorporation of backbone movement will likely be required to improve performance in homology modeling applications. Interestingly, optimization of zinc finger protein amino acid sequences for high-affinity binding to specific DNA sequences results in proteins with little or no predicted specificity, suggesting that naturally occurring DNA-binding proteins are optimized for specificity rather than affinity. When combined with algorithms that optimize specificity directly, the simple computational model developed here should be useful for the engineering of proteins with novel DNA-binding specificities.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号