首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
G-protein coupled receptors (GPCRs) belong to biologically important and functionally diverse and largest super family of membrane proteins. GPCRs retain a characteristic membrane topology of seven alpha helices with three intracellular, three extracellular loops and flanking N' and C' terminal residues. Subtle differences do exist in the helix boundaries (TM-domain), loop lengths, sequence features such as conserved motifs, and substituting amino acid patterns and their physiochemical properties amongst these sequences (clusters) at intra-genomic and inter-genomic level (please re-phrase into 2 statements for clarity). In the current study, we employ prediction of helix boundaries and scores derived from amino acid substitution exchange matrices to identify the conserved amino acid residues (motifs) as consensus in aligned set of homologous GPCR sequences. Co-clustered GPCRs from human and other genomes, organized as 32 clusters, were employed to study the amino acid conservation patterns and species-specific or cluster-specific motifs. Critical analysis on sequence composition and properties provide clues to connect functional relevance within and across genome for vast practical applications such as design of mutations and understanding of disease-causing genetic abnormalities.  相似文献   

3.
The three-dimensional structures of homologous proteins are usually conserved during evolution, as are critical residues in a few short sequence motifs that often constitute the active site in enzymes. The precise spatial organization of such sites depends on the lengths and positions of the secondary structural elements connecting the motifs. We show how members of protein superfamilies, such as kinesins, myosins, and G(alpha) subunits of trimeric G proteins, are identified and classed by simply counting the number of amino acid residues between important sequence motifs in their nucleotide triphosphate-hydrolyzing domains. Subfamily-specific landmark patterns (motif to motif scores) are principally due to inserts and gaps in surface loops. Unusual protein sequences and possible sequence prediction errors are detected.  相似文献   

4.
A suite of FORTRAN programs, PREF, is described for calculating preference functions from the data base of known protein structures and for comparing smoothed profiles of sequence-dependent preferences in proteins of unknown structure. Amino acid preferences for a secondary structure are considered as functions of a sequence environment. Sequence environment of amino acid residue in a protein is defined as an average over some physical, chemical, or statistical property of its primary structure neighbors. The frequency distribution of sequence environments in the data base of soluble protein structures is approximately normal for each amino acid type of known secondary conformation. An analytical expression for the dependence of preferences on sequence environment is obtained after each frequency distribution is replaced by corresponding Gaussian function. The preference for the α-helical conformation increases for each amino acid type with the increase of sequence environment of buried solvent-accessible surface areas. We show that a set of preference functions based on buried surface area is useful for predicting folding motifs in α-class proteins and in integral membrane proteins. The prediction accuracy for helical residues is 79% for 5 integral membrane proteins and 74% for 11 α-class soluble proteins. Most residues found in transmembrane segments of membrane proteins with known α-helical structure are predicted to be indeed in the helical conformation because of very high middle helix preferences. Both extramembrane and transmembrane helices in the photosynthetic reaction center M and L subunits are correctly predicted. We point out in the discussion that our method of conformational preference functions can identify what physical properties of the amino acids are important in the formation of particular secondary structure elements. © 1993 John Wiley & Sons, Inc.  相似文献   

5.
Adamian L  Liang J 《Proteins》2002,47(2):209-218
Polar and ionizable amino acid residues are frequently found in the transmembrane (TM) regions of membrane proteins. In this study, we show that they help to form extensive hydrogen bond connections between TM helices. We find that almost all TM helices have interhelical hydrogen bonding. In addition, we find that a pair of contacting TM helices is packed tighter when there are interhelical hydrogen bonds between them. We further describe several spatial motifs in the TM regions, including "Polar Clamp" and "Serine Zipper," where conserved Ser residues coincide with tightly packed locations in the TM region. With the examples of halorhodopsin, calcium-transporting ATPase, and bovine cytochrome c oxidase, we discuss the roles of hydrogen bonds in stabilizing helical bundles in polytopic membrane proteins and in protein functions.  相似文献   

6.
7.
Wang J  Feng JA 《Protein engineering》2003,16(11):799-807
This paper reports an extensive sequence analysis of the alpha-helices of proteins. alpha-Helices were extracted from the Protein Data Bank (PDB) and were divided into groups according to their sizes. It was found that some amino acids had differential propensity values for adopting helical conformation in short, medium and long alpha-helices. Pro and Trp had a significantly higher propensity for helical conformation in short helices than in medium and long helices. Trp was the strongest helix conformer in short helices. Sequence patterns favoring helical conformation were derived from a neighbor-dependent sequence analysis of proteins, which calculated the effect of neighboring amino acid type on the propensity of residues for adopting a particular secondary structure in proteins. This method produced an enhanced statistical significance scale that allowed us to explore the positional preference of amino acids for alpha-helical conformations. It was shown that the amino acid pair preference for alpha-helix had a unique pattern and this pattern was not always predictable by assuming proportional contributions from the individual propensity values of the amino acids. Our analysis also yielded a series of amino acid dyads that showed preference for alpha-helix conformation. The data presented in this study, along with our previous study on loop sequences of proteins, should prove useful for developing potential 'codes' for recognizing sequence patterns that are favorable for specific secondary structural elements in proteins.  相似文献   

8.
A novel helix-coil transition theory has been developed. This new theory contains more types of interactions than similar theories developed earlier. The parameters of the models were obtained from a database of 351 nonhomologous proteins. No manual adjustment of the parameters was performed. The interaction parameters obtained in this manner were found to be physically meaningful, consistent with current understanding of helix stabilizing/destabilizing interactions. Novel insights into helix stabilizing/destabilizing interactions have also emerged from this analysis. The theory developed here worked well in sorting out helical residues from amino acid sequences. If the theory was forced to make prediction on every residue of a given amino acid sequence, its performance was the best among ten other secondary structural prediction algorithms in distinguishing helical residues from nonhelical ones. The theory worked even better if one only required it to make prediction on residues that were “predictable” (identifiable by the theory); >90% predictive reliability could be achieved. The helical residues or segments identified by the helix-coil transition theory can be used as secondary structural contraints to speed up the prediction of the three-dimensional structure of a protein by reducing the dimension of a computational protein folding problem. Possible further improvements of this helix-coil transition theory are also discussed. Proteins 28:344–359, 1997. © 1997 Wiley-Liss, Inc.  相似文献   

9.
10.
Hanson RM  Kohler D  Braun SG 《Proteins》2011,79(7):2172-2180
We describe here definitions of "local helical axis" and "straightness" that are developed using a simple quaternion-based analysis of protein structure without resort to least-squares fitting. As part of this analysis, it is shown how quaternion differences can be visualized to depict accurately the local helical axis relating any two adjacent amino acid residues in standard, nonidealized proteins. Three different options for the definition of amino acid residue orientation in terms of quaternion frames are described. Two of these, the "C(α) frame" and the "P frame," are shown to be correlated strongly with a simple approximate measure derived solely from Ramachandran angles. The relationship between quaternion-based straightness and recognized DSSP-derived secondary structure motifs is discussed.  相似文献   

11.
Secondary structure of tobacco mosaic virus protein   总被引:1,自引:0,他引:1  
A set of rules is proposed for the prediction of α-helices in proteins. These rules lead to the correct assignment to either helical or non-helical regions of over 80% of the amino acid residues in the proteins from which they are derived. Applied to tobacco mosaic virus protein these rules lead to the prediction of five α-helical regions which may be consistent with other data.  相似文献   

12.
An amino acid motif was identified that consists of the sequence HisHydrHisHydrHydrHydr (Hydr--bulky hydrophobic residue) and is conserved in two vast classes of proteins, one of which is involved in initiation and termination of rolling circle DNA replication, or RCR (Rep proteins), and the other in mobilization (conjugal transfer) of plasmid DNA (Mob proteins). Based on analogies with metalloenzymes, it is hypothesized that the two conserved His residues in this motif may be involved in metal ion coordination required for the activity of the Rep and Mob proteins. Rep proteins contained two additional conserved motifs, one of which was located upstream, and the other downstream from the 'two His' motif. The C-terminal motif encompassed the Tyr residue(s) forming the covalent link with nicked DNA. Mob proteins were characterized by the opposite orientation of the conserved motifs, with the (putative) DNA-linking Tyr being located near their N-termini. Both Rep and Mob protein classes further split into several distinct families. Although it was not possible to find a motif or pattern that would be unique for the entire Rep or Mob class, unique patterns were derived for large subsets of the proteins of each class. These observations allowed the prediction of the amino acid residues involved in DNA nicking, which is required for the initiation of RCR or conjugal transfer of single-stranded (ss) DNA, in Rep and Mob proteins encoded by a number of replicons of highly diverse size, structure and origin. It is conjectured that recombination has played a major part in the dissemination of genes encoding related Rep or Mob proteins among the replicons exploiting RCR. It is speculated that the eucaryotic small ssDNA replicons encoding proteins with the conserved RCR motifs and replicating via RCR-related mechanisms, such as geminiviruses and parvoviruses, may have evolved from eubacterial replicons.  相似文献   

13.
Membrane proteins span a large variety of different functions such as cell-surface receptors, redox proteins, ion channels, and transporters. Proteins with functional pores show different characteristics of helix-helix packing as other helical membrane proteins. We found that the helix-helix contacts of 13 nonhomologous high-resolution structures of membrane channels and transporters are mainly accomplished by weakly polar amino acids (G > S > T > F) that preferably create contacts every fourth residue, typical for right-handed helix crossings. There is a strong correlation between the now available biological hydrophobicity scale and the propensities of the weakly polar and hydrophobic residues to be buried at helix-helix interfaces or to be exposed to the lipids in membrane channels and transporters. The polar residues, however, make no major contribution towards the packing of their transmembrane helices, and are therefore subsumed to be primarily exposed to the polar milieu during the folding process. The contact formation of membrane channels and transporters is therefore ruled by the solubility of the residues, which we suppose to be the driving force for the assembly of their transmembrane helices. By contrast, in 14 nonhomologous high-resolution structures of other membrane protein coils, also large and polar amino acids (D > S > M > Q) create characteristic contacts every 3.5th residues, which is a signature for left-handed helix crossings. Accordingly, it seems that dependent on the function, different concepts of folding and stabilization are realized for helical membrane proteins. Using a sequence-based matrix prediction method these differences are exploited to improve the prediction of buried and exposed residues of transmembrane helices significantly. When the sequence motifs typical for membrane channels and transporters were applied for the prediction of helix-helix contacts the quality of prediction rises by 16% to an average value of 76%, compared to the same approach when only single amino acid positions are taken into account.  相似文献   

14.
15.
Secondary structure prediction from amino acid sequence is a key component of protein structure prediction, with current accuracy at approximately 75%. We analysed two state-of-the-art secondary structure prediction methods, PHD and JPRED, comparing predictions with secondary structure assigned by the algorithms DSSP and STRIDE. The specific focus of our study was alpha-helix N-termini, as empirical free energy scales are available for residue preferences at N-terminal positions. Although these prediction methods perform well in general at predicting the alpha-helical locations and length distributions in proteins, they perform less well at predicting the correct helical termini. For example, although most predicted alpha-helices overlap a real alpha-helix (with relatively few completely missed or extra predicted helices), only one-third of JPRED and PHD predictions correctly identify the N-terminus. Analysis of neighbouring N-terminal sequences to predicted helical N-termini shows that the correct N-terminus is often within one or two residues. More importantly, the true N-terminal motif is, on average, more favourable as judged by our experimentally measured free energies. This suggests a simple, but powerful, strategy to improve secondary structure prediction using empirically derived energies to adjust the predicted output to a more favourable N-terminal sequence.  相似文献   

16.
We have identified four repeats and five domains that are novel in proteins encoded by the Pyrobaculum aerophilum str. IM2 proteome using automated in silico methods. A "repeat" corresponds to a region comprising less than 55 amino acid residues that occurs more than once in the protein sequence and sometimes present in tandem. A "domain" corresponds to a conserved region comprising greater than 55 amino acid residues and may be present as single or multiple copies in the protein sequence. These correspond to (1) 85 amino acid residues AAG domain, (2) 72 amino acid residues GFGN domain, (3) 43 amino acid residues KGG repeat, (4) 25 amino acid residues RWE repeat, (5) 25 amino acid residues RID repeat, (6) 108 amino acid residues NDFA domain, (7) 140 amino acid residues VxY domain, (8) 35 amino acid residues LLPN repeat and (9) 98 amino acid residues GxY domain. A repeat or domain is characterized by specific conserved sequence motifs. We discuss the presence of these repeats and domains in proteins from other genomes and their probable secondary structure.  相似文献   

17.
18.
An algorithm was derived to relate the amino acid sequence of a collagen triple helix to its thermal stability. This calculation is based on the triple helical stabilization propensities of individual residues and their intermolecular and intramolecular interactions, as quantitated by melting temperature values of host-guest peptides. Experimental melting temperature values of a number of triple helical peptides of varying length and sequence were successfully predicted by this algorithm. However, predicted T(m) values are significantly higher than experimental values when there are strings of oppositely charged residues or concentrations of like charges near the terminus. Application of the algorithm to collagen sequences highlights regions of unusually high or low stability, and these regions often correlate with biologically significant features. The prediction of stability from sequence indicates an understanding of the major forces maintaining this protein motif. The use of highly favorable KGE and KGD sequences is seen to complement the stabilizing effects of imino acids in modulating stability and may become dominant in the collagenous domains of bacterial proteins that lack hydroxyproline. The effect of single amino acid mutations in the X and Y positions can be evaluated with this algorithm. An interactive collagen stability calculator based on this algorithm is available online.  相似文献   

19.
Structural genomics projects as well as ab initio protein structure prediction methods provide structures of proteins with no sequence or fold similarity to proteins with known functions. These are often low-resolution structures that may only include the positions of C alpha atoms. We present a fast and efficient method to predict DNA-binding proteins from just the amino acid sequences and low-resolution, C alpha-only protein models. The method uses the relative proportions of certain amino acids in the protein sequence, the asymmetry of the spatial distribution of certain other amino acids as well as the dipole moment of the molecule. These quantities are used in a linear formula, with coefficients derived from logistic regression performed on a training set, and DNA-binding is predicted based on whether the result is above a certain threshold. We show that the method is insensitive to errors in the atomic coordinates and provides correct predictions even on inaccurate protein models. We demonstrate that the method is capable of predicting proteins with novel binding site motifs and structures solved in an unbound state. The accuracy of our method is close to another, published method that uses all-atom structures, time-consuming calculations and information on conserved residues.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号