首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Dihydrofolate reductase (DHFR) is of significant recent interest as a target for drugs against parasitic and opportunistic infections. Understanding factors which influence DHFR homolog inhibitor specificity is critical for the design of compounds that selectively target DHFRs from pathogenic organisms over the human homolog. This paper presents a novel approach for predicting residues involved in ligand discrimination in a protein family using DHFR as a model system. In this approach, the relationship between inhibitor specificity and amino acid composition for sets of protein homolog pairs is examined. Similar inhibitor specificity profiles correlate with increased sequence homology at specific alignment positions. Residue positions that exhibit the strongest correlations are predicted as specificity determinants. Correlation analysis requires a quantitative measure of similarity in inhibitor specificity (S(lig)) for a pair of homologs. To this end, a method of calculating S(lig) values using K(I) values for the two homologs against a set of inhibitors as input was developed. Correlation analysis of S(lig) values to amino acid sequence similarity scores - obtained via multiple sequence alignments - was performed for individual residue alignment positions and sets of residues on 13 DHFRs. Eighteen alignment positions were identified with a strong correlation of S(lig) to sequence similarity. Of these, three lie in the active site; four are located proximal to the active site, four are clustered together in the adenosine binding domain and five on the βFβG loop. The validity of the method is supported by agreement between experimental findings and current predictions involving active site residues.  相似文献   

2.
The information required to generate a protein structure is contained in its amino acid sequence, but how three-dimensional information is mapped onto a linear sequence is still incompletely understood. Multiple structure alignments of similar protein structures have been used to investigate conserved sequence features but contradictory results have been obtained, due, in large part, to the absence of subjective criteria to be used in the construction of sequence profiles and in the quantitative comparison of alignment results. Here, we report a new procedure for multiple structure alignment and use it to construct structure-based sequence profiles for similar proteins. The definition of "similar" is based on the structural alignment procedure and on the protein structural distance (PSD) described in paper I of this series, which offers an objective measure for protein structure relationships. Our approach is tested in two well-studied groups of proteins; serine proteases and Ig-like proteins. It is demonstrated that the quality of a sequence profile generated by a multiple structure alignment is quite sensitive to the PSD used as a threshold for the inclusion of proteins in the alignment. Specifically, if the proteins included in the aligned set are too distant in structure from one another, there will be a dilution of information and patterns that are relevant to a subset of the proteins are likely to be lost.In order to understand better how the same three-dimensional information can be encoded in seemingly unrelated sequences, structure-based sequence profiles are constructed for subsets of proteins belonging to nine superfolds. We identify patterns of relatively conserved residues in each subset of proteins. It is demonstrated that the most conserved residues are generally located in the regions where tertiary interactions occur and that are relatively conserved in structure. Nevertheless, the conservation patterns are relatively weak in all cases studied, indicating that structure-determining factors that do not require a particular sequential arrangement of amino acids, such as secondary structure propensities and hydrophobic interactions, are important in encoding protein fold information. In general, we find that similar structures can fold without having a set of highly conserved residue clusters or a well-conserved sequence profile; indeed, in some cases there is no apparent conservation pattern common to structures with the same fold. Thus, when a group of proteins exhibits a common and well-defined sequence pattern, it is more likely that these sequences have a close evolutionary relationship rather than the similarities having arisen from the structural requirements of a given fold.  相似文献   

3.
4.
Recent reports of the slow-tight binding inhibition of bovine liver dihydrofolate reductase (bDHFR) in the presence of polyphenols isolated from green tea leaves has spurred renewed interest in the biochemical properties of bDHFR. Earlier studies were done with native bDHFR but in order to validate models of polyphenol binding to bDHFR, larger quantities of bDHFR are necessary to support structural studies. Bovine DHFR differs from its closest sequence homologue, murine DHFR, by 19 amino acids. To obtain the bDHFR cDNA, murineDHFR cDNA was transformed by a series of nested PCRs to reproduce the amino acid coding sequence for bovine DHFR. The bovine liver DHFR cDNA has an open reading frame of 561 base pairs encoding a protein of 187 amino acids that has a high level of conservation at the primary sequence level with other DHFR enzymes, and more so for the amino acid residues in the active site of the mammalian DHFR enzymes. Expression of the bovine DHFR cDNA in bacterial cells produced a stable recombinant protein with high enzymatic activity and kinetic properties similar to those previously reported for the native protein.  相似文献   

5.
Tai N  Ding Y  Schmitz JC  Chu E 《Nucleic acids research》2002,30(20):4481-4488
Previous studies have shown that human dihydrofolate reductase (DHFR) acts as an RNA-binding protein, in which it binds to its own mRNA and, in so doing, results in translational repression. In this study, we used RNA gel mobility shift and nitrocellulose filter-binding assays to further investigate the specificity of the interaction between human DHFR protein and human DHFR mRNA. Site-directed mutagenesis was used to identify the critical amino acid residues on DHFR protein required for RNA recognition. Human His-Tag DHFR protein specifically binds to human DHFR mRNA, while unrelated proteins including thymidylate synthase, p53 and glutathione-S-transferase were unable to form a ribonucleoprotein complex with DHFR mRNA. The Cys6 residue is essential for RNA recognition, as mutation at this amino acid with either an alanine (C6A) or serine (C6S) residue almost completely abrogated RNA-binding activity. Neither one of the cysteine mutant proteins was able to repress the in vitro translation of human DHFR mRNA. Mutations at amino acids Ile7, Arg28 and Phe34, significantly reduced RNA-binding activity. An RNA footprinting analysis identified three different RNA sequences, bound to DHFR protein, ranging in size from 16 to 45 nt, while a UV cross-linking analysis isolated an ~16 nt RNA sequence bound to DHFR. These studies begin to identify the critical amino acid residues on human DHFR that mediate RNA binding either through forming direct contact points with RNA or through maintaining the protein in an optimal structure that allows for the critical RNA-binding domain to be accessible.  相似文献   

6.
Prediction of protein catalytic residues provides useful information for the studies of protein functions. Most of the existing methods combine both structure and sequence information but heavily rely on sequence conservation from multiple sequence alignments. The contribution of structure information is usually less than that of sequence conservation in existing methods. We found a novel structure feature, residue side chain orientation, which is the first structure-based feature that achieves prediction results comparable to that of evolutionary sequence conservation. We developed a structure-based method, Enzyme Catalytic residue SIde-chain Arrangement (EXIA), which is based on residue side chain orientations and backbone flexibility of protein structure. The prediction that uses EXIA outperforms existing structure-based features. The prediction quality of combing EXIA and sequence conservation exceeds that of the state-of-the-art prediction methods. EXIA is designed to predict catalytic residues from single protein structure without needing sequence or structure alignments. It provides invaluable information when there is no sufficient or reliable homology information for target protein. We found that catalytic residues have very special side chain orientation and designed the EXIA method based on the newly discovered feature. It was also found that EXIA performs well for a dataset of enzymes without any bounded ligand in their crystallographic structures.  相似文献   

7.
Wrabl JO  Grishin NV 《Proteins》2004,54(1):71-87
An algorithm was developed to locally optimize gaps from the FSSP database. Over 2 million gaps were identified from all versus all FSSP structure comparisons, and datasets of non-identical gaps and flanking regions comprising between 90,000 and 135,000 sequence fragments were extracted for statistical analysis. Relative to background frequencies, gaps were enriched in residue types with small side chains and high turn propensity (D, G, N, P, S), and were depleted in residue types with hydrophobic side chains (C, F, I, L, V, W, Y). In contrast, regions flanking a gap exhibited opposite trends in amino acid frequencies, i.e., enrichment in hydrophobic residues and a high degree of secondary structure. Log-odds scores of residue type as a function of position in or around a gap were derived from the statistics. Three simple experiments demonstrated that these scores contained significant predictive information. First, regions where gaps were observed in single sequences taken from HOMSTRAD structure-based multiple sequence alignments generally scored higher than regions where gaps were not observed. Second, given the correct pairwise-aligned cores, the actual positions of gaps could be reproduced from sequence more accurately using the structurally-derived statistics than by using random pairwise alignments. Finally, revision of the Clustal-W residue-specific gap opening parameters with this new information improved the agreement of Clustal-W alignments with the structure-based alignments. At least three applications for these results are envisioned: improvement of gap penalties in pairwise (or multiple) sequence alignment, prediction of regions of single sequences likely (or unlikely) to contain indels, and more accurate placement of gaps in automated pairwise structure alignment.  相似文献   

8.
An attempt has been made to build a model of human rhinovirus 2 (HRV2) based on the known human rhinovirus 14 (HRV14) structure. HRV2 was selected because its amino acid sequence is known and because it belongs to the minor rhinovirus receptor class as compared to HRV14, which belongs to the major class. Initial alignment of HRV2 with HRV14 based on the primary sequence and the knowledge of the three-dimensional structure of HRV14 showed that the most probable position of the majority of insertions and deletions occurred in the vicinity of the neutralizing immunogenic sites (NIm). Out of a total of 855 amino acids present in one copy of each of the capsid proteins VP1 through VP4 of HRV14, 411 are different between the two viruses. There are also 6 amino acid residues inserted and 14 residues deleted in HRV2 relative to HRV14. Examination of amino acid interactions showed several cases of conservation of function, e.g., salt bridges or the filling of restricted space. The largest variation amongst the residues lining the canyon, the putative receptor binding site, was in the carboxy-terminal residues of VP1.  相似文献   

9.
MOTIVATION: The prediction of ligand-binding residues or catalytically active residues of a protein may give important hints that can guide further genetic or biochemical studies. Existing sequence-based prediction methods mostly rank residue positions by evolutionary conservation calculated from a multiple sequence alignment of homologs. A problem hampering more wide-spread application of these methods is the low per-residue precision, which at 20% sensitivity is around 35% for ligand-binding residues and 20% for catalytic residues. RESULTS: We combine information from the conservation at each site, its amino acid distribution, as well as its predicted secondary structure (ss) and relative solvent accessibility (rsa). First, we measure conservation by how much the amino acid distribution at each site differs from the distribution expected for the predicted ss and rsa states. Second, we include the conservation of neighboring residues in a weighted linear score by analytically optimizing the signal-to-noise ratio of the total score. Third, we use conditional probability density estimation to calculate the probability of each site to be functional given its conservation, the observed amino acid distribution, and the predicted ss and rsa states. We have constructed two large data sets, one based on the Catalytic Site Atlas and the other on PDB SITE records, to benchmark methods for predicting functional residues. The new method FRcons predicts ligand-binding and catalytic residues with higher precision than alternative methods over the entire sensitivity range, reaching 50% and 40% precision at 20% sensitivity, respectively. AVAILABILITY: Server: http://frpred.tuebingen.mpg.de. Data sets: ftp://ftp.tuebingen.mpg.de/pub/protevo/FRpred/.  相似文献   

10.
The partial amino acid sequence of dihydrofolate reductase (DHFR, EC 1.5.1.3) from human KB/6b cells has been determined by using 3.5 mg of protein. Peptides covering the entire polypeptide chain were recovered from preparative peptide maps generated by the combination of paper chromatography and electrophoresis at pH 4.4 Peptide maps from mouse L1210 DHFR were also generated for comparison. Amino acid sequence of 75% of the 186 amino acid residues in the polypeptide chain of human KB/6b DHFR was obtained from Edman degradations and the remaining sequence was deduced from the amino acid compositions, from electrophoretic mobilities of related peptides and from the sequence homologies with other known mammalian DHFR sequences. A comparison of the proposed human DHFR sequence with the previously known sequences of mouse enzyme [Stone, et al. (1979) J. Biol. Chem. 245, 480-488] indicates that 18 differences are located in the established sequence of 139 residues and that 5 additional differences are in the tentative sequence of the remaining 47 amino acids. Kinetic properties of human KB/6b and mouse L1210 DHFR, which were determined in parallel experiments, are also compared. The possible structural-functional relationships between human and mouse DHFR are discussed.  相似文献   

11.
Cofactor-independent phosphoglycerate mutase (iPGM) has been previously identified as a member of the alkaline phosphatase (AlkP) superfamily of enzymes, based on the conservation of the predicted metal-binding residues. Structural alignment of iPGM with AlkP and cerebroside sulfatase confirmed that all these enzymes have a common core structure and revealed similarly located conserved Ser (in iPGM and AlkP) or Cys (in sulfatases) residues in their active sites. In AlkP, this Ser residue is phosphorylated during catalysis, whereas in sulfatases the active site Cys residues are modified to formylglycine and sulfatated. Similarly located Thr residue forms a phosphoenzyme intermediate in one more enzyme of the AlkP superfamily, alkaline phosphodiesterase/nucleotide pyrophosphatase PC-1 (autotaxin). Using structure-based sequence alignment, we identified homologous Ser, Thr, or Cys residues in other enzymes of the AlkP superfamily, such as phosphopentomutase, phosphoglycerol transferase, phosphonoacetate hydrolase, and GPI-anchoring enzymes (glycosylphosphatidylinositol phosphoethanolamine transferases) MCD4, GPI7, and GPI13. We predict that catalytical cycles of all the enzymes of AlkP superfamily include phosphoenzyme (or sulfoenzyme) intermediates.  相似文献   

12.
Analysis of catalytic residues in enzyme active sites   总被引:13,自引:0,他引:13  
We present an analysis of the residues directly involved in catalysis in 178 enzyme active sites. Specific criteria were derived to define a catalytic residue, and used to create a catalytic residue dataset, which was then analysed in terms of properties including secondary structure, solvent accessibility, flexibility, conservation, quaternary structure and function. The results indicate the dominance of a small set of amino acid residues in catalysis and give a picture of a general active site environment. It is hoped that this information will provide a better understanding of the molecular mechanisms involved in catalysis and a heuristic basis for predicting catalytic residues in enzymes of unknown function.  相似文献   

13.
R B Russell  G J Barton 《Proteins》1992,14(2):309-323
An algorithm is presented for the accurate and rapid generation of multiple protein sequence alignments from tertiary structure comparisons. A preliminary multiple sequence alignment is performed using sequence information, which then determines an initial superposition of the structures. A structure comparison algorithm is applied to all pairs of proteins in the superimposed set and a similarity tree calculated. Multiple sequence alignments are then generated by following the tree from the branches to the root. At each branchpoint of the tree, a structure-based sequence alignment and coordinate transformations are output, with the multiple alignment of all structures output at the root. The algorithm encoded in STAMP (STructural Alignment of Multiple Proteins) is shown to give alignments in good agreement with published structural accounts within the dehydrogenase fold domains, globins, and serine proteinases. In order to reduce the need for visual verification, two similarity indices are introduced to determine the quality of each generated structural alignment. Sc quantifies the global structural similarity between pairs or groups of proteins, whereas Pij' provides a normalized measure of the confidence in the alignment of each residue. STAMP alignments have the quality of each alignment characterized by Sc and Pij' values and thus provide a reproducible resource for studies of residue conservation within structural motifs.  相似文献   

14.
15.
16.
Suggestions for "safe" residue substitutions in site-directed mutagenesis   总被引:25,自引:0,他引:25  
The conserved topological structure observed in various molecular families such as globins or cytochromes c allows structural equivalencing of residues in every homologous structure and defines in a coherent way a global alignment in each sequence family. A search was performed for equivalent residue pairs in various topological families that were buried in protein cores or exposed at the protein surface and that had mutated but maintained similar unmutated environments. Amino acid residues with atoms in contact with the mutated residue pairs defined the environment. Matrices of preferred amino acid exchanges were then constructed and preferred or avoided amino acid substitutions deduced. Given the conserved atomic neighborhoods, such natural in vivo substitutions are subject to similar constrains as point mutations performed in site-directed mutagenesis experiments. The exchange matrices should provide guidelines for "safe" amino acid substitutions least likely to disturb the protein structure, either locally or in its overall folding pathway, and most likely to allow probing the structural and functional significance of the substituted site.  相似文献   

17.
The pattern of amino acid substitutions and sequence conservation over many structure-based alignments of protein sequences was analyzed as a function of percentage sequence identity. The statistics of the amino acid substitutions were converted into the form of log-odds amino acid substitution matrices to which eigenvalue decomposition was applied. It was found that the most important component of the substitution matrices exhibited a sharp transition at the sequence identity of 30-35%, which coincides with the twilight zone. Above the transition point, the most dominant component is related to the mutability of amino acids and it acts to disfavor any substitutions, whereas below the transition point, the most dominant component is related to the hydrophobicity of amino acids and substitutions between residues of similar hydrophobic character are positively favored. Implications for protein evolution and sequence analysis are discussed.  相似文献   

18.
For applications such as comparative modelling one major issue is the reliability of sequence alignments. Reliable regions in alignments can be predicted using sub-optimal alignments of the same pair of sequences. Here we show that reliable regions in alignments can also be predicted from multiple sequence profile information alone.Alignments were created for a set of remotely related pairs of proteins using five different test methods. Structural alignments were used to assess the quality of the alignments and the aligned positions were scored using information from the observed frequencies of amino acid residues in sequence profiles pre-generated for each template structure. High-scoring regions of these profile-derived alignment scores were a good predictor of reliably aligned regions.These profile-derived alignment scores are easy to obtain and are applicable to any alignment method. They can be used to detect those regions of alignments that are reliably aligned and to help predict the quality of an alignment. For those residues within secondary structure elements, the regions predicted as reliably aligned agreed with the structural alignments for between 92% and 97.4% of the residues. In loop regions just under 92% of the residues predicted to be reliable agreed with the structural alignments. The percentage of residues predicted as reliable ranged from 32.1% for helix residues to 52.8% for strand residues.This information could also be used to help predict conserved binding sites from sequence alignments. Residues in the template that were identified as binding sites, that aligned to an identical amino acid residue and where the sequence alignment agreed with the structural alignment were in highly conserved, high scoring regions over 80% of the time. This suggests that many binding sites that are present in both target and template sequences are in sequence-conserved regions and that there is the possibility of translating reliability to binding site prediction.  相似文献   

19.
A Taurog 《Biochimie》1999,81(5):557-562
Thyroid peroxidase is a member of a family of mammalian peroxidases that includes myeloperoxidase, lactoperoxidase, eosinophil peroxidase, and salivary peroxidase. Protein sequences showing a high degree of sequence similarity with mammalian peroxidases have recently been observed in several invertebrate species. A multiple sequence alignment prepared with five mammalian and six invertebrate peroxidases shows complete conservation of amino acid residues considered to be important in the formation of peroxidase compound 1. These include the distal and proximal histidines, a catalytic arginine residue, and an asparagine residue hydrogen bonded to the proximal histidine. TPO-2, an alternatively spliced form of TPO, lacks the essential asparagine (Asn 579). It is now possible to speak more broadly of the family of animal peroxidases, rather than mammalian peroxidases. The animal peroxidases comprise a group of homologous proteins that differ markedly from the plant/fungal/bacterial peroxidases in primary, secondary and tertiary structure, but which share with them a common function. Animal peroxidases probably arose independently of the plant/fungal/bacterial peroxidase superfamily and most likely belong to a different gene family. The relationship between animal and non-animal peroxidases probably represents an example of convergent evolution to a common enzymatic mechanism.  相似文献   

20.
Invariant sites are a common feature of amino acid sequence evolution. The presence of invariant sites is frequently attributed to the need to preserve function through site-specific conservation of amino acid residues. Amino acid substitution models without a provision for invariant sites often fit the data significantly worse than those that allow for an excess of invariant sites beyond those predicted by models that only incorporate rate variation among sites (e.g., a Gamma distribution). An alternative is epistasis between sites to preserve residue interactions that can create invariant sites. Through computer-simulated sequence evolution, we evaluated the relative effects of site-specific preferences and site-site couplings in the generation of invariant sites and the modulation of the rate of molecular evolution. In an analysis of ten major families of protein domains with diverse sequence and functional properties, we find that the negative selection imposed by epistasis creates many more invariant sites than site-specific residue preferences alone. Further, epistasis plays an increasingly larger role in creating invariant sites over longer evolutionary periods. Epistasis also dictates rates of domain evolution over time by exerting significant additional purifying selection to preserve site couplings. These patterns illuminate the mechanistic role of epistasis in the processes underlying observed site invariance and evolutionary rates.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号