首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
The influence of sixteen base triplet changes at a single position within a pur.pur.pyr triple helix was examined by affinity cleaving. For the 15 base pair target site studied here, G.GC, A.AT and T.AT triplets stabilize a triple helix to a greater extent than the other 13 natural triplets (pH = 7.4, 25 degrees C). Weaker interactions were detected for the C.AT, A.GC and T.CG triplets. The absence of specific, highly stabilizing interactions between third strand bases and the CG or TA base pairs demonstrates a current sequence limitation to formation of this structure. Models for the two dimensional base triplet interactions for all possible 16 natural triplets are presented.  相似文献   

3.
For applications such as comparative modelling one major issue is the reliability of sequence alignments. Reliable regions in alignments can be predicted using sub-optimal alignments of the same pair of sequences. Here we show that reliable regions in alignments can also be predicted from multiple sequence profile information alone.Alignments were created for a set of remotely related pairs of proteins using five different test methods. Structural alignments were used to assess the quality of the alignments and the aligned positions were scored using information from the observed frequencies of amino acid residues in sequence profiles pre-generated for each template structure. High-scoring regions of these profile-derived alignment scores were a good predictor of reliably aligned regions.These profile-derived alignment scores are easy to obtain and are applicable to any alignment method. They can be used to detect those regions of alignments that are reliably aligned and to help predict the quality of an alignment. For those residues within secondary structure elements, the regions predicted as reliably aligned agreed with the structural alignments for between 92% and 97.4% of the residues. In loop regions just under 92% of the residues predicted to be reliable agreed with the structural alignments. The percentage of residues predicted as reliable ranged from 32.1% for helix residues to 52.8% for strand residues.This information could also be used to help predict conserved binding sites from sequence alignments. Residues in the template that were identified as binding sites, that aligned to an identical amino acid residue and where the sequence alignment agreed with the structural alignment were in highly conserved, high scoring regions over 80% of the time. This suggests that many binding sites that are present in both target and template sequences are in sequence-conserved regions and that there is the possibility of translating reliability to binding site prediction.  相似文献   

4.
5.
To help elucidate the role of secondary structure packing preferences in protein folding, here we present an analysis of the packing geometry observed between alpha-helices and between alpha-helices and beta-sheets in 1316 diverse, nonredundant protein structures. Finite-length vectors were fit to the alpha-carbon atoms in each of the helices and strands, and the packing angle between the vectors, Omega, was determined at the closest point of approach within each helix-helix or helix-sheet pair. Helix-sheet interactions were found in 391 of the proteins, and the distributions of Omega values were calculated for all the helix-sheet and helix-helix interactions. The packing angle preferences for helix-helix interactions are similar to those previously observed. However, analysis of helix-strand packing preferences uncovered a remarkable tendency for helices to align antiparallel to parallel regions of beta-sheets, independent of the topological constraints or prevalence of beta-alpha-beta motifs in the proteins. This packing angle preference is significantly diminished in helix interactions involving mixed and antiparallel beta-sheets, suggesting a role for helix-sheet dipole alignment in guiding supersecondary structure formation in protein folding. This knowledge of preferred packing angles can be used to guide the engineering of stable protein modules.  相似文献   

6.
Triple-helical DNA shows increasing potential for applications in the control of gene expression (including therapeutics) and the development of sequence-specific DNA-cleaving agents. The major limitation in this technology has been the requirement of homopurine sequences for triplex formation. We describe a simple approach that relaxes this requirement, by utilizing both Pu.PuPy and Py.PuPy base triplets to form a continuous DNA triple helix at tandem oligopurine and oligopyrimidine tracts. [Triplex formation at such a sequence has been previously demonstrated only with the use of a special 3'-3' linkage in the third strand [Horne, D. A., & Dervan, P. B. (1990) J. Am. Chem. Soc. 112, 2435-2437].] Supporting evidence is from chemical probing experiments performed on several oligonucleotides designed to form 3-stranded fold-back structures. The third strand, consisting of both purine and pyrimidine blocks, pairs with purines in the Watson-Crick duplex, switching strands at the junction between the oligopurine and oligopyrimidine blocks but maintaining the required strand polarity without any special linkage. Although Mg2+ ions are not required for the formation of Pu.PuPy base triplets, they show enhanced stability in the presence of Mg2+. In the sequences observed. A.AT triplets appear to be more stable than G.GC triplets. As expected, triplex formation is largely independent of pH unless C+.GC base triplets are required.  相似文献   

7.
McGuffin LJ  Jones DT 《Proteins》2003,52(2):166-175
If secondary structure predictions are to be incorporated into fold recognition methods, an assessment of the effect of specific types of errors in predicted secondary structures on the sensitivity of fold recognition should be carried out. Here, we present a systematic comparison of different secondary structure prediction methods by measuring frequencies of specific types of error. We carry out an evaluation of the effect of specific types of error on secondary structure element alignment (SSEA), a baseline fold recognition method. The results of this evaluation indicate that missing out whole helix or strand elements, or predicting the wrong type of element, is more detrimental than predicting the wrong lengths of elements or overpredicting helix or strand. We also suggest that SSEA scoring is an effective method for assessing accuracy of secondary structure prediction and perhaps may also provide a more appropriate assessment of the "usefulness" and quality of predicted secondary structure, if secondary structure alignments are to be used in fold recognition.  相似文献   

8.
Two triple helix structures (15-mers containing only T.A-T triplets or containing mixed T.A-T and C.G-C triplets) have been studied by uranyl mediated DNA photocleavage to probe the accessibility of the phosphates of the DNA backbone. Whereas the phosphates of the pyrimidine strand are at least as accessible as in double stranded DNA, in the phosphates of the purine strand are partly shielded and more so at the 5'-end of the strand. With the homo A/T target increased cleavage is observed towards the 3'-end on the pyrimidine strand. These results show that the third strand is asymmetrically positioned along the groove with the tightest triple strand double strand interactions at the 5'-end of the third strand. The results also indicate that homo-A versus mixed A/G 'Hoogsteen-triple helices' have different structures.  相似文献   

9.
Exclusion of RNA strands from a purine motif triple helix.   总被引:5,自引:5,他引:0       下载免费PDF全文
Research concerning oligonucleotide-directed triple helix formation has mainly focused on the binding of DNA oligonucleotides to duplex DNA. The participation of RNA strands in triple helices is also of interest. For the pyrimidine motif (pyrimidine.purine.pyrimidine triplets), systematic substitution of RNA for DNA in one, two, or all three triplex strands has previously been reported. For the purine motif (purine.purine.pyrimidine triplets), studies have shown only that RNA cannot bind to duplex DNA. To extend this result, we created a DNA triple helix in the purine motif and systematically replaced one, two, or all three strands with RNA. In dramatic contrast to the general accommodation of RNA strands in the pyrimidine triple helix motif, a stable triplex forms in the purine motif only when all three of the substituent strands are DNA. The lack of triplex formation among any of the other seven possible strand combinations involving RNA suggests that: (i) duplex structures containing RNA cannot be targeted by DNA oligonucleotides in the purine motif; (ii) RNA strands cannot be employed to recognize duplex DNA in the purine motif; and (iii) RNA tertiary structures are likely to contain only isolated base triplets in the purine motif.  相似文献   

10.
SUMMARY: TargetFinder is a PC/Windows program for interactive effective antisense oligonucleotide (AO) selection based on mRNA accessible site tagging (MAST) and secondary structures of target mRNA. To make MAST result intuitive, both the alignment result and tag frequency profile is illustrated. As theoretical reference, secondary structure and single strand probability profile of target mRNA is also represented. All of these sequences and profiles are displayed in aligned mode, which facilitates identification of the accessible sites in target mRNA. Graphical, user-friendly interface makes TargetFinder a useful tool in AO target site selection. AVAILABILITY: The software is freely available at http://www.bioit.org.cn/ao/targetfinder.htm CONTACT: sqwang@nic.bmi.ac.cn.  相似文献   

11.

Background  

A number of methods are now available to perform automatic assignment of periodic secondary structures from atomic coordinates, based on different characteristics of the secondary structures. In general these methods exhibit a broad consensus as to the location of most helix and strand core segments in protein structures. However the termini of the segments are often ill-defined and it is difficult to decide unambiguously which residues at the edge of the segments have to be included. In addition, there is a "twilight zone" where secondary structure segments depart significantly from the idealized models of Pauling and Corey. For these segments, one has to decide whether the observed structural variations are merely distorsions or whether they constitute a break in the secondary structure.  相似文献   

12.
Conformational changes in proteins often involve secondary structure transitions. Such transitions can be divided into two types: disorder‐to‐order changes, in which a disordered segment acquires an ordered secondary structure (e.g., disorder to α‐helix, disorder to β‐strand), and order‐to‐order changes, where a segment switches from one ordered secondary structure to another (e.g., α‐helix to β‐strand, α‐helix to turn). In this study, we explore the distribution of these transitions in the proteome. Using a comprehensive, yet highly conservative method, we compared solved three‐dimensional structures of identical protein sequences, looking for differences in the secondary structures with which they were assigned. Protein chains in which such secondary structure transitions were detected, were classified into two sets according to the type of transition that is involved (disorder‐to‐order or order‐to‐order), allowing us to characterize each set by examining enrichment of gene ontology terms. The results reveal that the disorder‐to‐order set is significantly enriched with nucleotide binding proteins, whereas the order‐to‐order set is more diverse. Remarkably, further examination reveals that >22% of the purine nucleotide binding proteins include segments which undergo disorder‐to‐order transitions, suggesting that such transitions play an important role in this process. Proteins 2010. © 2009 Wiley‐Liss, Inc.  相似文献   

13.
SRP-RNA sequence alignment and secondary structure.   总被引:35,自引:21,他引:14       下载免费PDF全文
The secondary structures of the RNAs from the signal recognition particle, termed SRP-RNA, were derived buy comparative analyses of an alignment of 39 sequences. The models are minimal in that only base pairs are included for which there is comparative evidence. The structures represent refinements of earlier versions and include a new short helix.  相似文献   

14.
We examine how effectively simple potential functions previously developed can identify compatibilities between sequences and structures of proteins for database searches. The potential function consists of pairwise contact energies, repulsive packing potentials of residues for overly dense arrangement and short-range potentials for secondary structures, all of which were estimated from statistical preferences observed in known protein structures. Each potential energy term was modified to represent compatibilities between sequences and structures for globular proteins. Pairwise contact interactions in a sequence-structure alignment are evaluated in a mean field approximation on the basis of probabilities of site pairs to be aligned. Gap penalties are assumed to be proportional to the number of contacts at each residue position, and as a result gaps will be more frequently placed on protein surfaces than in cores. In addition to minimum energy alignments, we use probability alignments made by successively aligning site pairs in order by pairwise alignment probabilities. The results show that the present energy function and alignment method can detect well both folds compatible with a given sequence and, inversely, sequences compatible with a given fold, and yield mostly similar alignments for these two types of sequence and structure pairs. Probability alignments consisting of most reliable site pairs only can yield extremely small root mean square deviations, and including less reliable pairs increases the deviations. Also, it is observed that secondary structure potentials are usefully complementary to yield improved alignments with this method. Remarkably, by this method some individual sequence-structure pairs are detected having only 5-20% sequence identity.  相似文献   

15.
The conformational propensities of unfolded states of apomyoglobin have been investigated by measurement of residual dipolar couplings between (15)N and (1)H in backbone amide groups. Weak alignment of apomyoglobin in acid and urea-unfolded states was induced with both stretched and compressed polyacrylamide gels. In 8 M urea solution at pH 2.3, conditions under which apomyoglobin contains no detectable secondary or tertiary structure, significant residual dipolar couplings of uniform sign were observed for all residues. At pH 2.3 in the absence of urea, a change in the magnitude and/or sign of the residual dipolar couplings occurs in local regions of the polypeptide where there is a high propensity for helical secondary structure. These results are interpreted on the basis of the statistical properties of the unfolded polypeptide chain, viewed as a polymer of statistical segments. For a folded protein, the magnitude and sign of the residual dipolar couplings depend on the orientation of each bond vector relative to the alignment tensor of the entire molecule, which reorients as a single entity. For unfolded proteins, there is no global alignment tensor; instead, residual dipolar couplings are attributed to alignment of the statistical segments or of transient elements of secondary structure. For apomyoglobin in 8 M urea, the backbone is highly extended, with phi and psi dihedral angles favoring the beta or P(II) regions. Each statistical segment has a highly anisotropic shape, with the N-H bond vectors approximately perpendicular to the long axis, and becomes weakly aligned in the anisotropic environment of the strained acrylamide gels. Local regions of enhanced flexibility or chain compaction are characterized by a decrease in the magnitude of the residual dipolar couplings. The formation of a small population of helical structure in the acid-denatured state of apomyoglobin leads to a change in sign of the residual dipolar couplings in local regions of the polypeptide; the population of helix estimated from the residual dipolar couplings is in excellent agreement with that determined from chemical shifts. The alignment model described here for apomyoglobin can also explain the pattern of residual dipolar couplings reported previously for denatured states of staphylococcal nuclease and other proteins. In conjunction with other NMR experiments, residual dipolar couplings can provide valuable insights into the dynamic conformational propensities of unfolded and partly folded states of proteins and thereby help to chart the upper reaches of the folding landscape.  相似文献   

16.
Homaeian L  Kurgan LA  Ruan J  Cios KJ  Chen K 《Proteins》2007,69(3):486-498
Secondary protein structure carries information about local structural arrangements, which include three major conformations: alpha-helices, beta-strands, and coils. Significant majority of successful methods for prediction of the secondary structure is based on multiple sequence alignment. However, multiple alignment fails to provide accurate results when a sequence comes from the twilight zone, that is, it is characterized by low (<30%) homology. To this end, we propose a novel method for prediction of secondary structure content through comprehensive sequence representation, called PSSC-core. The method uses a multiple linear regression model and introduces a comprehensive feature-based sequence representation to predict amount of helices and strands for sequences from the twilight zone. The PSSC-core method was tested and compared with two other state-of-the-art prediction methods on a set of 2187 twilight zone sequences. The results indicate that our method provides better predictions for both helix and strand content. The PSSC-core is shown to provide statistically significantly better results when compared with the competing methods, reducing the prediction error by 5-7% for helix and 7-9% for strand content predictions. The proposed feature-based sequence representation uses a comprehensive set of physicochemical properties that are custom-designed for each of the helix and strand content predictions. It includes composition and composition moment vectors, frequency of tetra-peptides associated with helical and strand conformations, various property-based groups like exchange groups, chemical groups of the side chains and hydrophobic group, auto-correlations based on hydrophobicity, side-chain masses, hydropathy, and conformational patterns for beta-sheets. The PSSC-core method provides an alternative for predicting the secondary structure content that can be used to validate and constrain results of other structure prediction methods. At the same time, it also provides useful insight into design of successful protein sequence representations that can be used in developing new methods related to prediction of different aspects of the secondary protein structure.  相似文献   

17.
In computational structural biology, structure comparison is fundamental for our understanding of proteins. Structure comparison is, e.g., algorithmically the starting point for computational studies of structural evolution and it guides our efforts to predict protein structures from their amino acid sequences. Most methods for structural alignment of protein structures optimize the distances between aligned and superimposed residue pairs, i.e., the distances traveled by the aligned and superimposed residues during linear interpolation. Considering such a linear interpolation, these methods do not differentiate if there is room for the interpolation, if it causes steric clashes, or more severely, if it changes the topology of the compared protein backbone curves. To distinguish such cases, we analyze the linear interpolation between two aligned and superimposed backbones. We quantify the amount of steric clashes and find all self-intersections in a linear backbone interpolation. To determine if the self-intersections alter the protein’s backbone curve significantly or not, we present a path-finding algorithm that checks if there exists a self-avoiding path in a neighborhood of the linear interpolation. A new path is constructed by altering the linear interpolation using a novel interpretation of Reidemeister moves from knot theory working on three-dimensional curves rather than on knot diagrams. Either the algorithm finds a self-avoiding path or it returns a smallest set of essential self-intersections. Each of these indicates a significant difference between the folds of the aligned protein structures. As expected, we find at least one essential self-intersection separating most unknotted structures from a knotted structure, and we find even larger motions in proteins connected by obstruction free linear interpolations. We also find examples of homologous proteins that are differently threaded, and we find many distinct folds connected by longer but simple deformations. TM-align is one of the most restrictive alignment programs. With standard parameters, it only aligns residues superimposed within 5 Ångström distance. We find 42165 topological obstructions between aligned parts in 142068 TM-alignments. Thus, this restrictive alignment procedure still allows topological dissimilarity of the aligned parts. Based on the data we conclude that our program ProteinAlignmentObstruction provides significant additional information to alignment scores based solely on distances between aligned and superimposed residue pairs.  相似文献   

18.
Rai BK  Fiser A 《Proteins》2006,63(3):644-661
A major bottleneck in comparative protein structure modeling is the quality of input alignment between the target sequence and the template structure. A number of alignment methods are available, but none of these techniques produce consistently good solutions for all cases. Alignments produced by alternative methods may be superior in certain segments but inferior in others when compared to each other; therefore, an accurate solution often requires an optimal combination of them. To address this problem, we have developed a new approach, Multiple Mapping Method (MMM). The algorithm first identifies the alternatively aligned regions from a set of input alignments. These alternatively aligned segments are scored using a composite scoring function, which determines their fitness within the structural environment of the template. The best scoring regions from a set of alternative segments are combined with the core part of the alignments to produce the final MMM alignment. The algorithm was tested on a dataset of 1400 protein pairs using 11 combinations of two to four alignment methods. In all cases MMM showed statistically significant improvement by reducing alignment errors in the range of 3 to 17%. MMM also compared favorably over two alignment meta-servers. The algorithm is computationally efficient; therefore, it is a suitable tool for genome scale modeling studies.  相似文献   

19.
Zhang W  Dunker AK  Zhou Y 《Proteins》2008,71(1):61-67
How to make an objective assignment of secondary structures based on a protein structure is an unsolved problem. Defining the boundaries between helix, sheet, and coil structures is arbitrary, and commonly accepted standard assignments do not exist. Here, we propose a criterion that assesses secondary structure assignment based on the similarity of the secondary structures assigned to pairwise sequence-alignment benchmarks, where these benchmarks are determined by prior structural alignments of the protein pairs. This criterion is used to rank six secondary structure assignment methods: STRIDE, DSSP, SECSTR, KAKSI, P-SEA, and SEGNO with three established sequence-alignment benchmarks (PREFAB, SABmark, and SALIGN). STRIDE and KAKSI achieve comparable success rates in assigning the same secondary structure elements to structurally aligned residues in the three benchmarks. Their success rates are between 1-4% higher than those of the other four methods. The consensus of STRIDE, KAKSI, SECSTR, and P-SEA, called SKSP, improves assignments over the best single method in each benchmark by an additional 1%. These results support the usefulness of the sequence-alignment benchmarks as a means to evaluate secondary structure assignment. The SKSP server and the benchmarks can be accessed at http://sparks.informatics.iupui.edu  相似文献   

20.
The secondary structure of some protein segments may vary between α‐helix and β‐strand. To predict these switchable segments, we have developed an algorithm, Switch‐P, based solely on the protein sequence. This algorithm was used on the extracellular parts of FGF receptors. For FGFR2, it predicted that β4 and β5 strands of the third Ig‐like domain were highly switchable. These two strands possess a high number of somatic mutations associated with cancer. Analysis of PDB structures of FGF receptors confirmed the switchability prediction for β5. We thus evaluated if compound‐driven α‐helix/β‐strand switching of β5 could modulate FGFR2 signaling. We performed the virtual screening of a library containing 1.4 million of chemical compounds with two models of the third Ig‐like domain of FGFR2 showing different secondary structures for β5, and we selected 32 compounds. Experimental testing using proliferation assays with FGF7‐stimulated SNU‐16 cells and a FGFR2‐dependent Erk1/2 phosphorylation assay with FGFR2‐transfected L6 cells, revealed activators and inhibitors of FGFR2. Our method for the identification of switchable proteinic regions, associated with our virtual screening approach, provides an opportunity to discover new generation of drugs with under‐explored mechanism of action. Proteins 2014; 82:2982–2997. © 2014 Wiley Periodicals, Inc.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号