首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
In previous papers, a method of protein tertiary structure recognition was described based on the construction of an associative memory Hamiltonian, which encoded the amino acid sequence and the C alpha co-ordinates of a set of database proteins. Using molecular dynamics with simulated annealing, the ability of the Hamiltonian to successfully recall the structure of a protein in the memory database was successfully demonstrated, as long as the total number of database proteins did not exceed a characteristic value, called the capacity of the Hamiltonian, equal to 0.5N to 0.7N, where N is the number of amino acid residues in the protein to be recalled. In this paper, we describe the development of additional methods to increase the capacity of the Hamiltonian, including use of a more complete representation of the protein backbone and the incorporation of contextual information into the Hamiltonian through the use of secondary structure prediction. In addition, we further extend the ability of associative memory models to predict the tertiary structures of proteins not present in the protein data set, by making the Hamiltonian invariant with respect to biological symmetries that represent site mutations and insertions and deletions. The ability of the Hamiltonian to generalize from homologous proteins to an unknown protein in the presence of other unrelated proteins in the data set is demonstrated.  相似文献   

2.
Markovian models of protein evolution that relax the assumption of independent change among codons are considered. With this comparatively realistic framework, an evolutionary rate at a site can depend both on the state of the site and on the states of surrounding sites. By allowing a relatively general dependence structure among sites, models of evolution can reflect attributes of tertiary structure. To quantify the impact of protein structure on protein evolution, we analyze protein-coding DNA sequence pairs with an evolutionary model that incorporates effects of solvent accessibility and pairwise interactions among amino acid residues. By explicitly considering the relationship between nonsynonymous substitution rates and protein structure, this approach can lead to refined detection and characterization of positive selection. Analyses of simulated sequence pairs indicate that parameters in this evolutionary model can be well estimated. Analyses of lysozyme c and annexin V sequence pairs yield the biologically reasonable result that amino acid replacement rates are higher when the replacements lead to energetically favorable proteins than when they destabilize the proteins. Although the focus here is evolutionary dependence among codons that is associated with protein structure, the statistical approach is quite general and could be applied to diverse cases of evolutionary dependence where surrogates for sequence fitness can be measured or modeled.  相似文献   

3.
The library described here is a collection of phages with six degenerate codons in gene VIII, specifying amino acids 12, 13, 15-17 and 19 of the major coat protein. The randomized positions are surface exposed in the wild-type protein and thus might be expected to tolerate a great diversity of side chains without compromising phage viability. In agreement with this supposition, the new library showed great diversity of amino acids at the randomized positions and diversity did not diminish noticeably during repeated subculture. Despite their diversity, however, the randomized positions should be strongly constrained conformationally because they lie in an extended alpha-helical portion of the protein, stabilized by numerous inter- and intra-subunit contacts--a presupposition corroborated by circular dichroism spectroscopy of many library members. To reflect this conformational homogeneity and the fact that random amino acids subtend a major fraction of the surface 'landscape' of the particle, we call the new construct an alpha landscape library. It can be used as a source of alpha-helical ligands and substitute antibodies.  相似文献   

4.
5.
We suggest that tRNA actively participates in the transfer of 3D information from mRNA to peptides - in addition to its well-known, "classical" role of translating the 3-letter RNA codes into the one letter protein code. The tRNA molecule displays a series of thermodynamically favored configurations during translation, a movement which places the codon and coded amino acids in proximity to each other and make physical contact between some amino acids and their codons possible. This specific codon-amino acid interaction of some selected amino acids is necessary for the transfer of spatial information from mRNA to coded proteins, and is known as RNA-assisted protein folding.  相似文献   

6.
Prokaryotic sequences are responsible for more than just protein coding. There are two 10- to 11-base periodical patterns superimposed on the protein coding message within the same sequence. Positional auto- and cross-correlation analysis of the sequences shows that these two patterns are a short-range counter-phase oscillation of AA and TT dinucleotides and a medium-range in-phase oscillation of the same dinucleotides, spanning distances of up to ∼30 and ∼100 bases, respectively. The short-range oscillation is encoded by the amino acid sequences themselves, apparently, due to the presence of amphipathic α-helices in the proteins. The medium-range oscillation, related to DNA folding in the cell, is created largely by a special choice of the bases in the third positions of the codons. Interestingly, the amino acid sequences do contribute to that signal as well. That is, the very amino acid sequences are, to some extent, degenerate to serve the same oscillating pattern that is associated with the degenerate third codon positions. [Reviewing Editor: Dr. Richard Kliman]  相似文献   

7.
8.
A computer program (PINCERS) is described for use in the design of synthetic genes and mixed-probe DNA sequences. A protein sequence is reverse translated with generation of synonymous codons at each position producing a degenerate sequence. In order to locate potential restriction enzyme sites, the degenerate sequence is searched with a library of restriction enzymes for sites that utilize any combination of synonymous codons. These sites are indicated in a map so that they may be incorporated into the synthetic gene sequence. The program allows the user to select the appropriate codon usage table for the organism of interest and then to set a threshold usage frequency below which codons are not generated. PINCERS may also be used to assist in planning the synthesis of mixed-probe DNA sequences for cross-hybridization experiments. It can identify regions of specified length with the protein sequence that have the least overall degeneracy, thereby minimizing the number of probes to be synthesized and, therefore, maximizing the concentration of a given probe sequence.  相似文献   

9.
In living organisms 20 amino acids along with the terminator value(s) are encoded by 64 codons giving a degeneracy of the codons as described by the genetic code. A basic theoretical problem of genetic codes is to explain the particular distribution of degeneracies of partitions involved in the codes. In this work the degeneracy problem is considered in the framework of information theory. It is shown by direct numerical evaluation of a certain degeneracy information function associated with the genetic code that the degeneracy of the codes is observed to be related to the optimization of this function.  相似文献   

10.
氨基酸的分子结构与遗传密码简并及二维集合分类   总被引:13,自引:2,他引:11  
根据氨基酸遗传密码子的简并程度,可将64个遗传密码子分为高简并度类(3,4,6度简并组)和低简并度类(1,2度简并组)两大类。高简并度类有9个氨基酸,其分子量比较小,等电点的分布比较集中。低简并度类有11个氨基酸,其分子结构比较复杂,参考Taylor对氨基酸特性的分类图,本文提出以分子量(M)及等电点(P)作为氨基酸的化学特性坐标,作出其二维集合MP分类图,MP分类图可以反映出氨基酸的各种属性,如分子量的大小,简并度的高低,极性与非极性、带电荷或不带电荷,疏水性与亲水性,以及氨基酸残基的种类等。根据氨基酸的分类分析,可以认为:高简并度氨基酸多数是脂烃类和羟脂烃类的氨基酸,分子量比较小,分子结构比较简单,大部分为疏子性,主要组成跨膜结构或蛋白质的结构域,可能是出现较早的氨基酸;而低简并度的氨基酸,分子结构比较复杂,分子量比较大,多数是和蛋白质功能有密切联系的基团,可能是进化出现较晚的结构。  相似文献   

11.
The simple fact that proteins are built from 20 amino acids while DNA only contains four different bases, means that the 'signal-to-noise ratio' in protein sequence alignments is much better than in alignments of DNA. Besides this information-theoretical advantage, protein alignments also benefit from the information that is implicit in empirical substitution matrices such as BLOSUM-62. Taken together with the generally higher rate of synonymous mutations over non-synonymous ones, this means that the phylogenetic signal disappears much more rapidly from DNA sequences than from the encoded proteins. It is therefore preferable to align coding DNA at the amino acid level and it is for this purpose we have constructed the program RevTrans. RevTrans constructs a multiple DNA alignment by: (i) translating the DNA; (ii) aligning the resulting peptide sequences; and (iii) building a multiple DNA alignment by 'reverse translation' of the aligned protein sequences. In the resulting DNA alignment, gaps occur in groups of three corresponding to entire codons, and analogous codon positions are therefore always lined up. These features are useful when constructing multiple DNA alignments for phylogenetic analysis. RevTrans also accepts user-provided protein alignments for greater control of the alignment process. The RevTrans web server is freely available at http://www.cbs.dtu.dk/services/RevTrans/.  相似文献   

12.
9-fluorenylmethoxycarbonyl (Fmoc) and 4,4′-dimethoxytrityl (DMTr) are orthogonal hydroxyl protecting groups that have been used in conjunction to assemble oligonucleotide libraries whose variants contain wild-type and mutant codons randomly interspersed throughout a focused DNA region. Fmoc is labile to organic bases and stable to weak acids, whereas DMTr behaves oppositely. Based on these chemical characteristics, we have now devised TrimerDimer, a novel codon-based saturation mutagenesis approach that removes redundant and stop codons during the assembly of degenerate oligonucleotides. In this approach, five DMTr-protected trinucleotide phosphoramidites (dTGG, dATG, dTTT, dTAT and dTGC) and five Fmoc-protected dinucleotide phosphoramidites (dAA, dTT, dAT, dGC and dCG) react simultaneously with a starting oligonucleotide growing on a solid support. The Fmoc group is then removed and the incorporated dimers react with a mixture of three DMTr-protected monomer phosphoramidites (dC, dA and dG) to produce 15 trinucleotides: dCAA, dAAA, dGAA, dCTT, dATT, dGTT, dCAT, dAAT, dGAT, dCGC, dAGC, dGGC, dCCG, dACG and dGCG. After one mutagenic cycle, 20 codons are generated encoding the 20 natural amino acids. TrimerDimer was tested by randomizing the four contiguous codons that encode amino acids L64–G67 of an engineered, nonfluorescent GFP protein. Sequencing of 89 nonfluorescent mutant clones and isolation of two fluorescent mutants confirmed the principle.  相似文献   

13.
Techniques for position-specific incorporation of non-natural amino acids in an in vitro protein synthesizing system are described. First, a PNA-assisted non-enzymatic tRNA aminoacylation with a variety of natural and non-natural amino acids is described. With this technique, one can aminoacylate a specific tRNA simply by adding a preformed amino acid activated ester-PNA conjugate into an in vitro protein biosynthesizing system. Second, the genetic code is expanded by introducing 4-base codons that can be exclusively translated to non-natural amino acids. The most advantageous point of the 4-base codon strategy is to introduce multiple amino acids into specific positions in single proteins by using mutually orthogonal 4-base codons and orthogonal tRNAs. An easy and quick method for preparation of tRNAs possessing 4-base anticodons is also described. Combination of the non-enzymatic aminoacylation and the 4-base codon/anticodon strategy gives an easy and widely applicable technique for incorporating a variety of non-natural amino acids into proteins in vitro.  相似文献   

14.
Ren Zhang M.D. 《Amino acids》1997,12(2):167-177
Summary Based on the genetic codes and a simple theorem for the geometrical property of the regular tetrahedron, each amino acid is mapped onto a unique point in a 3-dimensional tetrahedral space. The distribution of the 20 mapping points for 20 amino acids is studied in detail. It is found that the mapping points for the hydrophobic and hydrophilic amino acids are distributed at distinct regions in the 3-dimensional space. A plane separating the two kinds of points satisfactorily based on the Fisher's algorithm has been calculated. It is shown that the codons coding for the hydrophobic amino acids are constituted dominantly by the bases of keto group, i.e., G and T. While the codons coding for the hydrophilic amino acids are constituted dominantly by the bases of amino group, i.e., A and C. The biological implication of the mapping points and the separating plane has been discussed in some details.  相似文献   

15.
The molecular mechanisms underlying pressure-induced protein denaturation can be analyzed based on the pressure-dependent differences in the apparent volume occupied by amino acids inside the protein and when they are exposed to water in an unfolded conformation. We present here an analysis for the peptide group and the 20 naturally occurring amino acid side chains based on volumetric parameters for the amino acids in the interior of the native state, the micelle-like interior of the pressure-induced denatured state, and the unfolded conformation modeled by N-acetyl amino acid amides. The transfer of peptide groups from the protein interior to water becomes increasingly favorable as pressure increases. Thus, solvation of peptide groups represents a major driving force in pressure-induced protein denaturation. Polar side chains do not appear to exhibit significant pressure-dependent changes in their preference for the protein interior or solvent. The transfer of nonpolar side chains from the protein interior to water becomes more unfavorable as pressure increases. We conclude that a sizeable population of nonpolar side chains remains buried inside a solvent-inaccessible core of the pressure-induced denatured state. At elevated pressures, this core may become packed almost as tightly as the interior of the native state. The presence and partial disappearance of large intraglobular voids is another driving force facilitating pressure-induced denaturation of individual proteins. Our data also have implications for the kinetics of protein folding and shed light on the nature of the folding transition state ensemble.  相似文献   

16.
Gray MW 《IUBMB life》2003,55(4-5):227-233
'RNA editing' describes the programmed alteration of the nucleotide sequence of an RNA species, relative to the sequence of the encoding DNA. The phenomenon encompasses two generic patterns of nucleotide change, 'insertion/deletion' and 'substitution', defined on the basis of whether the sequence of the edited RNA is colinear with the DNA sequence that encodes it. RNA editing is mediated by a variety of pathways that are mechanistically and evolutionarily unrelated. Messenger, ribosomal, transfer and viral RNAs all undergo editing in different systems, but well-documented cases of this phenomenon have so far been described only in eukaryotes, and most often in mitochondria. Editing of mRNA changes the identity of encoded amino acids and may create translation initiation and termination codons. The existence of RNA editing violates one of the long-accepted tenets of genetic information flow, namely, that the amino acid sequence of a protein can be directly predicted from the corresponding gene sequence. Particular RNA editing systems display a narrow phylogenetic distribution, which argues that such systems are derived within specific eukaryotic lineages, rather than representing traits that ultimately trace to a common ancestor of eukaryotes, or even further back in evolution. The derived nature of RNA editing raises intriguing questions about how and why RNA editing systems arise, and how they become fixed as additional, essential steps in genetic information transfer.  相似文献   

17.
T Palzkill  D Botstein 《Proteins》1992,14(1):29-44
A new analytical mutagenesis technique is described that involves randomizing the DNA sequence of a short stretch of a gene (3-6 codons) and determining the percentage of all possible random sequences that produce a functional protein. A low percentage of functional random sequences in a complete library of random substitutions indicates that the region mutagenized is important for the structure and/or function of the protein. Repeating the mutagenesis over many regions throughout a protein gives a global perspective of which amino acid sequences in a protein are critical. We applied this method to 66 codons of the gene encoding TEM-1 beta-lactamase in 19 separate experiments. We found that TEM-1 beta-lactamase is extremely tolerant of amino acid substitutions: on average, 44% of all mutants with random substitutions function and 20% of the substitutions are expressed, secreted, and fold well enough to function at levels similar to those for the wild-type enzyme. We also found a few exceptional regions where only a few random sequences function. Examination of the X-ray structures of homologous beta-lactamases indicates that the regions most sensitive to substitution are in the vicinity of the active site pocket or buried in the hydrophobic core of the protein. DNA sequence analysis of functional random sequences has been used to obtain more detailed information about the amino acid sequence requirements for several regions and this information has been compared to sequence conservation among several related beta-lactamases.  相似文献   

18.
Tang L  Gao H  Zhu X  Wang X  Zhou M  Jiang R 《BioTechniques》2012,52(3):149-158
Site-saturation mutagenesis is a powerful tool for protein optimization due to its efficiency and simplicity. A degenerate codon NNN or NNS (K) is often used to encode the 20 standard amino acids, but this will produce redundant codons and cause uneven distribution of amino acids in the constructed library. Here we present a novel "small-intelligent" strategy to construct mutagenesis libraries that have a minimal gene library size without inherent amino acid biases, stop codons, or rare codons of Escherichia coli by coupling well-designed combinatorial degenerate primers with suitable PCR-based mutagenesis methods. The designed primer mixture contains exactly one codon per amino acid and thus allows the construction of small-intelligent mutagenesis libraries with one gene per protein. In addition, the software tool DC-Analyzer was developed to assist in primer design according to the user-defined randomization scheme for library construction. This small-intelligent strategy was successfully applied to the randomization of halohydrin dehalogenases with one or two randomized sites. With the help of DC-Analyzer, the strategy was proven to be as simple as NNS randomization and could serve as a general tool to efficiently randomize target genes at positions of interest.  相似文献   

19.
The extremely complicated nature of many biological problems makes them bear the features of fuzzy sets, such as with vague, imprecise, noisy, ambiguous, or input-missing information For instance, the current data in classifying protein structural classes are typically a fuzzy set To deal with this kind of problem, the AAPCA (Amino Acid Principal Component Analysis) approach was introduced. In the AAPCA approach the 20-dimensional amino acid composition space is reduced to an orthogonal space with fewer dimensions, and the original base functions are converted into a set of orthogonal and normalized base functions The advantage of such an approach is that it can minimize the random errors and redundant information in protein dataset through a principal component selection, remarkably improving the success rates in predicting protein structural classes It is anticipated that the AAPCA approach can be used to deal with many other classification problems in proteins as well.  相似文献   

20.
The dynamics of directionally tuned linear multi-input single-output systems varies generally as a function of the spatial orientation of the inputs. A linear system receiving directionally specific inputs is represented by a linear combination of the respective input transfer functions. The input-output behaviour of such systems can be described by a vector transfer function which specifies the polarization directions of the system in real space. These directions, which can be either one (unidirectional vector transfer function) or two (bidirectional vector transfer function) but never three, are obtained by computing the eigenvectors and eigenvalues of the system matrix that is defined by the gain and phase values of the system's response to harmonic stimulation directed along three orthogonal directions in space. The spatial tuning behaviour is determined by the quadratic form associated with the system matrix. Neuronal systems with bidirectional vector transfer functions process input information in a plane-specific way and exhibit novel characteristics, very much different from those of systems with unidirectional vector transfer functions.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号