首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
Summary A method of estimating the number of nucleotide substitutions from amino acid sequence data is developed by using Dayhoff's mutation probability matrix. This method takes into account the effect of nonrandom amino acid substitutions and gives an estimate which is similar to the value obtained by Fitch's counting method, but larger than the estimate obtained under the assumption of random substitutions (Jukes and Cantor's formula). Computer simulations based on Dayhoff's mutation probability matrix have suggested that Jukes and Holmquist's method of estimating the number of nucleotide substitutions gives an overestimate when amino acid substitution is not random and the variance of the estimate is generally very large. It is also shown that when the number of nucleotide substitutions is small, this method tends to give an overestimate even when amino acid substitution is purely at random.  相似文献   

2.
By means of reverse-phase HPLC, 2 different proteins were obtained from apparently purified pig eosinophil major basic protein (MBP) and these proteins were named GMPB1 and GMBP2. It was revealed that these 2 components of MBP have similar molecular weights and pI values, although the amino acid compositions were slightly different. In the previous study, we cloned and sequenced GMPB1 cDNA. Here we obtained another clone by plaque hybridization using a screening probe synthesized by means of polymerase chain reaction. After sequencing, it became apparent that this clone corresponded to GMBP2. As in the case of GMBP1, the cDNA of GMBP2 encoded pre-proGMBP2 with 3 domains; signal peptide, acidic pro-portion, and mature GMBP2. By comparing the sequences of GMBP1 and GMBP2, it was revealed that the proteins were quite similar to each other. In addition, their sequences also resembled those of human MBP, especially in the basic domain of mature protein; but no such similarity existed in the pro-portion. Although the molecular weights determined by SDS-PAGE of guinea pig and human MBPs were 11,000 and 9,300, respectively, the calculated molecular weights of these 3 MBPs were all 13.8 kDa. The calculated pI values of GMBP1, GMBP2 and human MBP were 11.7, 11.3 and 11.6, respectively. By means of Harr plot analysis, it was revealed that the amino acid sequences, not only in signal peptides but also in the basic domains of mature proteins, were well conserved between guinea pig and human MBPs.  相似文献   

3.
The nucleotide sequence of the Escherichia coli colicin I receptor gene (cir) has been determined. The predicted mature protein consists of 599 amino acids and has a molecular weight of 67,169. Several previously noted characteristics of other E. coli outer membrane protein sequences were also identified in the sequence of Cir. These include an overall acidic nature, the absence of long hydrophobic stretches of amino acids, and a lack of predicted alpha-helical secondary structure. Because two classes of outer membrane proteins (the TonB-dependent transport proteins and the porins) share some structural features, protein sequences from both of these groups were aligned pairwise and scored for sequence similarity. Statistical evidence suggested that the porins were not related to the proteins in the TonB-dependent group; however, there was a significant relationship between the proteins in the TonB-dependent group. On the basis of the multiple progressive sequence alignment and the similarity scores derived from it, a tree representing evolutionary distance between five TonB-dependent outer membrane transport proteins was generated.  相似文献   

4.
Summary It is known that globin genes contain three exons with the middle exon coding for a four-helical supersecondary structure responsible for heme binding. Since this portion of the globin peptide chain can be structurally superimposed onto the cytochromec and cytochromeb 5 chains (Argos and Rossmann 1979), it can be inferred that the cytochromec gene will contain only one coding sequence while the cytochromeb 5 gene will be composed of three exons as found in the globin gene.  相似文献   

5.
When two strings of symbols are aligned it is important to know whether the observed number of matches is better than that expected between two independent sequences with the same frequency of symbols. When strings are of different lengths, nulls need to be inserted in order to align the sequences. One approach is to use simple approximations of sampling for replacement. We describe an algorithm for exactly determining the frequencies of given numbers of matches, sampling without replacement. This does not lead to a simple closed form expression. However we show examples where sampling with, or without, replacement give very similar results and the simple approach may be adequate for all but the smallest cases.  相似文献   

6.
A set of aligned homologous protein sequences is divided into two groups consisting of m and n most related sequences. The value of position variability for homologous protein sequences is defined as a number of failures to coincide in the intergroup comparison of all possible m*n pairs of amino acid residues in that position divided by m*n. The position variability value plotted versus the sequence position number with a window of 10 positions gives the intergroup local variability profile. Area S of the figure included between the local variability profile and the straight line corresponding to the mean local variability value is compared with the average area Sr for 1000 random homologous protein families. If S is greater than Sr by more than 2 standard deviation units sigma r, the local variability profile is assumed to contain peaks and hollows corresponding to significant variable and conservative regions of the sequences. The profile extrema containing the area surplus delta S = S-(Sr+ 2 sigma r) are cut off by two straight lines to locate significant regions. The difference (S-Sr) given in standard deviation units sigma r is believed to be the amino acid substitution overall irregularity along the homologous protein sequences OI = (S-Sr)/sigma r. The significant conservative and variable regions of six homologous sequence families (phospholipase A2, cytochromes b, alpha-subunits of Na,K-ATPase, L- and M-subunits of photosynthetic bacteria photoreaction centre and human rhodopsins) were identified. It was shown that for artificial homologous protein sequences derived by k-fold lengthening of natural protein sequences, the OI value rises as square root of k. To compare the degree of substitution irregularity in homologous protein sequence families of different lengths L the value of standard substitution overall irregularity for L = 250 is proposed.  相似文献   

7.
Replacement substitutions of mitochondrial cytochrome c and α- and β-chains of haemoglobin have been studied by considering the structural similarity among amino acid residues at the secondary and tertiary structural levels. Secondary structural similarity explains ~70% while tertiary structural similarity explains ~50% of observed replacements for most of the cases. These structural similarities could not account for all the replacement substitutions. The study was extended to consider the composition of codons, and the chemical nature and polarity of the replacing and replaced residues. These also could not individually account for all the affected replacements. In general, no property of amino acid residues is conserved for substitutions occurring at any single position during evolution of proteins.  相似文献   

8.
The homologous genomic region that contains two paralogous genes,Adh andAdh-dup, was compared in severalDrosophila species. Sequences were analyzed as follows: a) At the nucleotide level, Ka and Ks values were determined for each pair of species. Ka-Adh and Ka-Adh-dup are not significantly different. However, Ks-Adh values are significantly lower than Ks-Adh-dup, which are more variable. In agreement with other reports, lower Ks values forAdh correlate with a high level of gene expression and relatively high percentage of G+C content in the third codon position, while the opposite applies toAdh-dup. b) At the protein level, amino acid comparisons reveal conserved regions shared by ADH and ADH-DUP, which have been assigned to known functional domains. Key residues for dehydrogenasic function are also found in ADH-DUP, thus pointing to a dehydrogenase activity for ADH-DUP, albeit very different from that of ADH.  相似文献   

9.
A method for optimally locating gaps in the amino acid sequences of homologous proteins is presented. The method involves three steps: (1) demonstration that the sequences are indeed homologous, (2) location of regions where the homologous pairing is reasonably certain, and (3) location of gaps between these regions so as to minimize the total number of mutations required to account for the differences between the two sequences. The major virtues of this procedure are that the assertion of homology does not depend upon the prior introduction of gaps and that a genetic rather than a chemical test is the basis for asserting a genetic relationship.This project received support from grants from NSF (GB-7486) and NIH (NB 04545-06).  相似文献   

10.
A cDNA clone for porcine liver proline-beta-naphthylamidase was isolated and sequenced. The deduced amino acid sequence of 567 residues was highly homologous with those of carboxylesterases (EC 3.1.1.1) previously reported for other species. In addition, proline-beta-naphthylamidase purified from porcine liver was shown to have strong activity towards p-nitrophenylacetate, a representative substrate for carboxylesterases. These results suggest that proline-beta-naphthylamidase is identical with carboxylesterase.  相似文献   

11.
The proteolytic action of trypsin, chymotrypsin, submaxillary gland proteinases, Lys-C, Staphylococcus aureus st. V8, Armilarria mellea, Mixobacter AL-2 proteinase II, thermolysin and alpha-lytic proteinase is elucidated from the analysis of te data available on the amino acid sequence studies for above 70 proteins. Properties of a series of commercial enzymic preparations and the way of preferential application of proteinases for studying the amino acid sequence are discussed.  相似文献   

12.
L-Arginine is a source of nitrogen oxide and plays a great role in a number of other biochemical processes. Functions and prospects for practical application of five groups of arginine-containing amino acid sequences and synthetic polyarginine sequences are considered. The physiological characteristics of well-known arginine-containing peptides, such as RGD containing, kyotorphin, and tuftsin, are described in detail.  相似文献   

13.
L-arginine is a source of nitrogen oxide and plays a great role in a number of other biochemical processes. Functions and prospects for practical application of five groups of arginine-containing amino acid sequences and synthetic polyarginine sequences are considered. The physiological characteristics of well-known arginine-containing peptides, such as RGD peptides, kyotorphin, and tuftsin, are described in detail. The English version of the paper: Russian Journal of Bioorganic Chemistry, 2008, vol. 34, no. 2; see also http://www.maik.ru  相似文献   

14.
15.
Summary A discriminant analysis on the basis of the physicochemical properties of amino acid residues is developed to investigate the accumulation pattern of amino acid substitutions in a family of proteins. The application of this analysis to vertebrate hemoglobins reveals the following new results. (1) The major components of teleost fish and amphibian hemoglobins showing the Root effect are sharply discriminated from mammalian hemoglobins in several regions of the and chains, whereas shark, minor components of teleost fish and amphibian, reptile, and bird hemoglobins showing no Root effect exhibit a gradual change to mammalian hemoglobin in a straightforward way. This result suggests at least two lines of molecular evolution in vertebrate hemoglobins. (2) The nonadult hemoglobin chains are allocated to the latter line, i.e., tadpole, , and chains are similar to shark and trout I chains, and and chains are similar to some of the reptile chains. (3) In any case, most of the amino acid residues causing the discrimination are located near the sites that carry the amino acid residues conserved well throughout all classes of vertebrates, suggesting that modifications adapting to the respective living conditions or respiratory organs have taken place effectively near the amino acid residues essential for the manifestation of cooperative oxygen binding. (4) The amino acid residues at other sites are changed from one to another species even within the same class, showing a constant substitution rate as a whole. These amino acid substitutions may be nearly neutral, being under a weak functional constraint. The number of sites allowing such neutral substitutions is rather small, less than one-half of all the sites in the adult hemoglobins of bird and mammal, whereas it amounts to two-thirds in teleost fish hemoglobins.  相似文献   

16.

Background  

The current versions of the COG and arCOG databases, both excellent frameworks for studies in comparative and functional genomics, do not contain the nucleotide sequences corresponding to their protein or protein domain entries.  相似文献   

17.
Given two sequences, a pattern of length m, a text of lengthn and a positive integer k, we give two algorithms. The firstfinds all occurrences of the pattern in the text as long asthese do not differ from each other by more than k differences.It runs in O(nk) time. The second algorithm finds all subsequencealignments between the pattern and the test with at most k differences.This algorithm runs in O(nmk) time, is very simple and easyto program. Received on August 12, 1987; accepted on December 31, 1987  相似文献   

18.
19.
Y C Lone  M P Simon  A Kahn  J Marie 《FEBS letters》1986,195(1-2):97-100
Four overlapping cDNA clones for L-type pyruvate kinase (PK-L) were isolated from carbohydrate-induced rat liver cDNA libraries. They contained all the coding sequence of the enzyme from the 7th codon and the entire 3'-untranslated extension up to the poly(A) tail. The sequence of the first 7 codons and that of the 5'-untranslated region were determined by primer extension. The analyzed PK-L mRNA has 19 5'-untranslated bases, 1629 coding bases and 1281 3'-untranslated bases without the poly(A) tail; it corresponds to the heavier, 3.2 kb species of the L-type mRNAs. The codons for the phosphorylatable site are located at the 5'-end of the messenger. The unusually long 3'-untranslated extension contains a repetitive element complementary to the 'brain-specific' identifier sequence described by Sutcliffe et al. [(1982) Proc. Natl. Acad. Sci. USA 79, 4942-4946].  相似文献   

20.
Prediction of the effect of amino acid substitutions on the thermodynamic stability of proteins is of great importance for studies into the molecular mechanisms underlying the abnormal function of mutant proteins, interpretation of genotyping results, and purposeful design of modified proteins with improved biomedical and biotechnological properties. A set of methods was developed for predicting the changes in free energy (ΔΔG) of mutant proteins containing single substitutions using the information only about protein primary structure or also about the spatial structure. A modified KRAB algorithm was used; its higher accuracy in predicting the changes in the thermodynamic stability of mutant proteins compared with the other known methods designed for solving this problem is demonstrated. Distribution of the positions in the sequence of Malayan pit viper venom protein (kistrin) where the substitutions decrease or increase kistrin stability is analyzed. The substitutions at most positions conserved in the disintegrin family decrease the stability of this protein, except for several positions whose conservation can be determined by functional significance.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号