首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The amino acid sequence of human connective tissue growth factor was measured according to two-, three- and four-amino-acid sequences. The measured frequency and probability were compared with predicted frequency and probability. In human connective tissue growth factor, 81 (23.276%) and 21 (6.034%) of 348 two-amino-acid sequences can be explained by the predicted frequency and probability according to a purely random mechanism, 113 (55.122%) and 50 (24.390%) of 205 non-appearing two-amino-acid sequences can be explained by the predicted frequency and probability according to a purely random mechanism; no measured Markov transition probability for the second amino acid in two-amino-acid sequences matches the predicted conditional probability.  相似文献   

2.
The repeated amino-acid sequences in Citrobacter Freundii beta-lactamase may be indispensable for its function, because such repetitions cannot be simply attributed to a chance. In order to fully explore the functional units in Citrobacter Freundii beta-lactamase, it may need to analyse all the amino acid pairs, triplets, etc. along Citrobacter Freundii beta-lactamase from one terminal to the other terminal, to count their frequencies and calculate their probabilities. The amino-acid sequence of Citrobacter Freundii beta-lactamase was counted according to two-, three- and four-amino-acid sequences. The counted frequency and probability were compared with the predicted frequency and probability. The amino acid sequences, which appear in Citrobacter Freundii beta-lactamase and can be predicted from its amino acid composition according to a purely random mechanism, should not be deliberately evolved and conserved. By contrast, the amino acid sequences, which appear in Citrobacter Freundii beta-lactamase but cannot be predicted from its amino acid composition according to a purely random mechanism, should be deliberately evolved and conversed. Accordingly 99 (26.053%) and 33 (8.684%) of 380 two-amino-acid sequences can be predicted by the frequency and probability according to a purely random mechanism. Some kinds of amino acid sequences, which absent in Citrobacter Freundii beta-lactamase and can be predicted from its amino acid composition according to a purely random mechanism, should not be deliberately excluded from Citrobacter Freundii beta-lactamase. By contrast, some kinds of amino acid sequences, which absent in Citrobacter Freundii beta-lactamase and cannot be predicted from its amino acid composition according to a purely random mechanism, should be deliberately excluded from Citrobacter Freundii beta-lactamase. Accordingly 89 (48.370%) and 41 (22.283%) of 184 kinds of absent two-amino-acid sequences can be predicted by the frequency and probability according to a purely random mechanism, and 7236 (99.848%) of 7247 kinds of absent three-amino-acid sequences can be predicted by the frequency according to a purely random mechanism. The amino acids, whose probabilities in following certain preceding amino acids can be predicted from Citrobacter Freundii beta-lactamase amino acid composition according to a purely random mechanism, should not be deliberately evolved and conversed, accordingly 2 (0.526%) of 380 counted first order Markov transition probabilities for the second amino acid in two-amino-acid sequences match the predicted conditional probabilities.  相似文献   

3.
4.
The gene encoding trypanothione reductase, the redox disulfide-containing flavoenzyme that is unique to the parasitic trypanosomatids (Shames et al., 1986), has been isolated from the cattle pathogen Trypanosoma congolense. Library screening was carried out with inosine-containing oligonucleotide probes encoding sequences determined from two active site peptides isolated from the purified Crithidia fasciculata enzyme. The nucleotide sequence of the gene was determined according to the dideoxy chain termination method of Sanger. The structural gene is 1476 nucleotides long and encodes 492 amino acids. We have identified the active site peptide containing the redox-active disulfide, a peptide corresponding to the histidine-467 region of human erythrocyte glutathione reductase, as well as the flavin binding domain that is highly conserved in all disulfide-containing flavoprotein reductase enzymes. Alignment of five tryptic peptides (80 residues) isolated from the C. fasciculata trypanothione reductase with the primary sequence of the T. congolense enzyme showed 88% homology with 76% identity. Additionally, a sequence comparison of the glutathione reductase from Escherichia coli or human erythrocytes to T. congolense trypanothione reductase reveals greater than 50% homology. A search for the amino acid residues in the primary sequence of trypanothione reductase functionally active in binding/catalysis in human erythrocyte glutathione reductase shows that only the two arginine residues (Arg-37 and Arg-347), shown by X-ray crystallographic data to hydrogen bond to the GS1 glutathione glycyl carboxylate, are absent.  相似文献   

5.
In this article, we introduce the drifting Markov models (DMMs) which are inhomogeneous Markov models designed for modeling the heterogeneities of sequences (in our case DNA or protein sequences) in a more flexible way than homogeneous Markov chains or even hidden Markov models (HMMs). We focus here on the polynomial drift: the transition matrix varies in a polynomial way. To show the reliability of our models on DNA, we exhibit high similarities between the probability distributions of nucleotides obtained by our models and the frequencies of these nucleotides computed by using a sliding window. In a further step, these DMMs can be used as the states of an HMM: on each of its segments, the observed process can be modeled by a drifting Markov model. Search of rare words in DNA sequences remains possible with DMMs and according to the fits provided, DMMs turn out to be a powerful tool for this purpose. The software is available on request from the author. It will soon be integrated on seq++ library (http://stat.genopole.cnrs.fr/seqpp/).  相似文献   

6.
草地螟Loxostege stictialis L.是我国北方农牧业生产上一种重要迁飞性、暴发性害虫,一旦暴发会给当地农牧生产造成严重危害.根据康保县1977-2008年1代草地螟幼虫发生程度的时间序列资料,应用马尔科夫链的转移概率预测法,构建了1~3阶转移概率矩阵,组建模型对该县2009-2011年1代草地螟发生程度进行了预测,结果与大田实际发生情况完全一致,准确率100%.对1980-2011年的历史资料进行回检,历史符合率89.9%,该方法可对草地螟进行长期预报,为草地螟长期预报提供了一种准确有效的方法,对草地螟发生程度的长期预报具有重要指导意义.  相似文献   

7.
Many proteins need recognition of specific DNA sequences for functioning. The number of recognition sites and their distribution along the DNA might be of biological importance. For example, the number of restriction sites is often reduced in prokaryotic and phage genomes to decrease the probability of DNA cleavage by restriction endonucleases. We call a sequence an exceptional one if its frequency in a genome significantly differs from one predicted by some mathematical model. An exceptional sequence could be either under- or over-represented, depending on its frequency in comparison with the predicted one. Exceptional sequences could be considered biologically meaningful, for example, as targets of DNA-binding proteins or as parts of abundant repetitive elements. Several methods to predict frequency of a short sequence in a genome, based on actual frequencies of certain its subsequences, are used. The most popular are methods based on Markov chain models. But any rigorous comparison of the methods has not previously been performed. We compared three methods for the prediction of short sequence frequencies: the maximum-order Markov chain model-based method, the method that uses geometric mean of extended Markovian estimates, and the method that utilizes frequencies of all subsequences including discontiguous ones. We applied them to restriction sites in complete genomes of 2500 prokaryotic species and demonstrated that the results depend greatly on the method used: lists of 5% of the most under-represented sites differed by up to 50%. The method designed by Burge and coauthors in 1992, which utilizes all subsequences of the sequence, showed a higher precision than the other two methods both on prokaryotic genomes and randomly generated sequences after computational imitation of selective pressure. We propose this method as the first choice for detection of exceptional sequences in prokaryotic genomes.  相似文献   

8.
The amino acid sequences of the cysteinyl peptides of Spirulina sp. glutathione reductase were determined. Spirulina glutathione reductase was covalently bound to Thiopropyl-Sepharose 6B in the presence of 8M urea through thiol-disulfide exchange. After tryptic digestion, 4 distinct cysteinyl peptides were finally isolated from NADPH-reduced glutathione reductase and 2 from oxidized glutathione reductase. The amino acid sequences of the two cysteinyl peptides which could not be isolated from the oxidized glutathione reductase were very similar to those around the active site disulfide of the other flavoprotein disulfide oxidoreductases and a unique replacement of asparagine and valine by isoleucine and arginine between the two cysteine residues was found. The other two peptides isolated from both oxidized and reduced glutathione reductase also show considerable homology to the corresponding parts of human and Escherichia coli glutathione reductases.  相似文献   

9.
Methylglyoxal is mainly catabolized by two major enzymatic pathways. The first is the ubiquitous detoxification pathway, the glyoxalase pathway. In addition to the glyoxalase pathway, aldose reductase pathway also plays a crucial role in lowering the levels of methylglyoxal. The gene encoding aldose reductase (ALR) has been cloned from Leishmania donovani, a protozoan parasite causing visceral leishmaniasis. DNA sequence analysis revealed an open reading frame (ORF) of approximately 855 bp encoding a putative protein of 284 amino acids with a calculated molecular mass of 31.7 kDa and a predicted isoelectric point of 5.85. The sequence identity between L. donovani ALR (LdALR) and mammals and plants is only 36-44%. The ORF is a single copy gene. A protein with a molecular mass that matched the estimated approximately 74 kDa according to the amino acid composition of LdALR with a maltose binding tag present at its N-terminal end was induced by heterologous expression of LdALR in Escherichia coli. In the presence of glutathione, recombinant LdALR reduced methylglyoxal with a K(m) of approximately 112 microM. Comparative structural analysis of the human ALR structure with LdALR model suggests that the active site anchoring the N-terminal end of the glutathione is highly conserved. However, the C-terminal end of the glutathione backbone is expected to be exposed in LdALR, as the residues anchoring the C-terminal end of the glutathione backbone come from the three loop regions in human, which are apparently shortened in the LdALR structure. Thus, the computational analysis provides clues about the expected mode of glutathione binding and its interactions with the protein. This is the first report of the role of an ALR in the metabolic disposal of methylglyoxal in L. donovani and of thiol binding to a kinetoplastid aldose reductase.  相似文献   

10.
The degree of similarity of DNA sequences can be concluded according to the comparison of DNA sequences, which helps to speculate their relationship in respect of the structure, function and evolution. In this paper, we introduce the fundamental of the weighted relative entropy based on 2-step Markov Model to compare DNA sequences. The DNA sequence, consisted of four characters A, T, C, G, can be considered as a Markov chain. By taking state space I = {A, T, C, G} and describe the DNA sequences with 2-step transition probability matrix we can get the eigenvalue of the DNA sequence to define the similarity metric. Therefore, we find a new method to compare the DNA sequences, which is used to classify chromosomes DNA sequences obtained from 30 species. The phylogenetic tree built by the alignment-free method of the distance matrix resulted from the weighted relative entropy has clearer and more accurate division.  相似文献   

11.
Insertions and deletions in a profile hidden Markov model (HMM) are modeled by transition probabilities between insert, delete and match states. These are estimated by combining observed data and prior probabilities. The transition prior probabilities can be defined either ad hoc or by maximum likelihood (ML) estimation. We show that the choice of transition prior greatly affects the HMM's ability to discriminate between true and false hits. HMM discrimination was measured using the HMMER 2.2 package applied to 373 families from Pfam. We measured the discrimination between true members and noise sequences employing various ML transition priors and also systematically scanned the parameter space of ad hoc transition priors. Our results indicate that ML priors produce far from optimal discrimination, and we present an empirically derived prior that considerably decreases the number of misclassifications compared to ML. Most of the difference stems from the probabilities for exiting a delete state. The ML prior, which is unaware of noise sequences, estimates a delete-to-delete probability that is relatively high and does not penalize noise sequences enough for optimal discrimination.  相似文献   

12.
Phylogenetic inference: how much evolutionary history is knowable?   总被引:5,自引:2,他引:3  
In order to reconstruct phylogenetic trees from extremely dissimilar sequences it is necessary to estimate accurately the extent of sequence divergence. In this paper a new method of sequence analysis, Markov triple analysis, is developed for determining the relative frequencies of nucleotide substitutions within the three branches of a three-taxon dendrogram. Assuming that nucleotide sites are independently and identically distributed and assuming a Markov model for nucleotide (or protein) evolution, it is shown that the unique Markov matrices can be reconstructed given only the joint probability distribution relating three taxa. (In the much simpler case involving only two taxa and two character states, Markov matrices can also be reconstructed, provided symmetry assumptions are placed on the elements of the matrices.) The method is illustrated using sequence data from the combined first and second codon positions derived from complete human, mouse, and cow mitochondrial sequences.   相似文献   

13.
Abstract: Glutathione peroxidase, glutathione reductase, and catalase activities were measured to 48 h after death in mouse brains held at temperatures replicating the cooling occurring in human cadaver brain. Glutathione peroxidase was stable for 48 h; catalase was stable for 24 h and then declined 20% in activity. Glutathione reductase was stable for 4 h and then decreased to 55% of its initial activity by 48 h. Perfusion of mouse brain with 0.9% (wt/vol) NaCl did not decrease enzyme activities, indicating that erythrocyte contamination has little effect on measured brain activities. The results suggest that glutathione peroxidase would not be affected by moderate time delays in obtaining human postmortem brains but catalase activity may be affected if brains are not promptly removed. Glutathione reductase is not stable and measurements would require controls carefully matched for postmortem conditions.  相似文献   

14.
The human mitochondrial outer membrane protein mitoNEET is a newly discovered target of the type 2 diabetes drug pioglitazone. Structurally, mitoNEET is a homodimer with each monomer containing an N-terminal transmembrane α helix tethered to the mitochondrial outer membrane and a C-terminal cytosolic domain hosting a redox-active [2Fe–2S] cluster. Genetic studies have shown that mitoNEET has a central role in regulating energy metabolism in mitochondria. However, the specific function of mitoNEET remains largely elusive. Here we find that the mitoNEET [2Fe–2S] clusters can be efficiently reduced by Escherichia coli thioredoxin reductase and glutathione reductase in an NADPH-dependent reaction. Purified human glutathione reductase has the same activity as E. coli thioredoxin reductase and glutathione reductase to reduce the mitoNEET [2Fe–2S] clusters. However, rat thioredoxin reductase, a human thioredoxin reductase homolog that contains selenocysteine in the catalytic center, has very little or no activity to reduce the mitoNEET [2Fe–2S] clusters. N-ethylmaleimide, a potent thiol modifier, completely inhibits human glutathione reductase from reducing the mitoNEET [2Fe–2S] clusters, indicating that the redox-active disulfide in the catalytic center of human glutathione reductase may be directly involved in reducing the mitoNEET [2Fe–2S] clusters. Additional studies reveal that the reduced mitoNEET [2Fe–2S] clusters in mouse heart cell extracts can be reversibly oxidized by hydrogen peroxide without disruption of the clusters, suggesting that the mitoNEET [2Fe–2S] clusters may undergo redox transition to regulate energy metabolism in mitochondria in response to oxidative signals.  相似文献   

15.
16.
Complex biological dynamics often generate sequences of discrete events which can be described as a Markov process. The order of the underlying Markovian stochastic process is fundamental for characterizing statistical dependencies within sequences. As an example for this class of biological systems, we investigate the Markov order of sequences of microsaccadic eye movements from human observers. We calculate the integrated likelihood of a given sequence for various orders of the Markov process and use this in a Bayesian framework for statistical inference on the Markov order. Our analysis shows that data from most participants are best explained by a first-order Markov process. This is compatible with recent findings of a statistical coupling of subsequent microsaccade orientations. Our method might prove to be useful for a broad class of biological systems.  相似文献   

17.
Glutathione peroxidase and glutathione reductase activities were measured in whole rat brains at selected ages from birth to adulthood. On a wet weight basis glutathione peroxidase activity increased 70% during development and glutathione reductase activity increased 160%. On a protein basis glutathione peroxidase declined slightly in activity during the first two weeks of life and then maintained the 14-day activity into adulthood while glutathione reductase showed a 30% increase in activity. While less than the developmental changes in many enzymes involved in aerobic glycolysis or catecholamine metabolism, these increases do suggest a role in CNS metabolism.  相似文献   

18.
19.
Erythrocyte glutathione reductase is responsible for generating reduced glutathione, which has been implicated in maintaining the integrity of the red blood cell.Erythrocytes from peripheral blood were separated into fractions of increasing age and the activity of glutathione reductase and aspartate amino transferase determined in each fraction.The age-related decline in activity of both enzymes was confirmed, but with detailed resolution of the cells by age a significant secondary rise in only glutathione reductase activity was found in very old cells. As red blood cells from the same cohort survive in the circulation for varying periods they must vary in some way from one another. It is postulated that glutathione reductase is a critical enzyme which limits erythrocyte survival and that the rate of decline in activity varies from cell to cell. A simple mathematical model based on this postulate accounted quantitatively for both the pattern of glutathione reductase activity and the erythrocyte survival curve. In addition, a simplified model of the passage of erythrocytes through the circulation was designed and run. The predicted erythrocyte survival curve and pattern of glutathione reductase activity were very similar to observed patterns. This model may be useful in other situations where a finite resource is degraded at different rates by random passages through different pathways.  相似文献   

20.
Prevalence of quadruplexes in the human genome   总被引:28,自引:17,他引:11  
Guanine-rich DNA sequences of a particular form have the ability to fold into four-stranded structures called G-quadruplexes. In this paper, we present a working rule to predict which primary sequences can form this structure, and describe a search algorithm to identify such sequences in genomic DNA. We count the number of quadruplexes found in the human genome and compare that with the figure predicted by modelling DNA as a Bernoulli stream or as a Markov chain, using windows of various sizes. We demonstrate that the distribution of loop lengths is significantly different from what would be expected in a random case, providing an indication of the number of potentially relevant quadruplex-forming sequences. In particular, we show that there is a significant repression of quadruplexes in the coding strand of exonic regions, which suggests that quadruplex-forming patterns are disfavoured in sequences that will form RNA.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号