首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
一种用于蛋白质相似性分析的新的相对距离   总被引:1,自引:0,他引:1  
本文论述了一种新的相对距离,用于分析不同蛋白质序列的相似性分析和构造进化树.此种距离基于Lempel-Zip复杂度,不需要进行序列比对和复杂性算法.为了说明这种距离的合理性,本文对8个物种进行了相似性分析并构造了其进化树.  相似文献   

2.
In this article, we propose a relatively similar measure to compare protein secondary structures. We first transform a protein secondary structure into a special sequence representation (angle sequence) based on a partition of the backbone φ,ψ-space. Then, pairwise sequence distance is evaluated on the basis of a symbolic sequence complexity. To illustrate our approach, we construct the similarity tree of 24 proteins from PDB.  相似文献   

3.
4.
The main work of this paper is to propose a new theory and method, which is based on the idea of the pseudo-amino acid composition, for phylogenetic analysis of DNA primary sequences. In our method, we revise the part of the occurrence frequency of 20 amino acids in the method of the pseudo-amino acid composition by replacing the frequency of 16 dinucleotides. And we select eight LZ complexity factors of eight (0,1) sequences of a DNA primary sequence as PseAA components. Finally, we characterize a DNA sequence with a 24-dimensional vector. We reconstruct the phylogenetic trees of two datasets. The results show that our method is efficient and significant.  相似文献   

5.
Liu N  Wang T 《FEBS letters》2006,580(22):5321-5327
So far, various approaches for phylogenetic analysis have been developed. Almost all of them put stress on analyzing nucleic acid sequences or protein primary structures. In this paper, we take the physicochemical properties of amino acids into account and introduce the hydropathy profile of amino acids into phylogenetic analysis. We find that this introduction is effectual and our method may be used to complement phylogenetic analysis.  相似文献   

6.
In this, Part III of a general theory, the large-scale features of evolution of structure, order, and complexity are considered as characteristic features of the biological state of matter. This starts with a rigorous formal definition of structure, classes of structural order, complexity, measures of complexity, and how these arise through evolution by a cumulative process of storing information in memory systems. Three such memory systems have evolved: the genetic memory, the immune memory, and the memories of the nervous system. The evolution, characteristic parameters and the limitations of these memory systems are explored. From these considerations emerge the large-scale features of the evolutionary pathways of biological structure, function, and complexity.  相似文献   

7.
8.
A previously formulated procedure for the quantitative evaluation of the complexities of molecules and biostructures is applied to assess the complexities of selected genomic DNA sequences. These include: (1) Several E. coli genes, including lacI, as examples of DNA sequences which are nearly as complex as possible (relative complexity=∼1). This is verified by the Lempel-Ziv (LZ) complexity analysis. (2) The telomere of a yeast chromosome, which has a considerable number of regular features that reduce complexity; the telomere shows indeed a lower structural complexity value. (3) A segment of human DNA, gene p53, which has a certain number of regular features such as 29 interspersed alu elements; these features cause a certain reduction in the complexity of the p53 gene, but do not invalidate the (previous) overall conclusion that template complexity is very high. The close to maximal complexity of the transcribed regions of p53 is validated by the LZ compression analysis. The general conclusion is that DNA base sequence composition is the dominant factor determining cellular complexity. The high complexity of DNA arrived at is a direct consequence of the template character of DNA and reflects the role of genomic DNA as a principal regulating element of a cell. It will be a challenge to find systems of lower complexity with the ability to respond to challenges from the environment to the extent that DNA templated systems do. Cellular complexity and template directed activity are thus highly intertwined properties, at the heart of many developmental, behavioral and evolutionary processes.  相似文献   

9.
Bioassays of different complexity were compared with respect to their capability to predict the environmental impact of the herbicide atrazine in aquatic systems. Acute toxicity tests with Daphnia did not yield meaningful results. Sublethal tests with Daphnia (feeding inhibition, reduction of growth and reproduction) were more sensitive, but effective concentrations of atrazine were still rather high (2 mg/L). A relatively complicated artificial food chain system that incorporated direct and indirect effects on Daphnia yielded significant reduction of daphnid population growth at 0.1 mg/L. Enclosure experiments with natural communities were by far the most sensitive tools. Community responses could be measured at concentrations as low as 1 µg/L and 0.1 µg atrazine/L. At the lowest concentration, however, communities recovered after three weeks. We conclude that in complex systems indirect effects can be more important than direct effects, so that, contrary to the conditions in simple tests, non-target organisms may be the better indicators of herbicide stress to natural communities.  相似文献   

10.
Analysis of vegetation response to environmental gradients should take into account the spatial complexity of the environmental property itself. Whether a gradient exists on the landscape or in abstract space, the spatial variability of environmental factors often invalidates the implicit assumption that the gradient is continuous. There is a need to know how variable the spatial pattern of a gradient is and how much deviation from the general trend may be expected. Geostatistics is shown to provide a useful method for analyzing spatial variability. If the assumptions for its use can be met, the fractal dimension can be used in combination with geostatistics to provide a quantitative index of gradient complexity. An example is given, showing that an hypothesized gradient of shoreline erosion disturbance along Delaware Bay either does not exist or is so complicated by short-range, local factors that any longer-range gradient is relatively unimportant. Such complex environmental patterns are thought to be common in nature. Geostatistics, fractals, or similar spatial methods can be utilized to detect and measure such complexity.This work was conducted while the author was a research assistant at the Center for Coastal and Environmental Studies, Rutgers University, New Brunswick, N.J. The support of the Center is gratefully acknowledged.  相似文献   

11.
12.
Here we propose a weighted measure for the similarity analysis of DNA sequences. It is based on LZ complexity and (0,1) characteristic sequences of DNA sequences. This weighted measure enables biologists to extract similarity information from biological sequences according to their requirements. For example, by this weighted measure, one can obtain either the full similarity information or a similarity analysis from a given biological aspect. Moreover, the length of DNA sequence is not problematic. The application of the weighted measure to the similarity analysis of β-globin genes from nine species shows its flexibility.  相似文献   

13.
14.
Phylogenetic analyses have identified positive selection as an important driver of protein evolution, both structural and functional. However, the lack of appropriate combined functional and structural assays has generally hindered attempts to elucidate patterns of positively selected sites and their effects on enzyme activity and substrate specificity. In this study we investigated the evolutionary divergence of the glutathione S-transferase (GST) family in Pinus tabuliformis, a pine that is widely distributed from northern to central China, including cold temperate and drought-stressed regions. GSTs play important roles in plant stress tolerance and detoxification. We cloned 44 GST genes from P. tabuliformis and found that 26 of the 44 belong to the largest (Tau) class of GSTs and are differentially expressed across tissues and developmental stages. Substitution models identified five positively selected sites in the Tau GSTs. To examine the functional significance of these positively selected sites, we applied protein structural modeling and site-directed mutagenesis. We found that four of the five positively selected sites significantly affect the enzyme activity and specificity; thus their variation broadens the GST family substrate spectrum. In addition, positive selection has mainly acted on secondary substrate binding sites or sites close to (but not directly at) the primary substrate binding site; thus their variation enables the acquisition of new catalytic functions without compromising the protein primary biochemical properties. Our study sheds light on selective aspects of the functional and structural divergence of the GST family in pine and other organisms.  相似文献   

15.
张恩涛  张积家 《生物磁学》2009,(14):2728-2730
复杂语言结构和简单语言结构相比,哪一种更能促进语言能力提高,是语言心理研究的核心问题。近年来,国外出现大量语言复杂性的研究。本文对语言复杂性的理论、评估、原理及应用作了介绍,并对语言复杂性理论与教学理论的联系以及在应用中需要考虑的问题做了简单评述,旨在对语言学习和语言缺陷治疗提供帮助。  相似文献   

16.
In the postgenomic era, bioinformatic analysis of sequence similarity is an immensely powerful tool to gain insight into evolution and protein function. Over long evolutionary distances, however, sequence-based methods fail as the similarities become too low for phylogenetic analysis. Macromolecular structure generally appears better conserved than sequence, but clear models for how structure evolves over time are lacking. The exponential growth of three-dimensional structural information may allow novel structure-based methods to drastically extend the evolutionary time scales amenable to phylogenetics and functional classification of proteins. To this end, we analyzed 80 structures from the functionally diverse ferritin-like superfamily. Using evolutionary networks, we demonstrate that structural comparisons can delineate and discover groups of proteins beyond the "twilight zone" where sequence similarity does not allow evolutionary analysis, suggesting that considerable and useful evolutionary signal is preserved in three-dimensional structures.  相似文献   

17.
In the context of the study into elementary modes of metabolic networks, we prove two complexity results. Enumerating elementary modes containing a specific reaction is hard in an enumeration complexity sense. The decision problem if there exists an elementary mode containing two specific reactions is NP-complete. The complexity of enumerating all elementary modes remains open.  相似文献   

18.
Complementary DNAs encoding immunoglobulin light chains were isolated from two monotreme species, Ornithorhynchus anatinus (duckbill platypus) and Tachyglossus aculeatus (echidna). The sequences of both the variable and constant regions of these clones had greater similarity to IGK than to other light chain classes and phylogenetic analyses place them squarely within the mammalian IGK group, establishing them as monotreme IGK homologues. The constant region sequences of all clones were essentially identical within each species and, along with Southern blot results, the data are consistent with a single IGKC in each species. The expressed IGKV repertoires from both platypus and echidna were randomly sampled and there appear to be at least four platypus and at least nine echidna IGKV subgroups. The IGKV subgroups are highly divergent within species, in some cases sharing as little as 57% nucleotide identity. Two of the IGKV subgroups are present in both species, so there is some degree of overlap in the germline repertoires of these two monotremes. Overall the complexity seen in platypus and echidna IGK light chains is comparable with that of other mammals considered to have high levels of germline diversity and is in contrast to what has been found so far for monotreme IGL.Electronic Supplementary Material Supplementary material is available for this article at .  相似文献   

19.
With large amounts of experimental data, modern molecular biology needs appropriate methods to deal with biological sequences. In this work, we apply a statistical method (Pearson's chi-square test) to recognize the signals appear in the whole genome of the Escherichia coli. To show the effectiveness of the method, we compare the Pearson's chi-square test with linguistic complexity on the complete genome of E. coli. The results suggest that Pearson's chi-square test is an efficient method for distinguishing genes (coding regions) form pseudogenes (noncoding regions). On the other hand, the performance of the linguistic complexity is much lower than the chi-square test method. We also use the Pearson's chi-square test method to determine which parts of the Open Reading Frame (ORF) have significant effect on discriminating genes form pseudogenes. Moreover, different complexity measures and Pearson's chi-square test applied on the genes with high value of Pearson's chi-square statistic. We also compute the measures on homologous of these genes. The results illustrate that there is a region near the start codon with high value of chi-square statistic and low complexity that is conserve between homologous genes.  相似文献   

20.
We consider the general properties of developing systems, the approaches to their modeling, and the question of their complexity. The notion “complex system” is vague; somewhat more distinct is the complexity of the model describing a phenomenon. We propose to discuss two pertinent issues. (i) The complexity of basic models is minimal; in other words, complicated basic models are needless. (ii) Living systems are simpler than inanimate ones. Though developing systems are seen in abiotic as well as in biotic nature, the fundamental difference is that living beings are capable of goal-setting and purposeful development; hence they can be described with simpler basic models.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号