首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
3.
4.
5.
6.
7.
On the complexity measures of genetic sequences   总被引:7,自引:0,他引:7  
MOTIVATION: It is well known that the regulatory regions of genomes are highly repetitive. They are rich in direct, symmetric and complemented repeats, and there is no doubt about the functional significance of these repeats. Among known measures of complexity, the Ziv-Lempel complexity measure reflects most adequately repeats occurring in the text. But this measure does not take into account isomorphic repeats. By isomorphic repeats we mean fragments that are identical (or symmetric) modulo some permutation of the alphabet letters. RESULTS: In this paper, two complexity measures of symbolic sequences are proposed that generalize the Ziv-Lempel complexity measure by taking into account any isomorphic repeats in the text (rather than just direct repeats as in Ziv-Lempel). The first of them, the complexity vector, is designed for small alphabets such as the alphabet of nucleotides. The second is based on a search for the longest isomorphic fragment in the history of sequence synthesis and can be used for alphabets of arbitrary cardinality. These measures have been used for recognition of structural regularities in DNA sequences. Some interesting structures related to the regulatory region of the human growth hormone are reported.  相似文献   

8.
9.
10.
The progress in understanding the patterns of evolution of ontogeny is hindered by the fact that many features of ontogeny are counterintuitive (as well as the features of other processes related to self-organization, self-assembly, and spontaneous increase in complexity). The basic principle of ontogeny of multicellular organisms is that it is the process of self-assembly of ordered multicellular structures by means of coordinated behavior of many individual modules (cells), each of which follows the same set of"rules" encoded in the genome. These rules are based on the genetic regulatory networks. We hypothesize that many specific features of ontogeny that seem nontrivial or enigmatic are, in fact, the inevitable consequences of this basic principle. If so, they do not need special explanations. In order to verify this hypothesis, we developed the computer program "Evo-Devo" based on the above principle. The program is designed to model the self-assembly of ordered multicellular structures from an aggregation of dividing cells that originate from a single original cell (zygote). Each cell follows a set of rules of behavior ("genotype") that can be specified arbitrarily by the experimenter, and is the same for all cells in the embryo (each cell is programmed in exactly the same way as all other cells). It is not allowed to specify rules for groups of cells or for the whole embryo: only local rules that should be followed at the level of a single cell are possible. The analysis of phenotypic implementation of different genotypes revealed several features which are present in the ontogeny of real organisms and are regularly reproduced in the model. These include: inherent stochasticity; inescapable necessity of development of stabilizing adaptations based on negative feedback in order to decrease this stochasticity; equifinality (noise resistance) resulting from these adaptations; the ability of ontogeny to respond to major perturbations by generating new morphological structures that differ from the "normal" ones, but have similar level of complexity; the similarity of phenotypic manifestations of different mutations; channeling of possible evolutionary transformations of ontogeny; Waddington's creodes; high probability of destabilization of ontogeny (e.g., because of mutations); the possibility of a new morphological character to appear initially as a rare anomaly (low penetrance of many mutations); pleiotropy of mutations affecting ontogeny; spontaneous emergence of morphogenetic correlations; integrity of the developing organism. The fact that these features are regularly reproduced in the model implies that they are probably the inevitable consequences of the basic principle of ontogeny of multicellular organisms formulated above.  相似文献   

11.
Despite its potential role in the evolution of complex phenotypes, the detection of negative (purifying) and positive selection on noncoding regulatory sequence has been elusive because of the inherent difficulty in predicting the functional consequences of mutations on noncoding sequence. Because the functioning of regulatory sequence depends upon both chromatin configuration and cis-regulatory factor binding, we investigate the idea that the functional conservation of regulatory regions should be associated with the conservation of sequence-dependent bending properties of DNA that determine its affinity for the nucleosome. Recent advances in the computational prediction of sequence-dependent affinity to nucleosomes provide an opportunity to distinguish between neutral and nonneutral evolution of fine-scale chromatin organization. Here, a statistical test is presented for detecting evolutionary conservation and/or adaptive evolution of nucleosome affinity from interspecies comparisons of DNA sequences. Local nucleosome affinities of homologous sequences were calculated using 2 recently published methods. A randomization test was applied to sites of mutation to evaluate the similarity of DNA-nucleosome affinity between several closely related species of Saccharomyces yeast. For most of the genes we analyzed, the conservation of local nucleosome affinity was detected at a few distinct locations in the upstream noncoding region. Our results also demonstrate that different patterns of chromatin evolution have shaped DNA-nucleosome interaction at the core promoters of TATA-containing and TATA-less genes and that elevated purifying selection has maintained low affinity for nucleosome in the core promoters of the latter group. Across the entire yeast genome, DNA-nucleosome interaction was also discovered to be significantly more conserved in TATA-less genes compared with TATA-containing genes.  相似文献   

12.
13.
14.
Neurofibromatosis type 1 (NF1), a neuroectodermal disorder, is caused by germline mutations in the NF1 gene. NF1 affects approximately 1/3,000 individuals worldwide, with about 50% of cases representing de novo mutations. Although the NF1 gene was identified in 1990, the underlying gene mutations still remain undetected in a small but obdurate minority of NF1 patients. We postulated that in these patients, hitherto undetected pathogenic mutations might occur in regulatory elements far upstream of the NF1 gene. In an attempt to identify such remotely acting regulatory elements, we reasoned that some of them might reside within DNA sequences that (1) have the potential to interact at distance with the NF1 gene and (2) lie within a histone H3K27ac-enriched region, a characteristic of active enhancers. Combining Hi-C data, obtained by means of the chromosome conformation capture technique, with data on the location and level of histone H3K27ac enrichment upstream of the NF1 gene, we predicted in silico the presence of two remotely acting regulatory regions, located, respectively, approximately 600 kb and approximately 42 kb upstream of the NF1 gene. These regions were then sequenced in 47 NF1 patients in whom no mutations had been found in either the NF1 or SPRED1 gene regions. Five patients were found to harbour DNA sequence variants in the distal H3K27ac-enriched region. Although these variants are of uncertain pathological significance and still remain to be functionally characterized, this approach promises to be of general utility for the detection of mutations underlying other inherited disorders that may be caused by mutations in remotely acting regulatory elements.  相似文献   

15.
16.
17.
Summary Pelham has shown that theDrosophila hsp 70 gene is not transcribed under heat shock conditions unless a given upstream region is present. Davidson et al. have recently compiled a list of sequences homologous to this region in otherDrosophila heat shock genes. They proposed that a set of unlinked genes, such as the heat shock genes, could be coordinately induced through an interaction in cis with a common regulatory molecule. That this interaction involves structural elements is suggested by the fact that these upstream regions share inverted repeats as well as areas of Z-DNA potential. Furthermore, using the Calladine-Dickerson rules for local helical parameters, we show that these regions share structural homology. This is significant because the presence of regions homologous to a derived consensus sequence does not necessarily imply structural similarity. Therefore, we suggest that these structural features are at least as important as the sequence homologies in enabling the heat shock response.  相似文献   

18.
D Harrich  J Garcia  R Mitsuyasu    R Gaynor 《The EMBO journal》1990,9(13):4417-4423
Multiple regulatory elements in the human immunodeficiency virus long terminal repeat (HIV LTR) are required for activation of HIV gene expression. Previous transfection studies of HIV LTR constructs linked to the chloramphenicol acetyltransferase gene indicated that multiple regulatory regions including the enhancer, SP1, TATA and TAR regions were important for HIV gene expression. To characterize these regulatory elements further, mutations in these regions were inserted into both the 5' and 3' HIV LTRs and infectious proviral constructs were assembled. These constructs were transfected into either HeLa cells, Jurkat cells or U937 cells in both the presence and absence of phorbol esters which have previously been demonstrated to activate HIV gene expression. Viral gene expression was assayed by the level of p24 gag protein released from cultures transfected with the proviral constructs. Results in all cell lines indicated that mutations of the SP1, TATA and the TAR loop and stem secondary structure resulted in marked decreases in gene expression while mutations of the enhancer motif or TAR primary sequence resulted in only slight decreases. However, viruses containing mutations in either the TAR loop sequences or stem secondary structure which were very defective for gene expression in untreated Jurkat cells, gave nearly wild-type levels of gene expression in phorbol ester-treated Jurkat cells but not in phorbol ester-treated HeLa or U937 cells. High level gene expression of these TAR mutant constructs in phorbol ester-treated Jurkat cells was eliminated by second site mutations in the enhancer region or by disruption of the tat gene.(ABSTRACT TRUNCATED AT 250 WORDS)  相似文献   

19.
20.
The DNA sequence of the secA gene, essential for protein export in Escherichia coli, was determined and found to encode a hydrophilic protein of 901 amino acid residues with a predicted molecular weight of 101,902, consistent with its previously determined size and subcellular location. Sequence analysis of 9 secA(Ts) mutations conferring general protein export and secA regulatory defects revealed that these mutations were clustered in three specific regions within the first 170 amino acid residues of the SecA protein and were the result of single amino acid changes predicted to be severely disruptive of protein structure and function. The DNA sequence immediately upstream of secA was shown to encode a previously inferred gene, gene X. Sequence analysis of a conditionally lethal amber mutation, am109, previously inferred to be located proximally in the secA gene, revealed that it was located distally in gene X and was conditionally lethal due to its polar effect on secA expression. This and additional evidence are presented indicating that gene X and secA are cotranscribed.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号