期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Complexities of human promoter sequences

Zhao F Yang H Wang B 《Journal of theoretical biology》2007,247(4):645-649

By means of the diffusion entropy approach, we detect the scale-invariance characteristics embedded in the 4737 human promoter sequences. The exponent for the scale-invariance is in a wide range of [0.3,0.9], which centered at delta(c)=0.66. The distribution of the exponent can be separated into left and right branches with respect to the maximum. The left and right branches are asymmetric and can be fitted exactly with Gaussian form with different widths, respectively. 相似文献

2.

An approach to identify over-represented cis-elements in related sequences 总被引：4，自引：0，他引：4

下载免费PDF全文

Zheng J Wu J Sun Z 《Nucleic acids research》2003,31(7):1995-2005

相似文献

3.

Statistical analysis of nucleotide sequences. 总被引：1，自引：4，他引：1

下载免费PDF全文

E E Stückle C Emmrich U Grob P J Nielsen 《Nucleic acids research》1990,18(22):6641-6647

In order to scan nucleic acid databases for potentially relevant but as yet unknown signals, we have developed an improved statistical model for pattern analysis of nucleic acid sequences by modifying previous methods based on Markov chains. We demonstrate the importance of selecting the appropriate parameters in order for the method to function at all. The model allows the simultaneous analysis of several short sequences with unequal base frequencies and Markov order k not equal to 0 as is usually the case in databases. As a test of these modifications, we show that in E. coli sequences there is a bias against palindromic hexamers which correspond to known restriction enzyme recognition sites. 相似文献

4.

Nonlinear modelling approach to human promoter sequences

Yang H Zhao F Gu J Wang B 《Journal of theoretical biology》2006,241(4):765-773

相似文献

5.

Statistical analysis of DNA sequences. I

M. Y. Azbel Y. Kantor L. Verkh A. Vilenkin 《Biopolymers》1982,21(8):1687-1690

相似文献

6.

Statistical analysis of DNA sequences. II

Alexander Vilenkin Lev Verkh 《Biopolymers》1982,21(8):1691-1693

相似文献

7.

Predicting promoter activities of primary human DNA sequences

Irie T Park SJ Yamashita R Seki M Yada T Sugano S Nakai K Suzuki Y 《Nucleic acids research》2011,39(11):e75

相似文献

8.

Discriminant analysis of promoter regions in Escherichia coli sequences 总被引：2，自引：0，他引：2

Nakata Kotoko; Kanehisa Minoru; Maizel Jacob V. Jr 《Bioinformatics (Oxford, England)》1988,4(3):367-371

We have previously developed a general method based on the statisticaltechnique of discriminant analysis to predict splice junctionsin eukaryotic mRNA sequences [Nakata, K., Kanehisa, M. and DeLisi,C. (1985) Nucleic Acids Res., 13, 5327–5340]. In orderto evaluate further applicability of this method, we now analyzethe promoter region of Escherichia coli sequences. The attributesused for discrimination include the accuracy of consensus sequencepatterns measured by the perceptron algorithm, the thermal stabilitymap, the base composition and the Calladine-Dickerson rulesfor helical twist angle, roll angle, torsion angle and propellertwist angle. When applied to selected E. coli sequences in theGenBank database, the method correctly identifies 75 % of thetrue promoter regions. Received on May 15, 1987; accepted on April 17, 1988 相似文献

9.

Comparative analysis of human, bovine, and murine Oct-4 upstream promoter sequences 总被引：4，自引：0，他引：4

Verena Nordhoff Karin Hübner Andrea Bauer Irina Orlova Areti Malapetsa Hans R. Schöler 《Mammalian genome》2001,12(4):309-317

相似文献

10.

Statistical analysis of DNA sequences nearby splicing sites

Korzinov OM Astakhova TV Vlasov PK Roĭtberg MA 《Molekuliarnaia biologiia》2008,42(1):150-162

Recognition of coding regions within eukaryotic genomes is one of oldest but yet not solved problems of bioinformatics. New high-accuracy methods of splicing sites recognition are needed to solve this problem. A question of current interest is to identify specific features of nucleotide sequences nearby splicing sites and recognize sites in sequence context. We performed a statistical analysis of human genes fragment database and revealed some characteristics of nucleotide sequences in splicing sites neighborhood. Frequencies of all nucleotides and dinucleotides in splicing sites environment were computed and nucleotides and dinucleotides with extremely high\low occurrences were identified. Statistical information obtained in this work can be used in further development of the methods of splicing sites annotation and exon-intron structure recognition. 相似文献

11.

Statistical analysis of DNA sequences containing nucleosome positioning sites

Yu. L. Orlov V. G. Levitskii O. G. Smirnova O. A. Podkolodnaya T. M. Khlebodarova N. A. Kolchanov 《Biophysics》2006,51(4):541-546

相似文献

12.

Steroid-responsive sequences in the human glucocorticoid receptor gene 1A promoter 总被引：3，自引：0，他引：3

Geng CD Vedeckis WV 《Molecular endocrinology (Baltimore, Md.)》2004,18(4):912-924

相似文献

13.

Statistical analysis of DNA sequences in the neighborhood of splice sites

O. M. Korzinov T. V. Astakhova P. K. Vlasov M. A. Roytberg 《Molecular Biology》2008,42(1):133-145

Prediction of gene sequences and their exon-intron structure in large eukaryotic genomic sequences is one of the central problems of mathematical biology. Solving this problem involves, in particular, high-accuracy splice site recognition. Using statistical analysis of a splice site-containing human gene fragment database, some characteristic features were described for nucleotide sequences in the splicing site neighborhood, the frequencies of all nucleotides and dinucleotides were determined, and those with frequencies increased or decreased in comparison to a random sequence were identified. The results can be used in sequence annotation, splicing site prediction, and the recognition of the gene exon-intron structure. 相似文献

14.

Statistical analysis of nucleotide runs in coding and noncoding DNA sequences

A A Sprizhitsky YuANechipurenko YuDAlexandrov M V Volkenstein 《Journal of biomolecular structure & dynamics》1988,6(2):345-358

A statistical analysis of the occurrence of particular nucleotide runs in DNA sequences of different species has been carried out. There are considerable differences of run distributions in DNA sequences of procaryotes, invertebrates and vertebrates. There is an abundance of short runs (1-2 nucleotides long) in the coding sequences and there is a deficiency of such runs in the noncoding regions. However, some interesting exceptions from this rule exist for the run distribution of adenine in procaryotes and for the arrangement of purine-pyrimidine runs in eucaryotes. The similarity in the distributions of such runs in the coding and noncoding regions may be due to some structural features of the DNA molecule as a whole. Runs of guanine (or cytosine) of three to six nucleotides occur predominantly in noncoding DNA regions in eucaryotes, especially in vertebrates. 相似文献

15.

Curved DNA in promoter sequences

Gabrielian AE Landsman D Bolshoy A 《In silico biology》1999,1(4):183-196

相似文献

16.

Sequence-dependent flexibility in promoter sequences 总被引：7，自引：0，他引：7

Tsai L Luo L Sun Z 《Journal of biomolecular structure & dynamics》2002,20(1):127-134

The non-neighbor interactions between base-pairs were taken into account to calculate the angular parameters (Omega, rho and tau) describing the orientation of successive base-pair planes and the translation parameters (D(y)) along the long axis of base-pair steps for 36 independent tetramers. A statistical mechanical model was proposed to predict the DNA flexibility that is mainly related to the thermal fluctuations at individual base-pair steps. The DNA flexibility can be described by the root-mean-square deviation of the end-to-end distance of DNA helical structure. The present model was then used to investigate the extreme flexible pattern in prokaryotic and eukaryotic promoter sequences. The results demonstrated several extreme flexible regions related to functionally important elements exist both in prokaryotic promoters and in eukaryotic promoters, DNA flexibility and AT content are highly correlated. The probabilities finding flexibility pattern in promoter sequences were also estimated statistically. The biological implications were discussed briefly. 相似文献

17.

Intrinsic promoter activities of primary DNA sequences in the human genome.

Yuta Sakakibara Takuma Irie Yutaka Suzuki Riu Yamashita Hiroyuki Wakaguri Akinori Kanai Joe Chiba Toshihisa Takagi Junko Mizushima-Sugano Shin-ichi Hashimoto Kenta Nakai Sumio Sugano 《DNA research》2007,14(2):71-77

相似文献

18.

Compilation and analysis of eukaryotic POL II promoter sequences. 总被引：52，自引：20，他引：32

下载免费PDF全文

P Bucher E N Trifonov 《Nucleic acids research》1986,14(24):10009-10026

相似文献

19.

Compilation and analysis of Escherichia coli promoter DNA sequences. 总被引：472，自引：130，他引：472

下载免费PDF全文

D K Hawley W R McClure 《Nucleic acids research》1983,11(8):2237-2255

The DNA sequence of 168 promoter regions (-50 to +10) for Escherichia coli RNA polymerase were compiled. The complete listing was divided into two groups depending upon whether or not the promoter had been defined by genetic (promoter mutations) or biochemical (5' end determination) criteria. A consensus promoter sequence based on homologies among 112 well-defined promoters was determined that was in substantial agreement with previous compilations. In addition, we have tabulated 98 promoter mutations. Nearly all of the altered base pairs in the mutants conform to the following general rule: down-mutations decrease homology and up-mutations increase homology to the consensus sequence. 相似文献

20.

Efficient computation of absent words in genomic sequences

Julia Herold Stefan Kurtz Robert Giegerich 《BMC bioinformatics》2008,9(1):167

Background

Analysis of sequence composition is a routine task in genome research. Organisms are characterized by their base composition, dinucleotide relative abundance, codon usage, and so on. Unique subsequences are markers of special interest in genome comparison, expression profiling, and genetic engineering. Relative to a random sequence of the same length, unique subsequences are overrepresented in real genomes. Shortest words absent from a genome have been addressed in two recent studies. 相似文献