期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Conservative extraction of over-represented extensible motifs

Apostolico A Comin M Parida L 《Bioinformatics (Oxford, England)》2005,21(Z1):i9-18

相似文献

2.

Methods for discovering novel motifs in nucleic acid sequences

Staden Rodger 《Bioinformatics (Oxford, England)》1989,5(4):293-298

We describe a computer tool to aid the discovery of new motifsin nucleic acid sequences. A typical use would be to analysea set of upstream regions from a family of related genes inorder to find possible control sequences. The heart of the methodis the creation of dictionaries of related subsequences. Thesedictionaries can then be analysed to look for the commonestor best-defined subsequences, those that occur in the highestnumber of different sequences, or for those in equivalent positionswithin the family. We show the application of the method toa set of E. coli promoter sequences. Received on May 9, 1989; accepted on July 27, 1989 相似文献

3.

CABIOS REVIEW: The GenBank nucleic acid sequence database

Burks Christian; Fickett James W.; Goad Walter B.; Kanehisa Minoru; Lewitter Frances I.; Rindone Wayne P.; Swindell C. David; Tung Chang-Shung; Bilofsky Howard S. 《Bioinformatics (Oxford, England)》1985,1(4):225-233

‘The GenBank’^* nucleic acid sequence database isa computer-based collection of all published DNA and RNA sequences;it contains over five million bases in close to six thousandsequence entries drawn from four thousand five hundred publishedarticles. Each sequence is accompanied by relevant biologicalannotation. The database is available either on magnetic tape,on floppy diskettes, on-line or in hardcopy form. We discussthe structure of the database, the extent of the data and theimplications of the database for research on nucleic acids. 相似文献

4.

Visualization of nucleic acid sequence structural information 总被引：3，自引：0，他引：3

Stuber Kurt 《Bioinformatics (Oxford, England)》1985,1(1):35-42

Several interactive Pascal programs have been written for theanalysis and display of structural information in nucleic acidsequences. Layout procedures were developed to display the homologyand repeat matrices of a sequence and to predict and displaythe secondary structure of RNA/DNA molecules free of overlapand to predict and display internal repeats. No special plottingdevices are required because the output is adapted to line printers.Sequences from several DNA database systems can be used as input.These programs are part of a general nucleic acid sequence analysispackage. Received on December 9, 1984; accepted on January 11, 1985 相似文献

5.

An extra dimension in nucleic acid sequence recognition

Fox KR Brown T 《Quarterly reviews of biophysics》2005,38(4):311-320

Watson-Crick base pairing is a natural molecular recognition process that has been exploited in molecular biology and universally adopted in many fields. An additional mode of nucleic acid sequence recognition that could be used in combination with normal base pairing would add an exta dimension to nucleic acid interactions and open up many new applications. In principle the triplex approach could provide this if developed to recognize any DNA sequence. To this end modified nucleosides have been incorporated into triple-helix-forming oligonucleotides (TFOs) and used to recognize mixed sequence DNA with high selectivity and affinity at neutral pH. Continuing developments are directed towards improving TFO affinity at high pH and increasing triplex association kinetics. A number of applications of triplexes are currently being explored. 相似文献

6.

Human apolipoprotein A-II: complete nucleic acid sequence of preproapoA-II

Karl J. Lackner Simon W. Law H.Bryan Brewer 《FEBS letters》1984,175(1):159-164

The complete cDNA nucleic acid sequence of preproapolipoprotein (apo) A-II, a major protein constituent of high density lipoproteins, has been determined on clones from a human liver ds-cDNA library. Clones containing ds-cDNA for apoA-II were identified in the human liver ds-cDNA library using synthetic oligonucleotides as probes. Of 3200 clones screened, 4 reacted with the oligonucleotide probes. The DNA sequence coding for amino acids ?17 to +17 of apoA-II were determined by Maxam-Gilbert sequence analysis of restriction fragments isolated from one of these clones, pMDB2049. The remainder of the cDNA sequence was established by sequence analysis of a primer extension product synthesized utilizing a restriction fragment near the 5'-end of clone pMDB2049 as primer with total liver mRNA. The apoA-II mRNA encodes for a 100 amino acid protein, preproapoA-II that has an 18 amino acid prepeptide and a 5 amino acid propeptide terminating with a basic dipeptide (Arg-Arg) at the cleavage site to mature apoA-II. 相似文献

7.

Non-parametric statistics for nucleic acid sequence study 总被引：2，自引：0，他引：2

C Gautier M Gouy S Louail 《Biochimie》1985,67(5):449-453

The use of non-parametric statistics for nucleic acid sequence studies is illustrated by some examples. This method is highly flexible and allows design of specific tests for detecting sequence structure. Tests devoted to local repetitivity, codon nearest neighbors, and dinucleotide avoidance are discussed in detail. An appendix indicates all computations required to use these tests. 相似文献

8.

System analysis and nucleic acid sequence banks 总被引：2，自引：0，他引：2

M Gouy C Gautier F Milleret 《Biochimie》1985,67(5):433-436

The mass of published nucleic acid sequence data has required the design of several computerized data bases. We show that this activity is related to the methodology of System Analysis and that data bases are a means of modeling biological knowledge. As an example, the ACNUC data base we have created is presented. 相似文献

9.

Statistical characterization of nucleic acid sequence functional domains 总被引：6，自引：14，他引：6

T F Smith M S Waterman J R Sadler 《Nucleic acids research》1983,11(7):2205-2220

It has long been recognized that various genome classes were distinguishable on the basis of base composition and nearest neighbor frequencies. In addition Grantham et al. (8) have recently presented evidence that these distinctions are preserved at the level of codon usage. As discussed in this report it is now clear that these and related statistics can uniquely characterize the various functional domains of the genome. In particular peptide coding, intervening segments, structural RNA coding and mitochondrial domains of the vertebrate genome are uniquely characterizable. The statistical measures not only reflect understood functional differences among these domains but suggest others. The ability of these simple statistics of nucleic acid sequences to reflect so much of the encoded complex pattern information and/or effects of selective constraints is somewhat surprising. Here, we investigated the statistical measures most distinctive of the various domains and then linked them to our current understandings in so far as possible. 相似文献

10.

Improved assay-dependent searching of nucleic acid sequence databases

Gans JD Wolinsky M 《Nucleic acids research》2008,36(12):e74

Nucleic acid-based biochemical assays are crucial to modern biology. Key applications, such as detection of bacterial, viral and fungal pathogens, require detailed knowledge of assay sensitivity and specificity to obtain reliable results. Improved methods to predict assay performance are needed for exploiting the exponentially growing amount of DNA sequence data and for reducing the experimental effort required to develop robust detection assays. Toward this goal, we present an algorithm for the calculation of sequence similarity based on DNA thermodynamics. In our approach, search queries consist of one to three oligonucleotide sequences representing either a hybridization probe, a pair of Padlock probes or a pair of PCR primers with an optional TaqMantrade mark probe (i.e. in silico or 'virtual' PCR). Matches are reported if the query and target satisfy both the thermodynamics of the assay (binding at a specified hybridization temperature and/or change in free energy) and the relevant biological constraints (assay sequences binding to the correct target duplex strands in the required orientations). The sensitivity and specificity of our method is evaluated by comparing predicted to known sequence tagged sites in the human genome. Free energy is shown to be a more sensitive and specific match criterion than hybridization temperature. 相似文献

11.

Compression of nucleic acid and protein sequence data

Walker J. Richard; Willett Peter 《Bioinformatics (Oxford, England)》1986,2(2):89-93

This paper describes the application of text compression methodsto machine-readable files of nucleic acid and protein sequencedata. Two main methods are used to reduce the storage requirementsof such files, these being n-gram coding and run-length coding.A Pascal program combining both of these techniques resultedin a compression figure of 74.6% for the GenBank database anda program that used only n-gram coding gave a compression figureof 42.8% for the Protein Identification Resource database. Received on November 29, 1985; accepted on February 24, 1986 相似文献

12.

Efficient search on energy minima for structure prediction of nucleic acid motifs

Villescas-Diaz G Zacharias M 《Journal of biomolecular structure & dynamics》2004,22(3):355-364

Structure prediction of non-canonical motifs such as mismatches, extra unmatched nucleotides or internal and hairpin loop structures in nucleic acids is of great importance for understanding the function and design of nucleic acid structures. Systematic conformational analysis of such motifs typically involves the generation of many possible combinations of backbone dihedral torsion angles for a given motif and subsequent energy minimization (EM) and evaluation. Such approach is limited due to the number of dihedral angle combinations that grows very rapidly with the size of the motif. Two conformational search approaches have been developed that allow both an effective crossing of barriers during conformational searches and the computational demand grows much less with system size then search methods that explore all combinations of backbone dihedral torsion angles. In the first search protocol single torsion angles are flipped into favorable states using constraint EM and subsequent relaxation without constraints. The approach is repeated in an iterative manner along the backbone of the structural motif until no further energy improvement is obtained. In case of two test systems, a DNA-trinucleotide loop (sequence: GCA) and a RNA tetraloop (sequence: UUCG), the approach successfully identified low energy states close to experiment for two out of five start structures. In the second method randomly selected combinations of up to six backbone torsion angles are simultaneously flipped into preset ranges by a short constraint EM followed by unconstraint EM and acceptance according to a Metropolis acceptance criterion. This combined stochastic/EM search was even more effective than the single torsion flip approach and selected low energy states for the two test cases in between two and four cases out of five start structures. 相似文献

13.

Searching for amino acid sequence motifs among enzymes: the Enzyme-Reaction Database

Suyama Mikita; Ogiwara Atsushi; Nishioka Takaaki; Oda Jun'ichi 《Bioinformatics (Oxford, England)》1993,9(1):9-15

Recently we have constructed a database—the Enzyme–ReactionDatabase–which links a chemical structure to amino acidsequences of enzymes that recognize the chemical structure astheir ligand. The total number of enzymes registered in thedatabase is 1103 with 6668 NBRF–PIR entry codes and 1756chemical compounds. The chemical structures and chemical namesfor 842 compounds are registered in the Chemical–StructureDatabase on the MACCS system. For each enzyme, the sequenceswere divided into clusters, and multiply aligned in each clusterto extract a conserved sequence. A total of 158 781 five–residue–longfragments were constructed from 433 conserved sequences andcompared among different clusters of different enzymes. Oneof these motifs shared by different enzymes S–G–G–L–D.The motif was conserved in both argininosuccinate synthase (EC6.3.4.5[EC]) and asparagine synthase (glutamine–hydrolysing)(EC 6.3.5.4[EC]). This result showed that the database was usefulfor the analysis of the relationship between chemical structuresand amino acid sequence motifs. 相似文献

14.

Evolutionary relationship between luteoviruses and other RNA plant viruses based on sequence motifs in their putative RNA polymerases and nucleic acid helicases 总被引：13，自引：2，他引：13

下载免费PDF全文

N Habili R H Symons 《Nucleic acids research》1989,17(23):9543-9555

Comparative studies of sequence motifs in the RNA polymerases and nucleic acid helicases of positive-sense RNA plant viruses have provided a new scheme for the classification of these pathogens. We propose a new luteovirus supergroup which should be added to the already described Sindbisvirus-like and picornavirus-like supergroups. Sequence motifs of nucleic acid helicases and RNA polymerases which previously were considered to be specific for each of the two supergroups now occur together within this new supergroup. We propose that this new viral supergroup provides an evolutionary link between the other two supergroups. 相似文献

15.

All motifs are NOT created equal: structural properties of transcription factor-DNA interactions and the inference of sequence specificity

Michael B Eisen 《Genome biology》2005,6(5):P7

相似文献

16.

Retrieval and analysis of nucleic acid and protein sequence

《Journal of molecular biology》1985,183(4):639

相似文献

17.

iTriplet, a rule-based nucleic acid sequence motif finder

Eric S Ho Christopher D Jakubowski Samuel I Gunderson 《Algorithms for molecular biology : AMB》2009,4(1):14-14

Background

With the advent of high throughput sequencing techniques, large amounts of sequencing data are readily available for analysis. Natural biological signals are intrinsically highly variable making their complete identification a computationally challenging problem. Many attempts in using statistical or combinatorial approaches have been made with great success in the past. However, identifying highly degenerate and long (>20 nucleotides) motifs still remains an unmet challenge as high degeneracy will diminish statistical significance of biological signals and increasing motif size will cause combinatorial explosion. In this report, we present a novel rule-based method that is focused on finding degenerate and long motifs. Our proposed method, named iTriplet, avoids costly enumeration present in existing combinatorial methods and is amenable to parallel processing. 相似文献

18.

98 Improving nucleic acid design using biased sequence initialization

Mohammad Kayedkhordeh Stanislav Bellaousov 《Journal of biomolecular structure & dynamics》2015,33(5):62-63

相似文献

19.

Clustal W—蛋白质与核酸序列分析软件 总被引：2，自引：1，他引：2

郭崇志孙曼霁《生物技术通讯》2000,11(2):146-149

蛋白质与核酸的序列分析在现代生物学和生物信息学中发挥着重要作用,新的算法和软件层出不穷,本文介绍一个可运行在ＰＣ机上的完全免费的多序列比较软件－ＣｌｕｓｔａｌＷ,它不但可以进行蛋白质与核酸的多序列比较,分析不同序列之间的相似性关系,还可以绘制进化树。由于其灵活的输入输出格式、方便的参数设定和选择、详尽的在线帮助以及良好的可移植性,使得ＣｌｕｓｔａｌＷ在蛋白质与核酸的序列分析中得到了广泛应用。相似文献

20.

MouseV k gene classification by nucleic acid sequence similarity 总被引：3，自引：0，他引：3

Robert Strohal Arno Helmberg Guido Kroemer Reinhard Kofler 《Immunogenetics》1989,30(6):475-493

Analyses of immunoglobulin (Ig) variable (V) region gene usage in the immune response, estimates ofV gene germline complexity, and other nucleic acid hybridization-based studies depend on the extent to which such genes are related (i. e., sequence similarity) and their organization in gene families. While mouseIgh heavy chainV region (V _H) gene families are relatively well-established, a corresponding systematic classification ofIgk light chainV region (V _k) genes has not been reported. The present analysis, in the course of which we reviewed the known extent of theV _k germline gene repertoire andV _k gene usage in a variety of responses to foreign and self antigens, provides a classification of mouseV _k genes in gene families composed of members with >80% overall nucleic acid sequence similarity. This classification differed in several aspects from that ofV _H genes: only someV _k gene families were as clearly separated (by >25% sequence dissimilarity) as typicalV _H gene families; mostV _k gene families were closely related and, in several instances, members from different families were very similar (>80%) over large sequence portions; frequently, classification by nucleic acid sequence similarity diverged from existing classifications based on amino-terminal protein sequence similarity. Our data have implications forV _k gene analyses by nucleic acid hybridization and describe potentially important differences in sequence organization betweenV _H andV _k genes. 相似文献