期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

AutoFACT: AnAutomaticFunctionalAnnotation andClassificationTool

Liisa B Koski Michael W Gray B Franz Lang Gertraud Burger 《BMC bioinformatics》2005,6(1):151

Background

Assignment of function to new molecular sequence data is an essential step in genomics projects. The usual process involves similarity searches of a given sequence against one or more databases, an arduous process for large datasets. 相似文献

2.

Fast index based algorithms and software for matching position specific scoring matrices

Michael Beckstette Robert Homann Robert Giegerich Stefan Kurtz 《BMC bioinformatics》2006,7(1):389-25

Background

In biological sequence analysis, position specific scoring matrices (PSSMs) are widely used to represent sequence motifs in nucleotide as well as amino acid sequences. Searching with PSSMs in complete genomes or large sequence databases is a common, but computationally expensive task. 相似文献

3.

High throughput profile-profile based fold recognition for the entire human proteome

Liam J McGuffin Richard T Smith Kevin Bryson S?ren-Aksel S?rensen David T Jones 《BMC bioinformatics》2006,7(1):288

Background

In order to maintain the most comprehensive structural annotation databases we must carry out regular updates for each proteome using the latest profile-profile fold recognition methods. The ability to carry out these updates on demand is necessary to keep pace with the regular updates of sequence and structure databases. Providing the highest quality structural models requires the most intensive profile-profile fold recognition methods running with the very latest available sequence databases and fold libraries. However, running these methods on such a regular basis for every sequenced proteome requires large amounts of processing power. 相似文献

4.

ProFAT: a web-based tool for the functional annotation of protein sequences

Charles Richard Bradshaw Vineeth Surendranath Bianca Habermann 《BMC bioinformatics》2006,7(1):466-16

Background

The functional annotation of proteins relies on published information concerning their close and remote homologues in sequence databases. Evidence for remote sequence similarity can be further strengthened by a similar biological background of the query sequence and identified database sequences. However, few tools exist so far, that provide a means to include functional information in sequence database searches. 相似文献

5.

Hidden Markov model speed heuristic and iterative HMM search procedure 总被引：1，自引：0，他引：1

L Steven Johnson Sean R Eddy Elon Portugaly 《BMC bioinformatics》2010,11(1):431

Background

Profile hidden Markov models (profile-HMMs) are sensitive tools for remote protein homology detection, but the main scoring algorithms, Viterbi or Forward, require considerable time to search large sequence databases. 相似文献

6.

CoreGenes: A computational tool for identifying and cataloging "core" genes in a set of small genomes

Nikhat Zafar Raja Mazumder Donald Seto 《BMC bioinformatics》2002,3(1):12-8

Background

Improvements in DNA sequencing technology and methodology have led to the rapid expansion of databases comprising DNA sequence, gene and genome data. Lower operational costs and heightened interest resulting from initial intriguing novel discoveries from genomics are also contributing to the accumulation of these data sets. A major challenge is to analyze and to mine data from these databases, especially whole genomes. There is a need for computational tools that look globally at genomes for data mining. 相似文献

7.

SciDBMaker: new software for computer-aided design of specialized biological databases

Riadh Hammami Abdelmajid Zouhir Karim Naghmouchi Jeannette Ben Hamida Ismail Fliss 《BMC bioinformatics》2008,9(1):121

Background

The exponential growth of research in molecular biology has brought concomitant proliferation of databases for stocking its findings. A variety of protein sequence databases exist. While all of these strive for completeness, the range of user interests is often beyond their scope. Large databases covering a broad range of domains tend to offer less detailed information than smaller, more specialized resources, often creating a need to combine data from many sources in order to obtain a complete picture. Scientific researchers are continually developing new specific databases to enhance their understanding of biological processes. 相似文献

8.

CoSMoS: Conserved Sequence Motif Search in the proteome

Xiao I Liu Neeraj Korde Ursula Jakob Lars I Leichert 《BMC bioinformatics》2006,7(1):37-6

Background

With the ever-increasing number of gene sequences in the public databases, generating and analyzing multiple sequence alignments becomes increasingly time consuming. Nevertheless it is a task performed on a regular basis by researchers in many labs. 相似文献

9.

SS-Wrapper: a package of wrapper applications for similarity searches on Linux clusters

Chunlin?Wang Elliot?J?Lefkowitz Email author 《BMC bioinformatics》2004,5(1):171

Background

Large-scale sequence comparison is a powerful tool for biological inference in modern molecular biology. Comparing new sequences to those in annotated databases is a useful source of functional and structural information about these sequences. Using software such as the basic local alignment search tool (BLAST) or HMMPFAM to identify statistically significant matches between newly sequenced segments of genetic material and those in databases is an important task for most molecular biologists. Searching algorithms are intrinsically slow and data-intensive, especially in light of the rapid growth of biological sequence databases due to the emergence of high throughput DNA sequencing techniques. Thus, traditional bioinformatics tools are impractical on PCs and even on dedicated UNIX servers. To take advantage of larger databases and more reliable methods, high performance computation becomes necessary. 相似文献

10.

FastGroupII: A web-based bioinformatics platform for analyses of large 16S rDNA libraries

Yanan Yu Mya Breitbart Pat McNairnie Forest Rohwer 《BMC bioinformatics》2006,7(1):57

Background

High-throughput sequencing makes it possible to rapidly obtain thousands of 16S rDNA sequences from environmental samples. Bioinformatic tools for the analyses of large 16S rDNA sequence databases are needed to comprehensively describe and compare these datasets. 相似文献

11.

160-fold acceleration of the Smith-Waterman algorithm using a field programmable gate array (FPGA)

Isaac TS Li Warren Shum Kevin Truong 《BMC bioinformatics》2007,8(1):185

Background

To infer homology and subsequently gene function, the Smith-Waterman (SW) algorithm is used to find the optimal local alignment between two sequences. When searching sequence databases that may contain hundreds of millions of sequences, this algorithm becomes computationally expensive. 相似文献

12.

Identification and expression analysis of microRNAs and targets in the biofuel crop sugarcane

Almir S Zanca Renato Vicentini Fausto A Ortiz-Morea Luiz EV Del Bem Marcio J da Silva Michel Vincentz Fabio TS Nogueira 《BMC plant biology》2010,10(1):260

Background

MicroRNAs (miRNAs) are small regulatory RNAs, some of which are conserved in diverse plant genomes. Therefore, computational identification and further experimental validation of miRNAs from non-model organisms is both feasible and instrumental for addressing miRNA-based gene regulation and evolution. Sugarcane (Saccharum spp.) is an important biofuel crop with publicly available expressed sequence tag and genomic survey sequence databases, but little is known about miRNAs and their targets in this highly polyploid species. 相似文献

13.

mlstdbNet – distributed multi-locus sequence typing (MLST) databases

Keith?A?Jolley Email author Man-Suen?Chan Martin?CJ?Maiden 《BMC bioinformatics》2004,5(1):86

Background

Multi-locus sequence typing (MLST) is a method of typing that facilitates the discrimination of microbial isolates by comparing the sequences of housekeeping gene fragments. The mlstdbNet software enables the implementation of distributed web-accessible MLST databases that can be linked widely over the Internet. 相似文献

14.

Compressing DNA sequence databases with coil

W Timothy J White Michael D Hendy 《BMC bioinformatics》2008,9(1):242

Background

Publicly available DNA sequence databases such as GenBank are large, and are growing at an exponential rate. The sheer volume of data being dealt with presents serious storage and data communications problems. Currently, sequence data is usually kept in large "flat files," which are then compressed using standard Lempel-Ziv (gzip) compression – an approach which rarely achieves good compression ratios. While much research has been done on compressing individual DNA sequences, surprisingly little has focused on the compression of entire databases of such sequences. In this study we introduce the sequence database compression software coil. 相似文献

15.

The Hawaiian Algal Database: a laboratory LIMS and online resource for biodiversity data

Norman Wang Alison R Sherwood Akira Kurihara Kimberly Y Conklin Thomas Sauvage Gernot G Presting 《BMC plant biology》2009,9(1):117-5

Background

Organization and presentation of biodiversity data is greatly facilitated by databases that are specially designed to allow easy data entry and organized data display. Such databases also have the capacity to serve as Laboratory Information Management Systems (LIMS). The Hawaiian Algal Database was designed to showcase specimens collected from the Hawaiian Archipelago, enabling users around the world to compare their specimens with our photographs and DNA sequence data, and to provide lab personnel with an organizational tool for storing various biodiversity data types. 相似文献

16.

Virus variation resources at the National Center for Biotechnology Information: dengue virus

Wolfgang Resch Leonid Zaslavsky Boris Kiryutin Michael Rozanov Yiming Bao Tatiana A Tatusova 《BMC microbiology》2009,9(1):65-7

Background

There is an increasing number of complete and incomplete virus genome sequences available in public databases. This large body of sequence data harbors information about epidemiology, phylogeny, and virulence. Several specialized databases, such as the NCBI Influenza Virus Resource or the Los Alamos HIV database, offer sophisticated query interfaces along with integrated exploratory data analysis tools for individual virus species to facilitate extracting this information. Thus far, there has not been a comprehensive database for dengue virus, a significant public health threat. 相似文献

17.

G-InforBIO: integrated system for microbial genomics

Naoto Tanaka Takashi Abe Satoru Miyazaki Hideaki Sugawara 《BMC bioinformatics》2006,7(1):368-7

Background

Genome databases contain diverse kinds of information, including gene annotations and nucleotide and amino acid sequences. It is not easy to integrate such information for genomic study. There are few tools for integrated analyses of genomic data, therefore, we developed software that enables users to handle, manipulate, and analyze genome data with a variety of sequence analysis programs. 相似文献

18.

The Subviral RNA Database: a toolbox for viroids, the hepatitis delta virus and satellite RNAs research

Lynda Rocheleau Martin Pelchat 《BMC microbiology》2006,6(1):24-5

Background

Viroids, satellite RNAs, satellites viruses and the human hepatitis delta virus form the 'brotherhood' of the smallest known infectious RNA agents, known as the subviral RNAs. For most of these species, it is generally accepted that characteristics such as cell movement, replication, host specificity and pathogenicity are encoded in their RNA sequences and their resulting RNA structures. Although many sequences are indexed in publicly available databases, these sequence annotation databases do not provide the advanced searches and data manipulation capability for identifying and characterizing subviral RNA motifs. 相似文献

19.

Recent Hits Acquired by BLAST (ReHAB): A tool to identify new hits in sequence similarity searches

Joe?Whitney David?J?Esteban Chris?Upton Email author 《BMC bioinformatics》2005,6(1):23

Background

Sequence similarity searching is a powerful tool to help develop hypotheses in the quest to assign functional, structural and evolutionary information to DNA and protein sequences. As sequence databases continue to grow exponentially, it becomes increasingly important to repeat searches at frequent intervals, and similarity searches retrieve larger and larger sets of results. New and potentially significant results may be buried in a long list of previously obtained sequence hits from past searches. 相似文献

20.

Exploring inconsistencies in genome-wide protein function annotations: a machine learning approach

Carson?Andorf Drena?Dobbs Vasant?Honavar Email author 《BMC bioinformatics》2007,8(1):284

Background

Incorrectly annotated sequence data are becoming more commonplace as databases increasingly rely on automated techniques for annotation. Hence, there is an urgent need for computational methods for checking consistency of such annotations against independent sources of evidence and detecting potential annotation errors. We show how a machine learning approach designed to automatically predict a protein's Gene Ontology (GO) functional class can be employed to identify potential gene annotation errors. 相似文献