期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Applying negative rule mining to improve genome annotation

Irena I Artamonova Goar Frishman Dmitrij Frishman 《BMC bioinformatics》2007,8(1):261

Background

Unsupervised annotation of proteins by software pipelines suffers from very high error rates. Spurious functional assignments are usually caused by unwarranted homology-based transfer of information from existing database entries to the new target sequences. We have previously demonstrated that data mining in large sequence annotation databanks can help identify annotation items that are strongly associated with each other, and that exceptions from strong positive association rules often point to potential annotation errors. Here we investigate the applicability of negative association rule mining to revealing erroneously assigned annotation items. 相似文献

2.

Automatic detection of false annotations via binary property clustering

Noam?Kaplan Email author Michal?Linial 《BMC bioinformatics》2005,6(1):46

Background

Computational protein annotation methods occasionally introduce errors. False-positive (FP) errors are annotations that are mistakenly associated with a protein. Such false annotations introduce errors that may spread into databases through similarity with other proteins. Generally, methods used to minimize the chance for FPs result in decreased sensitivity or low throughput. We present a novel protein-clustering method that enables automatic separation of FP from true hits. The method quantifies the biological similarity between pairs of proteins by examining each protein's annotations, and then proceeds by clustering sets of proteins that received similar annotation into biological groups. 相似文献

3.

Splign: algorithms for computing spliced alignments with identification of paralogs

Yuri Kapustin Alexander Souvorov Tatiana Tatusova David Lipman 《Biology direct》2008,3(1):20

Background

The computation of accurate alignments of cDNA sequences against a genome is at the foundation of modern genome annotation pipelines. Several factors such as presence of paralogs, small exons, non-consensus splice signals, sequencing errors and polymorphic sites pose recognized difficulties to existing spliced alignment algorithms. 相似文献

4.

BibGlimpse: The case for a light-weight reprint manager in distributed literature research

Thomas Tüchler Golda Velez Alexandra Graf David P Kreil 《BMC bioinformatics》2008,9(1):406

Background

While text-mining and distributed annotation systems both aim at capturing knowledge and presenting it in a standardized form, there have been few attempts to investigate potential synergies between these two fields. For instance, distributed annotation would be very well suited for providing topic focussed, expert knowledge enriched text corpora. A key limitation for this approach is the availability of literature annotation systems that can be routinely used by groups of collaborating researchers on a day to day basis, not distracting from the main focus of their work. 相似文献

5.

prot4EST: Translating Expressed Sequence Tags from neglected genomes

James?D?Wasmuth Email author Mark?L?Blaxter 《BMC bioinformatics》2004,5(1):187

相似文献

6.

Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression 总被引：1，自引：0，他引：1

Rodrigo Malig Cristian Varela Eduardo Agosin Francisco Melo 《BMC bioinformatics》2006,7(1):487

Background

In this study, we present a robust and reliable computational method for tag-to-gene assignment in serial analysis of gene expression (SAGE). The method relies on current genome information and annotation, incorporation of several new features, and key improvements over alternative methods, all of which are important to determine gene expression levels more accurately. The method provides a complete annotation of potential virtual SAGE tags within a genome, along with an estimation of their confidence for experimental observation that ranks tags that present multiple matches in the genome. 相似文献

7.

A phylogenomic gene cluster resource: the Phylogenetically Inferred Groups (PhIGs) database

Paramvir S Dehal Jeffrey L Boore 《BMC bioinformatics》2006,7(1):201-9

Background

We present here the PhIGs database, a phylogenomic resource for sequenced genomes. Although many methods exist for clustering gene families, very few attempt to create truly orthologous clusters sharing descent from a single ancestral gene across a range of evolutionary depths. Although these non-phylogenetic gene family clusters have been used broadly for gene annotation, errors are known to be introduced by the artifactual association of slowly evolving paralogs and lack of annotation for those more rapidly evolving. A full phylogenetic framework is necessary for accurate inference of function and for many studies that address pattern and mechanism of the evolution of the genome. The automated generation of evolutionary gene clusters, creation of gene trees, determination of orthology and paralogy relationships, and the correlation of this information with gene annotations, expression information, and genomic context is an important resource to the scientific community. 相似文献

8.

cDNA2Genome: A tool for mapping and annotating cDNAs

Coral?del Val Email author Karl-Heinz?Glatting Sandor?Suhai 《BMC bioinformatics》2003,4(1):39

相似文献

9.

Formalization of taxon-based constraints to detect inconsistencies in annotation and ontology development

Jennifer?I?Deegan Emily?C?Dimmer Christopher?J?Mungall 《BMC bioinformatics》2010,11(1):530

Background

The Gene Ontology project supports categorization of gene products according to their location of action, the molecular functions that they carry out, and the processes that they are involved in. Although the ontologies are intentionally developed to be taxon neutral, and to cover all species, there are inherent taxon specificities in some branches. For example, the process 'lactation' is specific to mammals and the location 'mitochondrion' is specific to eukaryotes. The lack of an explicit formalization of these constraints can lead to errors and inconsistencies in automated and manual annotation. 相似文献

10.

Transcript-based redefinition of grouped oligonucleotide probe sets using AceView: High-resolution annotation for microarrays

Jun Lu Joseph C Lee Marc L Salit Margaret C Cam 《BMC bioinformatics》2007,8(1):108

Background

Extracting biological information from high-density Affymetrix arrays is a multi-step process that begins with the accurate annotation of microarray probes. Shortfalls in the original Affymetrix probe annotation have been described; however, few studies have provided rigorous solutions for routine data analysis. 相似文献

11.

Computational evaluation of TIS annotation for prokaryotic genomes

Gang-Qing Hu Xiaobin Zheng Li-Ning Ju Huaiqiu Zhu Zhen-Su She 《BMC bioinformatics》2008,9(1):160

Background

Accurate annotation of translation initiation sites (TISs) is essential for understanding the translation initiation mechanism. However, the reliability of TIS annotation in widely used databases such as RefSeq is uncertain due to the lack of experimental benchmarks. 相似文献

12.

Statistical Viewer: a tool to upload and integrate linkage and association data as plots displayed within the Ensembl genome browser

Judith?E?Stenger Email author Hong?Xu Carol?Haynes Elizabeth?R?Hauser Margaret?Pericak-Vance Pascal?J?Goldschmidt-Clermont Jeffery?M?Vance 《BMC bioinformatics》2005,6(1):95

Background

To facilitate efficient selection and the prioritization of candidate complex disease susceptibility genes for association analysis, increasingly comprehensive annotation tools are essential to integrate, visualize and analyze vast quantities of disparate data generated by genomic screens, public human genome sequence annotation and ancillary biological databases. We have developed a plug-in package for Ensembl called "Statistical Viewer" that facilitates the analysis of genomic features and annotation in the regions of interest defined by linkage analysis. 相似文献

13.

Domain-based small molecule binding site annotation

Kevin A Snyder Howard J Feldman Michel Dumontier John J Salama Christopher WV Hogue 《BMC bioinformatics》2006,7(1):152-19

Background

Accurate small molecule binding site information for a protein can facilitate studies in drug docking, drug discovery and function prediction, but small molecule binding site protein sequence annotation is sparse. The Small Molecule Interaction Database (SMID), a database of protein domain-small molecule interactions, was created using structural data from the Protein Data Bank (PDB). More importantly it provides a means to predict small molecule binding sites on proteins with a known or unknown structure and unlike prior approaches, removes large numbers of false positive hits arising from transitive alignment errors, non-biologically significant small molecules and crystallographic conditions that overpredict ion binding sites. 相似文献

14.

The comparative analysis of statistics,based on the likelihood ratio criterion,in the automated annotation problem

Andrey M Leontovich Konstantin Y Tokmachev Hans C van Houwelingen 《BMC bioinformatics》2008,9(1):31

Background

This paper discusses the problem of automated annotation. It is a continuation of the previous work on the A⁴-algorithm (Adaptive algorithm of automated annotation) developed by Leontovich and others. 相似文献

15.

Expression profiles of switch-like genes accurately classify tissue and infectious disease phenotypes in model-based classification

Michael Gormley Aydin Tozeren 《BMC bioinformatics》2008,9(1):486

Background

Large-scale compilation of gene expression microarray datasets across diverse biological phenotypes provided a means of gathering a priori knowledge in the form of identification and annotation of bimodal genes in the human and mouse genomes. These switch-like genes consist of 15% of known human genes, and are enriched with genes coding for extracellular and membrane proteins. It is of interest to determine the prediction potential of bimodal genes for class discovery in large-scale datasets. 相似文献

16.

GOtcha: a new method for prediction of protein function assessed by the annotation of seven genomes

David?MA?Martin Email author Matthew?Berriman Geoffrey?J?Barton 《BMC bioinformatics》2004,5(1):178

Background

The function of a novel gene product is typically predicted by transitive assignment of annotation from similar sequences. We describe a novel method, GOtcha, for predicting gene product function by annotation with Gene Ontology (GO) terms. GOtcha predicts GO term associations with term-specific probability (P-score) measures of confidence. Term-specific probabilities are a novel feature of GOtcha and allow the identification of conflicts or uncertainty in annotation. 相似文献

17.

Transcript-level annotation of Affymetrix probesets improves the interpretation of gene expression data

Hui Yu Feng Wang Kang Tu Lu Xie Yuan-Yuan Li Yi-Xue Li 《BMC bioinformatics》2007,8(1):194

相似文献

18.

Metabolite signal identification in accurate mass metabolomics data with MZedDB, an interactive m/z annotation tool utilising predicted ionisation behaviour 'rules'

John Draper David P Enot David Parker Manfred Beckmann Stuart Snowdon Wanchang Lin Hassan Zubair 《BMC bioinformatics》2009,10(1):227

Background

Metabolomics experiments using Mass Spectrometry (MS) technology measure the mass to charge ratio (m/z) and intensity of ionised molecules in crude extracts of complex biological samples to generate high dimensional metabolite 'fingerprint' or metabolite 'profile' data. High resolution MS instruments perform routinely with a mass accuracy of < 5 ppm (parts per million) thus providing potentially a direct method for signal putative annotation using databases containing metabolite mass information. Most database interfaces support only simple queries with the default assumption that molecules either gain or lose a single proton when ionised. In reality the annotation process is confounded by the fact that many ionisation products will be not only molecular isotopes but also salt/solvent adducts and neutral loss fragments of original metabolites. This report describes an annotation strategy that will allow searching based on all potential ionisation products predicted to form during electrospray ionisation (ESI). 相似文献

19.

Corpus annotation for mining biomedical events from literature

Jin-Dong Kim Tomoko Ohta Jun'ichi Tsujii 《BMC bioinformatics》2008,9(1):10

Background

Advanced Text Mining (TM) such as semantic enrichment of papers, event or relation extraction, and intelligent Question Answering have increasingly attracted attention in the bio-medical domain. For such attempts to succeed, text annotation from the biological point of view is indispensable. However, due to the complexity of the task, semantic annotation has never been tried on a large scale, apart from relatively simple term annotation. 相似文献

20.

PCAS – a precomputed proteome annotation database resource

Zhang Y Yin Y Chen Y Gao G Yu P Luo J Jiang Y 《BMC genomics》2003,4(1):42

Background

Many model proteomes or "complete" sets of proteins of given organisms are now publicly available. Much effort has been invested in computational annotation of those "draft" proteomes. Motif or domain based algorithms play a pivotal role in functional classification of proteins. Employing most available computational algorithms, mainly motif or domain recognition algorithms, we set up to develop an online proteome annotation system with integrated proteome annotation data to complement existing resources. 相似文献