期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

<Emphasis Type="SmallCaps">COM</Emphasis> e: the ontology of bioinorganic proteins

Kirill?Degtyarenko Email author Sergio?Contrino 《BMC structural biology》2004,4(1):3

Background

Many characterised proteins contain metal ions, small organic molecules or modified residues. In contrast, the huge amount of data generated by genome projects consists exclusively of sequences with almost no annotation. One of the goals of the structural genomics initiative is to provide representative three-dimensional (3-D) structures for as many protein/domain folds as possible to allow successful homology modelling. However, important functional features such as metal co-ordination or a type of prosthetic group are not always conserved in homologous proteins. So far, the problem of correct annotation of bioinorganic proteins has been largely ignored by the bioinformatics community and information on bioinorganic centres obtained by methods other than crystallography or NMR is only available in literature databases. 相似文献

2.

IdentiCS – Identification of coding sequence and <Emphasis Type="Italic">in silico</Emphasis> reconstruction of the metabolic network directly from unannotated low-coverage bacterial genome sequence

Jibin?Sun An-Ping?Zeng Email author 《BMC bioinformatics》2004,5(1):112

Background

A necessary step for a genome level analysis of the cellular metabolism is the in silico reconstruction of the metabolic network from genome sequences. The available methods are mainly based on the annotation of genome sequences including two successive steps, the prediction of coding sequences (CDS) and their function assignment. The annotation process takes time. The available methods often encounter difficulties when dealing with unfinished error-containing genomic sequence. 相似文献

3.

Novel definition files for human GeneChips based on GeneAnnot

Francesco Ferrari Stefania Bortoluzzi Alessandro Coppe Alexandra Sirota Marilyn Safran Michael Shmoish Sergio Ferrari Doron Lancet Gian Antonio Danieli Silvio Bicciato 《BMC bioinformatics》2007,8(1):446

相似文献

4.

OryzaPG-DB: Rice Proteome Database based on Shotgun Proteogenomics

Mohamed Helmy Masaru Tomita Yasushi Ishihama 《BMC plant biology》2011,11(1):63

Background

Proteogenomics aims to utilize experimental proteome information for refinement of genome annotation. Since mass spectrometry-based shotgun proteomics approaches provide large-scale peptide sequencing data with high throughput, a data repository for shotgun proteogenomics would represent a valuable source of gene expression evidence at the translational level for genome re-annotation. 相似文献

5.

SolEST database: a "one-stop shop" approach to the study of Solanaceae transcriptomes

Nunzio D'Agostino Alessandra Traini Luigi Frusciante Maria Luisa Chiusano 《BMC plant biology》2009,9(1):142-16

相似文献

6.

<Emphasis Type="Italic">μ</Emphasis>-CS: An extension of the TM4 platform to manage Affymetrix binary data

Pietro H Guzzi Mario Cannataro 《BMC bioinformatics》2010,11(1):315

Background

A main goal in understanding cell mechanisms is to explain the relationship among genes and related molecular processes through the combined use of technological platforms and bioinformatics analysis. High throughput platforms, such as microarrays, enable the investigation of the whole genome in a single experiment. There exist different kind of microarray platforms, that produce different types of binary data (images and raw data). Moreover, also considering a single vendor, different chips are available. The analysis of microarray data requires an initial preprocessing phase (i.e. normalization and summarization) of raw data that makes them suitable for use on existing platforms, such as the TIGR M4 Suite. Nevertheless, the annotations of data with additional information such as gene function, is needed to perform more powerful analysis. Raw data preprocessing and annotation is often performed in a manual and error prone way. Moreover, many available preprocessing tools do not support annotation. Thus novel, platform independent, and possibly open source tools enabling the semi-automatic preprocessing and annotation of microarray data are needed. 相似文献

7.

Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression 总被引：1，自引：0，他引：1

Rodrigo Malig Cristian Varela Eduardo Agosin Francisco Melo 《BMC bioinformatics》2006,7(1):487

Background

In this study, we present a robust and reliable computational method for tag-to-gene assignment in serial analysis of gene expression (SAGE). The method relies on current genome information and annotation, incorporation of several new features, and key improvements over alternative methods, all of which are important to determine gene expression levels more accurately. The method provides a complete annotation of potential virtual SAGE tags within a genome, along with an estimation of their confidence for experimental observation that ranks tags that present multiple matches in the genome. 相似文献

8.

The Distributed Annotation System 总被引：1，自引：0，他引：1

Robin D Dowell Rodney M Jokerst Allen Day Sean R Eddy Lincoln Stein 《BMC bioinformatics》2001,2(1):7-7

Background

Currently, most genome annotation is curated by centralized groups with limited resources. Efforts to share annotations transparently among multiple groups have not yet been satisfactory. 相似文献

9.

Statistical Viewer: a tool to upload and integrate linkage and association data as plots displayed within the Ensembl genome browser

Judith?E?Stenger Email author Hong?Xu Carol?Haynes Elizabeth?R?Hauser Margaret?Pericak-Vance Pascal?J?Goldschmidt-Clermont Jeffery?M?Vance 《BMC bioinformatics》2005,6(1):95

Background

To facilitate efficient selection and the prioritization of candidate complex disease susceptibility genes for association analysis, increasingly comprehensive annotation tools are essential to integrate, visualize and analyze vast quantities of disparate data generated by genomic screens, public human genome sequence annotation and ancillary biological databases. We have developed a plug-in package for Ensembl called "Statistical Viewer" that facilitates the analysis of genomic features and annotation in the regions of interest defined by linkage analysis. 相似文献

10.

Accessing the SEED Genome Databases via Web Services API: Tools for Programmers

Terry Disz Sajia Akhter Daniel Cuevas Robert Olson Ross Overbeek Veronika Vonstein Rick Stevens Robert A Edwards 《BMC bioinformatics》2010,11(1):319

Background

The SEED integrates many publicly available genome sequences into a single resource. The database contains accurate and up-to-date annotations based on the subsystems concept that leverages clustering between genomes and other clues to accurately and efficiently annotate microbial genomes. The backend is used as the foundation for many genome annotation tools, such as the Rapid Annotation using Subsystems Technology (RAST) server for whole genome annotation, the metagenomics RAST server for random community genome annotations, and the annotation clearinghouse for exchanging annotations from different resources. In addition to a web user interface, the SEED also provides Web services based API for programmatic access to the data in the SEED, allowing the development of third-party tools and mash-ups. 相似文献

11.

Toward the automated generation of genome-scale metabolic networks in the SEED

Matthew DeJongh Kevin Formsma Paul Boillot John Gould Matthew Rycenga Aaron Best 《BMC bioinformatics》2007,8(1):139

Background

Current methods for the automated generation of genome-scale metabolic networks focus on genome annotation and preliminary biochemical reaction network assembly, but do not adequately address the process of identifying and filling gaps in the reaction network, and verifying that the network is suitable for systems level analysis. Thus, current methods are only sufficient for generating draft-quality networks, and refinement of the reaction network is still largely a manual, labor-intensive process. 相似文献

12.

Functional enrichment analyses and construction of functional similarity networks with high confidence function prediction by PFP

Troy Hawkins Meghana Chitale Daisuke Kihara 《BMC bioinformatics》2010,11(1):265

Background

A new paradigm of biological investigation takes advantage of technologies that produce large high throughput datasets, including genome sequences, interactions of proteins, and gene expression. The ability of biologists to analyze and interpret such data relies on functional annotation of the included proteins, but even in highly characterized organisms many proteins can lack the functional evidence necessary to infer their biological relevance. 相似文献

13.

Towards a comprehensive structural variation map of an individual human genome

Andy W Pang Jeffrey R MacDonald Dalila Pinto John Wei Muhammad A Rafiq Donald F Conrad Hansoo Park Matthew E Hurles Charles Lee J Craig Venter Ewen F Kirkness Samuel Levy Lars Feuk Stephen W Scherer 《Genome biology》2010,11(5):R52

Background

Several genomes have now been sequenced, with millions of genetic variants annotated. While significant progress has been made in mapping single nucleotide polymorphisms (SNPs) and small (<10 bp) insertion/deletions (indels), the annotation of larger structural variants has been less comprehensive. It is still unclear to what extent a typical genome differs from the reference assembly, and the analysis of the genomes sequenced to date have shown varying results for copy number variation (CNV) and inversions.

Results

We have combined computational re-analysis of existing whole genome sequence data with novel microarray-based analysis, and detect 12,178 structural variants covering 40.6 Mb that were not reported in the initial sequencing of the first published personal genome. We estimate a total non-SNP variation content of 48.8 Mb in a single genome. Our results indicate that this genome differs from the consensus reference sequence by approximately 1.2% when considering indels/CNVs, 0.1% by SNPs and approximately 0.3% by inversions. The structural variants impact 4,867 genes, and >24% of structural variants would not be imputed by SNP-association.

Conclusions

Our results indicate that a large number of structural variants have been unreported in the individual genomes published to date. This significant extent and complexity of structural variants, as well as the growing recognition of their medical relevance, necessitate they be actively studied in health-related analyses of personal genomes. The new catalogue of structural variants generated for this genome provides a crucial resource for future comparison studies. 相似文献

14.

ArrayIDer: automated structural re-annotation pipeline for DNA microarrays

Bart HJ van den Berg Jay H Konieczka Fiona M McCarthy Shane C Burgess 《BMC bioinformatics》2009,10(1):30

Background

Systems biology modeling from microarray data requires the most contemporary structural and functional array annotation. However, microarray annotations, especially for non-commercial, non-traditional biomedical model organisms, are often dated. In addition, most microarray analysis tools do not readily accept EST clone names, which are abundantly represented on arrays. Manual re-annotation of microarrays is impracticable and so we developed a computational re-annotation tool (ArrayIDer) to retrieve the most recent accession mapping files from public databases based on EST clone names or accessions and rapidly generate database accessions for entire microarrays. 相似文献

15.

GOPET: A tool for automated predictions of Gene Ontology terms

Arunachalam Vinayagam Coral del Val Falk Schubert Roland Eils Karl-Heinz Glatting Sándor Suhai Rainer König 《BMC bioinformatics》2006,7(1):161-7

Background

Vast progress in sequencing projects has called for annotation on a large scale. A Number of methods have been developed to address this challenging task. These methods, however, either apply to specific subsets, or their predictions are not formalised, or they do not provide precise confidence values for their predictions. 相似文献

16.

Complete reannotation of the Arabidopsis genome: methods, tools, protocols and the final release

Brian J Haas Jennifer R Wortman Catherine M Ronning Linda I Hannick Roger K Smith Jr Rama Maiti Agnes P Chan Chunhui Yu Maryam Farzad Dongying Wu Owen White Christopher D Town 《BMC biology》2005,3(1):1-19

Background

Since the initial publication of its complete genome sequence, Arabidopsis thaliana has become more important than ever as a model for plant research. However, the initial genome annotation was submitted by multiple centers using inconsistent methods, making the data difficult to use for many applications.

Results

Over the course of three years, TIGR has completed its effort to standardize the structural and functional annotation of the Arabidopsis genome. Using both manual and automated methods, Arabidopsis gene structures were refined and gene products were renamed and assigned to Gene Ontology categories. We present an overview of the methods employed, tools developed, and protocols followed, summarizing the contents of each data release with special emphasis on our final annotation release (version 5).

Conclusion

Over the entire period, several thousand new genes and pseudogenes were added to the annotation. Approximately one third of the originally annotated gene models were significantly refined yielding improved gene structure annotations, and every protein-coding gene was manually inspected and classified using Gene Ontology terms. 相似文献

17.

Apollo2Go: a web service adapter for the Apollo genome viewer to enable distributed genome annotation

Kathrin Klee Rebecca Ernst Manuel Spannagl Klaus FX Mayer 《BMC bioinformatics》2007,8(1):1-5

Background

Apollo, a genome annotation viewer and editor, has become a widely used genome annotation and visualization tool for distributed genome annotation projects. When using Apollo for annotation, database updates are carried out by uploading intermediate annotation files into the respective database. This non-direct database upload is laborious and evokes problems of data synchronicity.

Results

To overcome these limitations we extended the Apollo data adapter with a generic, configurable web service client that is able to retrieve annotation data in a GAME-XML-formatted string and pass it on to Apollo's internal input routine.

Conclusion

This Apollo web service adapter, Apollo2Go, simplifies the data exchange in distributed projects and aims to render the annotation process more comfortable. The Apollo2Go software is freely available from ftp://ftpmips.gsf.de/plants/apollo_webservice. 相似文献

18.

Multiple RNAs from the mouse carboxypeptidase M locus: functional RNAs or transcription noise?

Alessander O Guimarães Fabiana L Motta Viviane S Alves Beatriz A Castilho João B Pesquero 《BMC molecular biology》2009,10(1):7-14

相似文献

19.

Applying Support Vector Machines for Gene ontology based gene function prediction

Arunachalam?Vinayagam Email author Rainer?K?nig Jutta?Moormann Falk?Schubert Roland?Eils Karl-Heinz?Glatting Sándor?Suhai 《BMC bioinformatics》2004,5(1):116

Background

The current progress in sequencing projects calls for rapid, reliable and accurate function assignments of gene products. A variety of methods has been designed to annotate sequences on a large scale. However, these methods can either only be applied for specific subsets, or their results are not formalised, or they do not provide precise confidence estimates for their predictions.

Results

We have developed a large-scale annotation system that tackles all of these shortcomings. In our approach, annotation was provided through Gene Ontology terms by applying multiple Support Vector Machines (SVM) for the classification of correct and false predictions. The general performance of the system was benchmarked with a large dataset. An organism-wise cross-validation was performed to define confidence estimates, resulting in an average precision of 80% for 74% of all test sequences. The validation results show that the prediction performance was organism-independent and could reproduce the annotation of other automated systems as well as high-quality manual annotations. We applied our trained classification system to Xenopus laevis sequences, yielding functional annotation for more than half of the known expressed genome. Compared to the currently available annotation, we provided more than twice the number of contigs with good quality annotation, and additionally we assigned a confidence value to each predicted GO term.

Conclusions

We present a complete automated annotation system that overcomes many of the usual problems by applying a controlled vocabulary of Gene Ontology and an established classification method on large and well-described sequence data sets. In a case study, the function for Xenopus laevis contig sequences was predicted and the results are publicly available at ftp://genome.dkfz-heidelberg.de/pub/agd/gene_association.agd_Xenopus.

相似文献

20.

Gene discovery in the hamster: a comparative genomics approach for gene annotation by sequencing of hamster testis cDNAs

Oduru S Campbell JL Karri S Hendry WJ Khan SA Williams SC 《BMC genomics》2003,4(1):22

Background

Complete genome annotation will likely be achieved through a combination of computer-based analysis of available genome sequences combined with direct experimental characterization of expressed regions of individual genomes. We have utilized a comparative genomics approach involving the sequencing of randomly selected hamster testis cDNAs to begin to identify genes not previously annotated on the human, mouse, rat and Fugu (pufferfish) genomes. 相似文献