期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Applying the Gene Ontology in microbial annotation

Michelle G. Giglio Candace W. Collmer Jane Lomax Amelia Ireland 《Trends in microbiology》2009,17(7):262-268

相似文献

2.

Gene Ontology annotation quality analysis in model eukaryotes 总被引：1，自引：0，他引：1

Buza TJ McCarthy FM Wang N Bridges SM Burgess SC 《Nucleic acids research》2008,36(2):e12

Functional analysis using the Gene Ontology (GO) is crucial for array analysis, but it is often difficult for researchers to assess the amount and quality of GO annotations associated with different sets of gene products. In many cases the source of the GO annotations and the date the GO annotations were last updated is not apparent, further complicating a researchers’ ability to assess the quality of the GO data provided. Moreover, GO biocurators need to ensure that the GO quality is maintained and optimal for the functional processes that are most relevant for their research community. We report the GO Annotation Quality (GAQ) score, a quantitative measure of GO quality that includes breadth of GO annotation, the level of detail of annotation and the type of evidence used to make the annotation. As a case study, we apply the GAQ scoring method to a set of diverse eukaryotes and demonstrate how the GAQ score can be used to track changes in GO annotations over time and to assess the quality of GO annotations available for specific biological processes. The GAQ score also allows researchers to quantitatively assess the functional data available for their experimental systems (arrays or databases). 相似文献

3.

Automated Gene Ontology annotation for anonymous sequence data 总被引：9，自引：1，他引：9

下载免费PDF全文

Hennig S Groth D Lehrach H 《Nucleic acids research》2003,31(13):3712-3715

相似文献

4.

Amplification of the Gene Ontology annotation of Affymetrix probe sets

Enrique M Muro Carolina Perez-Iratxeta Miguel A Andrade-Navarro 《BMC bioinformatics》2006,7(1):159-6

Background

The annotations of Affymetrix DNA microarray probe sets with Gene Ontology terms are carefully selected for correctness. This results in very accurate but incomplete annotations which is not always desirable for microarray experiment evaluation. 相似文献

5.

Automatic annotation of protein motif function with Gene Ontology terms

Xinghua?Lu Email author Chengxiang?Zhai Vanathi?Gopalakrishnan Bruce?G?Buchanan 《BMC bioinformatics》2004,5(1):122

Background

Conserved protein sequence motifs are short stretches of amino acid sequence patterns that potentially encode the function of proteins. Several sequence pattern searching algorithms and programs exist foridentifying candidate protein motifs at the whole genome level. However, amuch needed and importanttask is to determine the functions of the newly identified protein motifs. The Gene Ontology (GO) project is an endeavor to annotate the function of genes or protein sequences with terms from a dynamic, controlled vocabulary and these annotations serve well as a knowledge base. 相似文献

6.

GOPET: A tool for automated predictions of Gene Ontology terms

Arunachalam Vinayagam Coral del Val Falk Schubert Roland Eils Karl-Heinz Glatting Sándor Suhai Rainer König 《BMC bioinformatics》2006,7(1):161-7

Background

Vast progress in sequencing projects has called for annotation on a large scale. A Number of methods have been developed to address this challenging task. These methods, however, either apply to specific subsets, or their predictions are not formalised, or they do not provide precise confidence values for their predictions. 相似文献

7.

Gene Ontology and the annotation of pathogen genomes: the case of Candida albicans

Martha B. Arnaud Maria C. Costanzo Prachi Shah Marek S. Skrzypek Gavin Sherlock 《Trends in microbiology》2009,17(7):295-303

相似文献

8.

Cluster analysis of protein array results via similarity of Gene Ontology annotation

Cheryl Wolting C Jane McGlade David Tritchler 《BMC bioinformatics》2006,7(1):338-13

Background

With the advent of high-throughput proteomic experiments such as arrays of purified proteins comes the need to analyse sets of proteins as an ensemble, as opposed to the traditional one-protein-at-a-time approach. Although there are several publicly available tools that facilitate the analysis of protein sets, they do not display integrated results in an easily-interpreted image or do not allow the user to specify the proteins to be analysed. 相似文献

9.

Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation 总被引：12，自引：0，他引：12

Lord PW Stevens RD Brass A Goble CA 《Bioinformatics (Oxford, England)》2003,19(10):1275-1283

MOTIVATION: Many bioinformatics data resources not only hold data in the form of sequences, but also as annotation. In the majority of cases, annotation is written as scientific natural language: this is suitable for humans, but not particularly useful for machine processing. Ontologies offer a mechanism by which knowledge can be represented in a form capable of such processing. In this paper we investigate the use of ontological annotation to measure the similarities in knowledge content or 'semantic similarity' between entries in a data resource. These allow a bioinformatician to perform a similarity measure over annotation in an analogous manner to those performed over sequences. A measure of semantic similarity for the knowledge component of bioinformatics resources should afford a biologist a new tool in their repertoire of analyses. RESULTS: We present the results from experiments that investigate the validity of using semantic similarity by comparison with sequence similarity. We show a simple extension that enables a semantic search of the knowledge held within sequence databases. AVAILABILITY: Software available from http://www.russet.org.uk. 相似文献

10.

Ontology annotation: mapping genomic regions to biological function

Thomas PD Mi H Lewis S 《Current opinion in chemical biology》2007,11(1):4-11

With numerous whole genomes now in hand, and experimental data about genes and biological pathways on the increase, a systems approach to biological research is becoming essential. Ontologies provide a formal representation of knowledge that is amenable to computational as well as human analysis, an obvious underpinning of systems biology. Mapping function to gene products in the genome consists of two, somewhat intertwined enterprises: ontology building and ontology annotation. Ontology building is the formal representation of a domain of knowledge; ontology annotation is association of specific genomic regions (which we refer to simply as 'genes', including genes and their regulatory elements and products such as proteins and functional RNAs) to parts of the ontology. We consider two complementary representations of gene function: the Gene Ontology (GO) and pathway ontologies. GO represents function from the gene's eye view, in relation to a large and growing context of biological knowledge at all levels. Pathway ontologies represent function from the point of view of biochemical reactions and interactions, which are ordered into networks and causal cascades. The more mature GO provides an example of ontology annotation: how conclusions from the scientific literature and from evolutionary relationships are converted into formal statements about gene function. Annotations are made using a variety of different types of evidence, which can be used to estimate the relative reliability of different annotations. 相似文献

11.

Complexity of automated gene annotation

Nikoloski Z Grimbs S Klie S Selbig J 《Bio Systems》2011,104(1):1-8

Describing the determinants of robustness of biological systems has become one of the central questions in systems biology. Despite the increasing research efforts, it has proven difficult to arrive at a unifying definition for this important concept. We argue that this is due to the multifaceted nature of the concept of robustness and the possibility to formally capture it at different levels of systemic formalisms (e.g., topology and dynamic behavior). Here we provide a comprehensive review of the existing definitions of robustness pertaining to metabolic networks. As kinetic approaches have been excellently reviewed elsewhere, we focus on definitions of robustness proposed within graph-theoretic and constraint-based formalisms. 相似文献

12.

Statistically rigorous automated protein annotation

Krebs WG Bourne PE 《Bioinformatics (Oxford, England)》2004,20(7):1066-1073

MOTIVATION: Assignment of putative protein functional annotation by comparative analysis using pre-defined experimental annotations is performed routinely by molecular biologists. The number and statistical significance of these assignments remains a challenge in this era of high-throughput proteomics. A combined statistical method that enables robust, automated protein annotation by reliably expanding existing annotation sets is described. An existing clustering scheme, based on relevant experimental information (e.g. sequence identity, keywords or gene expression data) is required. The method assigns new proteins to these clusters with a measure of reliability. It can also provide human reviewers with a reliability score for both new and previously classified proteins. RESULTS: A dataset of 27 000 annotated Protein Data Bank (PDB) polypeptide chains (of 36 000 chains currently in the PDB) was generated from 23 000 chains classified a priori. AVAILABILITY: PDB annotations and sample software implementation are freely accessible on the Web at http://pmr.sdsc.edu/go 相似文献

13.

Adaptive algorithm of automated annotation

Leontovich AM Brodsky LI Drachev VA Nikolaev VK 《Bioinformatics (Oxford, England)》2002,18(6):838-844

相似文献

14.

Saccharomyces Genome Database (SGD) provides secondary gene annotation using the Gene Ontology (GO) 总被引：20，自引：2，他引：20

下载免费PDF全文

Selina S. Dwight Midori A. Harris Kara Dolinski Catherine A. Ball Gail Binkley Karen R. Christie Dianna G. Fisk Laurie Issel-Tarver Mark Schroeder Gavin Sherlock Anand Sethuraman Shuai Weng David Botstein J. Michael Cherry 《Nucleic acids research》2002,30(1):69-72

相似文献

15.

The Neural/Immune Gene Ontology: clipping the Gene Ontology for neurological and immunological systems

Nophar Geifman Alon Monsonego Eitan Rubin 《BMC bioinformatics》2010,11(1):458

Background

The Gene Ontology (GO) is used to describe genes and gene products from many organisms. When used for functional annotation of microarray data, GO is often slimmed by editing so that only higher level terms remain. This practice is designed to improve the summarizing of experimental results by grouping high level terms and the statistical power of GO term enrichment analysis. 相似文献

16.

EDITtoTrEMBL: a distributed approach to high-quality automated protein sequence annotation

Möller S Leser U Fleischmann W Apweiler R 《Bioinformatics (Oxford, England)》1999,15(3):219-227

相似文献

17.

Gene Ontology: looking backwards and forwards 总被引：3，自引：0，他引：3

下载免费PDF全文

Lewis SE 《Genome biology》2005,6(1):103

The Gene Ontology consortium began six years ago with a group of scientists who decided to connect our data by sharing the same language for describing it. Its most significant achievement lies in uniting many independent biological database efforts into a cooperative force. 相似文献

18.

Measuring the Evolution of Ontology Complexity: The Gene Ontology Case Study

Olivier Dameron Charles Bettembourg Nolwenn Le Meur 《PloS one》2013,8(10)

Ontologies support automatic sharing, combination and analysis of life sciences data. They undergo regular curation and enrichment. We studied the impact of an ontology evolution on its structural complexity. As a case study we used the sixty monthly releases between January 2008 and December 2012 of the Gene Ontology and its three independent branches, i.e. biological processes (BP), cellular components (CC) and molecular functions (MF). For each case, we measured complexity by computing metrics related to the size, the nodes connectivity and the hierarchical structure.The number of classes and relations increased monotonously for each branch, with different growth rates. BP and CC had similar connectivity, superior to that of MF. Connectivity increased monotonously for BP, decreased for CC and remained stable for MF, with a marked increase for the three branches in November and December 2012. Hierarchy-related measures showed that CC and MF had similar proportions of leaves, average depths and average heights. BP had a lower proportion of leaves, and a higher average depth and average height. For BP and MF, the late 2012 increase of connectivity resulted in an increase of the average depth and average height and a decrease of the proportion of leaves, indicating that a major enrichment effort of the intermediate-level hierarchy occurred.The variation of the number of classes and relations in an ontology does not provide enough information about the evolution of its complexity. However, connectivity and hierarchy-related metrics revealed different patterns of values as well as of evolution for the three branches of the Gene Ontology. CC was similar to BP in terms of connectivity, and similar to MF in terms of hierarchy. Overall, BP complexity increased, CC was refined with the addition of leaves providing a finer level of annotations but decreasing slightly its complexity, and MF complexity remained stable. 相似文献

19.

TEnest: automated chronological annotation and visualization of nested plant transposable elements 总被引：2，自引：0，他引：2

Kronmiller BA Wise RP 《Plant physiology》2008,146(1):45-59

Organisms with a high density of transposable elements (TEs) exhibit nesting, with subsequent repeats found inside previously inserted elements. Nesting splits the sequence structure of TEs and makes annotation of repetitive areas challenging. We present TEnest, a repeat identification and display tool made specifically for highly repetitive genomes. TEnest identifies repetitive sequences and reconstructs separated sections to provide full-length repeats and, for long-terminal repeat (LTR) retrotransposons, calculates age since insertion based on LTR divergence. TEnest provides a chronological insertion display to give an accurate visual representation of TE integration history showing timeline, location, and families of each TE identified, thus creating a framework from which evolutionary comparisons can be made among various regions of the genome. A database of repeats has been developed for maize (Zea mays), rice (Oryza sativa), wheat (Triticum aestivum), and barley (Hordeum vulgare) to illustrate the potential of TEnest software. All currently finished maize bacterial artificial chromosomes totaling 29.3 Mb were analyzed with TEnest to provide a characterization of the repeat insertions. Sixty-seven percent of the maize genome was found to be made up of TEs; of these, 95% are LTR retrotransposons. The rate of solo LTR formation is shown to be dissimilar across retrotransposon families. Phylogenetic analysis of TE families reveals specific events of extreme TE proliferation, which may explain the high quantities of certain TE families found throughout the maize genome. The TEnest software package is available for use on PlantGDB under the tools section (http://www.plantgdb.org/prj/TE_nest/TE_nest.html); the source code is available from (http://wiselab.org). 相似文献

20.

TRAP: automated classification, quantification and annotation of tandemly repeated sequences

Sobreira TJ Durham AM Gruber A 《Bioinformatics (Oxford, England)》2006,22(3):361-362

TRAP, the Tandem Repeats Analysis Program, is a Perl program that provides a unified set of analyses for the selection, classification, quantification and automated annotation of tandemly repeated sequences. TRAP uses the results of the Tandem Repeats Finder program to perform a global analysis of the satellite content of DNA sequences, permitting researchers to easily assess the tandem repeat content for both individual sequences and whole genomes. The results can be generated in convenient formats such as HTML and comma-separated values. TRAP can also be used to automatically generate annotation data in the format of feature table and GFF files. 相似文献