期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Nikoloski Z Grimbs S Klie S Selbig J 《Bio Systems》2011,104(1):1-8

Describing the determinants of robustness of biological systems has become one of the central questions in systems biology. Despite the increasing research efforts, it has proven difficult to arrive at a unifying definition for this important concept. We argue that this is due to the multifaceted nature of the concept of robustness and the possibility to formally capture it at different levels of systemic formalisms (e.g., topology and dynamic behavior). Here we provide a comprehensive review of the existing definitions of robustness pertaining to metabolic networks. As kinetic approaches have been excellently reviewed elsewhere, we focus on definitions of robustness proposed within graph-theoretic and constraint-based formalisms. 相似文献

2.

Statistically rigorous automated protein annotation

Krebs WG Bourne PE 《Bioinformatics (Oxford, England)》2004,20(7):1066-1073

MOTIVATION: Assignment of putative protein functional annotation by comparative analysis using pre-defined experimental annotations is performed routinely by molecular biologists. The number and statistical significance of these assignments remains a challenge in this era of high-throughput proteomics. A combined statistical method that enables robust, automated protein annotation by reliably expanding existing annotation sets is described. An existing clustering scheme, based on relevant experimental information (e.g. sequence identity, keywords or gene expression data) is required. The method assigns new proteins to these clusters with a measure of reliability. It can also provide human reviewers with a reliability score for both new and previously classified proteins. RESULTS: A dataset of 27 000 annotated Protein Data Bank (PDB) polypeptide chains (of 36 000 chains currently in the PDB) was generated from 23 000 chains classified a priori. AVAILABILITY: PDB annotations and sample software implementation are freely accessible on the Web at http://pmr.sdsc.edu/go 相似文献

3.

GoFigure: automated Gene Ontology annotation 总被引：4，自引：0，他引：4

Khan S Situ G Decker K Schmidt CJ 《Bioinformatics (Oxford, England)》2003,19(18):2484-2485

SUMMARY: We have developed a web tool to predict Gene Ontology (GO) terms. The tool accepts an input DNA or protein sequence, and uses BLAST to identify homologous sequences in GO annotated databases. A graph is returned to the user via email. AVAILABILITY: The tool is freely available at: http://udgenome.ags.udel.edu/frm_go.html/ 相似文献

4.

Ways to improve the prediction quality in the adaptive algorithm of automated annotation (A 4)

A. M. Leontovich K. Yu. Tokmachev 《Biophysics》2006,51(4):527-534

Modifications for improving the prediction quality in a previously described adaptive algorithm of automated annotation (A ⁴) were considered. First, the direct use of the basis statistic η ensures a higher prediction quality than the use of a previously proposed statistic γ. Second, the quality is improved when only some of the found similar sequences, rather than all of them, are used for prediction, since this reduces the data noise. 相似文献

5.

PhyloGena--a user-friendly system for automated phylogenetic annotation of unknown sequences 总被引：1，自引：0，他引：1

Hanekamp K Bohnebeck U Beszteri B Valentin K 《Bioinformatics (Oxford, England)》2007,23(7):793-801

MOTIVATION: Phylogenomic approaches towards functional and evolutionary annotation of unknown sequences have been suggested to be superior to those based only on pairwise local alignments. User-friendly software tools making the advantages of phylogenetic annotation available for the ever widening range of bioinformatically uninitiated biologists involved in genome/EST annotation projects are, however, not available. We were particularly confronted with this issue in the annotation of sequences from different groups of complex algae originating from secondary endosymbioses, where the identification of the phylogenetic origin of genes is often more problematic than in taxa well represented in the databases (e.g. animals, plants or fungi). RESULTS: We present a flexible pipeline with a user-friendly, interactive graphical user interface running on desktop computers that automatically performs a basic local alignment search tool (BLAST) search of query sequences, selects a representative subset of them, then creates a multiple alignment from the selected sequences, and finally computes a phylogenetic tree. The pipeline, named PhyloGena, uses public domain software for all standard bioinformatics tasks (similarity search, multiple alignment, and phylogenetic reconstruction). As the major technological innovation, selection of a meaningful subset of BLAST hits was implemented using logic programming, mimicing the selection procedure (BLAST tables, multiple alignments and phylogenetic trees) are displayed graphically, allowing the user to interact with the pipeline and deduce the function and phylogenetic origin of the query. PhyloGena thus makes phylogenomic annotation available also for those biologists without access to large computing facilities and with little informatics background. Although phylogenetic annotation is particularly useful when working with composite genomes (e.g. from complex algae), PhyloGena can be helpful in expressed sequence tag and genome annotation also in other organisms. AVAILABILITY: PhyloGena (executables for LINUX and Windows 2000/XP as well as source code) is available by anonymous ftp from http://www.awi.de/en/phylogena. 相似文献

6.

Evolutionarily conserved substrate substructures for automated annotation of enzyme superfamilies

Chiang RA Sali A Babbitt PC 《PLoS computational biology》2008,4(8):e1000142

The evolution of enzymes affects how well a species can adapt to new environmental conditions. During enzyme evolution, certain aspects of molecular function are conserved while other aspects can vary. Aspects of function that are more difficult to change or that need to be reused in multiple contexts are often conserved, while those that vary may indicate functions that are more easily changed or that are no longer required. In analogy to the study of conservation patterns in enzyme sequences and structures, we have examined the patterns of conservation and variation in enzyme function by analyzing graph isomorphisms among enzyme substrates of a large number of enzyme superfamilies. This systematic analysis of substrate substructures establishes the conservation patterns that typify individual superfamilies. Specifically, we determined the chemical substructures that are conserved among all known substrates of a superfamily and the substructures that are reacting in these substrates and then examined the relationship between the two. Across the 42 superfamilies that were analyzed, substantial variation was found in how much of the conserved substructure is reacting, suggesting that superfamilies may not be easily grouped into discrete and separable categories. Instead, our results suggest that many superfamilies may need to be treated individually for analyses of evolution, function prediction, and guiding enzyme engineering strategies. Annotating superfamilies with these conserved and reacting substructure patterns provides information that is orthogonal to information provided by studies of conservation in superfamily sequences and structures, thereby improving the precision with which we can predict the functions of enzymes of unknown function and direct studies in enzyme engineering. Because the method is automated, it is suitable for large-scale characterization and comparison of fundamental functional capabilities of both characterized and uncharacterized enzyme superfamilies. 相似文献

7.

Sequence annotation of nuclear receptor ligand-binding domains by automated homology modeling

Françoijs CJ Klomp JP Knegtel RM 《Protein engineering》2000,13(6):391-394

The quality of three-dimensional homology models derived from protein sequences provides an independent measure of the suitability of a protein sequence for a certain fold. We have used automated homology modeling and model assessment tools to identify putative nuclear hormone receptor ligand-binding domains in the genome of Caenorhabditis elegans. Our results indicate that the availability of multiple crystal structures is crucial to obtaining useful models in this receptor family. The majority of annotated mammalian nuclear hormone receptors could be assigned to a ligand-binding domain fold by using the best model derived from any of four template structures. This strategy also assigned the ligand-binding domain fold to a number of C.elegans. sequences without prior annotation. Interestingly, the retinoic acid receptor crystal structure contributed most to the number of sequences that could be assigned to a ligand-binding domain fold. Several causes for this can be suggested, including the high quality of this protein structure in terms of our assessment tools, similarity between the biological function or ligand of this receptor and the modeled genes and gene duplication in C.elegans. 相似文献

8.

EDITtoTrEMBL: a distributed approach to high-quality automated protein sequence annotation

Möller S Leser U Fleischmann W Apweiler R 《Bioinformatics (Oxford, England)》1999,15(3):219-227

相似文献

9.

Steady progress and recent breakthroughs in the accuracy of automated genome annotation

Brent MR 《Nature reviews. Genetics》2008,9(1):62-73

The sequencing of large, complex genomes has become routine, but understanding how sequences relate to biological function is less straightforward. Although much attention is focused on how to annotate genomic features such as developmental enhancers and non-coding RNAs, there is still no higher eukaryote for which we know the correct exon-intron structure of at least one ORF for each gene. Despite this uncomfortable truth, genome annotation has made remarkable progress since the first drafts of the human genome were analysed. By combining several computational and experimental methods, we are now closer to producing complete and accurate gene catalogues than ever before. 相似文献

10.

TRAP: automated classification, quantification and annotation of tandemly repeated sequences

Sobreira TJ Durham AM Gruber A 《Bioinformatics (Oxford, England)》2006,22(3):361-362

TRAP, the Tandem Repeats Analysis Program, is a Perl program that provides a unified set of analyses for the selection, classification, quantification and automated annotation of tandemly repeated sequences. TRAP uses the results of the Tandem Repeats Finder program to perform a global analysis of the satellite content of DNA sequences, permitting researchers to easily assess the tandem repeat content for both individual sequences and whole genomes. The results can be generated in convenient formats such as HTML and comma-separated values. TRAP can also be used to automatically generate annotation data in the format of feature table and GFF files. 相似文献

11.

TEnest: automated chronological annotation and visualization of nested plant transposable elements 总被引：2，自引：0，他引：2

Kronmiller BA Wise RP 《Plant physiology》2008,146(1):45-59

Organisms with a high density of transposable elements (TEs) exhibit nesting, with subsequent repeats found inside previously inserted elements. Nesting splits the sequence structure of TEs and makes annotation of repetitive areas challenging. We present TEnest, a repeat identification and display tool made specifically for highly repetitive genomes. TEnest identifies repetitive sequences and reconstructs separated sections to provide full-length repeats and, for long-terminal repeat (LTR) retrotransposons, calculates age since insertion based on LTR divergence. TEnest provides a chronological insertion display to give an accurate visual representation of TE integration history showing timeline, location, and families of each TE identified, thus creating a framework from which evolutionary comparisons can be made among various regions of the genome. A database of repeats has been developed for maize (Zea mays), rice (Oryza sativa), wheat (Triticum aestivum), and barley (Hordeum vulgare) to illustrate the potential of TEnest software. All currently finished maize bacterial artificial chromosomes totaling 29.3 Mb were analyzed with TEnest to provide a characterization of the repeat insertions. Sixty-seven percent of the maize genome was found to be made up of TEs; of these, 95% are LTR retrotransposons. The rate of solo LTR formation is shown to be dissimilar across retrotransposon families. Phylogenetic analysis of TE families reveals specific events of extreme TE proliferation, which may explain the high quantities of certain TE families found throughout the maize genome. The TEnest software package is available for use on PlantGDB under the tools section (http://www.plantgdb.org/prj/TE_nest/TE_nest.html); the source code is available from (http://wiselab.org). 相似文献

12.

An automated annotation tool for genomic DNA sequences using GeneScan and BLAST 总被引：1，自引：0，他引：1

Lynn AM Jain CK Kosalai K Barman P Thakur N Batra H Bhattacharya A 《Journal of genetics》2001,80(1):9-16

Genomic sequence data are often available well before the annotated sequence is published. We present a method for analysis of genomic DNA to identify coding sequences using the GeneScan algorithm and characterize these resultant sequences by BLAST. The routines are used to develop a system for automated annotation of genome DNA sequences. 相似文献

13.

Novel algorithm for automated genotyping of microsatellites 总被引：1，自引：0，他引：1

Matsumoto T Yukawa W Nozaki Y Nakashige R Shinya M Makino S Yagura M Ikuta T Imanishi T Inoko H Tamiya G Gojobori T 《Nucleic acids research》2004,32(20):6069-6077

Microsatellites or short tandem repeats (STRs) are abundant in the human genome with easily assayed polymorphisms, providing powerful genetic tools for mapping both Mendelian and complex traits. Microsatellite genotyping requires detection of the products of polymerase chain reaction (PCR) amplification by electrophoresis, and analysis of the peak data for discrimination of the true allele. A high-throughput genotyping approach requires computer-based automation at both the detection and analysis phases. In order to achieve this, complicated peak patterns from individual alleles must be interpreted in order to assign alleles. Previous methods consider limited types of noise peaks and cannot provide enough accuracy. By pattern recognition of various types of noise peaks, such as stutter peaks and additional peaks, we have achieved an overall average accuracy of 94% for allele calling in our actual data. Our algorithm is crucial for a high-throughput genotyping system for microsatellite markers by reducing manual editing and human errors. 相似文献

14.

EST Express: PHP/MySQL based automated annotation of ESTs from expression libraries

Robin P Smith William J Buchser Marcus B Lemmon Jose R Pardinas John L Bixby Vance P Lemmon 《BMC bioinformatics》2008,9(1):186

Background

Several biological techniques result in the acquisition of functional sets of cDNAs that must be sequenced and analyzed. The emergence of redundant databases such as UniGene and centralized annotation engines such as Entrez Gene has allowed the development of software that can analyze a great number of sequences in a matter of seconds. 相似文献

15.

An algorithm for automated closure during assembly

Sergey Koren Jason R Miller Brian P Walenz Granger Sutton 《BMC bioinformatics》2010,11(1):457

Background

Finishing is the process of improving the quality and utility of draft genome sequences generated by shotgun sequencing and computational assembly. Finishing can involve targeted sequencing. Finishing reads may be incorporated by manual or automated means. One automated method uses targeted addition by local re-assembly of gap regions. An obvious alternative uses de novo assembly of all the reads. 相似文献

16.

Quimp3, an automated pseudopod-tracking algorithm

《Cell Adhesion & Migration》2013,7(1):46-55

To understand movement of amoeboid cells we have developed an information tool that automatically detects protrusions of moving cells. The algorithm uses digitized cell recordings at a speed of ~1 image per second that are analyzed in three steps. In the first part, the outline of a cell is defined as a polygon of ~150 nodes, using the previously published Quimp2 program. By comparing the position of the nodes in place and time, each node contains information on position, local curvature and speed of movement. The second part uses rules for curvature and movement to define the position and time of start and end of a growing pseudopod. This part of the algorithm produces quantitative data on size, surface area, lifetime, frequency, and direction of pseudopod extension. The third part of the algorithm assigns qualitative properties to each pseudopod. It decides on the origin of a pseudopod as splitting of an existing pseudopod or as extension de novo. It also decides on the fate of each pseudopod as merged with the cell body or retracted. Here we describe the pseudopod tool and present the first data based on the analysis of ~1000 pseudopodia extended by Dictyostelium cells in the absence of external cues. 相似文献

17.

An extensible automated protein annotation tool: standardizing input and output using validated XML

Deevi SV Martin AC 《Bioinformatics (Oxford, England)》2006,22(3):291-296

相似文献

18.

Comparative analysis of the PCOLCE region in Fugu rubripes using a new automated annotation tool

Giorgia Romina Riboldi Tunnicliffe Gernot Gloeckner Greg S. Elgar Sydney Brenner André Rosenthal 《Mammalian genome》2000,11(3):213-219

The Japanese pufferfish Fugu rubripes with a genome of about 400 Mb is becoming increasingly recognized as a vertebrate model organism for comparative gene analysis (see Elgar 1996 for review). We have isolated and sequenced two Fugu cosmids spanning a genomic region of 66 kb containing the Fugu homolog to the human PCOLCE-I (Gl?ckner et al. 1998). We then examined if RUMMAGE-DP, a newly developed analysis tool for gene discovery which was designed for human and mouse genomic DNA, can be used for automatic annotation of Fugu genomic sequence. The exon prediction programs contained in RUMMAGE-DP performed better overall for the human sequence than for the Fugu contig. The GENSCAN program was the only exon prediction programme that performed equally well for both organisms. We show that RUMMAGE-DP is very useful in automatic analysis of Fugu sequences. Comparative analysis of the genomic structure of the PCOLCE-I genes in Fugu and human reveals that the exon/intron structure throughout the protein coding region is almost identical. We defined an additional domain based on the high degree of similarity of 26 aa between mammals and Fugu. The PCOLCE-I protein in both organisms contains two highly conserved CUB domains. Exons 6 and 7 are the only coding exons that differ in length between the two species. We assume that these exons do not code for any catalytic domain of the protein. Analysis of the remaining five Fugu genes within the 66 kb interval revealed no conserved synteny with the corresponding human 7q22 region. Received: 13 October 1998 / Accepted: 25 July 1999 相似文献

19.

Proteomic 2DE database for spot selection,automated annotation,and data analysis

Malmström L Malmström J Marko-Varga G Westergren-Thorsson G 《Journal of proteome research》2002,1(2):135-138

We present a software solution that enables faster and more accurate data analysis of 2DE/MALDI TOF MS data. The software supports data analysis through a number of automated data selection functions and advanced graphical tools. Once protein identities are determined using MALDI TOF MS, automated data retrieval from online databases provides biological information. The software, called 2DDB, reduces analysis time to a fraction without losing any quality compared to more manual data analysis. The database contains over 100,000 data entries, and selected parts can be reached at http://2ddb.org. 相似文献

20.

ARC: automated resource classifier for agglomerative functional classification of prokaryotic proteins using annotation texts

Gnanamani M Kumar N Ramachandran S 《Journal of biosciences》2007,32(5):937-945

Functional classification of proteins is central to comparative genomics. The need for algorithms tuned to enable integrative interpretation of analytical data is felt globally. The availability of a general,automated software with built-in flexibility will significantly aid this activity. We have prepared ARC (Automated Resource Classifier), which is an open source software meeting the user requirements of flexibility. The default classification scheme based on keyword match is agglomerative and directs entries into any of the 7 basic non-overlapping functional classes: Cell wall, Cell membrane and Transporters (C), Cell division (D), Information (I), Translocation (L), Metabolism (M), Stress(R), Signal and communication (S) and 2 ancillary classes: Others (O) and Hypothetical (H).The keyword library of ARC was built serially by first drawing keywords from Bacillus subtilis and Escherichia coli K12. In subsequent steps,this library was further enriched by collecting terms from archaeal representative Archaeoglobus fulgidus, Gene Ontology, and Gene Symbols. ARC is 94.04% successful on 6,75,663 annotated proteins from 348 prokaryotes. Three examples are provided to illuminate the current perspectives on mycobacterial physiology and costs of proteins in 333 prokaryotes. ARC is available at http://arc.igib.res.in. 相似文献