共查询到20条相似文献,搜索用时 15 毫秒
1.
About five years ago, ontology was almost unknown in bioinformatics, even more so in molecular biology. Nowadays, many bioinformatics articles mention it in connection with text mining, data integration or as a metaphysical cure for problems in standardisation of nomenclature and other applications. This article attempts to give an account of what concept ontologies in the domain of biology and bioinformatics are; what they are not; how they can be constructed; how they can be used; and some fallacies and pitfalls creators and users should be aware of. 相似文献
5.
Modern technologies have rapidly transformed biology into a data-intensive discipline. In addition to the enormous amounts of existing experimental data in the literature, every new study can produce a large amount of new data, resulting in novel ideas and more publications. In order to understand a biological process as completely as possible, scientists should be able to combine and analyze all such information. Not only molecular biology and bioinformatics, but all the other domains of biology including plant biology, require tools and technologies that enable experts to capture knowledge within distributed and heterogeneous sources of information. Ontologies have proven to be one of the most-useful means of constructing and formalizing expert knowledge. The key feature of an ontology is that it represents a computer-interpretable model of a particular subject area. This article outlines the importance of ontologies for systems biology, data integration and information analyses, as illustrated through the example of reactive oxygen species (ROS) signaling networks in plants. 相似文献
6.
SUMMARY: Although dozens of biological ontologies have been created and deployed, relatively little attention has been given to using ontologies to represent behavior. Ontologies for two different behavior systems are described here. One ontology was a translation of a published ethogram, and the second was coded from video clips in a comparative study of jumping spider courtship. AVAILABILITY: http://mesquiteproject.org/ontology/. 相似文献
7.
Background SeqHound has been developed as an integrated biological sequence, taxonomy, annotation and 3-D structure database system. It provides a high-performance server platform for bioinformatics research in a locally-hosted environment. 相似文献
8.
MOTIVATION: The output of a bioinformatic tool such as BLAST must usually be interpreted by an expert before reliable conclusions can be drawn. This may be based upon the expert's experience, additional data and statistical analysis. Often the process is laborious, goes unrecorded and may be biased. Argumentation is an established technique for reasoning about situations where absolute truth or precise probability is impossible to determine. RESULTS: We demonstrate the application of argumentation to 3D-PSSM, a protein structure prediction tool. The expert's interpretation of results is represented as an argumentation framework. Given a 3D-PSSM result, an automated procedure constructs arguments for and against the conclusion that the result is a good predictor of protein structure. In addition to capturing the unique expertise of the author of 3D-PSSM for distribution to users, an improvement in recall of 5-10 percentage points is achieved. This technique can be applied to a wide range of bioinformatic tools. AVAILABILITY: Example public server and benchmarking data are available at http://www.sbg.bio.ic.ac.uk/~brj03/argumentation/paper/. Source code available on request. 相似文献
9.
The contemporary research and development context in multidisciplinary biology has a serious requirement for integrating knowledge from disparate sources, and facilitating much-needed inter- and intra-disciplinary dialogue. A multiplicity of models arises when pluralistic approaches to modelling are followed, and also when there is not only a requirement to model systems and data, but also knowledge of systems and data. The challenges of addressing this multiplicity do not only include articulating the structure of complex systems, but also placing modelling within the framework of a process as well as a product. The graph representations presented here facilitate dialogue, modelling, clarification and specification of concepts, and the sharing of terms. This paper explores relationships between collections of graph representations. It is hoped that in future, when readers look at a node or a process in a graph, they will have a much deeper appreciation of relationships and context. 相似文献
10.
BackgroundBioinformatics has gained much attention as a fast growing interdisciplinary field. Several attempts have been conducted to explore the field of bioinformatics by bibliometric analysis, however, such works did not elucidate the role of visualization in analysis, nor focus on the relationship between sub-topics of bioinformatics. ResultsFirst, the hotspot of bioinformatics has moderately shifted from traditional molecular biology to omics research, and the computational method has also shifted from mathematical model to data mining and machine learning. Second, DNA-related topics are bridge topics in bioinformatics research. These topics gradually connect various sub-topics that are relatively independent at first. Third, only a small part of topics we have obtained involves a number of computational methods, and the other topics focus more on biological aspects. Fourth, the proportion of computing-related topics hit a trough in the 1980s. During this period, the use of traditional calculation methods such as mathematical model declined in a large proportion while the new calculation methods such as machine learning have not been applied in a large scale. This proportion began to increase gradually after the 1990s. Fifth, although the proportion of computing-related topics is only slightly higher than the original, the connection between other topics and computing-related topics has become closer, which means the support of computational methods is becoming increasingly important for the research of bioinformatics. ConclusionsThe results of our analysis imply that research on bioinformatics is becoming more diversified and the ranking of computational methods in bioinformatics research is also gradually improving. 相似文献
11.
Comprehensive understanding of biological systems requires efficient and systematic assimilation of high-throughput datasets in the context of the existing knowledge base. A major limitation in the field of proteomics is the lack of an appropriate software platform that can synthesize a large number of experimental datasets in the context of the existing knowledge base. Here, we describe a software platform, termed PROTEOME-3D, that utilizes three essential features for systematic analysis of proteomics data: creation of a scalable, queryable, customized database for identified proteins from published literature; graphical tools for displaying proteome landscapes and trends from multiple large-scale experiments; and interactive data analysis that facilitates identification of crucial networks and pathways. Thus, PROTEOME-3D offers a standardized platform to analyze high-throughput experimental datasets for the identification of crucial players in co-regulated pathways and cellular processes. 相似文献
13.
A significant part of our biological knowledge is centered on relationships between biological entities (bio-entities) such as proteins, genes, small molecules, pathways, gene ontology (GO) terms and diseases. Accumulated at an increasing speed, the information on bio-entity relationships is archived in different forms at scattered places. Most of such information is buried in scientific literature as unstructured text. Organizing heterogeneous information in a structured form not only facilitates study of biological systems using integrative approaches, but also allows discovery of new knowledge in an automatic and systematic way. In this study, we performed a large scale integration of bio-entity relationship information from both databases containing manually annotated, structured information and automatic information extraction of unstructured text in scientific literature. The relationship information we integrated in this study includes protein-protein interactions, protein/gene regulations, protein-small molecule interactions, protein-GO relationships, protein-pathway relationships, and pathway-disease relationships. The relationship information is organized in a graph data structure, named integrated bio-entity network (IBN), where the vertices are the bio-entities and edges represent their relationships. Under this framework, graph theoretic algorithms can be designed to perform various knowledge discovery tasks. We designed breadth-first search with pruning (BFSP) and most probable path (MPP) algorithms to automatically generate hypotheses--the indirect relationships with high probabilities in the network. We show that IBN can be used to generate plausible hypotheses, which not only help to better understand the complex interactions in biological systems, but also provide guidance for experimental designs. 相似文献
15.
In this paper it is argued that an expert system requires morethan factual knowledge before it can display expertise in agiven domain. The additional knowledge consists of the heuristicsor rules of thumb used by an expert to manipulateand interpret the factual knowledge. The knowledge acquisitionphase of an expert system project involves determining the factualknowledge (which may be obtained from published sources) andthe heuristics used by an expert to manipulate that knowledge-theseheuristics can only be obtained from an expert. In reviewingexisting biological expert systems it is apparent that manycontain only the factual knowledge relating to the domain, andlack the heuristics that enable such systems to show expertise.This paper reviews a number of knowledge acquisition techniqueswhich could be used for acquiring heuristic knowledge and discusseswhen their use is appropriate. The knowledge acquisition techniquesdiscussed are those suitable for the development of small-scaleexpert systems as these are most likely to be of interest tobiologists. The techniques include the use of questionnaires,interview techniques and protocol analysis; particular emphasisis placed on a mod cation to the twenty questionsinterview technique which was developed specifically to elicittaxonomic knowledge relating to water mite identification. 相似文献
16.
Recently, molecular biologists have sequenced about a dozen bacterial genomes and the first eukaryotic genome. We can now obtain answers to detailed questions about the complete set of genes of an organism. Bioinformatics methods are increasingly used for attaching biological knowledge to long lists of genes, assigning genes to biological pathways, comparing the gene sets of different species, identifying specificity factors, and describing sets of highly conserved proteins common to all domains of life. Substantial progress has recently been made in the availability of primary and added-value databases, in the development of algorithms and of network information services for genome analysis. The pharmaceutical industry has greatly benefited from the accumulation of sequence data through the identification of targets and candidates for the development of drugs, vaccines, diagnostic markers and therapeutic proteins. 相似文献
18.
BIOCHAM (the BIOCHemical Abstract Machine) is a software environment for modeling biochemical systems. It is based on two aspects: (1) the analysis and simulation of boolean, kinetic and stochastic models and (2) the formalization of biological properties in temporal logic. BIOCHAM provides tools and languages for describing protein networks with a simple and straightforward syntax, and for integrating biological properties into the model. It then becomes possible to analyze, query, verify and maintain the model with respect to those properties. For kinetic models, BIOCHAM can search for appropriate parameter values in order to reproduce a specific behavior observed in experiments and formalized in temporal logic. Coupled with other methods such as bifurcation diagrams, this search assists the modeler/biologist in the modeling process. AVAILABILITY: BIOCHAM (v. 2.5) is a free software available for download, with example models, at http://contraintes.inria.fr/BIOCHAM/. 相似文献
20.
A response to Life sentences: Ontology recapitulates philology by Sydney Brenner, Genome Biology 2002, 3:comment1006.1-1006.2. 相似文献
|