共查询到20条相似文献,搜索用时 0 毫秒
1.
2.
White adipose tissue reference network: a knowledge resource for exploring health-relevant relations
Thomas Kelder Georg Summer Martien Caspers Evert M. van Schothorst Jaap Keijer Loes Duivenvoorde Susanne Klaus Anja Voigt Laura Bohnert Catalina Pico Andreu Palou M. Luisa Bonet Aldona Dembinska-Kiec Malgorzata Malczewska-Malec Beata Kie?-Wilk Josep M. del Bas Antoni Caimari Lluis Arola Marjan van Erk Ben van Ommen Marijana Radonjic 《Genes & nutrition》2015,10(1)
Optimal health is maintained by interaction of multiple intrinsic and environmental factors at different levels of complexity—from molecular, to physiological, to social. Understanding and quantification of these interactions will aid design of successful health interventions. We introduce the reference network concept as a platform for multi-level exploration of biological relations relevant for metabolic health, by integration and mining of biological interactions derived from public resources and context-specific experimental data. A White Adipose Tissue Health Reference Network (WATRefNet) was constructed as a resource for discovery and prioritization of mechanism-based biomarkers for white adipose tissue (WAT) health status and the effect of food and drug compounds on WAT health status. The WATRefNet (6,797 nodes and 32,171 edges) is based on (1) experimental data obtained from 10 studies addressing different adiposity states, (2) seven public knowledge bases of molecular interactions, (3) expert’s definitions of five physiologically relevant processes key to WAT health, namely WAT expandability, Oxidative capacity, Metabolic state, Oxidative stress and Tissue inflammation, and (4) a collection of relevant biomarkers of these processes identified by BIOCLAIMS (http://bioclaims.uib.es). The WATRefNet comprehends multiple layers of biological complexity as it contains various types of nodes and edges that represent different biological levels and interactions. We have validated the reference network by showing overrepresentation with anti-obesity drug targets, pathology-associated genes and differentially expressed genes from an external disease model dataset. The resulting network has been used to extract subnetworks specific to the above-mentioned expert-defined physiological processes. Each of these process-specific signatures represents a mechanistically supported composite biomarker for assessing and quantifying the effect of interventions on a physiological aspect that determines WAT health status. Following this principle, five anti-diabetic drug interventions and one diet intervention were scored for the match of their expression signature to the five biomarker signatures derived from the WATRefNet. This confirmed previous observations of successful intervention by dietary lifestyle and revealed WAT-specific effects of drug interventions. The WATRefNet represents a sustainable knowledge resource for extraction of relevant relationships such as mechanisms of action, nutrient intervention targets and biomarkers and for assessment of health effects for support of health claims made on food products.
Electronic supplementary material
The online version of this article (doi:10.1007/s12263-014-0439-x) contains supplementary material, which is available to authorized users. 相似文献3.
Rose Oughtred Jennifer Rust Christie Chang Bobby‐Joe Breitkreutz Chris Stark Andrew Willems Lorrie Boucher Genie Leung Nadine Kolas Frederick Zhang Sonam Dolma Jasmin Coulombe‐Huntington Andrew Chatr‐aryamontri Kara Dolinski Mike Tyers 《Protein science : a publication of the Protein Society》2021,30(1):187-200
The BioGRID (Biological General Repository for Interaction Datasets, thebiogrid.org ) is an open‐access database resource that houses manually curated protein and genetic interactions from multiple species including yeast, worm, fly, mouse, and human. The ~1.93 million curated interactions in BioGRID can be used to build complex networks to facilitate biomedical discoveries, particularly as related to human health and disease. All BioGRID content is curated from primary experimental evidence in the biomedical literature, and includes both focused low‐throughput studies and large high‐throughput datasets. BioGRID also captures protein post‐translational modifications and protein or gene interactions with bioactive small molecules including many known drugs. A built‐in network visualization tool combines all annotations and allows users to generate network graphs of protein, genetic and chemical interactions. In addition to general curation across species, BioGRID undertakes themed curation projects in specific aspects of cellular regulation, for example the ubiquitin‐proteasome system, as well as specific disease areas, such as for the SARS‐CoV‐2 virus that causes COVID‐19 severe acute respiratory syndrome. A recent extension of BioGRID, named the Open Repository of CRISPR Screens (ORCS, orcs.thebiogrid.org ), captures single mutant phenotypes and genetic interactions from published high throughput genome‐wide CRISPR/Cas9‐based genetic screens. BioGRID‐ORCS contains datasets for over 1,042 CRISPR screens carried out to date in human, mouse and fly cell lines. The biomedical research community can freely access all BioGRID data through the web interface, standardized file downloads, or via model organism databases and partner meta‐databases. 相似文献
4.
Drug discovery is the process of new drug identification. This process is driven by the increasing data from existing chemical libraries and data banks. The knowledge graph is introduced to the domain of drug discovery for imposing an explicit structure to integrate heterogeneous biomedical data. The graph can provide structured relations among multiple entities and unstructured semantic relations associated with entities. In this review, we summarize knowledge graph-based works that implement drug repurposing and adverse drug reaction prediction for drug discovery. As knowledge representation learning is a common way to explore knowledge graphs for prediction problems, we introduce several representative embedding models to provide a comprehensive understanding of knowledge representation learning. 相似文献
5.
Ohta T Pyysalo S Kim JD Tsujii J 《Journal of bioinformatics and computational biology》2010,8(5):917-928
Text mining can support the interpretation of the enormous quantity of textual data produced in biomedical field. Recent developments in biomedical text mining include advances in the reliability of the recognition of named entities (NEs) such as specific genes and proteins, as well as movement toward richer representations of the associations of NEs. We argue that this shift in representation should be accompanied by the adoption of a more detailed model of the relations holding between NEs and other relevant domain terms. As a step toward this goal, we study NE-term relations with the aim of defining a detailed, broadly applicable set of relation types based on accepted domain standard concepts for use in corpus annotation and domain information extraction approaches. 相似文献
6.
《Biochimica et Biophysica Acta - Proteins and Proteomics》2020,1868(11):140504
MotivationProtein-protein interactions are important for many biological processes. Theoretical understanding of the structurally determining factors of interaction sites will help to understand the underlying mechanism of protein-protein interactions. Taking advantage of advanced mathematical methods to correctly predict interaction sites will be useful. Although some previous studies have been devoted to the interaction interface of protein monomer and the interface residues between chains of protein dimers, very few studies about the interface residues prediction of protein multimers, including trimers, tetramer and even more monomers in a large protein complex. As we all know, a large number of proteins function with the form of multibody protein complexes. And the complexity of the protein multimers structure causes the difficulty of interface residues prediction on them. So, we hope to build a method for the prediction of protein tetramer interface residue pairs.ResultsHere, we developed a new deep network based on LSTM network combining with graph to predict protein tetramers interaction interface residue pairs. On account of the protein structure data is not the same as the image or video data which is well-arranged matrices, namely the Euclidean Structure mentioned in many researches. Because the Non-Euclidean Structure data can't keep the translation invariance, and we hope to extract some spatial features from this kind of data applying on deep learning, an algorithm combining with graph was developed to predict the interface residue pairs of protein interactions based on a topological graph building a relationship between vertexes and edges in graph theory combining multilayer Long Short-Term Memory network. First, selecting the training and test samples from the Protein Data Bank, and then extracting the physicochemical property features and the geometric features of surface residue associated with interfacial properties. Subsequently, we transform the protein multimers data to topological graphs and predict protein interaction interface residue pairs using the model. In addition, different types of evaluation indicators verified its validity. 相似文献
7.
BioContrasts: extracting and exploiting protein-protein contrastive relations from biomedical literature 总被引:2,自引:0,他引:2
MOTIVATION: Contrasts are useful conceptual vehicles for learning processes and exploratory research of the unknown. For example, contrastive information between proteins can reveal what similarities, divergences and relations there are of the two proteins, leading to invaluable insights for better understanding about the proteins. Such contrastive information are found to be reported in the biomedical literature. However, there have been no reported attempts in current biomedical text mining work that systematically extract and present such useful contrastive information from the literature for exploitation. RESULTS: Our BioContrasts system extracts protein-protein contrastive information from MEDLINE abstracts and presents the information to biologists in a web-application for exploitation. Contrastive information are identified in the text abstracts with contrastive negation patterns such as 'A but not B'. A total of 799 169 pairs of contrastive expressions were successfully extracted from 2.5 million MEDLINE abstracts. Using grounding of contrastive protein names to Swiss-Prot entries, we were able to produce 41 471 pieces of contrasts between Swiss-Prot protein entries. These contrastive pieces of information are then presented via a user-friendly interactive web portal that can be exploited for applications such as the refinement of biological pathways. AVAILABILITY: BioContrasts can be accessed at http://biocontrasts.i2r.a-star.edu.sg. It is also mirrored at http://biocontrasts.biopathway.org. SUPPLEMENTARY INFORMATION: Supplementary materials are available at Bioinformatics online. 相似文献
8.
We present BioGraph, a data integration and data mining platform for the exploration and discovery of biomedical information.
The platform offers prioritizations of putative disease genes, supported by functional hypotheses. We show that BioGraph can
retrospectively confirm recently discovered disease genes and identify potential susceptibility genes, outperforming existing
technologies, without requiring prior domain knowledge. Additionally, BioGraph allows for generic biomedical applications
beyond gene discovery. BioGraph is accessible at . 相似文献
9.
Search engines running on MEDLINE abstracts have been widely used by biologists to find publications that are related to their research. The existing search engines such as PubMed, however, have limitations when applied for the task of seeking textual evidence of relations between given concepts. The limitations are mainly due to the problem that the search engines do not effectively deal with multi-term queries which may imply semantic relations between the terms. To address this problem, we present MedEvi, a novel search engine that imposes positional restriction on occurrences matching multi-term queries, based on the observation that terms with semantic relations which are explicitly stated in text are not found too far from each other. MedEvi further identifies additional keywords of biological and statistical significance from local context of matching occurrences in order to help users reformulate their queries for better results. AVAILABILITY: http://www.ebi.ac.uk/tc-test/textmining/medevi/ 相似文献
10.
Paul Thompson John McNaught Simonetta Montemagni Nicoletta Calzolari Riccardo del Gratta Vivian Lee Simone Marchi Monica Monachini Piotr Pezik Valeria Quochi CJ Rupp Yutaka Sasaki Giulia Venturi Dietrich Rebholz-Schuhmann Sophia Ananiadou 《BMC bioinformatics》2011,12(1):1-29
Background
Due to the rapidly expanding body of biomedical literature, biologists require increasingly sophisticated and efficient systems to help them to search for relevant information. Such systems should account for the multiple written variants used to represent biomedical concepts, and allow the user to search for specific pieces of knowledge (or events) involving these concepts, e.g., protein-protein interactions. Such functionality requires access to detailed information about words used in the biomedical literature. Existing databases and ontologies often have a specific focus and are oriented towards human use. Consequently, biological knowledge is dispersed amongst many resources, which often do not attempt to account for the large and frequently changing set of variants that appear in the literature. Additionally, such resources typically do not provide information about how terms relate to each other in texts to describe events.Results
This article provides an overview of the design, construction and evaluation of a large-scale lexical and conceptual resource for the biomedical domain, the BioLexicon. The resource can be exploited by text mining tools at several levels, e.g., part-of-speech tagging, recognition of biomedical entities, and the extraction of events in which they are involved. As such, the BioLexicon must account for real usage of words in biomedical texts. In particular, the BioLexicon gathers together different types of terms from several existing data resources into a single, unified repository, and augments them with new term variants automatically extracted from biomedical literature. Extraction of events is facilitated through the inclusion of biologically pertinent verbs (around which events are typically organized) together with information about typical patterns of grammatical and semantic behaviour, which are acquired from domain-specific texts. In order to foster interoperability, the BioLexicon is modelled using the Lexical Markup Framework, an ISO standard.Conclusions
The BioLexicon contains over 2.2 M lexical entries and over 1.8 M terminological variants, as well as over 3.3 M semantic relations, including over 2 M synonymy relations. Its exploitation can benefit both application developers and users. We demonstrate some such benefits by describing integration of the resource into a number of different tools, and evaluating improvements in performance that this can bring. 相似文献11.
High-quality protein knowledge resource: SWISS-PROT and TrEMBL 总被引:8,自引:0,他引:8
O'Donovan C Martin MJ Gattiker A Gasteiger E Bairoch A Apweiler R 《Briefings in bioinformatics》2002,3(3):275-284
12.
Silverstein KA Shoop E Johnson JE Kilian A Freeman JL Kunau TM Awad IA Mayer M Retzel EF 《Nucleic acids research》2001,29(1):49-51
MetaFam is a comprehensive relational database of protein family information. This web-accessible resource integrates data from several primary sequence and secondary protein family databases. By pooling together the information from these disparate sources, MetaFam is able to provide the most complete protein family sets available. Users are able to explore the interrelationships among these primary and secondary databases using a powerful graphical visualization tool, MetaFamView. Additionally, users can identify corresponding sequence entries among the sequence databases, obtain a quick summary of corresponding families (and their sequence members) among the family databases, and even attempt to classify their own unassigned sequences. Hypertext links to the appropriate source databases are provided at every level of navigation. Global family database statistics and information are also provided. Public access to the data is available at http://metafam.ahc.umn.edu/. 相似文献
13.
14.
Wesselingh RA 《The New phytologist》2007,174(1):26-37
The standard method of measuring pollen limitation is to add pollen to a number of flowers, preferably to a whole plant, and to compare fruit and seed set with that of naturally pollinated flowers on other plants. In 25 yr of research, this method has yielded valuable data, but it is difficult to use in large plants. This has caused a bias in the available data towards smaller, herbaceous plants with relatively few flowers. I argue that, in order to widen our knowledge of how pollen limitation affects plants, we should go beyond whole-plant pollen addition and change our concept of how a flowering plant functions. The traditional method does not take into account the variation in and dynamics of resource allocation and pollen availability. The concept of integrated physiological units (IPUs) does, but, although it has been applied to pollination biology, it has not received the attention it deserves. I use this article to present its merits again, to propose a step-by-step methodology for studying pollen limitation, and to examine factors influencing possible plant strategies. 相似文献
15.
An organism''s survival strategy under the constantly changing environment depends on its ability to sense and respond to
changes in its environment. Archaea, being capable to grow under various extreme environmental conditions, provide valuable
model for exploring how single-celled organisms respond to environmental stresses. However, no such approach has ever been
made to make an integrated classification of various archaeal stress responses.Archaeal Stress Response Database (ASRDb) is a web accessible (http://121.241.218.70/ASRDb) database that represents the first
online available resource providing a comprehensive overview of stress response genes of 66 archaeal genomes. This database
currently contains almost 6000 stress specific genes of 66 archaeal genomes. All the stress specific genes are grouped into 17
different stress categories. A user-friendly interface has been designed to examine data using query tools. This database provides
an efficient search engine for random and advanced database search operations. We have incorporated BLAST search options to
the resulting sequences retrieved from database search operations. A site map page representing the schematic diagram will enable
user to understand the logic behind the construction of the database. We have also provided a very rich and informative help page
to make user familiar with the database. We sincerely believe that ASRDb will be of particular interest to the life science
community and facilitates the biologists to unravel the role of stress specific genes in the adaptation of microorganisms under
various extreme environmental conditions. 相似文献
16.
MOTIVATION: The formal representation of mereological aspects of canonical anatomy (parthood relations) is relatively well understood. The formal representation of other aspects of canonical anatomy, such as connectedness and adjacency relations between anatomical parts, their shape and size as well as the spatial arrangement of anatomical parts within larger anatomical structures are, however, much less well understood and represented in existing computational anatomical and bio-medical ontologies only insufficiently. RESULTS: In this article, we provide a methodology of how to incorporate this kind of information into anatomical and bio-medical ontologies by applying techniques of representing qualitative spatial information from Artificial Intelligence. In particular, we focus on how to explicitly take into account the qualitative and time-dependent character of these relations. As a running example, we use the human temporomandibular joint (TMJ). AVAILABILITY: Using the presented methodology, a formal ontology was developed which is accessible on http://www.ifomis.org/bfo/fol. This ontology may help to improve the logical and ontological rigor of bio-medical ontologies such as the OBO relation ontology. 相似文献
17.
Markus Bundschus Mathaeus Dejori Martin Stetter Volker Tresp Hans-Peter Kriegel 《BMC bioinformatics》2008,9(1):207
Background
The increasing amount of published literature in biomedicine represents an immense source of knowledge, which can only efficiently be accessed by a new generation of automated information extraction tools. Named entity recognition of well-defined objects, such as genes or proteins, has achieved a sufficient level of maturity such that it can form the basis for the next step: the extraction of relations that exist between the recognized entities. Whereas most early work focused on the mere detection of relations, the classification of the type of relation is also of great importance and this is the focus of this work. In this paper we describe an approach that extracts both the existence of a relation and its type. Our work is based on Conditional Random Fields, which have been applied with much success to the task of named entity recognition. 相似文献18.
We live in an age of access to more information than ever before. This can be a double-edged sword. Increased access to information allows for more informed and empowered researchers, while information overload becomes an increasingly serious risk. Thus, there is a need for intelligent information retrieval systems that can summarize relevant and reliable textual sources to satisfy a user's query. Question answering is a specialized type of information retrieval with the aim of returning precise short answers to queries posed as natural language questions. We present a review and comparison of three biomedical question answering systems: askHERMES (http://www.askhermes.org/), EAGLi (http://eagl.unige.ch/EAGLi/), and HONQA (http://services.hon.ch/cgi-bin/QA10/qa.pl). 相似文献
19.
Jun Huan Deepak Bandyopadhyay Wei Wang Jack Snoeyink Jan Prins Alexander Tropsha 《Journal of computational biology》2005,12(6):657-671
We find recurring amino-acid residue packing patterns, or spatial motifs, that are characteristic of protein structural families, by applying a novel frequent subgraph mining algorithm to graph representations of protein three-dimensional structure. Graph nodes represent amino acids, and edges are chosen in one of three ways: first, using a threshold for contact distance between residues; second, using Delaunay tessellation; and third, using the recently developed almost-Delaunay edges. For a set of graphs representing a protein family from the Structural Classification of Proteins (SCOP) database, subgraph mining typically identifies several hundred common subgraphs corresponding to spatial motifs that are frequently found in proteins in the family but rarely found outside of it. We find that some of the large motifs map onto known functional regions in two protein families explored in this study, i.e., serine proteases and kinases. We find that graphs based on almost-Delaunay edges significantly reduce the number of edges in the graph representation and hence present computational advantage, yet the patterns extracted from such graphs have a biological interpretation approximately equivalent to that of those extracted from distance based graphs. 相似文献
20.
Sinding C 《Journal of the history of biology》1989,22(3):461-495
Two essential periods may be identified in the early stages of the history of vitamin D-resistant rickets. The first was the period during which a very well known deficiency disease, rickets, acquired a scientific status: this required the development of unifying principles to confer upon the newly developing science of pathology a doctrine without which it would have been condemned to remain a collection of unrelated facts with very little practical application. One first such unifying principle was provided by the notion of hygiene; while the blanket explanations provided by this notion alone were much too general to enable rickets to achieve scientific status, it did point out the need to look for specific external causes for the disease and to examine the life-style and dietary habits of the patients. The second phase of the conceptualization of the disease was the assumption of a specific cause — that is, using concepts developed in the area of infectious diseases. This was made possible by fundamental similarities between deficiency and infectious diseases, in spite of their apparent differences. Both types of illness have some of the characteristics of what Georges Canguilhem calls the ontological representation of disease77 as opposed to dysharmonic representations, which primarily concern the endocrine diseases. It is precisely this shift from one to the other manner of perceiving ill health that enabled the identification of the vitamin D-resistant rickets. Conceptualization of the notion of vitamin D resistancy required that the conditions causing the disease be looked for within the organism rather than outside it; this was thus the first time that endocrine concepts were applied to studying the physiology of vitamin D.The history of resistant rickets therefore represents an interesting model for understanding the growth of biomedical knowledge. It allows the development of a number of more general ideas on the question of the relationship between biology and medicine and on the thorny problem of specificity in medical thinking. As far as this topic is concerned it can be seen that there was an ongoing exchange between medical and biological thinking during the initial period up until 1937, and that one could deny any such specificity in medical thinking during this same period. Albright, for example, used human diseases as spontaneously produced models for experimental biology. It was also more feasible for an experimental biologist to administer toxic doses than for a clinician to do so. 相似文献