共查询到20条相似文献,搜索用时 15 毫秒
1.
Martijn P van Iersel Alexander R Pico Thomas Kelder Jianjiong Gao Isaac Ho Kristina Hanspers Bruce R Conklin Chris T Evelo 《BMC bioinformatics》2010,11(1):5
Background
Many complementary solutions are available for the identifier mapping problem. This creates an opportunity for bioinformatics tool developers. Tools can be made to flexibly support multiple mapping services or mapping services could be combined to get broader coverage. This approach requires an interface layer between tools and mapping services. 相似文献2.
Mark Wilkinson 《Source code for biology and medicine》2006,1(1):1-8
Background
The BioMoby project aims to identify and deploy standards and conventions that aid in the discovery, execution, and pipelining of distributed bioinformatics Web Services. As of August, 2006, approximately 680 bioinformatics resources were available through the BioMoby interoperability platform. There are a variety of clients that can interact with BioMoby-style services. Here we describe a Web-based browser-style client – Gbrowse Moby – that allows users to discover and "surf" from one bioinformatics service to the next using a semantically-aided browsing interface. 相似文献3.
Background
The prediction of the secondary structure of proteins is one of the most studied problems in bioinformatics. Despite their success in many problems of biological sequence analysis, Hidden Markov Models (HMMs) have not been used much for this problem, as the complexity of the task makes manual design of HMMs difficult. Therefore, we have developed a method for evolving the structure of HMMs automatically, using Genetic Algorithms (GAs). 相似文献4.
Thomas Kuhn Egon L Willighagen Achim Zielesny Christoph Steinbeck 《BMC bioinformatics》2010,11(1):159
Background
Small molecules are of increasing interest for bioinformatics in areas such as metabolomics and drug discovery. The recent release of large open access chemistry databases generates a demand for flexible tools to process them and discover new knowledge. To freely support open science based on these data resources, it is desirable for the processing tools to be open source and available for everyone. 相似文献5.
Background
In mouse the cytokine interleukin-7 (IL-7) is required for generation of B lymphocytes, but human IL-7 does not appear to have this function. A bioinformatics approach was therefore used to identify IL-7 receptor related genes in the hope of identifying the elusive human cytokine. 相似文献6.
XMPP for cloud computing in bioinformatics supporting discovery and invocation of asynchronous web services 总被引:1,自引:0,他引:1
Background
Life sciences make heavily use of the web for both data provision and analysis. However, the increasing amount of available data and the diversity of analysis tools call for machine accessible interfaces in order to be effective. HTTP-based Web service technologies, like the Simple Object Access Protocol (SOAP) and REpresentational State Transfer (REST) services, are today the most common technologies for this in bioinformatics. However, these methods have severe drawbacks, including lack of discoverability, and the inability for services to send status notifications. Several complementary workarounds have been proposed, but the results are ad-hoc solutions of varying quality that can be difficult to use. 相似文献7.
David W Kane Moses M Hohman Ethan G Cerami Michael W McCormick Karl F Kuhlmman Jeff A Byrd 《BMC bioinformatics》2006,7(1):273
Background
Agile is an iterative approach to software development that relies on strong collaboration and automation to keep pace with dynamic environments. We have successfully used agile development approaches to create and maintain biomedical software, including software for bioinformatics. This paper reports on a qualitative study of our experiences using these methods. 相似文献8.
Background
Very often genome-wide data analysis requires the interoperation of multiple databases and analytic tools. A large number of genome databases and bioinformatics applications are available through the web, but it is difficult to automate interoperation because: 1) the platforms on which the applications run are heterogeneous, 2) their web interface is not machine-friendly, 3) they use a non-standard format for data input and output, 4) they do not exploit standards to define application interface and message exchange, and 5) existing protocols for remote messaging are often not firewall-friendly. To overcome these issues, web services have emerged as a standard XML-based model for message exchange between heterogeneous applications. Web services engines have been developed to manage the configuration and execution of a web services workflow. 相似文献9.
Background
Benchmarking algorithms in structural bioinformatics often involves the construction of datasets of proteins with given sequence and structural properties. The SCOP database is a manually curated structural classification which groups together proteins on the basis of structural similarity. The ASTRAL compendium provides non redundant subsets of SCOP domains on the basis of sequence similarity such that no two domains in a given subset share more than a defined degree of sequence similarity. Taken together these two resources provide a 'ground truth' for assessing structural bioinformatics algorithms. We present a small and easy to use API written in python to enable construction of datasets from these resources. 相似文献10.
Background
Many online resources for the life sciences have been developed and introduced in peer-reviewed papers recently, ranging from databases and web applications to data-analysis software. Some have been introduced in special journal issues or websites with a search function, but others remain scattered throughout the Internet and in the published literature. The searchable resources on these sites are collected and maintained manually and are therefore of higher quality than automatically updated sites, but also require more time and effort. 相似文献11.
Bj?rn P. Pedersen Georgiana Ifrim Poul Liboriussen Kristian B. Axelsen Michael G. Palmgren Poul Nissen Carsten Wiuf Christian N. S. Pedersen 《PloS one》2014,9(1)
Background
Structured Logistic Regression (SLR) is a newly developed machine learning tool first proposed in the context of text categorization. Current availability of extensive protein sequence databases calls for an automated method to reliably classify sequences and SLR seems well-suited for this task. The classification of P-type ATPases, a large family of ATP-driven membrane pumps transporting essential cations, was selected as a test-case that would generate important biological information as well as provide a proof-of-concept for the application of SLR to a large scale bioinformatics problem.Results
Using SLR, we have built classifiers to identify and automatically categorize P-type ATPases into one of 11 pre-defined classes. The SLR-classifiers are compared to a Hidden Markov Model approach and shown to be highly accurate and scalable. Representing the bulk of currently known sequences, we analysed 9.3 million sequences in the UniProtKB and attempted to classify a large number of P-type ATPases. To examine the distribution of pumps on organisms, we also applied SLR to 1,123 complete genomes from the Entrez genome database. Finally, we analysed the predicted membrane topology of the identified P-type ATPases.Conclusions
Using the SLR-based classification tool we are able to run a large scale study of P-type ATPases. This study provides proof-of-concept for the application of SLR to a bioinformatics problem and the analysis of P-type ATPases pinpoints new and interesting targets for further biochemical characterization and structural analysis. 相似文献12.
Background
Dynamic programming is a widely used programming technique in bioinformatics. In sharp contrast to the simplicity of textbook examples, implementing a dynamic programming algorithm for a novel and non-trivial application is a tedious and error prone task. The algebraic dynamic programming approach seeks to alleviate this situation by clearly separating the dynamic programming recurrences and scoring schemes. 相似文献13.
Daniel Rios William M McLaren Yuan Chen Ewan Birney Arne Stabenau Paul Flicek Fiona Cunningham 《BMC bioinformatics》2010,11(1):238
Background
Advances in sequencing and genotyping technologies are leading to the widespread availability of multi-species variation data, dense genotype data and large-scale resequencing projects. The 1000 Genomes Project and similar efforts in other species are challenging the methods previously used for storage and manipulation of such data necessitating the redesign of existing genome-wide bioinformatics resources. 相似文献14.
Background
Supervised classification is fundamental in bioinformatics. Machine learning models, such as neural networks, have been applied to discover genes and expression patterns. This process is achieved by implementing training and test phases. In the training phase, a set of cases and their respective labels are used to build a classifier. During testing, the classifier is used to predict new cases. One approach to assessing its predictive quality is to estimate its accuracy during the test phase. Key limitations appear when dealing with small-data samples. This paper investigates the effect of data sampling techniques on the assessment of neural network classifiers. 相似文献15.
Jacob Köhler Katherine Munn Alexander Rüegg Andre Skusa Barry Smith 《BMC bioinformatics》2006,7(1):212-12
Background
Ontologies and taxonomies are among the most important computational resources for molecular biology and bioinformatics. A series of recent papers has shown that the Gene Ontology (GO), the most prominent taxonomic resource in these fields, is marked by flaws of certain characteristic types, which flow from a failure to address basic ontological principles. As yet, no methods have been proposed which would allow ontology curators to pinpoint flawed terms or definitions in ontologies in a systematic way. 相似文献16.
17.
18.
Background
The reconstruction of genetic regulatory networks from microarray gene expression data has been a challenging task in bioinformatics. Various approaches to this problem have been proposed, however, they do not take into account the topological characteristics of the targeted networks while reconstructing them. 相似文献19.
Saskia Preissner Mathias Dunkel Michael F. Hoffmann Sarah C. Preissner Nikolai Genov Wen Wei Rong Robert Preissner Karlheinz Seeger 《PloS one》2012,7(12)
Background
In general, drug metabolism has to be considered to avoid adverse effects and ineffective therapy. In particular, chemotherapeutic drug cocktails strain drug metabolizing enzymes especially the cytochrome P450 family (CYP). Furthermore, a number of important chemotherapeutic drugs such as cyclophosphamide, ifosfamide, tamoxifen or procarbazine are administered as prodrugs and have to be activated by CYP. Therefore, the genetic variability of these enzymes should be taken into account to design appropriate therapeutic regimens to avoid inadequate drug administration, toxicity and inefficiency.Objective
The aim of this work was to find drug interactions and to avoid side effects or ineffective therapy in chemotherapy.Data sources and methods
Information on drug administration in the therapy of leukemia and their drug metabolism was collected from scientific literature and various web resources. We carried out an automated textmining approach. Abstracts of PubMed were filtered for relevant articles using specific keywords. Abstracts were automatically screened for antineoplastic drugs and their synonyms in combination with a set of human CYPs in title or abstract.Results
We present a comprehensive analysis of over 100 common cancer treatment regimens regarding drug-drug interactions and present alternatives avoiding CYP overload. Typical concomitant medication, e.g. antiemetics or antibiotics is a preferred subject to improvement. A webtool, which allows drug cocktail optimization was developed and is publicly available on http://bioinformatics.charite.de/chemotherapy. 相似文献20.
Andrew C Doxey Zhenyu Cheng Barbara A Moffatt Brendan J McConkey 《BMC structural biology》2010,10(1):23