期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Data mining in bioinformatics using Weka 总被引：8，自引：0，他引：8

Frank E Hall M Trigg L Holmes G Witten IH 《Bioinformatics (Oxford, England)》2004,20(15):2479-2481

The Weka machine learning workbench provides a general-purpose environment for automatic classification, regression, clustering and feature selection-common data mining problems in bioinformatics research. It contains an extensive collection of machine learning algorithms and data pre-processing methods complemented by graphical user interfaces for data exploration and the experimental comparison of different machine learning techniques on the same problem. Weka can process data given in the form of a single relational table. Its main objectives are to (a) assist users in extracting useful information from data and (b) enable them to easily identify a suitable algorithm for generating an accurate predictive model from it. AVAILABILITY: http://www.cs.waikato.ac.nz/ml/weka. 相似文献

2.

BioJava: an open-source framework for bioinformatics 总被引：1，自引：0，他引：1

Holland RC Down TA Pocock M Prlić A Huen D James K Foisy S Dräger A Yates A Heuer M Schreiber MJ 《Bioinformatics (Oxford, England)》2008,24(18):2096-2097

SUMMARY: BioJava is a mature open-source project that provides a framework for processing of biological data. BioJava contains powerful analysis and statistical routines, tools for parsing common file formats and packages for manipulating sequences and 3D structures. It enables rapid bioinformatics application development in the Java programming language. AVAILABILITY: BioJava is an open-source project distributed under the Lesser GPL (LGPL). BioJava can be downloaded from the BioJava website (http://www.biojava.org). BioJava requires Java 1.5 or higher. All queries should be directed to the BioJava mailing lists. Details are available at http://biojava.org/wiki/BioJava:MailingLists. 相似文献

3.

A framework based on Web service orchestration for bioinformatics workflow management

Digiampietri LA Medeiros CB Setubal JC 《Genetics and molecular research : GMR》2005,4(3):535-542

Bioinformatics activities are growing all over the world, with proliferation of data and tools. This brings new challenges: how to understand and organize these resources and how to provide interoperability among tools to achieve a given goal. We defined and implemented a framework to help meet some of these challenges. Four issues were considered: the use of Web services as a basic unit, the notion of a Semantic Web to improve interoperability at the syntactic and semantic levels, and the use of scientific workflows to coordinate services to be executed, including their interdependencies and service orchestration. 相似文献

4.

A criticality-based framework for task composition in multi-agent bioinformatics integration systems

Karasavvas KA Baldock R Burger A 《Bioinformatics (Oxford, England)》2005,21(14):3155-3163

MOTIVATION: During task composition, such as can be found in distributed query processing, workflow systems and AI planning, decisions have to be made by the system and possibly by users with respect to how a given problem should be solved. Although there is often more than one correct way of solving a given problem, these multiple solutions do not necessarily lead to the same result. Some researchers are addressing this problem by providing data provenance information. Others use expert advice encoded in a supporting knowledge-base. In this paper, we propose an approach that assesses the importance of such decisions with respect to the overall result. We present a way of measuring decision criticality and describe its potential use. RESULTS: A multi-agent bioinformatics integration system is used as the basis of a framework that facilitates such functionality. We propose an agent architecture, and a concrete bioinformatics example (prototype) is used to show how certain decisions may not be critical in the context of more complex tasks. 相似文献

5.

Emerging bioinformatics for the metabolome 总被引：6，自引：0，他引：6

Mendes P 《Briefings in bioinformatics》2002,3(2):134-145

Metabolic profiling applied to functional genomics (metabolomics) is in an early stage of development. Here, the technologies used for metabolite profiling are briefly covered, illustrated by a few pioneering studies. Issues related to bioinformatics, namely data analysis, visualisation and archival, are the main focus of this review. Arguably there is already a need for databases containing metabolite profiles specific for a single organism, and a generic repository containing all metabolite profiling results, regardless of species. Data analyses and visualisations that combine the biological context with chemistry details are suggested as being the most promising. 相似文献

6.

S3DB core: a framework for RDF generation and management in bioinformatics infrastructures

Jonas S Almeida Helena F Deus Wolfgang Maass 《BMC bioinformatics》2010,11(1):387

Background

Biomedical research is set to greatly benefit from the use of semantic web technologies in the design of computational infrastructure. However, beyond well defined research initiatives, substantial issues of data heterogeneity, source distribution, and privacy currently stand in the way towards the personalization of Medicine. 相似文献

7.

A guide for the bioinformatics novice

《Nature structural biology》1999,6(12):1089

相似文献

8.

Global computing for bioinformatics

Loewe L 《Briefings in bioinformatics》2002,3(4):377-388

Global computing, the collaboration of idle PCs via the Internet in a SETI@home style, emerges as a new way of massive parallel multiprocessing with potentially enormous CPU power. Its relations to the broader, fast-moving field of Grid computing are discussed without attempting a review of the latter. This review (i) includes a short table of milestones in global computing history, (ii) lists opportunities global computing offers for bioinformatics, (iii) describes the structure of problems well suited for such an approach, (iv) analyses the anatomy of successful projects and (v) points to existing software frameworks. Finally, an evaluation of the various costs shows that global computing indeed has merit, if the problem to be solved is already coded appropriately and a suitable global computing framework can be found. Then, either significant amounts of computing power can be recruited from the general public, or--if employed in an enterprise-wide Intranet for security reasons--idle desktop PCs can substitute for an expensive dedicated cluster. 相似文献

9.

Ontology-based knowledge representation for bioinformatics 总被引：8，自引：0，他引：8

Stevens R Goble CA Bechhofer S 《Briefings in bioinformatics》2000,1(4):398-414

相似文献

10.

Utility library for structural bioinformatics 总被引：1，自引：0，他引：1

Gront D Kolinski A 《Bioinformatics (Oxford, England)》2008,24(4):584-585

相似文献

11.

Computing environments required for bioinformatics

Akiyama Y 《Tanpakushitsu kakusan koso. Protein, nucleic acid, enzyme》2003,48(9):1306-1312

相似文献

12.

Artificial intelligence techniques for bioinformatics 总被引：1，自引：0，他引：1

Narayanan A Keedwell EC Olsson B 《Applied bioinformatics》2002,1(4):191-222

This review provides an overview of the ways in which techniques from artificial intelligence (AI) can be usefully employed in bioinformatics, both for modelling biological data and for making new discoveries. The paper covers three techniques: symbolic machine learning approaches (nearest neighbour and identification tree techniques), artificial neural networks and genetic algorithms. Each technique is introduced and supported with examples taken from the bioinformatics literature. These examples include folding prediction, viral protease cleavage prediction, classification, multiple sequence alignment and microarray gene expression analysis. 相似文献

13.

A transatlantic bridge for bioinformatics

《Trends in biochemical sciences》2001,26(2):92

相似文献

14.

Designing XML schemas for bioinformatics

Bruhn RE Burton PJ 《BioTechniques》2003,34(6):1200-2, 1204, 1206 passim

Data interchange bioinformatics databases will, in the future, most likely take place using extensible markup language (XML). The document structure will be described by an XML Schema rather than a document type definition (DTD). To ensure flexibility, the XML Schema must incorporate aspects of Object-Oriented Modeling. This impinges on the choice of the data model, which, in turn, is based on the organization of bioinformatics data by biologists. Thus, there is a need for the general bioinformatics community to be aware of the design issues relating to XML Schema. This paper, which is aimed at a general bioinformatics audience, uses examples to describe the differences between a DTD and an XML Schema and indicates how Unified Modeling Language diagrams may be used to incorporate Object-Oriented Modeling in the design of schema. 相似文献

15.

Functional bioinformatics for Arabidopsis thaliana

Clare A Karwath A Ougham H King RD 《Bioinformatics (Oxford, England)》2006,22(9):1130-1136

MOTIVATION: The genome of Arabidopsis thaliana, which has the best understood plant genome, still has approximately one-third of its genes with no functional annotation at all from either MIPS or TAIR. We have applied our Data Mining Prediction (DMP) method to the problem of predicting the functional classes of these protein sequences. This method is based on using a hybrid machine-learning/data-mining method to identify patterns in the bioinformatic data about sequences that are predictive of function. We use data about sequence, predicted secondary structure, predicted structural domain, InterPro patterns, sequence similarity profile and expressions data. RESULTS: We predicted the functional class of a high percentage of the Arabidopsis genes with currently unknown function. These predictions are interpretable and have good test accuracies. We describe in detail seven of the rules produced. 相似文献

16.

Conceptual data modelling for bioinformatics

Bornberg-Bauer E Paton NW 《Briefings in bioinformatics》2002,3(2):166-180

Current research in the biosciences depends heavily on the effective exploitation of huge amounts of data. These are in disparate formats, remotely dispersed, and based on the different vocabularies of various disciplines. Furthermore, data are often stored or distributed using formats that leave implicit many important features relating to the structure and semantics of the data. Conceptual data modelling involves the development of implementation-independent models that capture and make explicit the principal structural properties of data. Entities such as a biopolymer or a reaction, and their relations, eg catalyses, can be formalised using a conceptual data model. Conceptual models are implementation-independent and can be transformed in systematic ways for implementation using different platforms, eg traditional database management systems. This paper describes the basics of the most widely used conceptual modelling notations, the ER (entity-relationship) model and the class diagrams of the UML (unified modelling language), and illustrates their use through several examples from bioinformatics. In particular, models are presented for protein structures and motifs, and for genomic sequences. 相似文献

17.

Functional bioinformatics for Arabidopsis thaliana

Clare A.; Karwath A.; Ougham H.; King R. D. 《Bioinformatics (Oxford, England)》2006,22(13):1674

The authors would 相似文献

18.

APBioNet: the Asia-Pacific regional consortium for bioinformatics

Ranganathan S Subbiah S Tan TW 《Applied bioinformatics》2002,1(2):101-105

Bioinformatics and computational biology, along with the related fields of genomics, proteomics, functional genomics and systems biology are new wave scientific disciplines that harness composite computational power across networks to advance biological knowledge at the most basic level and to direct traditional laboratory-based research efforts in the biomedical sciences. 'Fostering the growth of bioinformatics and allied disciplines in the Asia-Pacific region' is the motto of the first regional bioinformatics society, the Asia-Pacific Bioinformatics Network (APBioNet). APBioNet addresses the issues of hardware, software, databases and networks pertaining to bioinformatics, with the additional layer of pertinent education, training and research. Recent milestones achieved include hosting an international bioinformatics symposium in Asia and setting up large-scale regional grid-computing projects. 相似文献

19.

Evolving from bioinformatics in-the-small to bioinformatics in-the-large

Parker DS Gorlick MM Lee CJ 《Omics : a journal of integrative biology》2003,7(1):37-48

We argue the significance of a fundamental shift in bioinformatics, from in-the-small to in-the-large. Adopting a large-scale perspective is a way to manage the problems endemic to the world of the small-constellations of incompatible tools for which the effort required to assemble an integrated system exceeds the perceived benefit of the integration. Where bioinformatics in-the-small is about data and tools, bioinformatics in-the-large is about metadata and dependencies. Dependencies represent the complexities of large-scale integration, including the requirements and assumptions governing the composition of tools. The popular make utility is a very effective system for defining and maintaining simple dependencies, and it offers a number of insights about the essence of bioinformatics in-the-large. Keeping an in-the-large perspective has been very useful to us in large bioinformatics projects. We give two fairly different examples, and extract lessons from them showing how it has helped. These examples both suggest the benefit of explicitly defining and managing knowledge flows and knowledge maps (which represent metadata regarding types, flows, and dependencies), and also suggest approaches for developing bioinformatics database systems. Generally, we argue that large-scale engineering principles can be successfully adapted from disciplines such as software engineering and data management, and that having an in-the-large perspective will be a key advantage in the next phase of bioinformatics development. 相似文献

20.

A universal combinatorial design of antibody framework to graft distinct CDR sequences: A bioinformatics approach

Jaafar N. Haidar Qing‐An Yuan Lin Zeng Mark Snavely Xenia Luna Haifan Zhang Wei Zhu Dale L. Ludwig Zhenping Zhu 《Proteins》2013,81(9):1677-1677

相似文献