期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Discover protein sequence signatures from protein-protein interaction data

Jianwen Fang Ryan J Haasl Yinghua Dong Gerald H Lushington 《BMC bioinformatics》2005,6(1):277

Background

The development of high-throughput technologies such as yeast two-hybrid systems and mass spectrometry technologies has made it possible to generate large protein-protein interaction (PPI) datasets. Mining these datasets for underlying biological knowledge has, however, remained a challenge. 相似文献

2.

Preferred analysis methods for Affymetrix GeneChips. II. An expanded,balanced, wholly-defined spike-in dataset

Qianqian Zhu Jeffrey C Miecznikowski Marc S Halfon

《BMC bioinformatics》

Background

Concomitant with the rise in the popularity of DNA microarrays has been a surge of proposed methods for the analysis of microarray data. Fully controlled "spike-in" datasets are an invaluable but rare tool for assessing the performance of various methods. 相似文献

3.

Efficient analysis and extraction of MS/MS result data from Mascot™ result files

Florian?Grosse-Coosmann Andreas?M?Boehm Albert?Sickmann Email author 《BMC bioinformatics》2005,6(1):290

Background

Mascot™ is a commonly used protein identification program for MS as well as for tandem MS data. When analyzing huge shotgun proteomics datasets with Mascot™'s native tools, limits of computing resources are easily reached. Up to now no application has been available as open source that is capable of converting the full content of Mascot™ result files from the original MIME format into a database-compatible tabular format, allowing direct import into database management systems and efficient handling of huge datasets analyzed by Mascot™. 相似文献

4.

Missing value imputation for microarray gene expression data using histone acetylation information

Qian Xiang Xianhua Dai Yangyang Deng Caisheng He Jiang Wang Jihua Feng Zhiming Dai 《BMC bioinformatics》2008,9(1):252

Background

It is an important pre-processing step to accurately estimate missing values in microarray data, because complete datasets are required in numerous expression profile analysis in bioinformatics. Although several methods have been suggested, their performances are not satisfactory for datasets with high missing percentages. 相似文献

5.

MetaBar - a tool for consistent contextual data acquisition and standards compliant submission

Wolfgang Hankeln Pier Luigi Buttigieg Dennis Fink Renzo Kottmann Pelin Yilmaz Frank Oliver Glöckner 《BMC bioinformatics》2010,11(1):358

相似文献

6.

ITS as an environmental DNA barcode for fungi: an <Emphasis Type="Italic">in silico</Emphasis> approach reveals potential PCR biases

Eva Bellemain Tor Carlsen Christian Brochmann Eric Coissac Pierre Taberlet Håvard Kauserud 《BMC microbiology》2010,10(1):189

Background

During the last 15 years the internal transcribed spacer (ITS) of nuclear DNA has been used as a target for analyzing fungal diversity in environmental samples, and has recently been selected as the standard marker for fungal DNA barcoding. In this study we explored the potential amplification biases that various commonly utilized ITS primers might introduce during amplification of different parts of the ITS region in samples containing mixed templates ('environmental barcoding'). We performed in silico PCR analyses with commonly used primer combinations using various ITS datasets obtained from public databases as templates. 相似文献

7.

PubMatrix: a tool for multiplex literature mining

Kevin?G?Becker Email author Douglas?A?Hosack Glynn?DennisJr Richard?A?Lempicki Tiffani?J?Bright Chris?Cheadle Jim?Engel 《BMC bioinformatics》2003,4(1):61

Background

Molecular experiments using multiplex strategies such as cDNA microarrays or proteomic approaches generate large datasets requiring biological interpretation. Text based data mining tools have recently been developed to query large biological datasets of this type of data. PubMatrix is a web-based tool that allows simple text based mining of the NCBI literature search service PubMed using any two lists of keywords terms, resulting in a frequency matrix of term co-occurrence. 相似文献

8.

SplicerAV: a tool for mining microarray expression data for changes in RNA processing

Timothy J Robinson Michaela A Dinan Mark Dewhirst Mariano A Garcia-Blanco James L Pearson 《BMC bioinformatics》2010,11(1):108

Background

Over the past two decades more than fifty thousand unique clinical and biological samples have been assayed using the Affymetrix HG-U133 and HG-U95 GeneChip microarray platforms. This substantial repository has been used extensively to characterize changes in gene expression between biological samples, but has not been previously mined en masse for changes in mRNA processing. We explored the possibility of using HG-U133 microarray data to identify changes in alternative mRNA processing in several available archival datasets. 相似文献

9.

AutoSOME: a clustering method for identifying gene expression modules without prior knowledge of cluster number

Aaron M Newman James B Cooper 《BMC bioinformatics》2010,11(1):117

Background

Clustering the information content of large high-dimensional gene expression datasets has widespread application in "omics" biology. Unfortunately, the underlying structure of these natural datasets is often fuzzy, and the computational identification of data clusters generally requires knowledge about cluster number and geometry. 相似文献

10.

Data processing and classification analysis of proteomic changes: a case study of oil pollution in the mussel, Mytilus edulis

Monsinjon T Andersen OK Leboulenger F Knigge T 《Proteome science》2006,4(1):17-13

Background

Proteomics may help to detect subtle pollution-related changes, such as responses to mixture pollution at low concentrations, where clear signs of toxicity are absent. The challenges associated with the analysis of large-scale multivariate proteomic datasets have been widely discussed in medical research and biomarker discovery. This concept has been introduced to ecotoxicology only recently, so data processing and classification analysis need to be refined before they can be readily applied in biomarker discovery and monitoring studies. 相似文献

11.

Integrative missing value estimation for microarray data

Jianjun Hu Haifeng Li Michael S Waterman Xianghong Jasmine Zhou 《BMC bioinformatics》2006,7(1):449-14

Background

Missing value estimation is an important preprocessing step in microarray analysis. Although several methods have been developed to solve this problem, their performance is unsatisfactory for datasets with high rates of missing data, high measurement noise, or limited numbers of samples. In fact, more than 80% of the time-series datasets in Stanford Microarray Database contain less than eight samples. 相似文献

12.

MAID : An effect size based model for microarray data integration across laboratories and platforms

Ivan Borozan Limin Chen Bryan Paeper Jenny E Heathcote Aled M Edwards Michael Katze Zhaolei Zhang Ian D McGilvray 《BMC bioinformatics》2008,9(1):305

Background

Gene expression profiling has the potential to unravel molecular mechanisms behind gene regulation and identify gene targets for therapeutic interventions. As microarray technology matures, the number of microarray studies has increased, resulting in many different datasets available for any given disease. The increase in sensitivity and reliability of measurements of gene expression changes can be improved through a systematic integration of different microarray datasets that address the same or similar biological questions. 相似文献

13.

Assessment of methods for amino acid matrix selection and their use on empirical data shows that ad hoc assumptions for choice of matrix are not justified

Thomas M Keane Christopher J Creevey Melissa M Pentony Thomas J Naughton James O Mclnerney 《BMC evolutionary biology》2006,6(1):29-17

Background

In recent years, model based approaches such as maximum likelihood have become the methods of choice for constructing phylogenies. A number of authors have shown the importance of using adequate substitution models in order to produce accurate phylogenies. In the past, many empirical models of amino acid substitution have been derived using a variety of different methods and protein datasets. These matrices are normally used as surrogates, rather than deriving the maximum likelihood model from the dataset being examined. With few exceptions, selection between alternative matrices has been carried out in an ad hoc manner. 相似文献

14.

Combining sequence-based prediction methods and circular dichroism and infrared spectroscopic data to improve protein secondary structure determinations

Jonathan G Lees Robert W Janes 《BMC bioinformatics》2008,9(1):24

Background

A number of sequence-based methods exist for protein secondary structure prediction. Protein secondary structures can also be determined experimentally from circular dichroism, and infrared spectroscopic data using empirical analysis methods. It has been proposed that comparable accuracy can be obtained from sequence-based predictions as from these biophysical measurements. Here we have examined the secondary structure determination accuracies of sequence prediction methods with the empirically determined values from the spectroscopic data on datasets of proteins for which both crystal structures and spectroscopic data are available. 相似文献

15.

Performance of a genetic algorithm for mass spectrometry proteomics 总被引：1，自引：0，他引：1

Neal?O?Jeffries Email author 《BMC bioinformatics》2004,5(1):180

Background

Recently, mass spectrometry data have been mined using a genetic algorithm to produce discriminatory models that distinguish healthy individuals from those with cancer. This algorithm is the basis for claims of 100% sensitivity and specificity in two related publicly available datasets. To date, no detailed attempts have been made to explore the properties of this genetic algorithm within proteomic applications. Here the algorithm's performance on these datasets is evaluated relative to other methods. 相似文献

16.

A benchmark for statistical microarray data analysis that preserves actual biological and technical variance

Benoît De Hertogh Bertrand De Meulder Fabrice Berger Michael Pierre Eric Bareke Anthoula Gaigneaux Eric Depiereux 《BMC bioinformatics》2010,11(1):17

Background

Recent reanalysis of spike-in datasets underscored the need for new and more accurate benchmark datasets for statistical microarray analysis. We present here a fresh method using biologically-relevant data to evaluate the performance of statistical methods. 相似文献

17.

Methodology for systematic analysis and improvement of manufacturing unit process life-cycle inventory (UPLCI)—CO2PE! initiative (cooperative effort on process emissions in manufacturing). Part 1: Methodology description

Karel Kellens Wim Dewulf Michael Overcash Michael Z. Hauschild Joost R. Duflou 《The International Journal of Life Cycle Assessment》2012,17(1):69-78

Purpose

This report proposes a life-cycle analysis (LCA)-oriented methodology for systematic inventory analysis of the use phase of manufacturing unit processes providing unit process datasets to be used in life-cycle inventory (LCI) databases and libraries. The methodology has been developed in the framework of the CO₂PE! collaborative research programme (CO2PE! 2011a) and comprises two approaches with different levels of detail, respectively referred to as the screening approach and the in-depth approach. 相似文献

18.

Life cycle assessment of Australian sugarcane production with a focus on sugarcane growing 总被引：1，自引：0，他引：1

Marguerite Anne Renouf Malcolm K. Wegener Robert J. Pagan 《The International Journal of Life Cycle Assessment》2010,15(9):927-937

Purpose

Past life cycle assessments (LCA) of sugarcane (Saccharum officinarum) production have commonly been based on limited datasets, and variability has not been well described. In this work, Australian sugarcane production was assessed more comprehensively in order to generate a robust set of LCA results for use in subsequent assessments of sugarcane products and also to investigate: (1) variability due to regional differences, (2) factors influencing variability, and (3) significance of the impacts. 相似文献

19.

Examining the significance of fingerprint-based classifiers

Brian T Luke Jack R Collins 《BMC bioinformatics》2008,9(1):545

Background

Experimental examinations of biofluids to measure concentrations of proteins or their fragments or metabolites are being explored as a means of early disease detection, distinguishing diseases with similar symptoms, and drug treatment efficacy. Many studies have produced classifiers with a high sensitivity and specificity, and it has been argued that accurate results necessarily imply some underlying biology-based features in the classifier. The simplest test of this conjecture is to examine datasets designed to contain no information with classifiers used in many published studies. 相似文献

20.

False positive reduction in protein-protein interaction predictions using gene ontology annotations 总被引：1，自引：0，他引：1

Mahmoud A Mahdavi Yen-Han Lin 《BMC bioinformatics》2007,8(1):262

Background

Many crucial cellular operations such as metabolism, signalling, and regulations are based on protein-protein interactions. However, the lack of robust protein-protein interaction information is a challenge. One reason for the lack of solid protein-protein interaction information is poor agreement between experimental findings and computational sets that, in turn, comes from huge false positive predictions in computational approaches. Reduction of false positive predictions and enhancing true positive fraction of computationally predicted protein-protein interaction datasets based on highly confident experimental results has not been adequately investigated. 相似文献