期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

LIMPIC: a computational method for the separation of protein MALDI-TOF-MS signals from noise

Dante Mantini Francesca Petrucci Damiana Pieragostino Piero Del Boccio Marta Di Nicola Carmine Di Ilio Giorgio Federici Paolo Sacchetta Silvia Comani Andrea Urbani 《BMC bioinformatics》2007,8(1):101

Background

Mass spectrometry protein profiling is a promising tool for biomarker discovery in clinical proteomics. However, the development of a reliable approach for the separation of protein signals from noise is required. In this paper, LIMPIC, a computational method for the detection of protein peaks from linear-mode MALDI-TOF data is proposed. LIMPIC is based on novel techniques for background noise reduction and baseline removal. Peak detection is performed considering the presence of a non-homogeneous noise level in the mass spectrum. A comparison of the peaks collected from multiple spectra is used to classify them on the basis of a detection rate parameter, and hence to separate the protein signals from other disturbances. 相似文献

2.

Tiling array data analysis: a multiscale approach using wavelets

Alexander Karpikov Joel Rozowsky Mark Gerstein 《BMC bioinformatics》2011,12(1):57

Background

Tiling array data is hard to interpret due to noise. The wavelet transformation is a widely used technique in signal processing for elucidating the true signal from noisy data. Consequently, we attempted to denoise representative tiling array datasets for ChIP-chip experiments using wavelets. In doing this, we used specific wavelet basis functions, Coiflets, since their triangular shape closely resembles the expected profiles of true ChIP-chip peaks. 相似文献

3.

Application of multiple statistical tests to enhance mass spectrometry-based biomarker discovery

Niclas C Tan Wayne G Fisher Kevin P Rosenblatt Harold R Garner 《BMC bioinformatics》2009,10(1):144

Background

Mass spectrometry-based biomarker discovery has long been hampered by the difficulty in reconciling lists of discriminatory peaks identified by different laboratories for the same diseases studied. We describe a multi-statistical analysis procedure that combines several independent computational methods. This approach capitalizes on the strengths of each to analyze the same high-resolution mass spectral data set to discover consensus differential mass peaks that should be robust biomarkers for distinguishing between disease states. 相似文献

4.

Unsupervised reduction of random noise in complex data by a row-specific,sorted principal component-guided method

Joseph W Foley Fumiaki Katagiri 《BMC bioinformatics》2008,9(1):508

Background

Large biological data sets, such as expression profiles, benefit from reduction of random noise. Principal component (PC) analysis has been used for this purpose, but it tends to remove small features as well as random noise. 相似文献

5.

WaveletQuant,an improved quantification software based on wavelet signal threshold de-noising for labeled quantitative proteomic analysis

Fan Mo Qun Mo Yuanyuan Chen David R Goodlett Leroy Hood Gilbert S Omenn Song Li Biaoyang Lin 《BMC bioinformatics》2010,11(1):219

Background

Quantitative proteomics technologies have been developed to comprehensively identify and quantify proteins in two or more complex samples. Quantitative proteomics based on differential stable isotope labeling is one of the proteomics quantification technologies. Mass spectrometric data generated for peptide quantification are often noisy, and peak detection and definition require various smoothing filters to remove noise in order to achieve accurate peptide quantification. Many traditional smoothing filters, such as the moving average filter, Savitzky-Golay filter and Gaussian filter, have been used to reduce noise in MS peaks. However, limitations of these filtering approaches often result in inaccurate peptide quantification. Here we present the WaveletQuant program, based on wavelet theory, for better or alternative MS-based proteomic quantification. 相似文献

6.

A normalization strategy applied to HiCEP (an AFLP-based expression profiling) analysis: Toward the strict alignment of valid fragments across electrophoretic patterns

Koji?Kadota Ryutaro?Fukumura Joseph?J?Rodrigue Ryoko?Araki Masumi?Abe Email author 《BMC bioinformatics》2005,6(1):43

Background

Gene expression analysis based on comparison of electrophoretic patterns is strongly dependent on the accuracy of DNA fragment sizing. The current normalization strategy based on molecular weight markers has limited accuracy because marker peaks are often masked by intense peaks nearby. Cumulative errors in fragment lengths cause problems in the alignment of same-length fragments across different electropherograms, especially for small fragments (< 100 bp). For accurate comparison of electrophoretic patterns, further inspection and normalization of electrophoretic data after fragment sizing by conventional strategies is needed. 相似文献

7.

PyMix - The Python mixture package - a tool for clustering of heterogeneous biological data

Benjamin Georgi Ivan Gesteira Costa Alexander Schliep 《BMC bioinformatics》2010,11(1):9

Background

Cluster analysis is an important technique for the exploratory analysis of biological data. Such data is often high-dimensional, inherently noisy and contains outliers. This makes clustering challenging. Mixtures are versatile and powerful statistical models which perform robustly for clustering in the presence of noise and have been successfully applied in a wide range of applications. 相似文献

8.

Intensity dependent estimation of noise in microarrays improves detection of differentially expressed genes

Amit Zeisel Amnon Amir Wolfgang J Köstler Eytan Domany 《BMC bioinformatics》2010,11(1):400

Background

In many microarray experiments, analysis is severely hindered by a major difficulty: the small number of samples for which expression data has been measured. When one searches for differentially expressed genes, the small number of samples gives rise to an inaccurate estimation of the experimental noise. This, in turn, leads to loss of statistical power. 相似文献

9.

Improving the analysis of designed studies by combining statistical modelling with study design information

Uwe Thissen Suzan Wopereis Sjoerd AA van den Berg Ivana Bobeldijk Robert Kleemann Teake Kooistra Ko Willems van Dijk Ben van Ommen Age K Smilde 《BMC bioinformatics》2009,10(1):52

Background

In the fields of life sciences, so-called designed studies are used for studying complex biological systems. The data derived from these studies comply with a study design aimed at generating relevant information while diminishing unwanted variation (noise). Knowledge about the study design can be used to decompose the total data into data blocks that are associated with specific effects. Subsequent statistical analysis can be improved by this decomposition if these are applied on selected combinations of effects. 相似文献

10.

A Novel Preprocessing Method Using Hilbert Huang Transform for MALDI-TOF and SELDI-TOF Mass Spectrometry Data

Li-Ching Wu Hsin-Hao Chen Jorng-Tzong Horng Chen Lin Norden E. Huang Yu-Che Cheng Kuang-Fu Cheng 《PloS one》2010,5(8)

Motivation

Mass spectrometry is a high throughput, fast, and accurate method of protein analysis. Using the peaks detected in spectra, we can compare a normal group with a disease group. However, the spectrum is complicated by scale shifting and is also full of noise. Such shifting makes the spectra non-stationary and need to align before comparison. Consequently, the preprocessing of the mass data plays an important role during the analysis process. Noises in mass spectrometry data come in lots of different aspects and frequencies. A powerful data preprocessing method is needed for removing large amount of noises in mass spectrometry data.

Results

Hilbert-Huang Transformation is a non-stationary transformation used in signal processing. We provide a novel algorithm for preprocessing that can deal with MALDI and SELDI spectra. We use the Hilbert-Huang Transformation to decompose the spectrum and filter-out the very high frequencies and very low frequencies signal. We think the noise in mass spectrometry comes from many sources and some of the noises can be removed by analysis of signal frequence domain. Since the protein in the spectrum is expected to be a unique peak, its frequence domain should be in the middle part of frequence domain and will not be removed. The results show that HHT, when used for preprocessing, is generally better than other preprocessing methods. The approach not only is able to detect peaks successfully, but HHT has the advantage of denoising spectra efficiently, especially when the data is complex. The drawback of HHT is that this approach takes much longer for the processing than the wavlet and traditional methods. However, the processing time is still manageable and is worth the wait to obtain high quality data. 相似文献

11.

Integrative missing value estimation for microarray data

Jianjun Hu Haifeng Li Michael S Waterman Xianghong Jasmine Zhou 《BMC bioinformatics》2006,7(1):449-14

Background

Missing value estimation is an important preprocessing step in microarray analysis. Although several methods have been developed to solve this problem, their performance is unsatisfactory for datasets with high rates of missing data, high measurement noise, or limited numbers of samples. In fact, more than 80% of the time-series datasets in Stanford Microarray Database contain less than eight samples. 相似文献

12.

ANMM4CBR: a case-based reasoning method for gene expression data classification

Bangpeng Yao Shao Li 《Algorithms for molecular biology : AMB》2010,5(1):14

Background

Accurate classification of microarray data is critical for successful clinical diagnosis and treatment. The "curse of dimensionality" problem and noise in the data, however, undermines the performance of many algorithms. 相似文献

13.

Chemosensitivity assay in mice prostate tumor: Preliminary report of flow cytometry,DNA fragmentation,ion ratiometric methods of anti-neoplastic drug monitoring

Sharma R Kline R 《Cancer cell international》2004,4(1):3

Flow cytometry, DNA fragmentation, ion ratiomateric analysis and NMR peaks characterized drug chemosensitivity of antineoplastic drugs. Hypotheses were: 1. The chemosensitive effect of different cancer cell lines is characteristic; 2. DNA fragmentation, ion ratiometric analysis suggest apoptosis status of tumor cells. 相似文献

14.

Analytical model of peptide mass cluster centres with applications

Witold E Wolski Malcolm Farrow Anne-Katrin Emde Hans Lehrach Maciej Lalowski Knut Reinert 《Proteome science》2006,4(1):18-19

Background

The elemental composition of peptides results in formation of distinct, equidistantly spaced clusters across the mass range. The property of peptide mass clustering is used to calibrate peptide mass lists, to identify and remove non-peptide peaks and for data reduction. 相似文献

15.

A strand specific high resolution normalization method for chip-sequencing data employing multiple experimental control measurements

Enroth S Andersson CR Andersson R Wadelius C Gustafsson MG Komorowski J 《Algorithms for molecular biology : AMB》2012,7(1):2

Background

High-throughput sequencing is becoming the standard tool for investigating protein-DNA interactions or epigenetic modifications. However, the data generated will always contain noise due to e.g. repetitive regions or non-specific antibody interactions. The noise will appear in the form of a background distribution of reads that must be taken into account in the downstream analysis, for example when detecting enriched regions (peak-calling). Several reported peak-callers can take experimental measurements of background tag distribution into account when analysing a data set. Unfortunately, the background is only used to adjust peak calling and not as a pre-processing step that aims at discerning the signal from the background noise. A normalization procedure that extracts the signal of interest would be of universal use when investigating genomic patterns. 相似文献

16.

How high is the level of technical noise in microarray data?

Lev Klebanov Andrei Yakovlev 《Biology direct》2007,2(1):9-9

Background

Microarray gene expression data are commonly perceived as being extremely noisy because of many imperfections inherent in the current technology. A recent study conducted by the MicroArray Quality Control (MAQC) Consortium and published in Nature Biotechnology provides a unique opportunity to probe into the true level of technical noise in such data. 相似文献

17.

Recursive SVM feature selection and sample classification for mass-spectrometry and microarray data 总被引：3，自引：0，他引：3

Xuegong Zhang Xin Lu Qian Shi Xiu-qin Xu Hon-chiu E Leung Lyndsay N Harris James D Iglehart Alexander Miron Jun S Liu Wing H Wong 《BMC bioinformatics》2006,7(1):197-13

Background

Like microarray-based investigations, high-throughput proteomics techniques require machine learning algorithms to identify biomarkers that are informative for biological classification problems. Feature selection and classification algorithms need to be robust to noise and outliers in the data. 相似文献

18.

Spectral estimation in unevenly sampled space of periodically expressed microarray time series data

Alan Wee-Chung Liew Jun Xian Shuanhu Wu David Smith Hong Yan 《BMC bioinformatics》2007,8(1):137

Background

Periodogram analysis of time-series is widespread in biology. A new challenge for analyzing the microarray time series data is to identify genes that are periodically expressed. Such challenge occurs due to the fact that the observed time series usually exhibit non-idealities, such as noise, short length, and unevenly sampled time points. Most methods used in the literature operate on evenly sampled time series and are not suitable for unevenly sampled time series. 相似文献

19.

PeakRanger: A cloud-enabled peak caller for ChIP-seq data

Xin Feng Robert Grossman Lincoln Stein 《BMC bioinformatics》2011,12(1):139

相似文献

20.

Assessing and selecting gene expression signals based upon the quality of the measured dynamics

Eric Yang Ioannis P Androulakis 《BMC bioinformatics》2009,10(1):55

Background

One of the challenges with modeling the temporal progression of biological signals is dealing with the effect of noise and the limited number of replicates at each time point. Given the rising interest in utilizing predictive mathematical models to describe the biological response of an organism or analysis such as clustering and gene ontology enrichment, it is important to determine whether the dynamic progression of the data has been accurately captured despite the limited number of replicates, such that one can have confidence that the results of the analysis are capturing important salient dynamic features. 相似文献