期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

A database application for pre-processing, storage and comparison of mass spectra derived from patients and controls

Mark K Titulaer Ivar Siccama Lennard J Dekker Angelique LCT van Rijswijk Ron MA Heeren Peter A Sillevis Smitt Theo M Luider 《BMC bioinformatics》2006,7(1):403-16

Background

Statistical comparison of peptide profiles in biomarker discovery requires fast, user-friendly software for high throughput data analysis. Important features are flexibility in changing input variables and statistical analysis of peptides that are differentially expressed between patient and control groups. In addition, integration the mass spectrometry data with the results of other experiments, such as microarray analysis, and information from other databases requires a central storage of the profile matrix, where protein id's can be added to peptide masses of interest. 相似文献

2.

NITPICK: peak identification for mass spectrometry data

Bernhard Y Renard Marc Kirchner Hanno Steen Judith AJ Steen Fred A Hamprecht 《BMC bioinformatics》2008,9(1):355

Background

The reliable extraction of features from mass spectra is a fundamental step in the automated analysis of proteomic mass spectrometry (MS) experiments. 相似文献

3.

Methods for peptide identification by spectral comparison

Jian Liu Alexander W Bell John JM Bergeron Corey M Yanofsky Brian Carrillo Christian EH Beaudrie Robert E Kearney 《Proteome science》2007,5(1):3-12

Background

Tandem mass spectrometry followed by database search is currently the predominant technology for peptide sequencing in shotgun proteomics experiments. Most methods compare experimentally observed spectra to the theoretical spectra predicted from the sequences in protein databases. There is a growing interest, however, in comparing unknown experimental spectra to a library of previously identified spectra. This approach has the advantage of taking into account instrument-dependent factors and peptide-specific differences in fragmentation probabilities. It is also computationally more efficient for high-throughput proteomics studies. 相似文献

4.

msmsEval: tandem mass spectral quality assignment for high-throughput proteomics

Jason WH Wong Matthew J Sullivan Hugh M Cartwright Gerard Cagney 《BMC bioinformatics》2007,8(1):51

Background

In proteomics experiments, database-search programs are the method of choice for protein identification from tandem mass spectra. As amino acid sequence databases grow however, computing resources required for these programs have become prohibitive, particularly in searches for modified proteins. Recently, methods to limit the number of spectra to be searched based on spectral quality have been proposed by different research groups, but rankings of spectral quality have thus far been based on arbitrary cut-off values. In this work, we develop a more readily interpretable spectral quality statistic by providing probability values for the likelihood that spectra will be identifiable. 相似文献

5.

Feature selection and nearest centroid classification for protein mass spectrometry

Ilya?Levner Email author 《BMC bioinformatics》2005,6(1):68

Background

The use of mass spectrometry as a proteomics tool is poised to revolutionize early disease diagnosis and biomarker identification. Unfortunately, before standard supervised classification algorithms can be employed, the "curse of dimensionality" needs to be solved. Due to the sheer amount of information contained within the mass spectra, most standard machine learning techniques cannot be directly applied. Instead, feature selection techniques are used to first reduce the dimensionality of the input space and thus enable the subsequent use of classification algorithms. This paper examines feature selection techniques for proteomic mass spectrometry. 相似文献

6.

Quality control and quality assessment of data from surface-enhanced laser desorption/ionization (SELDI) time-of flight (TOF) mass spectrometry (MS)

Hong H Dragan Y Epstein J Teitel C Chen B Xie Q Fang H Shi L Perkins R Tong W 《BMC bioinformatics》2005,6(Z2):S5

Background

Proteomic profiling of complex biological mixtures by the ProteinChip technology of surface-enhanced laser desorption/ionization time-of-flight (SELDI-TOF) mass spectrometry (MS) is one of the most promising approaches in toxicological, biological, and clinic research. The reliable identification of protein expression patterns and associated protein biomarkers that differentiate disease from health or that distinguish different stages of a disease depends on developing methods for assessing the quality of SELDI-TOF mass spectra. The use of SELDI data for biomarker identification requires application of rigorous procedures to detect and discard low quality spectra prior to data analysis.

Results

The systematic variability from plates, chips, and spot positions in SELDI experiments was evaluated using biological and technical replicates. Systematic biases on plates, chips, and spots were not found. The reproducibility of SELDI experiments was demonstrated by examining the resulting low coefficient of variances of five peaks presented in all 144 spectra from quality control samples that were loaded randomly on different spots in the chips of six bioprocessor plates. We developed a method to detect and discard low quality spectra prior to proteomic profiling data analysis, which uses a correlation matrix to measure the similarities among SELDI mass spectra obtained from similar biological samples. Application of the correlation matrix to our SELDI data for liver cancer and liver toxicity study and myeloma-associated lytic bone disease study confirmed this approach as an efficient and reliable method for detecting low quality spectra.

Conclusion

This report provides evidence that systematic variability between plates, chips, and spots on which the samples were assayed using SELDI based proteomic procedures did not exist. The reproducibility of experiments in our studies was demonstrated to be acceptable and the profiling data for subsequent data analysis are reliable. Correlation matrix was developed as a quality control tool to detect and discard low quality spectra prior to data analysis. It proved to be a reliable method to measure the similarities among SELDI mass spectra and can be used for quality control to decrease noise in proteomic profiling data prior to data analysis.

相似文献

7.

SAMPI: Protein Identification with Mass Spectra Alignments

Hans-Michael Kaltenbach Andreas Wilke Sebastian Böcker 《BMC bioinformatics》2007,8(1):102

Background

Mass spectrometry based peptide mass fingerprints (PMFs) offer a fast, efficient, and robust method for protein identification. A protein is digested (usually by trypsin) and its mass spectrum is compared to simulated spectra for protein sequences in a database. However, existing tools for analyzing PMFs often suffer from missing or heuristic analysis of the significance of search results and insufficient handling of missing and additional peaks. 相似文献

8.

A compatible exon-exon junction database for the identification of exon skipping events using tandem mass spectrum data

Fan Mo Xu Hong Feng Gao Lin Du Jun Wang Gilbert S Omenn Biaoyang Lin 《BMC bioinformatics》2008,9(1):537

Background

Alternative splicing is an important gene regulation mechanism. It is estimated that about 74% of multi-exon human genes have alternative splicing. High throughput tandem (MS/MS) mass spectrometry provides valuable information for rapidly identifying potentially novel alternatively-spliced protein products from experimental datasets. However, the ability to identify alternative splicing events through tandem mass spectrometry depends on the database against which the spectra are searched. 相似文献

9.

Computing H/D-Exchange rates of single residues from data of proteolytic fragments

Ernst Althaus Stefan Canzar Carsten Ehrler Mark R Emmett Andreas Karrenbauer Alan G Marshall Anke Meyer-Bäse Jeremiah D Tipton Hui-Min Zhang 《BMC bioinformatics》2010,11(1):424

Background

Protein conformation and protein/protein interaction can be elucidated by solution-phase Hydrogen/Deuterium exchange (sHDX) coupled to high-resolution mass analysis of the digested protein or protein complex. In sHDX experiments mutant proteins are compared to wild-type proteins or a ligand is added to the protein and compared to the wild-type protein (or mutant). The number of deuteriums incorporated into the polypeptides generated from the protease digest of the protein is related to the solvent accessibility of amide protons within the original protein construct. 相似文献

10.

A machine learning approach to explore the spectra intensity pattern of peptides using tandem mass spectrometry data

Cong Zhou Lucas D Bowler Jianfeng Feng 《BMC bioinformatics》2008,9(1):325

Background

A better understanding of the mechanisms involved in gas-phase fragmentation of peptides is essential for the development of more reliable algorithms for high-throughput protein identification using mass spectrometry (MS). Current methodologies depend predominantly on the use of derived m/z values of fragment ions, and, the knowledge provided by the intensity information present in MS/MS spectra has not been fully exploited. Indeed spectrum intensity information is very rarely utilized in the algorithms currently in use for high-throughput protein identification. 相似文献

11.

A novel scoring schema for peptide identification by searching protein sequence databases using tandem mass spectrometry data

Zhuo Zhang Shiwei Sun Xiaopeng Zhu Suhua Chang Xiaofei Liu Chungong Yu Dongbo Bu Runsheng Chen 《BMC bioinformatics》2006,7(1):222-8

Background

Tandem mass spectrometry (MS/MS) is a powerful tool for protein identification. Although great efforts have been made in scoring the correlation between tandem mass spectra and an amino acid sequence database, improvements could be made in three aspects, including characterization ofpeaks in spectra, adoption of effective scoring functions and access to thereliability of matching between peptides and spectra. 相似文献

12.

multiplierz: an extensible API based desktop environment for proteomics data analysis

Jignesh R Parikh Manor Askenazi Scott B Ficarro Tanya Cashorali James T Webber Nathaniel C Blank Yi Zhang Jarrod A Marto 《BMC bioinformatics》2009,10(1):364

Background

Efficient analysis of results from mass spectrometry-based proteomics experiments requires access to disparate data types, including native mass spectrometry files, output from algorithms that assign peptide sequence to MS/MS spectra, and annotation for proteins and pathways from various database sources. Moreover, proteomics technologies and experimental methods are not yet standardized; hence a high degree of flexibility is necessary for efficient support of high- and low-throughput data analytic tasks. Development of a desktop environment that is sufficiently robust for deployment in data analytic pipelines, and simultaneously supports customization for programmers and non-programmers alike, has proven to be a significant challenge. 相似文献

13.

Improved machine learning method for analysis of gas phase chemistry of peptides

Allison Gehrke Shaojun Sun Lukasz Kurgan Natalie Ahn Katheryn Resing Karen Kafadar Krzysztof Cios 《BMC bioinformatics》2008,9(1):515

Background

Accurate peptide identification is important to high-throughput proteomics analyses that use mass spectrometry. Search programs compare fragmentation spectra (MS/MS) of peptides from complex digests with theoretically derived spectra from a database of protein sequences. Improved discrimination is achieved with theoretical spectra that are based on simulating gas phase chemistry of the peptides, but the limited understanding of those processes affects the accuracy of predictions from theoretical spectra. 相似文献

14.

Tandem mass spectrometry data quality assessment by self-convolution

Keng Wah Choo Wai Mun Tham 《BMC bioinformatics》2007,8(1):352

Background

Many algorithms have been developed for deciphering the tandem mass spectrometry (MS) data sets. They can be essentially clustered into two classes. The first performs searches on theoretical mass spectrum database, while the second based itself on de novo sequencing from raw mass spectrometry data. It was noted that the quality of mass spectra affects significantly the protein identification processes in both instances. This prompted the authors to explore ways to measure the quality of MS data sets before subjecting them to the protein identification algorithms, thus allowing for more meaningful searches and increased confidence level of proteins identified. 相似文献

15.

X-ray sequence and crystal structure of luffaculin 1, a novel type 1 ribosome-inactivating protein

Xiaomin Hou Minghuang Chen Liqing Chen Edward J Meehan Jieming Xie Mingdong Huang 《BMC structural biology》2007,7(1):29

Background

Protein sequence can be obtained through Edman degradation, mass spectrometry, or cDNA sequencing. High resolution X-ray crystallography can also be used to derive protein sequence information, but faces the difficulty in distinguishing the Asp/Asn, Glu/Gln, and Val/Thr pairs. Luffaculin 1 is a new type 1 ribosome-inactivating protein (RIP) isolated from the seeds of Luffa acutangula. Besides rRNA N-glycosidase activity, luffaculin 1 also demonstrates activities including inhibiting tumor cells' proliferation and inducing tumor cells' differentiation. 相似文献

16.

Hydra: software for tailored processing of H/D exchange data from MS or tandem MS analyses

Gordon W Slysz Charles AH Baker Benjamin M Bozsa Anthony Dang Andrew J Percy Melissa Bennett David C Schriemer 《BMC bioinformatics》2009,10(1):162

Background

Hydrogen/deuterium exchange mass spectrometry (H/DX-MS) experiments implemented to characterize protein interaction and protein folding generate large quantities of data. Organizing, processing and visualizing data requires an automated solution, particularly when accommodating new tandem mass spectrometry modes for H/DX measurement. We sought to develop software that offers flexibility in defining workflows so as to support exploratory treatments of H/DX-MS data, with a particular focus on the analysis of very large protein systems and the mining of tandem mass spectrometry data. 相似文献

17.

Statistical learning of peptide retention behavior in chromatographic separations: a new kernel-based approach for computational proteomics

Nico Pfeifer Andreas Leinenbach Christian G Huber Oliver Kohlbacher 《BMC bioinformatics》2007,8(1):468

Background

High-throughput peptide and protein identification technologies have benefited tremendously from strategies based on tandem mass spectrometry (MS/MS) in combination with database searching algorithms. A major problem with existing methods lies within the significant number of false positive and false negative annotations. So far, standard algorithms for protein identification do not use the information gained from separation processes usually involved in peptide analysis, such as retention time information, which are readily available from chromatographic separation of the sample. Identification can thus be improved by comparing measured retention times to predicted retention times. Current prediction models are derived from a set of measured test analytes but they usually require large amounts of training data. 相似文献

18.

Harvest: an open-source tool for the validation and improvement of peptide identification metrics and fragmentation exploration

Leo C McHugh Jonathan W Arthur 《BMC bioinformatics》2010,11(1):448

Background

Protein identification using mass spectrometry is an important tool in many areas of the life sciences, and in proteomics research in particular. Increasing the number of proteins correctly identified is dependent on the ability to include new knowledge about the mass spectrometry fragmentation process, into computational algorithms designed to separate true matches of peptides to unidentified mass spectra from spurious matches. This discrimination is achieved by computing a function of the various features of the potential match between the observed and theoretical spectra to give a numerical approximation of their similarity. It is these underlying "metrics" that determine the ability of a protein identification package to maximise correct identifications while limiting false discovery rates. There is currently no software available specifically for the simple implementation and analysis of arbitrary novel metrics for peptide matching and for the exploration of fragmentation patterns for a given dataset. 相似文献

19.

PatternLab for proteomics: a tool for differential shotgun proteomics

Paulo C Carvalho Juliana SG Fischer Emily I Chen John R YatesIII Valmir C Barbosa 《BMC bioinformatics》2008,9(1):316

Background

A goal of proteomics is to distinguish between states of a biological system by identifying protein expression differences. Liu et al. demonstrated a method to perform semi-relative protein quantitation in shotgun proteomics data by correlating the number of tandem mass spectra obtained for each protein, or "spectral count", with its abundance in a mixture; however, two issues have remained open: how to normalize spectral counting data and how to efficiently pinpoint differences between profiles. Moreover, Chen et al. recently showed how to increase the number of identified proteins in shotgun proteomics by analyzing samples with different MS-compatible detergents while performing proteolytic digestion. The latter introduced new challenges as seen from the data analysis perspective, since replicate readings are not acquired. 相似文献

20.

AVID: An integrative framework for discovering functional relationships among proteins

Taijiao?Jiang Amy?E?Keating Email author 《BMC bioinformatics》2005,6(1):136

Background

Determining the functions of uncharacterized proteins is one of the most pressing problems in the post-genomic era. Large scale protein-protein interaction assays, global mRNA expression analyses and systematic protein localization studies provide experimental information that can be used for this purpose. The data from such experiments contain many false positives and false negatives, but can be processed using computational methods to provide reliable information about protein-protein relationships and protein function. An outstanding and important goal is to predict detailed functional annotation for all uncharacterized proteins that is reliable enough to effectively guide experiments. 相似文献