首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
    
Searching spectral libraries in MS/MS is an important new approach to improving the quality of peptide and protein identification. The idea relies on the observation that ion intensities in an MS/MS spectrum of a given peptide are generally reproducible across experiments, and thus, matching between spectra from an experiment and the spectra of previously identified peptides stored in a spectral library can lead to better peptide identification compared to the traditional database search. However, the use of libraries is greatly limited by their coverage of peptide sequences: even for well‐studied organisms a large fraction of peptides have not been previously identified. To address this issue, we propose to expand spectral libraries by predicting the MS/MS spectra of peptides based on the spectra of peptides with similar sequences. We first demonstrate that the intensity patterns of dominant fragment ions between similar peptides tend to be similar. In accordance with this observation, we develop a neighbor‐based approach that first selects peptides that are likely to have spectra similar to the target peptide and then combines their spectra using a weighted K‐nearest neighbor method to accurately predict fragment ion intensities corresponding to the target peptide. This approach has the potential to predict spectra for every peptide in the proteome. When rigorous quality criteria are applied, we estimate that the method increases the coverage of spectral libraries available from the National Institute of Standards and Technology by 20–60%, although the values vary with peptide length and charge state. We find that the overall best search performance is achieved when spectral libraries are supplemented by the high quality predicted spectra.  相似文献   

2.
    
Topology analysis of membrane proteins can be obtained by enzymatic shaving in combination with MS identification of peptides. Ideally, such analysis could provide quite detailed information about the membrane spanning regions. Here, we examine the ability of some shaving enzymes to provide large‐scale analysis of membrane proteome topologies. To compare different shaving enzymes, we first analyzed the detected peptides from two over‐expressed proteins. Second, we analyzed the peptides from non‐over‐expressed Escherichia coli membrane proteins with known structure to evaluate the shaving methods. Finally, the identified peptides were used to test the accuracy of a number of topology predictors. At the end we suggest that the usage of thermolysin, an enzyme working at the natural pH of the cell for membrane shaving, is superior because: (i) we detect a similar number of peptides and proteins using thermolysin and trypsin; (ii) thermolysin shaving can be run at a natural pH and (iii) the incubation time is quite short. (iv) Fewer detected peptides from thermolysin shaving originate from the transmembrane regions. Using thermolysin shaving we can also provide a clear separation between the best and the less accurate topology predictors, indicating that using data from shaving can provide valuable information when developing new topology predictors.  相似文献   

3.
    
Scherl A  Tsai YS  Shaffer SA  Goodlett DR 《Proteomics》2008,8(14):2791-2797
Although mass spectrometers are capable of providing high mass accuracy data, assignment of true monoisotopic precursor ion mass is complicated during data-dependent ion selection for LC-MS/MS analysis of complex mixtures. The complication arises when chromatographic peak widths for a given analyte exceed the time required to acquire a precursor ion mass spectrum. The result is that many measured monoisotopic masses are misassigned due to calculation from a single mass spectrum with poor ion statistics based on only a fraction of the total available ions for a given analyte. Such data in turn produces errors in automated database searches, where precursor m/z value is one search parameter. We propose here a postacquisition approach to correct misassigned monoisotopic m/z values that involves peak detection over the entire elution profile and correction of the precursor ion monoisotopic mass. As a result of using this approach to reprocess shotgun proteomic data we increased peptide sequence assignments by 10% while reducing the estimated false positive ratio from 1 to 0.2%. We also show that 4% of the salvaged identifications may be accounted for by correction of mixed tandem mass spectra resulting from fragmentation of multiple peptides simultaneously, a situation which we refer to as accidental CID.  相似文献   

4.
    
pyOpenMS is an open‐source, Python‐based interface to the C++ OpenMS library, providing facile access to a feature‐rich, open‐source algorithm library for MS‐based proteomics analysis. It contains Python bindings that allow raw access to the data structures and algorithms implemented in OpenMS, specifically those for file access (mzXML, mzML, TraML, mzIdentML among others), basic signal processing (smoothing, filtering, de‐isotoping, and peak‐picking) and complex data analysis (including label‐free, SILAC, iTRAQ, and SWATH analysis tools). pyOpenMS thus allows fast prototyping and efficient workflow development in a fully interactive manner (using the interactive Python interpreter) and is also ideally suited for researchers not proficient in C++. In addition, our code to wrap a complex C++ library is completely open‐source, allowing other projects to create similar bindings with ease. The pyOpenMS framework is freely available at https://pypi.python.org/pypi/pyopenms while the autowrap tool to create Cython code automatically is available at https://pypi.python.org/pypi/autowrap (both released under the 3‐clause BSD licence).  相似文献   

5.
McIntyre SF 《Proteomics》2005,5(15):3828-3830
This report describes the highlights of the second scientific meeting of the British Society for Proteome Research (BSPR), jointly organised with the European Bioinformatics Institute (EBI), and held at The Genome Centre, Cambridge UK in July 2005. The theme of the meeting was "From Proteins to Systems" covering many diverse aspects of proteomics, bioinformatics and systems biology.  相似文献   

6.
    
Mass spectrometry (MS) analysis of peptides carrying post‐translational modifications is challenging due to the instability of some modifications during MS analysis. However, glycopeptides as well as acetylated, methylated and other modified peptides release specific fragment ions during CID (collision‐induced dissociation) and HCD (higher energy collisional dissociation) fragmentation. These fragment ions can be used to validate the presence of the PTM on the peptide. Here, we present PTM MarkerFinder, a software tool that takes advantage of such marker ions. PTM MarkerFinder screens the MS/MS spectra in the output of a database search (i.e., Mascot) for marker ions specific for selected PTMs. Moreover, it reports and annotates the HCD and the corresponding electron transfer dissociation (ETD) spectrum (when present), and summarizes information on the type, number, and ratios of marker ions found in the data set. In the present work, a sample containing enriched N‐acetylhexosamine (HexNAc) glycopeptides from yeast has been analyzed by liquid chromatography‐mass spectrometry on an LTQ Orbitrap Velos using both HCD and ETD fragmentation techniques. The identification result (Mascot .dat file) was submitted as input to PTM MarkerFinder and screened for HexNAc oxonium ions. The software output has been used for high‐throughput validation of the identification results.  相似文献   

7.
Proteomics strategies for protein identification   总被引:13,自引:0,他引:13  
Resing KA  Ahn NG 《FEBS letters》2005,579(4):885-889
The information from genome sequencing provides new approaches for systems-wide understanding of protein networks and cellular function. DNA microarray technologies have advanced to the point where nearly complete monitoring of gene expression is feasible in several organisms. An equally important goal is to comprehensive survey cellular proteomes and profile protein changes under different cellular states. This presents a complex analytical problem, due to the chemical variability between proteins and peptides. Here, we discuss strategies to improve accuracy and sensitivity of peptide identification, distinguish represented protein isoforms, and quantify relative changes in protein abundance.  相似文献   

8.
9.
    
This paper reports on the 5th joint British Society for Proteome Research (BSPR) and European Bioinformatics Institute (EBI) meeting which took place at the Wellcome Trust Conference Centre, Cambridge, UK, from the 8th to 10th July, 2008. As in previous years, the meeting attracted leading experts in the field who presented the latest cutting edge in proteomics. The meeting was entitled “Proteomics: From Technology to New Biology” taking into account the major transition proteomics has undergone in the past few years. In particular, the use of multiple reaction monitoring (MRM)‐based targeted experiments for absolute quantification and validation of proteins was the hot topic of the meeting. Attended by some 250 delegates, the conference was extremely well organised and provided a great opportunity for discussion and initiation of new collaborations.  相似文献   

10.
    
Now in its 6(th) year, the East Midlands Proteomics workshop held in November 2007 brought together over 200 scientists with a common interest in proteomic techniques and their application to complex biological and biomedical problems. For the first time, this meeting was jointly supported by the British Society for Proteome Research (BSPR) and British Mass Spectrometry Society (BMSS).  相似文献   

11.
    
Numerous workflows exist for large‐scale bottom‐up proteomics, many of which achieve exceptional proteome depth. Herein, we evaluated the performance of several commonly used sample preparation techniques for proteomic characterization of HeLa lysates [unfractionated in‐solution digests, SDS‐PAGE coupled with in‐gel digestion, gel‐eluted liquid fraction entrapment electrophoresis (GELFrEE) technology, SCX StageTips and high‐/low‐pH reversed phase fractionation (HpH)]. HpH fractionation was found to be superior in terms of proteome depth (>8400 proteins detected) and fractionation efficiency compared to other techniques. SCX StageTip fractionation required minimal sample handling and was also a substantial improvement over SDS‐PAGE separation and GELFrEE technology. Sequence coverage of the HeLa proteome increased to 38% when combining all workflows, however, total proteins detected improved only slightly to 8710. In summary, HpH fractionation and SCX StageTips are robust techniques and highly suited for complex proteome analysis.  相似文献   

12.
    
This study assesses the ability of a novel family of machine learning algorithms to identify changes in relative protein expression levels, measured using 2-D DIGE data, which support accurate class prediction. The analysis was done using a training set of 36 total cellular lysates comprised of six normal and three cancer biological replicates (the remaining are technical replicates) and a validation set of four normal and two cancer samples. Protein samples were separated by 2-D DIGE and expression was quantified using DeCyder-2D Differential Analysis Software. The relative expression reversal (RER) classifier correctly classified 9/9 training biological samples (p<0.022) as estimated using a modified version of leave one out cross validation and 6/6 validation samples. The classification rule involved comparison of expression levels for a single pair of protein spots, tropomyosin isoforms and alpha-enolase, both of which have prior association as potential biomarkers in cancer. The data was also analyzed using algorithms similar to those found in the extended data analysis package of DeCyder software. We propose that by accounting for sources of within- and between-gel variation, RER classifiers applied to 2-D DIGE data provide a useful approach for identifying biomarkers that discriminate among protein samples of interest.  相似文献   

13.
    
  相似文献   

14.
    
High-throughput proteomics experiments typically generate large amounts of peptide fragmentation mass spectra during a single experiment. There is often a substantial amount of redundant fragmentation of the same precursors among these spectra, which is usually considered a nuisance. We here discuss the potential of clustering and merging redundant spectra to turn this redundancy into a useful property of the dataset. To this end, we have created the first general-purpose, freely available open-source software application for clustering and merging MS/MS spectra. The application also introduces a novel approach to calculating the similarity of fragmentation mass spectra that takes into account the increased precision of modern mass spectrometers, and we suggest a simple but effective improvement to single-linkage clustering. The application and the novel algorithms are applied to several real-life proteomic datasets and the results are discussed. An analysis of the influence of the different algorithms available and their parameters is given, as well as a number of important applications of the overall approach.  相似文献   

15.
    
The Proteomics Identifications Database (PRIDE, www.ebi.ac.uk/pride ) is one of the main repositories of MS derived proteomics data. Here, we point out the main functionalities of PRIDE both as a submission repository and as a source for proteomics data. We describe the main features for data retrieval and visualization available through the PRIDE web and BioMart interfaces. We also highlight the mechanism by which tailored queries in the BioMart can join PRIDE to other resources such as Reactome, Ensembl or UniProt to execute extremely powerful across‐domain queries. We then present the latest improvements in the PRIDE submission process, using the new easy‐to‐use, platform‐independent graphical user interface submission tool PRIDE Converter. Finally, we speak about future plans and the role of PRIDE in the ProteomExchange consortium.  相似文献   

16.
    
The identification of ubiquitin (Ub) and Ub‐like protein (Ubl) conjugation sites is important in understanding their roles in biological pathway regulations. However, unambiguously and sensitively identifying Ub/Ubl conjugation sites through high‐throughput MS remains challenging. We introduce an improved workflow for identifying Ub/Ubl conjugation sites based on the ChopNSpice and X!Tandem software. ChopNSpice is modified to generate Ub/Ubl conjugation peptides in the form of a cross‐link. A combinatorial FASTA database can be acquired using the modified ChopNSpice (MchopNSpice). The modified X!Tandem (UblSearch) introduces a new fragmentation model for the Ub/Ubl conjugation peptides to match unambiguously the MS/MS spectra with linear peptides or Ub/Ubl conjugation peptides using the combinatorial FASTA database. The novel workflow exhibited better performance in analyzing an Ub and Ubl spectral library and a large‐scale Trypanosoma cruzi small Ub‐related modifier dataset compared with the original ChopNSpice method. The proposed workflow is more suitable for processing large‐scale MS datasets of Ub/Ubl modification. MchopNSpice and UblSearch are freely available under the GNU General Public License v3.0 at http://sourceforge.net/projects/maublsearch .  相似文献   

17.
    
Current standardization initiatives have greatly contributed to share the information derived by proteomics experiments. One of these initiatives is the XML-based repository PRIDE (PRoteomics IDEntification database), although an XML-based document does not appear to present a user-friendly view at the first glance. PRIDEViewer is a novel Java-based application that presents the information available in a PRIDE XML file in a user-friendly manner, facilitating the interaction among end users as well as the understanding and evaluation of the compiled information. PRIDEViewer is freely available at: http://proteo.cnb.csic.es/prideviewer/.  相似文献   

18.
    
With the advent of a new generation of high-resolution mass spectrometers, the fields of proteomics and metabolomics have gained powerful new tools. In this paper, we demonstrate a novel computational method that improves the mass accuracy of the LTQ-Orbitrap mass spectrometer from an initial +/- 1-2 ppm, obtained by the standard software, to an absolute median of 0.21 ppm (SD 0.21 ppm). With the increased mass accuracy it becomes much easier to match mass chromatograms in replicates and different sample types, even if compounds are detected at very low intensities. The proposed method exploits the ubiquitous presence of background ions in LC-MS profiles for accurate alignment and internal mass calibration, making it applicable for all types of MS equipment. The accuracy of this approach will facilitate many downstream systems biology applications, including mass-based molecule identification, ab initio metabolic network reconstruction, and untargeted metabolomics in general.  相似文献   

19.
    
Growth factor receptor mediated signaling is meanwhile recognized as a complex signaling network, which is initiated by recruiting specific patterns of adaptor proteins to the intracellular domain of epidermal growth factor receptor (EGFR). Approaches to globally identify EGFR‐binding proteins are required to elucidate this network. We affinity‐purified EGFR with its interacting proteins by coprecipitation from lysates of A431 cells. A total of 183 proteins were repeatedly detected in high‐resolution MS measurements. For 15 of these, direct interactions with EGFR were listed in the iRefIndex interaction database, including Grb2, shc‐1, SOS1 and 2, STAT 1 and 3, AP2, UBS3B, and ERRFI. The newly developed Cytoscape plugin ModuleGraph allowed retrieving and visualizing 93 well‐described protein complexes that contained at least one of the proteins found to interact with EGFR in our experiments. Abundances of 14 proteins were modulated more than twofold upon EGFR activation whereof clathrin‐associated adaptor complex AP‐2 showed 4.6‐fold enrichment. These proteins were further annotated with different cellular compartments. Finally, interactions of AP‐2 proteins and the newly discovered interaction of CIP2A could be verified. In conclusion, a powerful technique is presented that allowed identification and quantitative assessment of the EGFR interactome to provide further insight into EGFR signaling.  相似文献   

20.
    
The detection and quantification of insulin and its therapeutic analogs is important for medical, sports doping, and forensic applications. Synthetic variants contain slight sequence variations to affect bioavailability. To reduce sample handling bias, a universal extraction method is required for simultaneous extraction of endogenous and variant insulins with subsequent targeted quantification by LC‐MS. A mass spectrometric immunoassay (MSIA), a multiplexed assay for intact insulin and its analogues that couples immunoenrichment with high resolution and accurate mass (HR/AM) spectrometric detection across the clinical range is presented in this report. The assay is sensitive, selective, semi‐automated and can potentially be applied to detect new insulin isoforms allowing their further incorporation into second or third generation assays.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号