首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 718 毫秒
1.
MOTIVATION: One of the major problems in shotgun proteomics is the low peptide coverage when analyzing complex protein samples. Identifying more peptides, e.g. non-tryptic peptides, may increase the peptide coverage and improve protein identification and/or quantification that are based on the peptide identification results. Searching for all potential non-tryptic peptides is, however, time consuming for shotgun proteomics data from complex samples, and poses a challenge for a routine data analysis. RESULTS: We hypothesize that non-tryptic peptides are mainly created from the truncation of regular tryptic peptides before separation. We introduce the notion of truncatability of a tryptic peptide, i.e. the probability of the peptide to be identified in its truncated form, and build a predictor to estimate a peptide's truncatability from its sequence. We show that our predictions achieve useful accuracy, with the area under the ROC curve from 76% to 87%, and can be used to filter the sequence database for identifying truncated peptides. After filtering, only a limited number of tryptic peptides with the highest truncatability are retained for non-tryptic peptide searching. By applying this method to identification of semi-tryptic peptides, we show that a significant number of such peptides can be identified within a searching time comparable to that of tryptic peptide identification.  相似文献   

2.
MS‐based proteomics has emerged as a powerful tool in biological studies. The shotgun proteomics strategy, in which proteolytic peptides are analyzed in data‐dependent mode, enables a detection of the most comprehensive proteome (>10 000 proteins from whole‐cell lysate). The quantitative proteomics uses stable isotopes or label‐free method to measure relative protein abundance. The isotope labeling strategies are more precise and accurate compared to label‐free methods, but labeling procedures are complicated and expensive, and the sample number and types are also limited. Sequential window acquisition of all theoretical mass spectra (SWATH) is a recently developed technique, in which data‐independent acquisition is coupled with peptide spectral library match. In principle SWATH method is able to do label‐free quantification in an MRM‐like manner, which has higher quantification accuracy and precision. Previous data have demonstrated that SWATH can be used to quantify less complex systems, such as spiked‐in peptide mixture or protein complex. Our study first time assessed the quantification performance of SWATH method on proteome scale using a complex mouse‐cell lysate sample. In total 3600 proteins got identified and quantified without sample prefractionation. The SWATH method shows outstanding quantification precision, whereas the quantification accuracy becomes less perfect when protein abundances differ greatly. However, this inaccuracy does not prevent discovering biological correlates, because the measured signal intensities had linear relationship to the sample loading amounts; thus the SWATH method can predict precisely the significance of a protein. Our results prove that SWATH can provide precise label‐free quantification on proteome scale.  相似文献   

3.
Stable isotope labeling is at present one of the most powerful methods in quantitative proteomics. Stable isotope labeling has been performed at both the protein as well as the peptide level using either metabolic or chemical labeling. Here, we present a straightforward and cost-effective triplex quantification method that is based on stable isotope dimethyl labeling at the peptide level. Herein, all proteolytic peptides are chemically labeled at their alpha- and epsilon-amino groups. We use three different isotopomers of formaldehyde to enable the parallel analysis of three different samples. These labels provide a minimum of 4 Da mass difference between peaks in the generated peptide triplets. The method was evaluated based on the quantitative analysis of a cell lysate, using a typical "shotgun" proteomics experiment. While peptide complexity was increased by introducing three labels, still more than 1300 proteins could be identified using 60 microg of starting material, whereby more than 600 proteins could be quantified using at least four peptides per protein. The triplex labeling was further utilized to distinguish specific from aspecific cAMP binding proteins in a chemical proteomics experiment using immobilized cAMP. Thereby, differences in abundance ratio of more than two orders of magnitude could be quantified.  相似文献   

4.
Precise protein quantification is essential in comparative proteomics. Currently, quantification bias is inevitable when using proteotypic peptide‐based quantitative proteomics strategy for the differences in peptides measurability. To improve quantification accuracy, we proposed an “empirical rule for linearly correlated peptide selection (ERLPS)” in quantitative proteomics in our previous work. However, a systematic evaluation on general application of ERLPS in quantitative proteomics under diverse experimental conditions needs to be conducted. In this study, the practice workflow of ERLPS was explicitly illustrated; different experimental variables, such as, different MS systems, sample complexities, sample preparations, elution gradients, matrix effects, loading amounts, and other factors were comprehensively investigated to evaluate the applicability, reproducibility, and transferability of ERPLS. The results demonstrated that ERLPS was highly reproducible and transferable within appropriate loading amounts and linearly correlated response peptides should be selected for each specific experiment. ERLPS was used to proteome samples from yeast to mouse and human, and in quantitative methods from label‐free to O18/O16‐labeled and SILAC analysis, and enabled accurate measurements for all proteotypic peptide‐based quantitative proteomics over a large dynamic range.  相似文献   

5.
6.
The advent of algorithms for fragmentation spectrum-based label-free quantitative proteomics has enabled straightforward quantification of shotgun proteomic experiments. Despite the popularity of these approaches, few studies have been performed to assess their performance. We have therefore profiled the precision and the accuracy of three distinct relative label-free methods on both the protein and the proteome level. We derived our test data from two well-characterized publicly available quantitative data sets.  相似文献   

7.
The main goal of many proteomics experiments is an accurate and rapid quantification and identification of regulated proteins in complex biological samples. The bottleneck in quantitative proteomics remains the availability of efficient software to evaluate and quantify the tremendous amount of mass spectral data acquired during a proteomics project. A new software suite, ICPLQuant, has been developed to accurately quantify isotope‐coded protein label (ICPL)‐labeled peptides on the MS level during LC‐MALDI and peptide mass fingerprint experiments. The tool is able to generate a list of differentially regulated peptide precursors for subsequent MS/MS experiments, minimizing time‐consuming acquisition and interpretation of MS/MS data. ICPLQuant is based on two independent units. Unit 1 performs ICPL multiplex detection and quantification and proposes peptides to be identified by MS/MS. Unit 2 combines MASCOT MS/MS protein identification with the quantitative data and produces a protein/peptide list with all the relevant information accessible for further data mining. The accuracy of quantification, selection of peptides for MS/MS‐identification and the automated output of a protein list of regulated proteins are demonstrated by the comparative analysis of four different mixtures of three proteins (Ovalbumin, Horseradish Peroxidase and Rabbit Albumin) spiked into the complex protein background of the DGPF Proteome Marker.  相似文献   

8.
9.
Pachl F  Fellenberg K  Wagner C  Kuster B 《Proteomics》2012,12(9):1328-1332
Isobaric tagging using reagents such as tandem mass tags (TMT) and isobaric tags for relative and absolute quantification (iTRAQ) have become popular tools for mass spectrometry based quantitative proteomics. Because the peptide quantification information is collected in tandem mass spectra, the accuracy and precision of this method largely depend on the resolution with which precursor ions can be selected for the fragmentation and the specificity of the generated reporter ion. The latter can constitute an issue if near isobaric ion signals are present in such spectra because they may distort quantification results. We propose a simple remedy for this problem by identifying reporter ions via the accurate mass differences within a single tandem mass spectrum instead of applying fixed mass error tolerances for all tandem mass spectra. Our results show that this leads to unambiguous reporter ion identification and complete removal of interfering signals. This mode of data processing is easily implemented in software and offers advantages for protein quantification based on few peptides.  相似文献   

10.
Assessment of differential protein abundance from the observed properties of detected peptides is an essential part of protein profiling based on shotgun proteomics. However, the abundance observed for shared peptides may be due to contributions from multiple proteins that are affected differently by a given treatment. Excluding shared peptides eliminates this ambiguity but may significantly decrease the number of proteins for which abundance estimates can be obtained. Peptide sharing within a family of biologically related proteins does not cause ambiguity if family members have a common response to treatment. On the basis of this concept, we have developed an approach for including shared peptides in the analysis of differential protein abundance in protein profiling. Data from a recent proteomics study of lung tissue from mice exposed to lipopolysaccharide, cigarette smoke, and a combination of these agents are used to illustrate our method. Starting from data where about half of the implicated database protein involved shared peptides, 82% of the affected proteins were grouped into families, based on FASTA annotation, with closure on peptide sharing. In many cases, a common abundance relative to control was sufficient to explain ion-current peak areas for peptides, both unique and shared, that identified biologically related proteins in a peptide-sharing closure group. On the basis of these results, we propose that peptide-sharing closure groups provide a way to include abundance data for shared peptides in quantitative protein profiling by high-throughput mass spectrometry.  相似文献   

11.
Spectral libraries have emerged as a viable alternative to protein sequence databases for peptide identification. These libraries contain previously detected peptide sequences and their corresponding tandem mass spectra (MS/MS). Search engines can then identify peptides by comparing experimental MS/MS scans to those in the library. Many of these algorithms employ the dot product score for measuring the quality of a spectrum-spectrum match (SSM). This scoring system does not offer a clear statistical interpretation and ignores fragment ion m/z discrepancies in the scoring. We developed a new spectral library search engine, Pepitome, which employs statistical systems for scoring SSMs. Pepitome outperformed the leading library search tool, SpectraST, when analyzing data sets acquired on three different mass spectrometry platforms. We characterized the reliability of spectral library searches by confirming shotgun proteomics identifications through RNA-Seq data. Applying spectral library and database searches on the same sample revealed their complementary nature. Pepitome identifications enabled the automation of quality analysis and quality control (QA/QC) for shotgun proteomics data acquisition pipelines.  相似文献   

12.
Systems biology relies on data sets in which the same group of proteins is consistently identified and precisely quantified across multiple samples, a requirement that is only partially achieved by current proteomics approaches. Selected reaction monitoring (SRM)—also called multiple reaction monitoring—is emerging as a technology that ideally complements the discovery capabilities of shotgun strategies by its unique potential for reliable quantification of analytes of low abundance in complex mixtures. In an SRM experiment, a predefined precursor ion and one of its fragments are selected by the two mass filters of a triple quadrupole instrument and monitored over time for precise quantification. A series of transitions (precursor/fragment ion pairs) in combination with the retention time of the targeted peptide can constitute a definitive assay. Typically, a large number of peptides are quantified during a single LC‐MS experiment. This tutorial explains the application of SRM for quantitative proteomics, including the selection of proteotypic peptides and the optimization and validation of transitions. Furthermore, normalization and various factors affecting sensitivity and accuracy are discussed.  相似文献   

13.
LC‐ESI/MS/MS‐based shotgun proteomics is currently the most commonly used approach for the identification and quantification of proteins in large‐scale studies of biomarker discovery. In the past several years, the shotgun proteomics technologies have been refined toward further enhancement of proteome coverage. In the complex series of protocols involved in shotgun proteomics, however, loss of proteolytic peptides during the lyophilization step prior to the LC/MS/MS injection has been relatively neglected despite the fact that the dissolution of the hydrophobic peptides in lyophilized samples is difficult in 0.05–0.1% TFA or formic acid, causing substantial loss of precious peptide samples. In order to prevent the loss of peptide samples during this step, we devised a new protocol using Invitrosol (IVS), a commercially available surfactant compatible with ESI‐MS; by dissolving the lyophilized peptides in IVS, we show improved recovery of hydrophobic peptides, leading to enhanced coverage of proteome. Thus, the use of IVS in the recovery step of lyophilized peptides will help the shotgun proteomics analysis by expanding the proteome coverage, which would significantly promote the discovery and development of new diagnostic markers and therapeutic targets.  相似文献   

14.
All shotgun proteomics experiments rely on efficient proteolysis steps for sensitive peptide/protein identification and quantification. Previous reports suggest that the sequential tandem LysC/trypsin digest yields higher recovery of fully tryptic peptides than single‐tryptic proteolysis. Based on the previous studies, it is assumed that the advantageous effect of tandem proteolysis requires a high sample denaturation state for the initial LysC digest. Therefore, to date, all systematic assessments of LysC/trypsin proteolysis are done in chaotropic environments such as urea. Here, sole trypsin is compared with LysC/trypsin and it is shown that tandem digestion can be carried with high efficiency in Mass Spectrometry‐compatible detergents, thereby resulting in higher quantitative yields of fully cleaved peptides. It is further demonstrated that higher cleavage efficiency of tandem digests has a positive impact on absolute protein quantification using intensity‐based absolute quantification (iBAQ) values. The results of the examination of divergent urea tandem conditions imply that beneficial effects of the initial LysC digest do not depend on the sample denaturation state, but, are mainly caused by different target specificities of LysC and trypsin. The observed detergent compatibility enables tandem digestion schemes to be implemented in efficient cellular solubilization proteomics procedures without the need for buffer exchange to chaotropic environments.  相似文献   

15.
Targeted proteomics has gained significant popularity in mass spectrometry‐based protein quantification as a method to detect proteins of interest with high sensitivity, quantitative accuracy and reproducibility. However, with the emergence of a wide variety of targeted proteomics methods, some of them with high‐throughput capabilities, it is easy to overlook the essence of each method and to determine what makes each of them a targeted proteomics method. In this viewpoint, we revisit the main targeted proteomics methods and classify them in four categories differentiating those methods that perform targeted data acquisition from targeted data analysis, and those methods that are based on peptide ion data (MS1 targeted methods) from those that rely on the peptide fragments (MS2 targeted methods).  相似文献   

16.
N‐succinimidyloxycarbonylmethyl tris(2,4,6‐trimethoxyphenyl) phosphonium bromide (TMPP‐Ac‐OSu) reacts rapidly, mildly, and specifically with the N‐terminals of proteins and peptides. Thus, it can be developed as an ideal isotope‐coded tag to be used in quantitative proteomics. Here, we present a strategy for light and heavy TMPP‐based quantitative proteomic analysis, in which peptides in a mixture can be quantified using an on‐tip TMPP derivatization approach. To demonstrate the accuracy of this strategy, light and heavy TMPP‐labeled peptides were combined at different ratios and subsequently analyzed by LC‐MS/MS. The MS spectra and scatter plots show that peptide and protein ratios were both consistent with the mixed ratios. We observed a linear correlation between protein ratios and the predicted ratios. In comparison with SILAC method, the TMPP labeling method produced similarly accurate quantitative results with low CVs. In conclusion, our results suggest that this isotope‐coded TMPP method achieved accurate quantification and compatibility with IEF‐based separation. With the inherent advantages of TMPP derivatization, we believe that it holds great promise for future applications in quantitative proteomics analysis.  相似文献   

17.
Tandem MS (MS2) quantification using the series of N‐ and C‐terminal fragment ion pairs generated from isobaric‐labelled peptides was recently considered an accurate strategy in quantitative proteomics. However, the presence of multiplexed terminal fragment ion in MS2 spectra may reduce the efficiency of peptide identification, resulting in lower identification scores or even incorrect assignments. To address this issue, we developed a quantitative software tool, denoted isobaric tandem MS quantification (ITMSQ), to improve N‐ and C‐terminal fragment ion pairs based isobaric MS2 quantification. A spectrum splitting module was designed to separate the MS2 spectra from different samples, increasing the accuracy of both identification and quantification. ITMSQ offers a convenient interface through which parameters can be changed along with the labelling method, and the result files and all of the intermediate files can be exported. We performed an analysis of in vivo terminal amino acid labelling labelled HeLa samples and found that the numbers of quantified proteins and peptides increased by 13.64 and 27.52% after spectrum splitting, respectively. In conclusion, ITMSQ provides an accurate and reliable quantitative solutionfor N‐ and C‐terminal fragment ion pairs based isobaric MS2 quantitative methods.  相似文献   

18.
Membrane proteins are of particular interest in proteomics because of their potential therapeutic utility. Past proteomic approaches used to investigate membrane proteins have only been partially successful at providing a comprehensive analysis due to the inherently hydrophobic nature and low abundance for some of these proteins. Recently, these difficulties have been improved by analyzing membrane protein enriched samples using shotgun proteomics. In addition, the recent application of methanol-assisted trypsin digestion of membrane proteins has been shown to be a method to improve membrane protein identifications. In this study, a comparison of different concentrations of methanol was assessed for assisting membrane protein digestion with trypsin prior to analysis using a gel-based shotgun proteomics approach called peptide immobilized pH gradient isoelectric focusing (IPG-IEF). We demonstrate the use of peptide IEF on pH 3-10 IPG strips as the first dimension of two-dimensional shotgun proteomics for protein identifications from the membrane fraction of rat liver. Tryptic digestion of proteins was carried out in varying concentrations of methanol in 10 mM ammonium bicarbonate: 0% (v/v), 40% (v/v), and 60% (v/v). A total of 800 proteins were identified from 60% (v/v) methanol, which increased the protein identifications by 17% and 14% compared to 0% (v/v) methanol and 40% (v/v) methanol assisted digestion, respectively. In total, 1549 nonredundant proteins were identified from all three concentrations of methanol including 690 (42%) integral membrane proteins of which 626 of these proteins contained at least one transmembrane domain. Peptide IPG-IEF separation of peptides was successful as the peptides were separated into discrete pI regions with high resolution. The results from this study prove utility of 60% (v/v) methanol assisted digestion in conjunction with peptide IPG-IEF as an optimal shotgun proteomics technique for the separation and identification of previously unreported membrane proteins.  相似文献   

19.
Labeling‐based proteomics is a powerful method for detection of differentially expressed proteins (DEPs). The current data analysis platform typically relies on protein‐level ratios, which is obtained by summarizing peptide‐level ratios for each protein. In shotgun proteomics, however, some proteins are quantified with more peptides than others, and this reproducibility information is not incorporated into the differential expression (DE) analysis. Here, we propose a novel probabilistic framework EBprot that directly models the peptide‐protein hierarchy and rewards the proteins with reproducible evidence of DE over multiple peptides. To evaluate its performance with known DE states, we conducted a simulation study to show that the peptide‐level analysis of EBprot provides better receiver‐operating characteristic and more accurate estimation of the false discovery rates than the methods based on protein‐level ratios. We also demonstrate superior classification performance of peptide‐level EBprot analysis in a spike‐in dataset. To illustrate the wide applicability of EBprot in different experimental designs, we applied EBprot to a dataset for lung cancer subtype analysis with biological replicates and another dataset for time course phosphoproteome analysis of EGF‐stimulated HeLa cells with multiplexed labeling. Through these examples, we show that the peptide‐level analysis of EBprot is a robust alternative to the existing statistical methods for the DE analysis of labeling‐based quantitative datasets. The software suite is freely available on the Sourceforge website http://ebprot.sourceforge.net/ . All MS data have been deposited in the ProteomeXchange with identifier PXD001426 ( http://proteomecentral.proteomexchange.org/dataset/PXD001426/ ).  相似文献   

20.
Abstract A probability-based quantification framework is presented for the calculation of relative peptide and protein abundance in label-free and label-dependent LC-MS proteomics data. The results are accompanied by credible intervals and regulation probabilities. The algorithm takes into account data uncertainties via Poisson statistics modified by a noise contribution that is determined automatically during an initial normalization stage. Protein quantification relies on assignments of component peptides to the acquired data. These assignments are generally of variable reliability and may not be present across all of the experiments comprising an analysis. It is also possible for a peptide to be identified to more than one protein in a given mixture. For these reasons the algorithm accepts a prior probability of peptide assignment for each intensity measurement. The model is constructed in such a way that outliers of any type can be automatically reweighted. Two discrete normalization methods can be employed. The first method is based on a user-defined subset of peptides, while the second method relies on the presence of a dominant background of endogenous peptides for which the concentration is assumed to be unaffected. Normalization is performed using the same computational and statistical procedures employed by the main quantification algorithm. The performance of the algorithm will be illustrated on example data sets, and its utility demonstrated for typical proteomics applications. The quantification algorithm supports relative protein quantification based on precursor and product ion intensities acquired by means of data-dependent methods, originating from all common isotopically-labeled approaches, as well as label-free ion intensity-based data-independent methods.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号