首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 609 毫秒
1.
MHC class I (MHC‐I)‐bound ligands play a pivotal role in CD8 T cell immunity and are hence of major interest in understanding and designing immunotherapies. One of the most commonly utilized approaches for detecting MHC ligands is LC‐MS/MS. Unfortunately, the effectiveness of current algorithms to identify MHC ligands from LC‐MS/MS data is limited because the search algorithms used were originally developed for proteomics approaches detecting tryptic peptides. Consequently, the analysis often results in inflated false discovery rate (FDR) statistics and an overall decrease in the number of peptides that pass FDR filters. Andreatta et al. describe a new scoring tool (MS‐rescue) for peptides from MHC‐I immunopeptidome datasets. MS‐rescue incorporates the existence of MHC‐I peptide motifs to rescore peptides from ligandome data. The tool is demonstrated here using peptides assigned from LC‐MS/MS data with PEAKs software but can be deployed on data from any search algorithm. This new approach increased the number of peptides identified by up to 20–30% and promises to aid the discovery of novel MHC‐I ligands with immunotherapeutic potential.  相似文献   

2.
3.
LC combined with MS/MS analysis of complex mixtures of protein digests is a reliable and sensitive method for characterization of protein phosphorylation. Peptide retention times (RTs) measured during an LC‐MS/MS run depend on both the peptide sequence and the location of modified amino acids. These RTs can be predicted using the LC of biomacromolecules at critical conditions model (BioLCCC). Comparing the observed RTs to those obtained from the BioLCCC model can provide additional validation of MS/MS‐based peptide identifications to reduce the false discovery rate and to improve the reliability of phosphoproteome profiling. In this study, energies of interaction between phosphorylated residues and the surface of RP separation media for both “classic” alkyl C18 and polar‐embedded C18 stationary phases were experimentally determined and included in the BioLCCC model extended for phosphopeptide analysis. The RTs for phosphorylated peptides and their nonphosphorylated analogs were predicted using the extended BioLCCC model and compared with their experimental RTs. The extended model was evaluated using literary data and a complex phosphoproteome data set distributed through the Association of Biomolecular Resource Facilities Proteome Informatics Research Group 2010 study. The reported results demonstrate the capability of the extended BioLCCC model to predict RTs which may lead to improved sensitivity and reliability of LC‐MS/MS‐based phosphoproteome profiling.  相似文献   

4.
Changming Xu  Ning Li  Hui Liu  Jie Ma  Yunping Zhu  Hongwei Xie 《Proteomics》2012,12(23-24):3475-3484
Database searching based methods for label‐free quantification aim to reconstruct the peptide extracted ion chromatogram based on the identification information, which can limit the search space and thus make the data processing much faster. The random effect of the MS/MS sampling can be remedied by cross‐assignment among different runs. Here, we present a new label‐free fast quantitative analysis tool, LFQuant, for high‐resolution LC‐MS/MS proteomics data based on database searching. It is designed to accept raw data in two common formats (mzXML and Thermo RAW), and database search results from mainstream tools (MASCOT, SEQUEST, and X!Tandem), as input data. LFQuant can handle large‐scale label‐free data with fractionation such as SDS‐PAGE and 2D LC. It is easy to use and provides handy user interfaces for data loading, parameter setting, quantitative analysis, and quantitative data visualization. LFQuant was compared with two common quantification software packages, MaxQuant and IDEAL‐Q, on the replication data set and the UPS1 standard data set. The results show that LFQuant performs better than them in terms of both precision and accuracy, and consumes significantly less processing time. LFQuant is freely available under the GNU General Public License v3.0 at http://sourceforge.net/projects/lfquant/ .  相似文献   

5.
The peptide‐based quantitation accuracy and precision of LC‐ESI (QSTAR Elite) and LC‐MALDI (4800 MALDI TOF/TOF) were compared by analyzing identical Escherichia coli tryptic digests containing iTRAQ‐labeled peptides of defined abundances (1:1, 2.5:1, 5:1, and 10:1). Only 51.4% of QSTAR spectra were used for quantitation by ProteinPilot Software versus 66.7% of LC‐MALDI spectra. The average protein sequence coverages for LC‐ESI and LC‐MALDI were 24.0 and 18.2% (14.9 and 8.4 peptides per protein), respectively. The iTRAQ‐based expression ratios determined by ProteinPilot from the 57 467 ESI‐MS/MS and 26 085 MALDI‐MS/MS spectra were analyzed for measurement accuracy and reproducibility. When the relative abundances of peptides within a sample were increased from 1:1 to 10:1, the mean ratios calculated on both instruments differed by only 0.7–6.7% between platforms. In the 10:1 experiment, up to 64.7% of iTRAQ ratios from LC‐ESI MS/MS spectra failed S/N thresholds and were excluded from quantitation, while only 0.1% of the equivalent LC‐MALDI iTRAQ ratios were rejected. Re‐analysis of an archived LC‐MALDI sample set stored for 5 months generated 3715 MS/MS spectra for quantitation, compared with 3845 acquired originally, and the average ratios differed by only 3.1%. Overall, MS/MS‐based peptide quantitation performance of offline LC‐MALDI was comparable with on‐line LC‐ESI, which required threefold less time. However, offline LC‐MALDI allows the re‐analysis of archived HPLC‐separated samples.  相似文献   

6.
Database-searching programs generally identify only a fraction of the spectra acquired in a standard LC/MS/MS study of digested proteins. Subtle variations in database-searching algorithms for assigning peptides to MS/MS spectra have been known to provide different identification results. To leverage this variation, a probabilistic framework is developed for combining the results of multiple search engines. The scores for each search engine are first independently converted into peptide probabilities. These probabilities can then be readily combined across search engines using Bayesian rules and the expectation maximization learning algorithm. A significant gain in the number of peptides identified with high confidence with each additional search engine is demonstrated using several data sets of increasing complexity, from a control protein mixture to a human plasma sample, searched using SEQUEST, Mascot, and X! Tandem database-searching programs. The increased rate of peptide assignments also translates into a substantially larger number of protein identifications in LC/MS/MS studies compared to a typical analysis using a single database-search tool.  相似文献   

7.
Comprehensive proteome profiling of breast cancer tissue samples is challenging, as the tissue samples contain many proteins with varying concentrations and modifications. We report an effective sample preparation strategy combined with liquid chromatography (LC) electrospray ionization (ESI) quadrupole time-of-flight (QTOF) MS/MS for proteome analysis of human breast cancer tissue. The complexity of the breast cancer tissue proteome was reduced by using protein precipitation from a tissue extract, followed by sequential protein solubilization in solvents of different solubilizing strength. The individual fractions of protein mixtures or subproteomes were subjected to trypsin digestion and the resultant peptides were separated by strong cation exchange (SCX) chromatography, followed by reversed-phase capillary LC combined with high resolution and high accuracy ESI-QTOF MS/MS. This approach identified 14407 unique peptides from 3749 different proteins based on peptide matches with scores above the threshold scores at the 95% confidence level in MASCOT database search of the acquired MS/MS spectra. The false positive rate of peptide matches was determined to be 0.95% by using the target-decoy sequence search strategy. On the basis of gene ontology categorization, the identified proteins represented a wide variety of biological functions, cellular processes, and cellular locations.  相似文献   

8.
We demonstrate an approach for global quantitative analysis of protein mixtures using differential stable isotopic labeling of the enzyme-digested peptides combined with microbore liquid chromatography (LC) matrix-assisted laser desorption ionization (MALDI) mass spectrometry (MS). Microbore LC provides higher sample loading, compared to capillary LC, which facilitates the quantification of low abundance proteins in protein mixtures. In this work, microbore LC is combined with MALDI MS via a heated droplet interface. The compatibilities of two global peptide labeling methods (i.e., esterification to carboxylic groups and dimethylation to amine groups of peptides) with this LC-MALDI technique are evaluated. Using a quadrupole-time-of-flight mass spectrometer, MALDI spectra of the peptides in individual sample spots are obtained to determine the abundance ratio among pairs of differential isotopically labeled peptides. MS/MS spectra are subsequently obtained from the peptide pairs showing significant abundance differences to determine the sequences of selected peptides for protein identification. The peptide sequences determined from MS/MS database search are confirmed by using the overlaid fragment ion spectra generated from a pair of differentially labeled peptides. The effectiveness of this microbore LC-MALDI approach is demonstrated in the quantification and identification of peptides from a mixture of standard proteins as well as E. coli whole cell extract of known relative concentrations. It is shown that this approach provides a facile and economical means of comparing relative protein abundances from two proteome samples.  相似文献   

9.
High‐resolution MS/MS spectra of peptides can be deisotoped to identify monoisotopic masses of peptide fragments. The use of such masses should improve protein identification rates. However, deisotoping is not universally used and its benefits have not been fully explored. Here, MS2‐Deisotoper, a tool for use prior to database search, is used to identify monoisotopic peaks in centroided MS/MS spectra. MS2‐Deisotoper works by comparing the mass and relative intensity of each peptide fragment peak to every other peak of greater mass, and by applying a set of rules concerning mass and intensity differences. After comprehensive parameter optimization, it is shown that MS2‐Deisotoper can improve the number of peptide spectrum matches (PSMs) identified by up to 8.2% and proteins by up to 2.8%. It is effective with SILAC and non‐SILAC MS/MS data. The identification of unique peptide sequences is also improved, increasing the number of human proteoforms by 3.7%. Detailed investigation of results shows that deisotoping increases Mascot ion scores, improves FDR estimation for PSMs, and leads to greater protein sequence coverage. At a peptide level, it is found that the efficacy of deisotoping is affected by peptide mass and charge. MS2‐Deisotoper can be used via a user interface or as a command‐line tool.  相似文献   

10.
11.
Liquid chromatography coupled tandem mass spectrometry (LC‐MS/MS) is an important technique for detecting peptides in proteomics studies. Here, we present an open source software tool, termed IPeak, a peptide identification pipeline that is designed to combine the Percolator post‐processing algorithm and multi‐search strategy to enhance the sensitivity of peptide identifications without compromising accuracy. IPeak provides a graphical user interface (GUI) as well as a command‐line interface, which is implemented in JAVA and can work on all three major operating system platforms: Windows, Linux/Unix and OS X. IPeak has been designed to work with the mzIdentML standard from the Proteomics Standards Initiative (PSI) as an input and output, and also been fully integrated into the associated mzidLibrary project, providing access to the overall pipeline, as well as modules for calling Percolator on individual search engine result files. The integration thus enables IPeak (and Percolator) to be used in conjunction with any software packages implementing the mzIdentML data standard. IPeak is freely available and can be downloaded under an Apache 2.0 license at https://code.google.com/p/mzidentml‐lib/ .  相似文献   

12.
Searching a spectral library for the identification of protein MS/MS data has proven to be a fast and accurate method, while yielding a high identification rate. We investigated the potential to increase peptide discovery rate, with little increase in computational time, by constructing a workflow based on a sequence search with Phenyx followed by a library search with SpectraST. Searching a consensus library compiled from the search results of the prior Phenyx search increased the number of confidently matched spectra by up to 156%. Additionally matched spectra by SpectraST included noisy spectra, spectra representing missed cleaved peptides as well as spectra from post‐translationally modified peptides.  相似文献   

13.
Reversed-phase liquid chromatography (LC) directly coupled with electrospray-tandem mass spectrometry (MS/MS) is a successful choice to obtain a large number of product ion spectra from a complex peptide mixture. We describe a search validation program, ScoreRidge, developed for analysis of LC-MS/MS data. The program validates peptide assignments to product ion spectra resulting from usual probability-based searches against primary structure databases. The validation is based only on correlation between the measured LC elution time of each peptide and the deduced elution time from the amino acid sequence assigned to product ion spectra obtained from the MS/MS analysis of the peptide. Sufficient numbers of probable assignments gave a highly correlative curve. Any peptide assignments within a certain tolerance from the correlation curve were accepted for the following arrangement step to list identified proteins. Using this data validation program, host protein candidates responsible for interaction with human hepatitis B virus core protein were identified from a partially purified protein mixture. The present simple and practical program complements protein identification from usual product ion search algorithms and reduces manual interpretation of the search result data. It will lead to more explicit protein identification from complex peptide mixtures such as whole proteome digests from tissue samples.  相似文献   

14.
Modern nano‐HPLC systems are capable of extremely precise control of solvent gradients, allowing high‐resolution separation of peptides. Most proteomics laboratories use a simple linear analytical gradient for nano‐LC‐MS/MS experiments, though recent evidence indicates that optimized non‐linear gradients result in increased peptide and protein identifications from cell lysates. In concurrent work, we examined non‐linear gradients for the analysis of samples fractionated at the peptide level, where the distribution of peptide retention times often varies by fraction. We hypothesized that greater coverage of these samples could be achieved using per‐fraction optimized gradients. We demonstrate that the optimized gradients improve the distribution of peptides throughout the analysis. Using previous generation MS instrumentation, a considerable gain in peptide and protein identifications can be realized. With current MS platforms that have faster electronics and achieve shorter duty cycle, the improvement in identifications is smaller. Our gradient optimization method has been implemented in a simple graphical tool (GOAT) that is MS‐vendor independent, does not require peptide ID input, and is freely available for non‐commercial use at http://proteomics.swmed.edu/goat/  相似文献   

15.
Typically, detection of protein sequences in collision-induced dissociation (CID) tandem MS (MS2) dataset is performed by mapping identified peptide ions back to protein sequence by using the protein database search (PDS) engine. Finding a particular peptide sequence of interest in CID MS2 records very often requires manual evaluation of the spectrum, regardless of whether the peptide-associated MS2 scan is identified by PDS algorithm or not. We have developed a compact cross-platform database-free command-line utility, pepgrep, which helps to find an MS2 fingerprint for a selected peptide sequence by pattern-matching of modelled MS2 data using Peptide-to-MS2 scoring algorithm. pepgrep can incorporate dozens of mass offsets corresponding to a variety of post-translational modifications (PTMs) into the algorithm. Decoy peptide sequences are used with the tested peptide sequence to reduce false-positive results. The engine is capable of screening an MS2 data file at a high rate when using a cluster computing environment. The matched MS2 spectrum can be displayed by using built-in graphical application programming interface (API) or optionally recorded to file. Using this algorithm, we were able to find extra peptide sequences in studied CID spectra that were missed by PDS identification. Also we found pepgrep especially useful for examining a CID of small fractions of peptides resulting from, for example, affinity purification techniques. The peptide sequences in such samples are less likely to be positively identified by using routine protein-centric algorithm implemented in PDS. The software is freely available at http://bsproteomics.essex.ac.uk:8080/data/download/pepgrep-1.4.tgz.  相似文献   

16.
Proteomics research routinely involves identifying peptides and proteins via MS/MS sequence database search. Thus the database search engine is an integral tool in many proteomics research groups. Here, we introduce the Comet search engine to the existing landscape of commercial and open‐source database search tools. Comet is open source, freely available, and based on one of the original sequence database search tools that has been widely used for many years.  相似文献   

17.
18.
Phosphorylation is a reversible posttranslational protein modification which plays a pivotal role in intracellular signaling. Despite extensive efforts, phosphorylation site mapping of proteomes is still incomplete motivating the exploration of alternative methods that complement existing workflows. In this study, we compared tandem mass spectrometry (MS/MS) on matrix assisted laser desorption/ionization time‐of‐flight (MALDI‐TOF) and nano‐electrospray ionization (nESI) Orbitrap instruments with respect to their ability to identify phosphopeptides from complex proteome digests. Phosphopeptides were enriched from tryptic digests of cell lines using Fe‐IMAC column chromatography and subjected to LC‐MS/MS analysis. We found that the two analytical workflows exhibited considerable orthogonality. For instance, MALDI‐TOF MS/MS favored the identification of phosphopeptides encompassing clear motif signatures for acidic residue directed kinases. The extent of orthogonality of the two LC‐MS/MS systems was comparable to that of using alternative proteases such as Asp‐N, Arg‐C, chymotrypsin, Glu‐C and Lys‐C on just one LC‐MS/MS instrument. Notably, MALDI‐TOF MS/MS identified an unexpectedly high number and percentage of phosphotyrosine sites (~20% of all sites), possibly as a direct consequence of more efficient ionization. The data clearly show that LC‐MALDI MS/MS can be a useful complement to LC‐nESI MS/MS for phosphoproteome mapping and particularly so for acidic and phosphotyrosine containing peptides.  相似文献   

19.
We report an isotope labeling shotgun proteome analysis strategy to validate the spectrum-to-sequence assignments generated by using sequence-database searching for the construction of a more reliable MS/MS spectral library. This strategy is demonstrated in the analysis of the E. coli K12 proteome. In the workflow, E. coli cells were cultured in normal and (15)N-enriched media. The differentially labeled proteins from the cell extracts were subjected to trypsin digestion and two-dimensional liquid chromatography quadrupole time-of-flight tandem mass spectrometry (2D-LC QTOF MS/MS) analysis. The MS/MS spectra of the two samples were individually searched using Mascot against the E. coli proteome database to generate lists of peptide sequence matches. The two data sets were compared by overlaying the spectra of unlabeled and labeled matches of the same peptide sequence for validation. Two cutoff filters, one based on the number of common fragment ions and another one on the similarity of intensity patterns among the common ions, were developed and applied to the overlaid spectral pairs to reject the low quality or incorrectly assigned spectra. By examining 257,907 and 245,156 spectra acquired from the unlabeled and (15)N-labeled samples, respectively, an experimentally validated MS/MS spectral library of tryptic peptides was constructed for E. coli K12 that consisted of 9,302 unique spectra with unique sequence and charge state, representing 7,763 unique peptide sequences. This E. coli spectral library could be readily expanded, and the overall strategy should be applicable to other organisms. Even with this relatively small library, it was shown that more peptides could be identified with higher confidence using the spectral search method than by sequence-database searching.  相似文献   

20.
In proteomics, MS plays an essential role in identifying and quantifying proteins. To characterize mature target proteins from living cells, candidate proteins are often analyzed with PMF and MS/MS ion search methods in combination with computational search routines based on bioinformatics. In contrast to shotgun proteomics, which is widely used to identify proteins, proteomics based on the analysis of N- and C-terminal amino acid sequences (terminal proteomics) should render higher fidelity results because of the high information content of terminal sequence and potentially high throughput of the method not requiring very high sequence coverage to be achieved by extensive sequencing. In line with this expectation, we review recent advances in methods for N- and C-terminal amino acid sequencing of proteins. This review focuses mainly on the methods of N- and C-terminal analyses based on MALDI-TOF MS for its easy accessibility, with several complementary approaches using LC/MS/MS. We also describe problems associated with MS and possible remedies, including chemical and enzymatic procedures to enhance the fidelity of these methods.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号