首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
The high-throughput nature of proteomics mass spectrometry is enabled by a productive combination of data acquisition protocols and the computational tools used to interpret the resulting spectra. One of the key components in mainstream protocols is the generation of tandem mass (MS/MS) spectra by peptide fragmentation using collision induced dissociation, the approach currently used in the large majority of proteomics experiments to routinely identify hundreds to thousands of proteins from single mass spectrometry runs. Complementary to these, alternative peptide fragmentation methods such as electron capture/transfer dissociation and higher-energy collision dissociation have consistently achieved significant improvements in the identification of certain classes of peptides, proteins, and post-translational modifications. Recognizing these advantages, mass spectrometry instruments now conveniently support fine-tuned methods that automatically alternate between peptide fragmentation modes for either different types of peptides or for acquisition of multiple MS/MS spectra from each peptide. But although these developments have the potential to substantially improve peptide identification, their routine application requires corresponding adjustments to the software tools and procedures used for automated downstream processing. This review discusses the computational implications of alternative and alternate modes of MS/MS peptide fragmentation and addresses some practical aspects of using such protocols for identification of peptides and post-translational modifications.  相似文献   

2.
Advancement in proteomics research relies on the development of new, innovative tools for identifying and characterizing proteins. Here, we describe a protocol for analyzing peptides and proteins on a chromatographic timescale by coupling nanoflow reverse-phase (RP) liquid chromatography (LC) to electron-transfer dissociation (ETD) mass spectrometry. For this protocol, proteins can be proteolytically digested before ETD analysis, although digestion is not necessary for all applications. Proteins 相似文献   

3.
A concept of unique peptides(CUP)was proposed and implemented to identify whole-cell proteins from tandem mass spectrometry(MS/MS)ion spectra.A unique peptide is defined as a peptide,irrespective of its length,that exists only in one protein of a proteome of interest,despite the fact that this peptide may appear more than once in the same protein.Integrating CUP,a two-step whole-cell protein identification strategy was developed to further increase the confidence of identified proteins.A dataset containing 40,243 MS/MS ion spectra of Saccharomyces cerevisiae and protein identification tools including Mascot and SEQUEST were used to illustrate the proposed concept and strategy.Without implementing CUP,the proteins identified by SEQUEST are 2.26 fold of those identified by Mascot.When CUP was applied,the proteins bearing unique peptides identified by SEQUEST are3.89 fold of those identified by Mascot.By cross-comparing two sets of identified proteins,only 89 common proteins derived from CUP were found.The key discrepancy between identified proteins was resulted from the filtering criteria employed by each protein identification tool.According to the origin of peptides classified by CUP and the commonality of proteins recognized by protein identification tools,all identified proteins were cross-compared,resulting in four groups of proteins possessing different levels of assigned confidence.  相似文献   

4.
De novo peptide sequencing via tandem mass spectrometry.   总被引:10,自引:0,他引:10  
Peptide sequencing via tandem mass spectrometry (MS/MS) is one of the most powerful tools in proteomics for identifying proteins. Because complete genome sequences are accumulating rapidly, the recent trend in interpretation of MS/MS spectra has been database search. However, de novo MS/MS spectral interpretation remains an open problem typically involving manual interpretation by expert mass spectrometrists. We have developed a new algorithm, SHERENGA, for de novo interpretation that automatically learns fragment ion types and intensity thresholds from a collection of test spectra generated from any type of mass spectrometer. The test data are used to construct optimal path scoring in the graph representations of MS/MS spectra. A ranked list of high scoring paths corresponds to potential peptide sequences. SHERENGA is most useful for interpreting sequences of peptides resulting from unknown proteins and for validating the results of database search algorithms in fully automated, high-throughput peptide sequencing.  相似文献   

5.
One of the challenges associated with large-scale proteome analysis using tandem mass spectrometry (MS/MS) and automated database searching is to reduce the number of false positive identifications without sacrificing the number of true positives found. In this work, a systematic investigation of the effect of 2MEGA labeling (N-terminal dimethylation after lysine guanidination) on the proteome analysis of a membrane fraction of an Escherichia coli cell extract by 2-dimensional liquid chromatography MS/MS is presented. By a large-scale comparison of MS/MS spectra of native peptides with those from the 2MEGA-labeled peptides, the labeled peptides were found to undergo facile fragmentation with enhanced a1 or a1-related (a(1)-17 and a(1)-45) ions derived from all N-terminal amino acids in the MS/MS spectra; these ions are usually difficult to detect in the MS/MS spectra of nonderivatized peptides. The 2MEGA labeling alleviated the biased detection of arginine-terminated peptides that is often observed in MALDI and ESI MS experiments. 2MEGA labeling was found not only to increase the number of peptides and proteins identified but also to generate enhanced a1 or a1-related ions as a constraint to reduce the number of false positive identifications. In total, 640 proteins were identified from the E. coli membrane fraction, with each protein identified based on peptide mass and sequence match of one or more peptides using MASCOT database search algorithm from the MS/MS spectra generated by a quadrupole time-of-flight mass spectrometer. Among them, the subcellular locations of 336 proteins are presently known, including 258 membrane and membrane-associated proteins (76.8%). Among the classified proteins, there was a dramatic increase in the total number of integral membrane proteins identified in the 2MEGA-labeled sample (153 proteins) versus the unlabeled sample (77 proteins).  相似文献   

6.
Tandem mass spectrometry (MS/MS) is frequently used in the identification of peptides and proteins. Typical proteomic experiments rely on algorithms such as SEQUEST and MASCOT to compare thousands of tandem mass spectra against the theoretical fragment ion spectra of peptides in a database. The probabilities that these spectrum-to-sequence assignments are correct can be determined by statistical software such as PeptideProphet or through estimations based on reverse or decoy databases. However, many of the software applications that assign probabilities for MS/MS spectra to sequence matches were developed using training data sets from 3D ion-trap mass spectrometers. Given the variety of types of mass spectrometers that have become commercially available over the last 5 years, we sought to generate a data set of reference data covering multiple instrumentation platforms to facilitate both the refinement of existing computational approaches and the development of novel software tools. We analyzed the proteolytic peptides in a mixture of tryptic digests of 18 proteins, named the "ISB standard protein mix", using 8 different mass spectrometers. These include linear and 3D ion traps, two quadrupole time-of-flight platforms (qq-TOF), and two MALDI-TOF-TOF platforms. The resulting data set, which has been named the Standard Protein Mix Database, consists of over 1.1 million spectra in 150+ replicate runs on the mass spectrometers. The data were inspected for quality of separation and searched using SEQUEST. All data, including the native raw instrument and mzXML formats and the PeptideProphet validated peptide assignments, are available at http://regis-web.systemsbiology.net/PublicDatasets/.  相似文献   

7.
Despite a recent surge of interest in database-independent peptide identifications, accurate de novo peptide sequencing remains an elusive goal. While the recently introduced spectral network approach resulted in accurate peptide sequencing in low-complexity samples, its success depends on the chance of presence of spectra from overlapping peptides. On the other hand, while multistage mass spectrometry (collecting multiple MS 3 spectra from each MS 2 spectrum) can be applied to all spectra in a complex sample, there are currently no software tools for de novo peptide sequencing by multistage mass spectrometry. We describe a rigorous probabilistic framework for analyzing spectra of overlapping peptides and show how to apply it for multistage mass spectrometry. Our software results in both accurate de novo peptide sequencing from multistage mass spectra (despite the inferior quality of MS 3 spectra) and improved interpretation of spectral networks. We further study the problem of de novo peptide sequencing with accurate parent mass (but inaccurate fragment masses), the protocol that may soon become the dominant mode of spectral acquisition. Most existing peptide sequencing algorithms (based on the spectrum graph approach) do not track the accurate parent mass and are thus not equipped for solving this problem. We describe a de novo peptide sequencing algorithm aimed at this experimental protocol and show that it improves the sequencing accuracy on both tandem and multistage mass spectrometry.  相似文献   

8.
Mass spectrometry in three dimensions (MS3D) is a newly developed method for the determination of protein structures involving intramolecular chemical crosslinking of proteins, proteolytic digestion of the resulting adducts, identification of crosslinks by mass spectrometry (MS), peak assignment using theoretical mass lists, and computational reduction of crosslinks to a structure by distance geometry methods. To facilitate the unambiguous identification of crosslinked peptides from proteolytic digestion mixtures of crosslinked proteins by MS, we introduced double 18O isotopic labels into the crosslinking reagent to provide the crosslinked peptides with a characteristic isotope pattern. The presence of doublets separated by 4 Da in the mass spectra of these materials allowed ready discrimination between crosslinked and modified peptides, and uncrosslinked peptides using automated intelligent data acquisition (IDA) of MS/MS data. This should allow ready automation of the method for application to whole expressible proteomes.  相似文献   

9.
When analyzing proteins in complex samples using tandem mass spectrometry of peptides generated by proteolysis, the inference of proteins can be ambiguous, even with well-validated peptides. Unresolved questions include whether to show all possible proteins vs a minimal list, what to do when proteins are inferred ambiguously, and how to quantify peptides that bridge multiple proteins, each with distinguishing evidence. Here we describe IsoformResolver, a peptide-centric protein inference algorithm that clusters proteins in two ways, one based on peptides experimentally identified from MS/MS spectra, and the other based on peptides derived from an in silico digest of the protein database. MS/MS-derived protein groups report minimal list proteins in the context of all possible proteins, without redundantly listing peptides. In silico-derived protein groups pull together functionally related proteins, providing stable identifiers. The peptide-centric grouping strategy used by IsoformResolver allows proteins to be displayed together when they share peptides in common, providing a comprehensive yet concise way to organize protein profiles. It also summarizes information on spectral counts and is especially useful for comparing results from multiple LC-MS/MS experiments. Finally, we examine the relatedness of proteins within IsoformResolver groups and compare its performance to other protein inference software.  相似文献   

10.
Protein catalogs containing a large number of proteins expressed in a variety of organs can be powerful tools for stem-cell research, because this requires accurate knowledge about how cells differentiate. Salivary gland progenitor (SGP) cells are somatic stem cells isolated from the salivary gland that can differentiate into hepatic or pancreatic cell lineages. Their differentiation state has been assessed by the expression of major protein markers, but to use these cells in regenerative medicine, it will be necessary to establish additional means of quality assessment. We examined the use of shotgun proteomics for porcine salivary gland (a source of SGP cells) and liver (a destination of differentiated SGP cells) for determining the state of SGP cell differentiation. Protein complexes from each organ were digested into peptides and separated by two-dimensional liquid chromatography involving strong cation-exchange chromatography followed by reversed-phase liquid chromatography. The separated peptides were analyzed by on-line electrospray ionization tandem mass spectrometry using a quadrupole-time of flight mass spectrometer (ESI Q-TOF MS/MS), and the spectra obtained were processed to search peptides against a mammalian database for protein identification. Using this method, we identified 117 proteins in porcine salivary gland and 154 proteins in porcine liver. Of these, 72 and 109 were specific to salivary gland and liver, respectively, and some of these were previously shown to be organ specific. The current study can be utilized in the future as a basis to study the pattern of differentiation in protein expression by stem cells.  相似文献   

11.
Highly complex protein mixtures can be directly analyzed after proteolysis by liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS). In this paper, we have utilized the combination of strong cation exchange (SCX) and reversed-phase (RP) chromatography to achieve two-dimensional separation prior to MS/MS. One milligram of whole yeast protein was proteolyzed and separated by SCX chromatography (2.1 mm i.d.) with fraction collection every minute during an 80-min elution. Eighty fractions were reduced in volume and then re-injected via an autosampler in an automated fashion using a vented-column (100 microm i.d.) approach for RP-LC-MS/MS analysis. More than 162,000 MS/MS spectra were collected with 26,815 matched to yeast peptides (7,537 unique peptides). A total of 1,504 yeast proteins were unambiguously identified in this single analysis. We present a comparison of this experiment with a previously published yeast proteome analysis by Yates and colleagues (Washburn, M. P.; Wolters, D.; Yates, J. R., III. Nat. Biotechnol. 2001, 19, 242-7). In addition, we report an in-depth analysis of the false-positive rates associated with peptide identification using the Sequest algorithm and a reversed yeast protein database. New criteria are proposed to decrease false-positives to less than 1% and to greatly reduce the need for manual interpretation while permitting more proteins to be identified.  相似文献   

12.
Mass spectrometry (MS) is a technique that is used for biological studies. It consists in associating a spectrum to a biological sample. A spectrum consists of couples of values (intensity, m/z), where intensity measures the abundance of biomolecules (as proteins) with a mass-to-charge ratio (m/z) present in the originating sample. In proteomics experiments, MS spectra are used to identify pattern expressions in clinical samples that may be responsible of diseases. Recently, to improve the identification of peptides/proteins related to patterns, MS/MS process is used, consisting in performing cascade of mass spectrometric analysis on selected peaks. Latter technique has been demonstrated to improve the identification and quantification of proteins/peptide in samples. Nevertheless, MS analysis deals with a huge amount of data, often affected by noises, thus requiring automatic data management systems. Tools have been developed and most of the time furnished with the instruments allowing: (i) spectra analysis and visualization, (ii) pattern recognition, (iii) protein databases querying, (iv) peptides/proteins quantification and identification. Currently most of the tools supporting such phases need to be optimized to improve the protein (and their functionalities) identification processes. In this article we survey on applications supporting spectrometrists and biologists in obtaining information from biological samples, analyzing available software for different phases. We consider different mass spectrometry techniques, and thus different requirements. We focus on tools for (i) data preprocessing, allowing to prepare results obtained from spectrometers to be analyzed; (ii) spectra analysis, representation and mining, aimed to identify common and/or hidden patterns in spectra sets or in classifying data; (iii) databases querying to identify peptides; and (iv) improving and boosting the identification and quantification of selected peaks. We trace some open problems and report on requirements that represent new challenges for bioinformatics.  相似文献   

13.
Membrane proteins are fairly refractory to digestion especially by trypsin, and less specific proteases, such as elastase and pepsin, are much more effective. However, database searching using nontryptic peptides is much less effective because of the lack of charge localization at the N and C termini and the absence of sequence specificity. We describe a method for N-terminal-specific labeling of peptides from nontryptic digestions of membrane proteins, which facilitates Mascot database searching and can be used for relative quantitation. The conditions for digestion have been optimized to obtain peptides of a suitable length for mass spectrometry (MS) fragmentation. We show the effectiveness of the method using a plasma membrane preparation from a leukemia cell line and demonstrate a large increase in the number of membrane proteins, with small extra-membranar domains being identified in comparison to previous published methods.  相似文献   

14.
Hundreds of ribosomally synthesized cyclopeptides have been isolated from all domains of life, the vast majority having been reported in the last 15 years. Studies of cyclic peptides have highlighted their exceptional potential both as stable drug scaffolds and as biomedicines in their own right. Despite this, computational techniques for cyclopeptide identification are still in their infancy, with many such peptides remaining uncharacterized. Tandem mass spectrometry has occupied a niche role in cyclopeptide identification, taking over from traditional techniques such as nuclear magnetic resonance spectroscopy (NMR). MS/MS studies require only picogram quantities of peptide (compared to milligrams for NMR studies) and are applicable to complex samples, abolishing the requirement for time-consuming chromatographic purification. While database search tools such as Sequest and Mascot have become standard tools for the MS/MS identification of linear peptides, they are not applicable to cyclopeptides, due to the parent mass shift resulting from cyclization and different fragmentation patterns of cyclic peptides. In this paper, we describe the development of a novel database search methodology to aid in the identification of cyclopeptides by mass spectrometry and evaluate its utility in identifying two peptide rings from Helianthus annuus, a bacterial cannibalism factor from Bacillus subtilis, and a θ-defensin from Rhesus macaque.  相似文献   

15.
We report on the analysis of endogenous peptides in cerebrospinal fluid (CSF) by mass spectrometry. A method was developed for preparation of peptide extracts from CSF. Analysis of the extracts by offline LC-MALDI MS resulted in the detection of 3,000-4,000 peptide-like features. Out of these, 730 peptides were identified by MS/MS. The majority of these peptides have not been previously reported in CSF. The identified peptides were found to originate from 104 proteins, of which several have been reported to be involved in different disorders of the central nervous system. These results support the notion that CSF peptidomics may be viable complement to proteomics in the search of biomarkers of CNS disorders.  相似文献   

16.
17.
Protein identification by mass spectrometry is mainly based on MS/MS spectra and the accuracy of molecular mass determination. However, the high complexity and dynamic ranges for any species of proteomic samples, surpass the separation capacity and detection power of the most advanced multidimensional liquid chromatographs and mass spectrometers. Only a tiny portion of signals is selected for MS/MS experiments and a still considerable number of them do not provide reliable peptide identification. In this article, an in silico analysis for a novel methodology of peptides and proteins identification is described. The approach is based on mass accuracy, isoelectric point (pI), retention time (t(R)) and N-terminal amino acid determination as protein identification criteria regardless of high quality MS/MS spectra. When the methodology was combined with the selective isolation methods, the number of unique peptides and identified proteins increases. Finally, to demonstrate the feasibility of the methodology, an OFFGEL-LC-MS/MS experiment was also implemented. We compared the more reliable peptide identified with MS/MS information, and peptide identified with three experimental features (pI, t(R), molecular mass). Also, two theoretical assumptions from MS/MS identification (selective isolation of peptides and N-terminal amino acid) were analyzed. Our results show that using the information provided by these features and selective isolation methods we could found the 93% of the high confidence protein identified by MS/MS with false-positive rate lower than 5%.  相似文献   

18.
Post-translational modifications are used by cells to control the functions of proteins. Phosducin-like protein (PhLP) is a regulator of G-protein signaling that is post-translationally modified via phosphorylation. Phosphorylation of PhLP initiates its degradation by the 26S proteasome in serum-stimulated cells. In this report, we show that PhLP is phosphorylated in serum-stimulated Chinese hamster ovary (CHO) cells. Through the use of tandem mass spectrometry (MS/MS), the specific amino acids phosphorylated can be identified. A PhLP-myc-His construct was purified and phosphorylated by serum-stimulated CHO extract. The resulting protein was digested with trypsin and the peptides were identified by liquid chromatography-tandem mass spectrometry (LC-MS/MS). Automated collison-induced dissociation data acquisition was compared with LC-MS/MS of manually chosen parents. In general, LC-MS/MS is superior for parent ions chosen manually, with the notable exception that automated fragmentation employs dynamic collision energy, which can result in higher quality collison-induced dissociation. Using the LC-MS/MS methods, four phosphorylation sites on PhLP were positively identified.  相似文献   

19.
Multiplexed tandem mass spectrometry (MS/MS) has recently been demonstrated as a means to increase the throughput of peptide identification in liquid chromatography (LC) MS/MS experiments. In this approach, a set of parent species is dissociated simultaneously and measured in a single spectrum (in the same manner that a single parent ion is conventionally studied), providing a gain in sensitivity and throughput proportional to the number of species that can be simultaneously addressed. In the present work, simulations performed using the Caenorhabditis elegans predicted proteins database show that multiplexed MS/MS data allow the identification of tryptic peptides from mixtures of up to ten peptides from a single dataset with only three "y" or "b" fragments per peptide and a mass accuracy of 2.5 to 5 ppm. At this level of database and data complexity, 98% of the 500 peptides considered in the simulation were correctly identified. This compares favorably with the rates obtained for classical MS/MS at more modest mass measurement accuracy. LC multiplexed Fourier transform-ion cyclotron resonance MS/MS data obtained from a 66 kDa protein (bovine serum albumin) tryptic digest sample are presented to illustrate the approach, and confirm that peptides can be effectively identified from the C. elegans database to which the protein sequence had been appended.  相似文献   

20.
Two-dimensional liquid chromatography (2D-LC) coupled on-line with electrospray ionization tandem mass spectrometry (2D-LC-ESI-MS/MS) is a new platform for analysis and identification of proteome. Peptides are separated by 2D-LC and then performed MS/MS analysis by tandem MS/MS. The MS/MS data are searched against database for protein identification. In one 2D-LC-ESI-MS/MS run, we obtained not only the structural information of peptides directly from MS/MS, but also the retention time of peptides eluted from LC. Information on the chromatographic behavior of peptides can assist protein identification in the new platform for proteomics. The retention time of the matching peptides of the identified protein was predicted by the hydrophobic contribute of each amino acid on reversed-phase liquid chromatography (RPLC). By using this strategy proteins were identified by four types of information: peptide mass fingerprinting (PMF), sequence query, and MS/MS ions searched and the predicted retention time. This additional information obtained from LC could assist protein identification with no extra experimental cost.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号