首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Reversed-phase liquid chromatography (LC) directly coupled with electrospray-tandem mass spectrometry (MS/MS) is a successful choice to obtain a large number of product ion spectra from a complex peptide mixture. We describe a search validation program, ScoreRidge, developed for analysis of LC-MS/MS data. The program validates peptide assignments to product ion spectra resulting from usual probability-based searches against primary structure databases. The validation is based only on correlation between the measured LC elution time of each peptide and the deduced elution time from the amino acid sequence assigned to product ion spectra obtained from the MS/MS analysis of the peptide. Sufficient numbers of probable assignments gave a highly correlative curve. Any peptide assignments within a certain tolerance from the correlation curve were accepted for the following arrangement step to list identified proteins. Using this data validation program, host protein candidates responsible for interaction with human hepatitis B virus core protein were identified from a partially purified protein mixture. The present simple and practical program complements protein identification from usual product ion search algorithms and reduces manual interpretation of the search result data. It will lead to more explicit protein identification from complex peptide mixtures such as whole proteome digests from tissue samples.  相似文献   

2.
MOTIVATION: The identification of peptides by tandem mass spectrometry (MS/MS) is a central method of proteomics research, but due to the complexity of MS/MS data and the large databases searched, the accuracy of peptide identification algorithms remains limited. To improve the accuracy of identification we applied a machine-learning approach using a hidden Markov model (HMM) to capture the complex and often subtle links between a peptide sequence and its MS/MS spectrum. Model: Our model, HMM_Score, represents ion types as HMM states and calculates the maximum joint probability for a peptide/spectrum pair using emission probabilities from three factors: the amino acids adjacent to each fragmentation site, the mass dependence of ion types and the intensity dependence of ion types. The Viterbi algorithm is used to calculate the most probable assignment between ion types in a spectrum and a peptide sequence, then a correction factor is added to account for the propensity of the model to favor longer peptides. An expectation value is calculated based on the model score to assess the significance of each peptide/spectrum match. RESULTS: We trained and tested HMM_Score on three data sets generated by two different mass spectrometer types. For a reference data set recently reported in the literature and validated using seven identification algorithms, HMM_Score produced 43% more positive identification results at a 1% false positive rate than the best of two other commonly used algorithms, Mascot and X!Tandem. HMM_Score is a highly accurate platform for peptide identification that works well for a variety of mass spectrometer and biological sample types. AVAILABILITY: The program is freely available on ProteomeCommons via an OpenSource license. See http://bioinfo.unc.edu/downloads/ for the download link.  相似文献   

3.
Tandem mass spectrometry using precursor ion selection (MS/MS) is an invaluable tool for structural elucidation of small molecules. In non-targeted metabolite profiling studies, instrument duty cycle limitations and experimental costs have driven efforts towards alternate approaches. Recently, researchers have begun to explore methods for collecting indiscriminant MS/MS (idMS/MS) data in which the fragmentation process does not involve precursor ion isolation. While this approach has many advantages, importantly speed, sensitivity and coverage, confident assignment of precursor–product ion relationships is challenging, which has inhibited broad adoption of the technique. Here, we present an approach that uses open source software to improve the assignment of precursor–product relationships in idMS/MS data by appending a dataset-wide correlational analysis to existing tools. The utility of the approach was demonstrated using a dataset of standard compounds spiked into a malt-barley background, as well as unspiked human serum. The workflow was able to recreate idMS/MS spectra which are highly similar to standard MS/MS spectra of authentic standards, even in the presence of a complex matrix background. The application of this approach has the potential to generate high quality idMS/MS spectra for each detectable molecular feature, which will streamline the identification process for non-targeted metabolite profiling studies.  相似文献   

4.
Protein identification by interrogation of databases requires a comprehensive compilation of modified amino acids forms. Here, we describe the chemical oxidation of carboxyamidomethyl cysteine to the sulfoxide and sulfone forms, species that may add more complexity to peptide analyses. They can be easily distinguished by tandem mass spectrometry (MS/MS) due to their characteristic pattern of side chain neutral eliminations either from the parent ion or ion series that generate dehydroalanine as detected by MS(3). This finding was supported by the MS(n) spectra recorded for a peptide isolated from a mixture of tryptic peptides and for a derivatized/oxidized synthetic peptide with a different sequence. These modifications and their diagnostic neutral losses should be included in the list of chemical modifications and in algorithms designed for the automatic sequencing of peptides and database searching.  相似文献   

5.
Although tandem mass spectrometry (MS/MS) has become an integral part of proteomics, intensity patterns in MS/MS spectra are rarely weighted heavily in most widely used algorithms because they are not yet fully understood. Here a knowledge mining approach is demonstrated to discover fragmentation intensity patterns and elucidate the chemical factors behind such patterns. Fragmentation intensity information from 28 330 ion trap peptide MS/MS spectra of different charge states and sequences went through unsupervised clustering using a penalized K-means algorithm. Without any prior chemistry assumptions, four clusters with distinctive fragmentation patterns were obtained. A decision tree was generated to investigate peptide sequence motif and charge state status that caused these fragmentation patterns. This data-mining scheme is generally applicable for any large data sets. It bypasses the common prior knowledge constraints and reports on the overall peptide fragmentation behavior. It improves the understanding of gas-phase peptide dissociation and provides a foundation for new or improved protein identification algorithms.  相似文献   

6.
Large numbers of MS/MS peptide spectra generated in proteomics experiments require efficient, sensitive and specific algorithms for peptide identification. In the Open Mass Spectrometry Search Algorithm (OMSSA), specificity is calculated by a classic probability score using an explicit model for matching experimental spectra to sequences. At default thresholds, OMSSA matches more spectra from a standard protein cocktail than a comparable algorithm. OMSSA is designed to be faster than published algorithms in searching large MS/MS datasets.  相似文献   

7.
Liquid chromatography–mass spectrometry (LC–MS) is a commonly used analytical platform for non-targeted metabolite profiling experiments. Although data acquisition, processing and statistical analyses are almost routine in such experiments, further annotation and subsequent identification of chemical compounds are not. For identification, tandem mass spectra provide valuable information towards the structure of chemical compounds. These are typically acquired online, in data-dependent mode, or offline, using handcrafted acquisition methods and manually extracted from raw data. Here, we present several methods to fast-track and improve both the acquisition and processing of LC–MS/MS data. Our nearly online (nearline) data-dependent tandem MS strategy creates a minimal set of LC–MS/MS acquisition methods for relevant features revealed by a preceding non-targeted profiling experiment. Using different filtering criteria, such as intensity or ion type, the acquisition of irrelevant spectra is minimized. Afterwards, LC–MS/MS raw data are processed with feature detection and grouping algorithms. The extracted tandem mass spectra can be used for both library search and de-novo identification methods. The algorithms are implemented in the R package MetShot and support the export to Bruker, Agilent or Waters QTOF instruments and the vendor-independent TraML standard. We evaluate the performance of our workflow on a Bruker micrOTOF-Q by comparison of automatically acquired and extracted tandem mass spectra obtained from a mixture of natural product standards against manually extracted reference spectra. Using Arabidopsis thaliana wild-type and biosynthetic gene knockout plants, we characterize the metabolic products of a biosynthetic pathway and demonstrate the integration of our approach into a typical non-targeted metabolite profiling workflow.  相似文献   

8.

Liquid chromatography–mass spectrometry (LC–MS) is a commonly used analytical platform for non-targeted metabolite profiling experiments. Although data acquisition, processing and statistical analyses are almost routine in such experiments, further annotation and subsequent identification of chemical compounds are not. For identification, tandem mass spectra provide valuable information towards the structure of chemical compounds. These are typically acquired online, in data-dependent mode, or offline, using handcrafted acquisition methods and manually extracted from raw data. Here, we present several methods to fast-track and improve both the acquisition and processing of LC–MS/MS data. Our nearly online (nearline) data-dependent tandem MS strategy creates a minimal set of LC–MS/MS acquisition methods for relevant features revealed by a preceding non-targeted profiling experiment. Using different filtering criteria, such as intensity or ion type, the acquisition of irrelevant spectra is minimized. Afterwards, LC–MS/MS raw data are processed with feature detection and grouping algorithms. The extracted tandem mass spectra can be used for both library search and de-novo identification methods. The algorithms are implemented in the R package MetShot and support the export to Bruker, Agilent or Waters QTOF instruments and the vendor-independent TraML standard. We evaluate the performance of our workflow on a Bruker micrOTOF-Q by comparison of automatically acquired and extracted tandem mass spectra obtained from a mixture of natural product standards against manually extracted reference spectra. Using Arabidopsis thaliana wild-type and biosynthetic gene knockout plants, we characterize the metabolic products of a biosynthetic pathway and demonstrate the integration of our approach into a typical non-targeted metabolite profiling workflow.

  相似文献   

9.
Tandem mass spectrometry (MS/MS) is frequently used in the identification of peptides and proteins. Typical proteomic experiments rely on algorithms such as SEQUEST and MASCOT to compare thousands of tandem mass spectra against the theoretical fragment ion spectra of peptides in a database. The probabilities that these spectrum-to-sequence assignments are correct can be determined by statistical software such as PeptideProphet or through estimations based on reverse or decoy databases. However, many of the software applications that assign probabilities for MS/MS spectra to sequence matches were developed using training data sets from 3D ion-trap mass spectrometers. Given the variety of types of mass spectrometers that have become commercially available over the last 5 years, we sought to generate a data set of reference data covering multiple instrumentation platforms to facilitate both the refinement of existing computational approaches and the development of novel software tools. We analyzed the proteolytic peptides in a mixture of tryptic digests of 18 proteins, named the "ISB standard protein mix", using 8 different mass spectrometers. These include linear and 3D ion traps, two quadrupole time-of-flight platforms (qq-TOF), and two MALDI-TOF-TOF platforms. The resulting data set, which has been named the Standard Protein Mix Database, consists of over 1.1 million spectra in 150+ replicate runs on the mass spectrometers. The data were inspected for quality of separation and searched using SEQUEST. All data, including the native raw instrument and mzXML formats and the PeptideProphet validated peptide assignments, are available at http://regis-web.systemsbiology.net/PublicDatasets/.  相似文献   

10.
Depository of low-molecular-weight compounds or metabolites detected in various organisms in a non-targeted manner is indispensable for metabolomics research. Due to the diverse chemical compounds, various mass spectrometry (MS) setups with state-of-the-art technologies have been used. Over the past two decades, we have analyzed various biological samples by using gas chromatography-mass spectrometry, liquid chromatography-mass spectrometry, or capillary electrophoresis-mass spectrometry, and archived the datasets in the depository MassBase (http://webs2.kazusa.or.jp/massbase/). As the format of MS datasets depends on the MS setup used, we converted each raw binary dataset of the mass chromatogram to text file format, and thereafter, information of the chromatograph peak was extracted in the text file from the converted file. In total, the depository comprises 46,493 datasets, of which 38,750 belong to the plant species and 7,743 are authentic or mixed chemicals as well as other sources (microorganisms, animals, and foods), as on August 1, 2020. All files in the depository can be downloaded in bulk from the website. Mass chromatograms of 90 plant species obtained by LC-Fourier transform ion cyclotron resonance MS or Orbitrap MS, which detect the ionized molecules with high accuracy allowing speculation of chemical compositions, were converted to text files by the software PowerGet, and the chemical annotation of each peak was added. The processed datasets were deposited in the annotation database KomicMarket2 (http://webs2.kazusa.or.jp/km2/). The archives provide fundamental resources for comparative metabolomics and functional genomics, which may result in deeper understanding of living organisms.  相似文献   

11.
Characterization of betaines using electrospray MS/MS   总被引:3,自引:0,他引:3  
Betaines are an important class of naturally occurring compounds that function as compatible solutes or osmoprotectants. Because of the permanent positive charge on the quaternary ammonium moiety, mass spectrometric analysis has been approached by desorption methods, including fast atom bombardment and plasma desorption mass spectrometry. Here we show that electrospray ionization MS gives comparable results to plasma desorption MS for a range of authentic betaine standards and betaines purified from plant extracts by ion exchange chromatography. A distinct advantage of electrospray ionization MS over plasma desorption MS is the capability of obtaining product ion spectra via MS/MS of selected parent ions, and hence structural information to discriminate between ions of identical mass.  相似文献   

12.
Chromatographed peptide signals form the basis of further data processing that eventually results in functional information derived from data‐dependent bottom‐up proteomics assays. We seek to rank LC/MS parent ions by the quality of their extracted ion chromatograms. Ranked extracted ion chromatograms act as an intuitive physical/chemical preselection filter to improve the quality of MS/MS fragment scans submitted for database search. We identify more than 4900 proteins when considering detector shifts of less than 7 ppm. High quality parent ions for which the database search yields no hits become candidates for subsequent unrestricted analysis for PTMs. Following this rational approach, we prioritize identification of more than 5000 spectrum matches from modified peptides and confirmed the presence of acetylaldehyde‐modified His/Lys. We present a logical workflow that scores data‐dependent selected ion chromatograms and leverage information about semianalytical LC/LC dimension prior to MS. Our method can be successfully used to identify unexpected modifications in peptides with excellent chromatography characteristics, independent of fragmentation pattern and activation methods. We illustrate analysis of ion chromatograms detected in two different modes by RF linear ion trap and electrostatic field orbitrap.  相似文献   

13.
Proteomics uses tandem mass spectrometers and correlation algorithms to match peptides and their fragment spectra to amino acid sequences. The replication of multiple liquid chromatography experiments with electrospray ionization of peptides and tandem mass spectrometry (LC–ESI–MS/MS) produces large sets of MS/MS spectra. There is a need to assess the quality of large sets of experimental results by statistical comparison with that of random expectation. Classical frequency-based statistics such as goodness-of-fit tests for peptide-to-protein distributions could be used to calculate the probability that an entire set of experimental results has arisen by random chance. The frequency distributions of authentic MS/MS spectra from human blood were compared with those of false positive MS/MS spectra generated by a computer, or instrument noise, using the chi-square test. Here the mechanics of the chi-square test to compare the results in toto from a set of LC–ESI–MS/MS experiments with those of random expectation is detailed. The chi-square analysis of authentic spectra demonstrates unambiguously that the analysis of blood proteins separated by partition chromatography prior to tryptic digestions has a low probability that the cumulative peptide-to-protein distribution is the same as that of random or noise false positive spectra.  相似文献   

14.
Nakamura T  Dohmae N  Takio K 《Proteomics》2004,4(9):2558-2566
We describe a new approach for the characterization of a digested protein complex with quantitative aspects. Accurate masses of tryptic peptides in the digested complex were acquired by nano-liquid chromatography Fourier transform-ion cyclotron resonance mass spectrometry (MS). The conditions of the electrospray ion source were alternated to acquire normal and fragment-ion-rich mass spectra concurrently. This, alternating-scan method, which includes no tandem mass spectrometry (MS/MS), allowed us to retain the integrity of the mass chromatograms and averted missed peptides due to MS and MS/MS switching. Tentative assignments of accurate peptide masses were verified with the concurrently acquired fragment-ion-rich spectra, and the identities of the protein components were established. For each identified protein component, mass chromatograms attributable to the validated accurate peptide masses were extracted, and the peak areas of multiple mass chromatograms were standardized. The standardized peak areas appeared to reasonably reflect the molar ratio of the protein components in standard mixtures. This new approach was successfully applied to the characterization of a cyanobacterial photosystem II complex preparation. A clear difference in the standardized peak areas was observed between the two groups of identified components, namely eight stoichiometric photosystem II proteins and two minor copurified phycobiliproteins.  相似文献   

15.
Protein and peptide mass analysis and amino acid sequencing by mass spectrometry is widely used for identification and annotation of post-translational modifications (PTMs) in proteins. Modification-specific mass increments, neutral losses or diagnostic fragment ions in peptide mass spectra provide direct evidence for the presence of post-translational modifications, such as phosphorylation, acetylation, methylation or glycosylation. However, the commonly used database search engines are not always practical for exhaustive searches for multiple modifications and concomitant missed proteolytic cleavage sites in large-scale proteomic datasets, since the search space is dramatically expanded. We present a formal definition of the problem of searching databases with tandem mass spectra of peptides that are partially (sub-stoichiometrically) modified. In addition, an improved search algorithm and peptide scoring scheme that includes modification specific ion information from MS/MS spectra was implemented and tested using the Virtual Expert Mass Spectrometrist (VEMS) software. A set of 2825 peptide MS/MS spectra were searched with 16 variable modifications and 6 missed cleavages. The scoring scheme returned a large set of post-translationally modified peptides including precise information on modification type and position. The scoring scheme was able to extract and distinguish the near-isobaric modifications of trimethylation and acetylation of lysine residues based on the presence and absence of diagnostic neutral losses and immonium ions. In addition, the VEMS software contains a range of new features for analysis of mass spectrometry data obtained in large-scale proteomic experiments. Windows binaries are available at http://www.yass.sdu.dk/.  相似文献   

16.
Tandem mass spectrometry (MS/MS) has emerged as a cornerstone of proteomics owing in part to robust spectral interpretation algorithms. Widely used algorithms do not fully exploit the intensity patterns present in mass spectra. Here, we demonstrate that intensity pattern modeling improves peptide and protein identification from MS/MS spectra. We modeled fragment ion intensities using a machine-learning approach that estimates the likelihood of observed intensities given peptide and fragment attributes. From 1,000,000 spectra, we chose 27,000 with high-quality, nonredundant matches as training data. Using the same 27,000 spectra, intensity was similarly modeled with mismatched peptides. We used these two probabilistic models to compute the relative likelihood of an observed spectrum given that a candidate peptide is matched or mismatched. We used a 'decoy' proteome approach to estimate incorrect match frequency, and demonstrated that an intensity-based method reduces peptide identification error by 50-96% without any loss in sensitivity.  相似文献   

17.
Spectral libraries have emerged as a viable alternative to protein sequence databases for peptide identification. These libraries contain previously detected peptide sequences and their corresponding tandem mass spectra (MS/MS). Search engines can then identify peptides by comparing experimental MS/MS scans to those in the library. Many of these algorithms employ the dot product score for measuring the quality of a spectrum-spectrum match (SSM). This scoring system does not offer a clear statistical interpretation and ignores fragment ion m/z discrepancies in the scoring. We developed a new spectral library search engine, Pepitome, which employs statistical systems for scoring SSMs. Pepitome outperformed the leading library search tool, SpectraST, when analyzing data sets acquired on three different mass spectrometry platforms. We characterized the reliability of spectral library searches by confirming shotgun proteomics identifications through RNA-Seq data. Applying spectral library and database searches on the same sample revealed their complementary nature. Pepitome identifications enabled the automation of quality analysis and quality control (QA/QC) for shotgun proteomics data acquisition pipelines.  相似文献   

18.
19.
Mass spectrometers that provide high mass accuracy such as FT-ICR instruments are increasingly used in proteomic studies. Although the importance of accurately determined molecular masses for the identification of biomolecules is generally accepted, its role in the analysis of shotgun proteomic data has not been thoroughly studied. To gain insight into this role, we used a hybrid linear quadrupole ion trap/FT-ICR (LTQ FT) mass spectrometer for LC-MS/MS analysis of a highly complex peptide mixture derived from a fraction of the yeast proteome. We applied three data-dependent MS/MS acquisition methods. The FT-ICR part of the hybrid mass spectrometer was either not exploited, used only for survey MS scans, or also used for acquiring selected ion monitoring scans to optimize mass accuracy. MS/MS data were assigned with the SEQUEST algorithm, and peptide identifications were validated by estimating the number of incorrect assignments using the composite target/decoy database search strategy. We developed a simple mass calibration strategy exploiting polydimethylcyclosiloxane background ions as calibrant ions. This strategy allowed us to substantially improve mass accuracy without reducing the number of MS/MS spectra acquired in an LC-MS/MS run. The benefits of high mass accuracy were greatest for assigning MS/MS spectra with low signal-to-noise ratios and for assigning phosphopeptides. Confident peptide identification rates from these data sets could be doubled by the use of mass accuracy information. It was also shown that improving mass accuracy at a cost to the MS/MS acquisition rate substantially lowered the sensitivity of LC-MS/MS analyses. The use of FT-ICR selected ion monitoring scans to maximize mass accuracy reduced the number of protein identifications by 40%.  相似文献   

20.
A widely used method for protein identification couples prefractionation of protein samples by one-dimensional (1D) PAGE with LC/MS/MS. We developed a new label-free quantitative algorithm by combining measurements of spectral counting, ion intensity, and peak area on 1D PAGE-based proteomics. This algorithm has several improvements over other label-free quantitative algorithms: (i) Errors in peak detection are reduced because the retention time is based on each LC/MS/MS run and actual precursor m/z. (ii) Detection sensitivity is increased because protein quantification is based on the combination of peptide count, ion intensity, and peak area. (iii) Peak intensity and peak area are calculated in each LC/MS/MS run for all slices from 1D PAGE for every single identified protein and visualized as a Western blot image. The sensitivity and accuracy of this algorithm were demonstrated by using standard curves (17.4 fmol to 8.7 pmol), complex protein mixtures (30 fmol to 1.16 pmol) of known composition, and spiked protein (34.8 fmol to 17.4 pmol) in complex proteins. We studied the feasibility of this approach using the secretome of angiotensin II (Ang II)-stimulated vascular smooth muscle cells (VSMCs). From the VSMC-conditioned medium, 629 proteins were identified including 212 putative secreted proteins. 26 proteins were differently expressed in control and Ang II-stimulated VSMCs, including 18 proteins not previously reported. Proteins related to cell growth (CYR61, protein NOV, and clusterin) were increased, whereas growth arrest-specific 6 (GAS6) and growth/differentiation factor 6 were decreased by Ang II stimulation. Ang II-stimulated changes of plasminogen activator inhibitor-1, GAS6, cathepsin B, and periostin were validated by Western blot. In conclusion, a novel label-free quantitative analysis of 1D PAGE-LC/MS/MS-based proteomics has been successfully applied to the identification of new potential mediators of Ang II action and may provide an alternative to traditional protein staining methods.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号