首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 78 毫秒
1.
Protonated peptides derived from proline‐rich proteins (PRP) are often difficult to sequence by standard collision‐induced dissociation (CID) mass spectrometry (MS) due to preferential amide bond cleavage N‐terminal to proline. In connection with bovine spongiform encephalopathy regulations, proteolytic products derived from the PRP collagen have been suggested as markers for contamination of animal feedstuffs with processed animal protein (Fernandez Ocaña, M. et al., Analyst 2004, 129, 111–115). Herein, we report the identification of these marker peptides using the strategy of C‐terminal sequencing by CID MS from their sodium and lithium adducts. Upon fragmentation a new cationized peptide was produced that is one C‐terminal amino acid shorter in length. This dissociation pathway allowed for the facile identification of the C‐terminal residue by matrix‐assisted laser desorption/ionization tandem time‐of‐flight mass spectrometry. Each newly formed cationized peptide was further fragmented by up to seven stages of electrospray ionization ion trap MS. Proline‐rich C‐terminal sequence tags were established which permitted successful database identification of collagen alpha type I proteins.  相似文献   

2.
An Z  Chen Y  Koomen JM  Merkler DJ 《Proteomics》2012,12(2):173-182
Amidation is a post-translational modification found at the C-terminus of ~50% of all neuropeptide hormones. Cleavage of the C(α)-N bond of a C-terminal glycine yields the α-amidated peptide in a reaction catalyzed by peptidylglycine α-amidating monooxygenase (PAM). The mass of an α-amidated peptide decreases by 58 Da relative to its precursor. The amino acid sequences of an α-amidated peptide and its precursor differ only by the C-terminal glycine meaning that the peptides exhibit similar RP-HPLC properties and tandem mass spectral (MS/MS) fragmentation patterns. Growth of cultured cells in the presence of a PAM inhibitor ensured the coexistence of α-amidated peptides and their precursors. A strategy was developed for precursor and α-amidated peptide pairing (PAPP): LC-MS/MS data of peptide extracts were scanned for peptide pairs that differed by 58 Da in mass, but had similar RP-HPLC retention times. The resulting peptide pairs were validated by checking for similar fragmentation patterns in their MS/MS data prior to identification by database searching or manual interpretation. This approach significantly reduced the number of spectra requiring interpretation, decreasing the computing time required for database searching and enabling manual interpretation of unidentified spectra. Reported here are the α-amidated peptides identified from AtT-20 cells using the PAPP method.  相似文献   

3.
Multiplexed tandem mass spectrometry (MS/MS) has recently been demonstrated as a means to increase the throughput of peptide identification in liquid chromatography (LC) MS/MS experiments. In this approach, a set of parent species is dissociated simultaneously and measured in a single spectrum (in the same manner that a single parent ion is conventionally studied), providing a gain in sensitivity and throughput proportional to the number of species that can be simultaneously addressed. In the present work, simulations performed using the Caenorhabditis elegans predicted proteins database show that multiplexed MS/MS data allow the identification of tryptic peptides from mixtures of up to ten peptides from a single dataset with only three "y" or "b" fragments per peptide and a mass accuracy of 2.5 to 5 ppm. At this level of database and data complexity, 98% of the 500 peptides considered in the simulation were correctly identified. This compares favorably with the rates obtained for classical MS/MS at more modest mass measurement accuracy. LC multiplexed Fourier transform-ion cyclotron resonance MS/MS data obtained from a 66 kDa protein (bovine serum albumin) tryptic digest sample are presented to illustrate the approach, and confirm that peptides can be effectively identified from the C. elegans database to which the protein sequence had been appended.  相似文献   

4.
One of the challenges associated with large-scale proteome analysis using tandem mass spectrometry (MS/MS) and automated database searching is to reduce the number of false positive identifications without sacrificing the number of true positives found. In this work, a systematic investigation of the effect of 2MEGA labeling (N-terminal dimethylation after lysine guanidination) on the proteome analysis of a membrane fraction of an Escherichia coli cell extract by 2-dimensional liquid chromatography MS/MS is presented. By a large-scale comparison of MS/MS spectra of native peptides with those from the 2MEGA-labeled peptides, the labeled peptides were found to undergo facile fragmentation with enhanced a1 or a1-related (a(1)-17 and a(1)-45) ions derived from all N-terminal amino acids in the MS/MS spectra; these ions are usually difficult to detect in the MS/MS spectra of nonderivatized peptides. The 2MEGA labeling alleviated the biased detection of arginine-terminated peptides that is often observed in MALDI and ESI MS experiments. 2MEGA labeling was found not only to increase the number of peptides and proteins identified but also to generate enhanced a1 or a1-related ions as a constraint to reduce the number of false positive identifications. In total, 640 proteins were identified from the E. coli membrane fraction, with each protein identified based on peptide mass and sequence match of one or more peptides using MASCOT database search algorithm from the MS/MS spectra generated by a quadrupole time-of-flight mass spectrometer. Among them, the subcellular locations of 336 proteins are presently known, including 258 membrane and membrane-associated proteins (76.8%). Among the classified proteins, there was a dramatic increase in the total number of integral membrane proteins identified in the 2MEGA-labeled sample (153 proteins) versus the unlabeled sample (77 proteins).  相似文献   

5.
Advances in time-of-flight mass spectrometry allow unit mass resolution of proteins and peptides up to about 6000 Da molecular weight. Identification of larger proteins and study of their posttranslational or experimental modifications by mass analysis is greatly enhanced by cleavage into smaller fragments. Most membrane proteins are difficult to mass analyze because of their high hydrophobicity, typical expression in low quantities, and because the detergents commonly used for solubilization may be deleterious to mass analysis. Cleavage with cyanogen bromide is beneficial for analysis of membrane proteins since the methionine cleavage sites are typically located in hydrophobic domains and cleavage at these points reduces the size of the hydrophobic fragments. Cyanogen bromide also gives high cleavage yields and introduces only volatile contaminants. Even after cleavage membrane proteins often contain fragments that are difficult to chromatograph. Matrix-assisted laser desorption ionization mass spectrometry (MALDI MS) is capable of analyzing complex mixtures without chromatography. We present a MALDI MS method that quickly and reliably identifies the cyanogen bromide fragments and posttranslational modifications of reduced and alkylated bovine rhodopsin from as little as 30 pmol of rhodopsin in detergent-solubilized retinal rod disk membranes, using 1-5 pmol of digest per sample. The amino acid sequences of some of the peptides in the digest were confirmed by post source decomposition MS analysis of the same samples. The method appears to be general and applicable to the analysis of membrane proteins and the protein composition of membrane preparations.  相似文献   

6.
Protein identification has been greatly facilitated by database searches against protein sequences derived from product ion spectra of peptides. This approach is primarily based on the use of fragment ion mass information contained in a MS/MS spectrum. Unambiguous protein identification from a spectrum with low sequence coverage or poor spectral quality can be a major challenge. We present a two-dimensional (2D) mass spectrometric method in which the numbers of nitrogen atoms in the molecular ion and the fragment ions are used to provide additional discriminating power for much improved protein identification and de novo peptide sequencing. The nitrogen number is determined by analyzing the mass difference of corresponding peak pairs in overlaid spectra of (15)N-labeled and unlabeled peptides. These peptides are produced by enzymatic or chemical cleavage of proteins from cells grown in (15)N-enriched and normal media, respectively. It is demonstrated that, using 2D information, i.e., m/z and its associated nitrogen number, this method can, not only confirm protein identification results generated by MS/MS database searching, but also identify peptides that are not possible to identify by database searching alone. Examples are presented of analyzing Escherichia coli K12 extracts that yielded relatively poor MS/MS spectra, presumably from the digests of low abundance proteins, which can still give positive protein identification using this method. Additionally, this 2D MS method can facilitate spectral interpretation for de novo peptide sequencing and identification of posttranslational or other chemical modifications. We envision that this method should be particularly useful for proteome expression profiling of organelles or cells that can be grown in (15)N-enriched media.  相似文献   

7.
Trypsin (EC 3.4.21.4) is the protease of choice for proteome analysis using mass spectrometry of peptides in sample digests. In this work, trypsin from Streptomyces griseus (SGT) was purified to homogeneity from pronase. The enzyme was evaluated in in-gel digestion of protein standards followed by matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectrometry (MS) analyses of the digests. We recognized a remarkable cleavage performance of SGT. The number of produced and matching tryptic peptides was higher than in the case of commonly used bovine trypsin (BT) and allowed us to obtain higher identification scores in database searches. Interestingly, SGT was found to also generate nonspecific peptides whose sequencing by MALDI-TOF/TOF tandem mass spectrometry (MS/MS) revealed a partial F-X, Y-X, and W-X cleavage specificity. To suppress autolysis, either arginine or arginine plus lysine residues in SGT were modified by chemical reagents. In consequence, the autolytic pattern of SGT was reduced significantly, but specific activity dropped dramatically. As demonstrated by relative quantification of peptides at different times, SGT is more stable at 37 °C than is its bovine counterpart. We conclude that SGT represents a convenient alternative for proteomic applications involving protein digestion. Moreover, parallel digestions of sample aliquots by SGT and BT provide the possibility of combining partially different results (unique matching peptides) to improve protein identification.  相似文献   

8.
The identification of ubiquitin (Ub) and Ub‐like protein (Ubl) conjugation sites is important in understanding their roles in biological pathway regulations. However, unambiguously and sensitively identifying Ub/Ubl conjugation sites through high‐throughput MS remains challenging. We introduce an improved workflow for identifying Ub/Ubl conjugation sites based on the ChopNSpice and X!Tandem software. ChopNSpice is modified to generate Ub/Ubl conjugation peptides in the form of a cross‐link. A combinatorial FASTA database can be acquired using the modified ChopNSpice (MchopNSpice). The modified X!Tandem (UblSearch) introduces a new fragmentation model for the Ub/Ubl conjugation peptides to match unambiguously the MS/MS spectra with linear peptides or Ub/Ubl conjugation peptides using the combinatorial FASTA database. The novel workflow exhibited better performance in analyzing an Ub and Ubl spectral library and a large‐scale Trypanosoma cruzi small Ub‐related modifier dataset compared with the original ChopNSpice method. The proposed workflow is more suitable for processing large‐scale MS datasets of Ub/Ubl modification. MchopNSpice and UblSearch are freely available under the GNU General Public License v3.0 at http://sourceforge.net/projects/maublsearch .  相似文献   

9.

Background  

Often high-quality MS/MS spectra of tryptic peptides do not match to any database entry because of only partially sequenced genomes and therefore, protein identification requires de novo peptide sequencing. To achieve protein identification of the economically important but still unsequenced plant pathogenic oomycete Plasmopara halstedii, we first evaluated the performance of three different de novo peptide sequencing algorithms applied to a protein digests of standard proteins using a quadrupole TOF (QStar Pulsar i).  相似文献   

10.
Wang H  Dass C 《Peptides》2002,23(12):2143-2150
A method based upon a combination of fast high-performance liquid chromatography (HPLC) and electrospray ionization (ESI)–mass spectrometry (MS) is developed for the analysis of bioactive peptides in bovine adrenal medulla. The fast HPLC uses a short column (33 mm×4.6 mm) packed with nonporous silica-based C-18 stationary phase. Prior to HPLC separation, the medulla was homogenized and the peptide-rich fraction was isolated from it by solid-phase extraction. In-source collision-induced dissociation and tandem MS were used to obtain the sequence of the suspected peptides. Several peptides, including Met–Enk, Leu–Enk, Leu–Enk–Lys, bovine adrenal medullary (BAM)-12 (Met–Enk–RRVGRPE), Leu–Enk–Arg, and YGGT, were unambiguously identified. The first four peptides are the products of proenkephalin A precursor protein and Leu–Enk–Arg belongs to the dynorphin family and is derived from proenkephalin B (prodynorphin) precursor. The database search revealed that YGGT is a part of the sequence of five different precursor proteins.  相似文献   

11.
While tandem mass spectrometry (MS/MS) is routinely used to identify proteins from complex mixtures, certain types of proteins present unique challenges for MS/MS analyses. The major wheat gluten proteins, gliadins and glutenins, are particularly difficult to distinguish by MS/MS. Each of these groups contains many individual proteins with similar sequences that include repetitive motifs rich in proline and glutamine. These proteins have few cleavable tryptic sites, often resulting in only one or two tryptic peptides that may not provide sufficient information for identification. Additionally, there are less than 14,000 complete protein sequences from wheat in the current NCBInr release. In this paper, MS/MS methods were optimized for the identification of the wheat gluten proteins. Chymotrypsin and thermolysin as well as trypsin were used to digest the proteins and the collision energy was adjusted to improve fragmentation of chymotryptic and thermolytic peptides. Specialized databases were constructed that included protein sequences derived from contigs from several assemblies of wheat expressed sequence tags (ESTs), including contigs assembled from ESTs of the cultivar under study. Two different search algorithms were used to interrogate the database and the results were analyzed and displayed using a commercially available software package (Scaffold). We examined the effect of protein database content and size on the false discovery rate. We found that as database size increased above 30,000 sequences there was a decrease in the number of proteins identified. Also, the type of decoy database influenced the number of proteins identified. Using three enzymes, two search algorithms and a specialized database allowed us to greatly increase the number of detected peptides and distinguish proteins within each gluten protein group.  相似文献   

12.
Highly protonated histone-derived peptides impede a sufficient mass spectrometry (MS)-based epigenetic analysis because their relatively low m/z, due to a high degree of proton addition to peptides, would make it difficult to analyze the resulting complex MS/MS spectra. To reduce the degree of protonations, we have developed a new interface, the Ionization Variable Unit (IVU), in which peptides are ionized under a vaporized organic solvent. It is demonstrated that the doubly charged histone tail H2B peptide, PEPAKSAPAPKKGSKKAVTKAQKK (m/z 1238.243, +2), which was not detectable before, can be detected by using the IVU interface and sequenced.  相似文献   

13.
Recent advances in instrument control and enrichment procedures have enabled us to quantify large numbers of phosphoproteins and record site-specific phosphorylation events. An intriguing problem that has arisen with these advances is to accurately validate where phosphorylation events occur, if possible, in an automated manner. The problem is difficult because MS/MS spectra of phosphopeptides are generally more complicated than those of unmodified peptides. For large scale studies, the problem is even more evident because phosphorylation sites are based on single peptide identifications in contrast to protein identifications where at least two peptides from the same protein are required for identification. To address this problem we have developed an integrated strategy that increases the reliability and ease for phosphopeptide validation. We have developed an off-line titanium dioxide (TiO(2)) selective phosphopeptide enrichment procedure for crude cell lysates. Following enrichment, half of the phosphopeptide fractionated sample is enzymatically dephosphorylated, after which both samples are subjected to LC-MS/MS. From the resulting MS/MS analyses, the dephosphorylated peptide is used as a reference spectrum against the original phosphopeptide spectrum, in effect generating two peptide spectra for the same amino acid sequence, thereby enhancing the probability of a correct identification. The integrated procedure is summarized as follows: 1) enrichment for phosphopeptides by TiO(2) chromatography, 2) dephosphorylation of half the sample, 3) LC-MS/MS-based analysis of phosphopeptides and corresponding dephosphorylated peptides, 4) comparison of peptide elution profiles before and after dephosphorylation to confirm phosphorylation, and 5) comparison of MS/MS spectra before and after dephosphorylation to validate the phosphopeptide and its phosphorylation site. This phosphopeptide identification represents a major improvement as compared with identifications based only on single MS/MS spectra and probability-based database searches. We investigated an applicability of this method to crude cell lysates and demonstrate its application on the large scale analysis of phosphorylation sites in differentiating mouse myoblast cells.  相似文献   

14.
Searching spectral libraries in MS/MS is an important new approach to improving the quality of peptide and protein identification. The idea relies on the observation that ion intensities in an MS/MS spectrum of a given peptide are generally reproducible across experiments, and thus, matching between spectra from an experiment and the spectra of previously identified peptides stored in a spectral library can lead to better peptide identification compared to the traditional database search. However, the use of libraries is greatly limited by their coverage of peptide sequences: even for well‐studied organisms a large fraction of peptides have not been previously identified. To address this issue, we propose to expand spectral libraries by predicting the MS/MS spectra of peptides based on the spectra of peptides with similar sequences. We first demonstrate that the intensity patterns of dominant fragment ions between similar peptides tend to be similar. In accordance with this observation, we develop a neighbor‐based approach that first selects peptides that are likely to have spectra similar to the target peptide and then combines their spectra using a weighted K‐nearest neighbor method to accurately predict fragment ion intensities corresponding to the target peptide. This approach has the potential to predict spectra for every peptide in the proteome. When rigorous quality criteria are applied, we estimate that the method increases the coverage of spectral libraries available from the National Institute of Standards and Technology by 20–60%, although the values vary with peptide length and charge state. We find that the overall best search performance is achieved when spectral libraries are supplemented by the high quality predicted spectra.  相似文献   

15.
Although trypsin remains the most commonly used protease in MS, other proteases may be employed for increasing peptide coverage or generating overlapping peptides. Knowledge of the accurate specificity rules of these proteases is helpful for database search tools to detect peptides, and becomes crucial when label‐free MS is used to discover in vivo proteolytic cleavages. Since in vivo cleavages are inferred by subtracting digestion‐induced cleavages from all observed cleavages, it is important to ensure that the specificity rule used to identify digestion‐induced cleavages are broad enough to capture even minor cleavages produced in digestion, to avoid erroneously identifying them as in vivo cleavages. In this study, we describe MS‐Proteolysis, a software tool for identifying putative sites of in vivo proteolytic cleavage using label‐free MS. The tool is used in conjunction with digestion by trypsin and three other proteases, whose specificity rules are revised and extended before inferring proteolytic cleavages. Finally, we show that comparative analysis of multiple proteases can be used to detect putative in vivo proteolytic sites on a proteome‐wide scale.  相似文献   

16.
Unconventional epitopes presented by HLA class I complexes are emerging targets for T cell targeted immunotherapies. Their identification by mass spectrometry (MS) required development of novel methods to cope with the large number of theoretical candidates. Methods to identify post-translationally spliced peptides led to a broad range of outcomes. We here investigated the impact of three common database search engines – that is, Mascot, Mascot+Percolator, and PEAKS DB – as final identification step, as well as the features of target database on the ability to correctly identify non-spliced and cis-spliced peptides. We used ground truth datasets measured by MS to benchmark methods’ performance and extended the analysis to HLA class I immunopeptidomes. PEAKS DB showed better precision and recall of cis-spliced peptides and larger number of identified peptides in HLA class I immunopeptidomes than the other search engine strategies. The better performance of PEAKS DB appears to result from better discrimination between target and decoy hits and hence a more robust FDR estimation, and seems independent to peptide and spectrum features here investigated.  相似文献   

17.
Membrane proteins are fairly refractory to digestion especially by trypsin, and less specific proteases, such as elastase and pepsin, are much more effective. However, database searching using nontryptic peptides is much less effective because of the lack of charge localization at the N and C termini and the absence of sequence specificity. We describe a method for N-terminal-specific labeling of peptides from nontryptic digestions of membrane proteins, which facilitates Mascot database searching and can be used for relative quantitation. The conditions for digestion have been optimized to obtain peptides of a suitable length for mass spectrometry (MS) fragmentation. We show the effectiveness of the method using a plasma membrane preparation from a leukemia cell line and demonstrate a large increase in the number of membrane proteins, with small extra-membranar domains being identified in comparison to previous published methods.  相似文献   

18.
利用反相高效液相色谱 (RP HPLC)和电喷雾串联质谱 (ESI MS MS)联用技术直接对模式蛋白分子 (牛血清白蛋白 ,BSA)的胰蛋白酶酶解产物进行分离和测定 .获得的一系列BSA酶解片段的一级 (MS)和二级 (MS MS)质谱数据经分析软件处理后 ,分别在不同处理和不同参数条件下 ,用 3种不同的方法通过网上蛋白质数据库进行蛋白质搜寻鉴定 .结果显示 ,3种搜寻法都能正确地鉴定该蛋白质 ,其中以利用MS数据的肽质量指纹谱搜寻法 (PMF法 )较为快捷方便 ,但鉴定结果易受数据处理和数据库搜寻鉴定时参数设置等因素的影响 ;利用未解析MS MS数据 (rawMS MSdata)的搜寻法可在较宽的搜寻参数变化范围内获得明确的鉴定结果 ;而借助从头测序 (denovosequencing)结果的序列搜寻法 (sequencequery)则显示出更高的专一性 ,利用较少酶解片段数据就能得到稳定和明确的鉴定结果 ,搜寻参数变化的影响很小 .就酶解条件、数据处理和搜寻参数设置对蛋白质鉴定结果的影响展开详细的讨论 ,为蛋白质组学研究中的数据处理和库搜寻鉴定积累了可借鉴的资料  相似文献   

19.
Orthogonal analysis of amino acid substitutions as a result of SNPs in existing proteomic datasets provides a critical foundation for the emerging field of population-based proteomics. Large-scale proteomics datasets, derived from shotgun tandem MS analysis of complex cellular protein mixtures, contain many unassigned spectra that may correspond to alternate alleles coded by SNPs. The purpose of this work was to identify tandem MS spectra in LC-MS/MS shotgun proteomics datasets that may represent coding nonsynonymous SNPs (nsSNP). To this end, we generated a tryptic peptide database created from allelic information found in NCBI's dbSNP. We searched this database with tandem MS spectra of tryptic peptides from DU4475 breast tumor cells that had been fractioned by pI in the first-dimension and reverse-phase LC in the second dimension. In all we identified 629 nsSNPs, of which 36 were of alternate SNP alleles not found in the reference NCBI or IPI protein databases. Searches for SNP-peptides carry a high risk of false positives due both to mass shifts caused by modifications and because of multiple representations of the same peptide within the genome. In this work, false positives were filtered using a novel peptide pI prediction algorithm and characterized using a decoy database developed by random substitution of similarly sized reference peptides. Secondary validation by sequencing of corresponding genomic DNA confirmed the presence of the predicted SNP in 8 of 10 SNP-peptides. This work highlights that the usefulness of interpreting unassigned spectra as polymorphisms is highly reliant on the ability to detect and filter false positives.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号