共查询到20条相似文献,搜索用时 15 毫秒
1.
James C. Wright Chaoqin Du Qiang Feng Xun Xu Jyoti S. Choudhary Jun Wang 《Proteomics》2014,14(9):1011-1014
Protein identification by MS/MS is an important technique in proteome studies. The Open Mass Spectrometry Search Algorithm (OMSSA) is an open‐source search engine that can be used to identify MS/MS spectra acquired in these experiments. Here, we present a software tool, termed OMSSAPercolator, which interfaces OMSSA with Percolator, a post‐search machine learning method for rescoring database search results. We demonstrate that it outperforms the standard OMSSA scoring scheme, and provides reliable significant measurements. OMSSAPercolator is programmed using JAVA and can be readily used as a standalone tool or integrated into existing data analysis pipelines. OMSSAPercolator is freely available and can be downloaded at http://sourceforge.net/projects/omssapercolator/ . 相似文献
2.
Guilin Li Fawaz Ghali Andrew R. Jones Lukas Käll Shaohang Xu Ruo Zhou Zhe Ren Qiang Feng Xun Xu Jun Wang 《Proteomics》2015,15(17):2916-2920
Liquid chromatography coupled tandem mass spectrometry (LC‐MS/MS) is an important technique for detecting peptides in proteomics studies. Here, we present an open source software tool, termed IPeak, a peptide identification pipeline that is designed to combine the Percolator post‐processing algorithm and multi‐search strategy to enhance the sensitivity of peptide identifications without compromising accuracy. IPeak provides a graphical user interface (GUI) as well as a command‐line interface, which is implemented in JAVA and can work on all three major operating system platforms: Windows, Linux/Unix and OS X. IPeak has been designed to work with the mzIdentML standard from the Proteomics Standards Initiative (PSI) as an input and output, and also been fully integrated into the associated mzidLibrary project, providing access to the overall pipeline, as well as modules for calling Percolator on individual search engine result files. The integration thus enables IPeak (and Percolator) to be used in conjunction with any software packages implementing the mzIdentML data standard. IPeak is freely available and can be downloaded under an Apache 2.0 license at https://code.google.com/p/mzidentml‐lib/ . 相似文献
3.
Although mass spectrometers are capable of providing high mass accuracy data, assignment of true monoisotopic precursor ion mass is complicated during data-dependent ion selection for LC-MS/MS analysis of complex mixtures. The complication arises when chromatographic peak widths for a given analyte exceed the time required to acquire a precursor ion mass spectrum. The result is that many measured monoisotopic masses are misassigned due to calculation from a single mass spectrum with poor ion statistics based on only a fraction of the total available ions for a given analyte. Such data in turn produces errors in automated database searches, where precursor m/z value is one search parameter. We propose here a postacquisition approach to correct misassigned monoisotopic m/z values that involves peak detection over the entire elution profile and correction of the precursor ion monoisotopic mass. As a result of using this approach to reprocess shotgun proteomic data we increased peptide sequence assignments by 10% while reducing the estimated false positive ratio from 1 to 0.2%. We also show that 4% of the salvaged identifications may be accounted for by correction of mixed tandem mass spectra resulting from fragmentation of multiple peptides simultaneously, a situation which we refer to as accidental CID. 相似文献
4.
Li N Wu S Zhang C Chang C Zhang J Ma J Li L Qian X Xu P Zhu Y He F 《Proteomics》2012,12(11):1720-1725
In this study, we presented a quality control tool named PepDistiller to facilitate the validation of MASCOT search results. By including the number of tryptic termini, and integrating a refined false discovery rate (FDR) calculation method, we demonstrated the improved sensitivity of peptide identifications obtained from semitryptic search results. Based on the analysis of a complex data set, approximately 7% more peptide identifications were obtained using PepDistiller than using MASCOT Percolator. Moreover, the refined method generated lower FDR estimations than the percentage of incorrect target (PIT) fixed method applied in Percolator. Using a standard data set, we further demonstrated the increased accuracy of the refined FDR estimations relative to the PIT-fixed FDR estimations. PepDistiller is fast and convenient to use, and is freely available for academic access. The software can be downloaded from http://www.bprc.ac.cn/pepdistiller. 相似文献
5.
Biomphalaria glabrata is an important host in the transmission of human schistosomiasis in the Caribbean and South America. Therefore, it is of interest to analyse the proteome data of Biomphalaria glabrata hemolymph to identify immunity related proteins in host-pathogen relationship. We used shotgun proteomic and bioinformatic analyses of the non-depleted and depleted [0.5 and 0.75% Trifluoroacetic acid (TFA) depletion] hemolymph of B. glabrata (LE strain). Analysis showed 148 proteins from the hemolymph. 148 were obtained from the 0.5% TFA-depleted sample. 62 proteins follow this from the 0.75% TFA-depleted sample. However, only 59 were found from non-depleted hemolymph. A number of proteins were identified from the hemolymph of this schistosomiasis snail vector linked to immunity related functions. This provides insights to the understanding of schistosome-snail interaction. 相似文献
6.
Phillip A. Wilmarth Michael A. Riviere Larry L. David 《Journal of ocular biology, diseases, and informatics》2009,2(4):223-234
Analysis of shotgun proteomics datasets requires techniques to distinguish correct peptide identifications from incorrect identifications, such as linear discriminant functions and target/decoy protein databases. We report an efficient, flexible proteomic analysis workflow pipeline that implements these techniques to control both peptide and protein false discovery rates. We demonstrate its performance by analyzing two-dimensional liquid chromatography separations of lens proteins from human, mouse, bovine, and chicken lenses. We compared the use of International Protein Index databases to UniProt databases and no-enzyme SEQUEST searches to tryptic searches. Sequences present in the International Protein Index databases allowed detection of several novel crystallins. An alternate start codon isoform of βA4 was found in human lens. The minor crystallin γN was detected for the first time in bovine and chicken lenses. Chicken γS was identified and is the first member of the γ-crystallin family observed in avian lenses. 相似文献
7.
The in‐depth analysis of complex proteome samples requires fractionation of the sample into subsamples prior to LC‐MS/MS in shotgun proteomics experiments. We have established a 3D workflow for shotgun proteomics that relies on protein separation by 1D PAGE, gel fractionation, trypsin digestion, and peptide separation by in‐gel IEF, prior to RP‐HPLC‐MS/MS. Our results show that applying peptide IEF can significantly increase the number of proteins identified from PAGE subfractionation. This method delivers deeper proteome coverage and provides a large degree of flexibility in experimentally approaching highly complex mixtures by still relying on protein separation according to molecular weight in the first dimension. 相似文献
8.
Yusuke Kawashima Naoyuki Takahashi Mamoru Satoh Tatsuya Saito Sayaka Kado Fumio Nomura Hiroyuki Matsumoto Yoshio Kodera 《Proteomics》2013,13(5):751-755
LC‐ESI/MS/MS‐based shotgun proteomics is currently the most commonly used approach for the identification and quantification of proteins in large‐scale studies of biomarker discovery. In the past several years, the shotgun proteomics technologies have been refined toward further enhancement of proteome coverage. In the complex series of protocols involved in shotgun proteomics, however, loss of proteolytic peptides during the lyophilization step prior to the LC/MS/MS injection has been relatively neglected despite the fact that the dissolution of the hydrophobic peptides in lyophilized samples is difficult in 0.05–0.1% TFA or formic acid, causing substantial loss of precious peptide samples. In order to prevent the loss of peptide samples during this step, we devised a new protocol using Invitrosol (IVS), a commercially available surfactant compatible with ESI‐MS; by dissolving the lyophilized peptides in IVS, we show improved recovery of hydrophobic peptides, leading to enhanced coverage of proteome. Thus, the use of IVS in the recovery step of lyophilized peptides will help the shotgun proteomics analysis by expanding the proteome coverage, which would significantly promote the discovery and development of new diagnostic markers and therapeutic targets. 相似文献
9.
Quentin Giai Gianetto Yohann Couté Christophe Bruley Thomas Burger 《Proteomics》2016,16(14):1955-1960
Selecting proteins with significant differential abundance is the cornerstone of many relative quantitative proteomics experiments. To do so, a trade‐off between p‐value thresholding and fold‐change thresholding can be performed because of a specific parameter, named fudge factor, and classically noted s0. We have observed that this fudge factor is routinely turned away from its original (and statistically valid) use, leading to important distortion in the distribution of p‐values, jeopardizing the protein differential analysis, as well as the subsequent biological conclusion. In this article, we provide a comprehensive viewpoint on this issue, as well as some guidelines to circumvent it. 相似文献
10.
Carvalho PC Fischer JS Xu T Cociorva D Balbuena TS Valente RH Perales J Yates JR Barbosa VC 《Proteomics》2012,12(7):944-949
The search engine processor (SEPro) is a tool for filtering, organizing, sharing, and displaying peptide spectrum matches. It employs a novel three-tier Bayesian approach that uses layers of spectrum, peptide, and protein logic to lead the data to converge to a single list of reliable protein identifications. SEPro is integrated into the PatternLab for proteomics environment, where an arsenal of tools for analyzing shotgun proteomic data is provided. By using the semi-labeled decoy approach for benchmarking, we show that SEPro significantly outperforms a commercially available competitor. 相似文献
11.
12.
McQueen P Spicer V Rydzak T Sparling R Levin D Wilkins JA Krokhin O 《Proteomics》2012,12(8):1160-1169
We have developed a real-time graphic-processor-unit-based search engine capable of high-quality peptide identifications in <500 μs per spectrum. The steps of peptide/protein identification, in-silico prediction of all possible tryptic peptides from these proteins, and the prediction of their expected retention times and m/z values take less than 5 s per cycle over ~3000 MS/MS spectra. This lays the foundation for information-dependent acquisition with exclusion lists generated on-the-fly, as the instrument continues to acquire data. While a complete evaluation of the dynamic exclusion system requires the participation from instrument vendors, we conducted a series of model experiments using a whole cell tryptic digestion of the bacterium Clostridium thermocellum. We ran a series of five iterative LC-MS/MS runs, adding a new exclusion list at each of four chromatographic \"tripping points\" - the elution times of the four standard peptides spiked into the sample. Retention times of these standard peptides were also used for real-time \"chromatographic calibration.\" The dynamic exclusion approach gave a ≈ 5% increase in confident protein identification (for typical 2 h LC-MS/MS run), and reduced the average number of identified peptides per protein from 4.7 to 2.9. Its application to a two-times shorter gradient gave a ≈ 17% increase in proteins identified. Further improvements are possible for instruments with better mass accuracy, by employing a more accurate retention prediction algorithm and by developing better understanding of the possible chemical modifications and fragmentations produced during electrospray ionization. 相似文献
13.
Abdulqader A. Alhaider Nervana Bayoumy Evelyn Argo Abdel G. M. A. Gader David A. Stead 《Proteomics》2012,12(22):3403-3406
14.
Anna A. Lobas Dmitry S. Karpov Arthur T. Kopylov Elizaveta M. Solovyeva Mark V. Ivanov Irina Y. Ilina Vassily N. Lazarev Ksenia G. Kuznetsova Ekaterina V. Ilgisonis Victor G. Zgoda Mikhail V. Gorshkov Sergei A. Moshkovskii 《Proteomics》2016,16(14):1980-1991
Genomic and proteomic data were integrated into the proteogenomic workflow to identify coding genomic variants of Human Embryonic Kidney 293 (HEK‐293) cell line at the proteome level. Shotgun proteome data published by Geiger et al. (2012), Chick et al. (2015), and obtained in this work for HEK‐293 were searched against the customized genomic database generated using exome data published by Lin et al. (2014). Overall, 112 unique variants were identified at the proteome level out of ~1200 coding variants annotated in the exome. Seven identified variants were shared between all the three considered proteomic datasets, and 27 variants were found in any two datasets. Some of the found variants belonged to widely known genomic polymorphisms originated from the germline, while the others were more likely resulting from somatic mutations. At least, eight of the proteins bearing amino acid variants were annotated as cancer‐related ones, including p53 tumor suppressor. In all the considered shotgun datasets, the variant peptides were at the ratio of 1:2.5 less likely being identified than the wild‐type ones compared with the corresponding theoretical peptides. This can be explained by the presence of the so‐called “passenger” mutations in the genes, which were never expressed in HEK‐293 cells. All MS data have been deposited in the ProteomeXchange with the dataset identifier PXD002613 ( http://proteomecentral.proteomexchange.org/dataset/PXD002613 ). 相似文献
15.
Paul D. Piehowski Vladislav A. Petyuk John D. Sandoval Kristin E. Burnum Gary R. Kiebel Matthew E. Monroe Gordon A. Anderson David G. Camp II Richard D. Smith 《Proteomics》2013,13(5):766-770
For bottom‐up proteomics, there are wide variety of database‐searching algorithms in use for matching peptide sequences to tandem MS spectra. Likewise, there are numerous strategies being employed to produce a confident list of peptide identifications from the different search algorithm outputs. Here we introduce a grid‐search approach for determining optimal database filtering criteria in shotgun proteomics data analyses that is easily adaptable to any search. Systematic Trial and Error Parameter Selection‐–referred to as STEPS‐–utilizes user‐defined parameter ranges to test a wide array of parameter combinations to arrive at an optimal “parameter set” for data filtering, thus maximizing confident identifications. The benefits of this approach in terms of numbers of true‐positive identifications are demonstrated using datasets derived from immunoaffinity‐depleted blood serum and a bacterial cell lysate, two common proteomics sample types. 相似文献
16.
Joanna Nynca Georg J. Arnold Thomas Fröhlich Kathrin Otte Andrzej Ciereszko 《Proteomics》2014,14(12):1569-1573
17.
Katharina Nöbauer Karin Hummel Corina Mayrhofer Maike Ahrens Francis M.C. Setyabudi Markus Schmidt‐Heydt Martin Eisenacher Ebrahim Razzazi‐Fazeli 《Proteomics》2017,17(9)
Mass spectrometric identification of proteins in species lacking validated sequence information is a major problem in veterinary science. In the present study, we used ochratoxin A producing Penicillium verrucosum to identify and quantitatively analyze proteins of an organism with yet no protein information available. The work presented here aimed to provide a comprehensive protein identification of P. verrucosum using shotgun proteomics. We were able to identify 3631 proteins in an “ab initio” translated database from DNA sequences of P. verrucosum. Additionally, a sequential window acquisition of all theoretical fragment‐ion spectra analysis was done to find differentially regulated proteins at two different time points of the growth curve. We compared the proteins at the beginning (day 3) and at the end of the log phase (day 12). 相似文献
18.
Juraj Lenco Marek Link Vojtech Tambor Jitka Zaková Lukas Cerveny and Jiri Stulik 《Proteomics》2009,9(10):2875-2882
Proteomics has been shown to significantly contribute to the investigation of the pathogenicity of the extremely infectious bacteria Francisella tularensis. In this study, the authors employed iTRAQ quantitative proteomic analysis in order to monitor alterations in proteomes of F. tularensis ssp. holarctica live vaccine strain and F. tularensis ssp. tularensis SCHU S4 associated with the cultivation at different temperatures or in the stationary phase. Correlated production of the identified proteins studied by the exploratory statistical analysis revealed novel candidates for virulence factors that were regulated in a similar manner to the genes encoded in the Francisella Pathogenicity Island. Moreover, the assessment of the adaptation of live vaccine strain and SCHU S4 strain to the examined stimuli uncovered differences in their physiological responses to the stationary phase of growth. 相似文献
19.
Lee J Jiang W Qiao Y Cho YI Woo MO Chin JH Kwon SW Hong SS Choi IY Koh HJ 《Proteomics》2011,11(3):455-468
To survey protein expression patterns in the reduced culm number (RCN) rice, a comparative shotgun proteomic analysis was conducted. For large-scale protein identification, multidimensional protein identification technology (MudPIT) coupled with pre-fractionation of plant shoot proteins led to the identification of 3004 non-redundant rice proteins. By statistically comparing relative amounts of 1353 reproducibly identified proteins between the RCN rice and the wild-type rice, 44 differentially expressed proteins were detected, where 42 proteins were increased and 2 proteins were decreased in the RCN rice. These proteins appear to have roles in glycolysis, trichloroacetic acid cycle, secondary metabolism, nutrient recycling, and nucleotide metabolism and repair. Consequently, we hypothesized that the RCN rice might fail to maintain sugar nutrient homeostasis. This was confirmed with the observation that the sucrose concentration was increased significantly in the RCN rice compared with the wild-type rice. Also, the RCN rice showed a hypersensitive response to exogenous sucrose treatment. 相似文献
20.
The emergence of shotgun proteomics has facilitated the numerous biological discoveries made by proteomic studies. However, comprehensive proteomic analysis remains challenging and shotgun proteomics is a continually changing field. This review details the recent developments in shotgun proteomics and describes emerging technologies that will influence shotgun proteomics going forward. In addition, proteomic studies of integral membrane proteins remain challenging due to the hydrophobic nature in integral membrane proteins and their general low abundance levels. However, there have been many strategies developed for enriching, isolating and separating membrane proteins for proteomic analysis that have moved this field forward. In summary, while shotgun proteomics is a widely used and mature technology, the continued pace of improvements in mass spectrometry and proteomic technology and methods indicate that future studies will have an even greater impact on biological discovery. 相似文献