首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
We investigated the combination of weak anion exchange (WAX) fractionation and on-line reversed-phase liquid chromatography (RPLC) separation using a 12 T FTICR mass spectrometer for the detection of intact proteins from a Shewanella oneidensis MR-1 cell lysate. This work aimed at optimizing intact protein detection for profiling proteins at a level that incorporates their modification state. A total of 715 intact proteins were detected, and the combined results from the WAX fractions and the unfractionated cell lysate were aligned using LC-MS features to facilitate protein abundance measurements. Protein identifications and post-translational modifications were assigned for approximately 10% of the detected proteins by comparing intact protein mass measurements to proteins identified in peptide MS/MS analysis of an aliquot of the same fraction. Intact proteins were also detected for S. oneidensis lysates obtained from cells grown on 13C-, 15N-depleted media under aerobic and sub-oxic conditions. The strategy can be readily applied for measuring differential protein abundances and provides a platform for high-throughput selection of biologically relevant targets for further characterization.  相似文献   

2.
Recent studies have revealed a relationship between protein abundance and sampling statistics, such as sequence coverage, peptide count, and spectral count, in label-free liquid chromatography-tandem mass spectrometry (LC-MS/MS) shotgun proteomics. The use of sampling statistics offers a promising method of measuring relative protein abundance and detecting differentially expressed or coexpressed proteins. We performed a systematic analysis of various approaches to quantifying differential protein expression in eukaryotic Saccharomyces cerevisiae and prokaryotic Rhodopseudomonas palustris label-free LC-MS/MS data. First, we showed that, among three sampling statistics, the spectral count has the highest technical reproducibility, followed by the less-reproducible peptide count and relatively nonreproducible sequence coverage. Second, we used spectral count statistics to measure differential protein expression in pairwise experiments using five statistical tests: Fisher's exact test, G-test, AC test, t-test, and LPE test. Given the S. cerevisiae data set with spiked proteins as a benchmark and the false positive rate as a metric, our evaluation suggested that the Fisher's exact test, G-test, and AC test can be used when the number of replications is limited (one or two), whereas the t-test is useful with three or more replicates available. Third, we generalized the G-test to increase the sensitivity of detecting differential protein expression under multiple experimental conditions. Out of 1622 identified R. palustris proteins in the LC-MS/MS experiment, the generalized G-test detected 1119 differentially expressed proteins under six growth conditions. Finally, we studied correlated expression of these 1119 proteins by analyzing pairwise expression correlations and by delineating protein clusters according to expression patterns. Through pairwise expression correlation analysis, we demonstrated that proteins co-located in the same operon were much more strongly coexpressed than those from different operons. Combining cluster analysis with existing protein functional annotations, we identified six protein clusters with known biological significance. In summary, the proposed generalized G-test using spectral count sampling statistics is a viable methodology for robust quantification of relative protein abundance and for sensitive detection of biologically significant differential protein expression under multiple experimental conditions in label-free shotgun proteomics.  相似文献   

3.
4.
Liquid chromatography-mass spectrometry (LC-MS)-based proteomics is becoming an increasingly important tool in characterizing the abundance of proteins in biological samples of various types and across conditions. Effects of disease or drug treatments on protein abundance are of particular interest for the characterization of biological processes and the identification of biomarkers. Although state-of-the-art instrumentation is available to make high-quality measurements and commercially available software is available to process the data, the complexity of the technology and data presents challenges for bioinformaticians and statisticians. Here, we describe a pipeline for the analysis of quantitative LC-MS data. Key components of this pipeline include experimental design (sample pooling, blocking, and randomization) as well as deconvolution and alignment of mass chromatograms to generate a matrix of molecular abundance profiles. An important challenge in LC-MS-based quantitation is to be able to accurately identify and assign abundance measurements to members of protein families. To address this issue, we implement a novel statistical method for inferring the relative abundance of related members of protein families from tryptic peptide intensities. This pipeline has been used to analyze quantitative LC-MS data from multiple biomarker discovery projects. We illustrate our pipeline here with examples from two of these studies, and show that the pipeline constitutes a complete workable framework for LC-MS-based differential quantitation. Supplementary material is available at http://iec01.mie.utoronto.ca/~thodoros/Bukhman/.  相似文献   

5.
6.
Melanson JE  Avery SL  Pinto DM 《Proteomics》2006,6(16):4466-4474
Peptide dimethylation with isotopically coded formaldehydes was evaluated as a potential alternative to techniques such as the iTRAQ method for comparative proteomics. The isotopic labeling strategy and custom-designed protein quantitation software were tested using protein standards and then applied to measure proteins levels associated with Alzheimer's disease (AD). The method provided high accuracy (10% error), precision (14% RSD) and coverage (70%) when applied to the analysis of a standard solution of BSA by LC-MS/MS. The technique was then applied to measure protein abundance levels in brain tissue afflicted with AD relative to normal brain tissue. 2-D LC-MS analysis identified 548 unique proteins (p<0.05). Of these, 349 were quantified with two or more peptides that met the statistical criteria used in this study. Several classes of proteins exhibited significant changes in abundance. For example, elevated levels of antioxidant proteins and decreased levels of mitochondrial electron transport proteins were observed. The results demonstrate the utility of the labeling method for high-throughput quantitative analysis.  相似文献   

7.
Comparing a protein's concentrations across two or more treatments is the focus of many proteomics studies. A frequent source of measurements for these comparisons is a mass spectrometry (MS) analysis of a protein's peptide ions separated by liquid chromatography (LC) following its enzymatic digestion. Alas, LC-MS identification and quantification of equimolar peptides can vary significantly due to their unequal digestion, separation, and ionization. This unequal measurability of peptides, the largest source of LC-MS nuisance variation, stymies confident comparison of a protein's concentration across treatments. Our objective is to introduce a mixed-effects statistical model for comparative LC-MS proteomics studies. We describe LC-MS peptide abundance with a linear model featuring pivotal terms that account for unequal peptide LC-MS measurability. We advance fitting this model to an often incomplete LC-MS data set with REstricted Maximum Likelihood (REML) estimation, producing estimates of model goodness-of-fit, treatment effects, standard errors, confidence intervals, and protein relative concentrations. We illustrate the model with an experiment featuring a known dilution series of a filamentous ascomycete fungus Trichoderma reesei protein mixture. For 781 of the 1546 T. reesei proteins with sufficient data coverage, the fitted mixed-effects models capably described the LC-MS measurements. The LC-MS measurability terms effectively accounted for this major source of uncertainty. Ninety percent of the relative concentration estimates were within 0.5-fold of the true relative concentrations. Akin to the common ratio method, this model also produced biased estimates, albeit less biased. Bias decreased significantly, both absolutely and relative to the ratio method, as the number of observed peptides per protein increased. Mixed-effects statistical modeling offers a flexible, well-established methodology for comparative proteomics studies integrating common experimental designs with LC-MS sample processing plans. It favorably accounts for the unequal LC-MS measurability of peptides and produces informative quantitative comparisons of a protein's concentration across treatments with objective measures of uncertainties.  相似文献   

8.
Microbes are known to regulate both gene expression and protein activity through the use of post-translational modifications (PTMs). Common PTMs involved in cellular signaling and gene control include methylations, acetylations, and phosphorylations, whereas oxidations have been implicated as an indicator of stress. Shewanella oneidensis MR-1 is a Gram-negative bacterium that demonstrates both respiratory versatility and the ability to sense and adapt to diverse environmental conditions. The data set used in this study consisted of tandem mass spectra derived from midlog phase aerobic cultures of S. oneidensis either native or shocked with 1 mM chromate [Cr(VI)]. In this study, three algorithms (DBDigger, Sequest, and InsPecT) were evaluated for their ability to scrutinize shotgun proteomic data for evidence of PTMs. The use of conservative scoring filters for peptides or proteins versus creating a subdatabase first from a nonmodification search was evaluated with DBDigger. The use of higher-scoring filters for peptide identifications was found to result in optimal identifications of PTM peptides with a 2% false discovery rate (FDR) for the total data set using the DBDigger algorithm. However, the FDR climbs to unacceptably high levels when only PTM peptides are considered. Sequest was evaluated as a method for confirming PTM peptides putatively identified using DBDigger; however, there was a low identification rate ( approximately 25%) for the searched spectra. InsPecT was found to have a much lower, and thus more acceptable, FDR than DBDigger for PTM peptides. Comparisons between InsPecT and DBDigger were made with respect to both the FDR and PTM peptide identifications. As a demonstration of this approach, a number of S. oneidensis chemotaxis proteins as well as low-abundance signal transduction proteins were identified as being post-translationally modified in response to chromate challenge.  相似文献   

9.
There is an increasing interest in the quantitative proteomic measurement of the protein contents of substantially similar biological samples, e.g. for the analysis of cellular response to perturbations over time or for the discovery of protein biomarkers from clinical samples. Technical limitations of current proteomic platforms such as limited reproducibility and low throughput make this a challenging task. A new LC-MS-based platform is able to generate complex peptide patterns from the analysis of proteolyzed protein samples at high throughput and represents a promising approach for quantitative proteomics. A crucial component of the LC-MS approach is the accurate evaluation of the abundance of detected peptides over many samples and the identification of peptide features that can stratify samples with respect to their genetic, physiological, or environmental origins. We present here a new software suite, SpecArray, that generates a peptide versus sample array from a set of LC-MS data. A peptide array stores the relative abundance of thousands of peptide features in many samples and is in a format identical to that of a gene expression microarray. A peptide array can be subjected to an unsupervised clustering analysis to stratify samples or to a discriminant analysis to identify discriminatory peptide features. We applied the SpecArray to analyze two sets of LC-MS data: one was from four repeat LC-MS analyses of the same glycopeptide sample, and another was from LC-MS analysis of serum samples of five male and five female mice. We demonstrate through these two study cases that the SpecArray software suite can serve as an effective software platform in the LC-MS approach for quantitative proteomics.  相似文献   

10.
Miniature microbial fuel cells (mini-MFCs) were used to monitor the current generated by Shewanella oneidensis DSP10 under both anaerobic and aerobic conditions when exposed to glucose as a potential electron donor. In addition to glucose, other carbon fuels including fructose, sucrose, acetate, and ascorbic acid were also tested. When the anolyte containing S. oneidensis was grown in the presence of oxygen, power densities of 270+/-10, 350+/-20, and 120+/-10 W/m(3) were recorded from the mini-MFC for glucose, fructose, and ascorbic acid electron donors, respectively, while sucrose and acetate produced no response. The power produced from glucose decreased considerably (相似文献   

11.
The use of liquid chromatography – mass spectrometry (LC-MS) for the characterization of proteins can provide a plethora of information related to their structure, including amino acid sequence determination and analysis of posttranslational modifications. The variety of LC-MS based applications has led to the use of LC-MS characterization of therapeutic proteins and monoclonal antibodies as an integral part of the regulatory approval process. However, the improper use of an LC-MS system, related to intrinsic instrument limitations, improper tuning parameters, or poorly optimized methods may result in the production of low quality data. Improper system performance may arise from subtle changes in operating conditions that limit the ability to detect low abundance species. To address this issue, we systematically evaluated LC-MS/MS operating parameters to identify a set of metrics that can be used in a workflow to determine if a system is suitable for its intended purpose. Development of this workflow utilized a bovine serum albumin (BSA) digest standard spiked with synthetic peptides present at 0.1% to 100% of the BSA digest peptide concentration to simulate the detection of low abundance species using a traditional bottom-up workflow and data-dependent MS2 acquisition. BSA sequence coverage, a commonly used indicator for instrument performance did not effectively identify settings that led to limited dynamic range or poorer absolute mass accuracy on 2 separate LC-MS systems. Additional metrics focusing on the detection limit and sensitivity for peptide identification were determined to be necessary to establish system suitability for protein therapeutic characterization by LC-MS.  相似文献   

12.
《MABS-AUSTIN》2013,5(6):1104-1117
The use of liquid chromatography – mass spectrometry (LC-MS) for the characterization of proteins can provide a plethora of information related to their structure, including amino acid sequence determination and analysis of posttranslational modifications. The variety of LC-MS based applications has led to the use of LC-MS characterization of therapeutic proteins and monoclonal antibodies as an integral part of the regulatory approval process. However, the improper use of an LC-MS system, related to intrinsic instrument limitations, improper tuning parameters, or poorly optimized methods may result in the production of low quality data. Improper system performance may arise from subtle changes in operating conditions that limit the ability to detect low abundance species. To address this issue, we systematically evaluated LC-MS/MS operating parameters to identify a set of metrics that can be used in a workflow to determine if a system is suitable for its intended purpose. Development of this workflow utilized a bovine serum albumin (BSA) digest standard spiked with synthetic peptides present at 0.1% to 100% of the BSA digest peptide concentration to simulate the detection of low abundance species using a traditional bottom-up workflow and data-dependent MS2 acquisition. BSA sequence coverage, a commonly used indicator for instrument performance did not effectively identify settings that led to limited dynamic range or poorer absolute mass accuracy on 2 separate LC-MS systems. Additional metrics focusing on the detection limit and sensitivity for peptide identification were determined to be necessary to establish system suitability for protein therapeutic characterization by LC-MS.  相似文献   

13.
14.
Abstract A probability-based quantification framework is presented for the calculation of relative peptide and protein abundance in label-free and label-dependent LC-MS proteomics data. The results are accompanied by credible intervals and regulation probabilities. The algorithm takes into account data uncertainties via Poisson statistics modified by a noise contribution that is determined automatically during an initial normalization stage. Protein quantification relies on assignments of component peptides to the acquired data. These assignments are generally of variable reliability and may not be present across all of the experiments comprising an analysis. It is also possible for a peptide to be identified to more than one protein in a given mixture. For these reasons the algorithm accepts a prior probability of peptide assignment for each intensity measurement. The model is constructed in such a way that outliers of any type can be automatically reweighted. Two discrete normalization methods can be employed. The first method is based on a user-defined subset of peptides, while the second method relies on the presence of a dominant background of endogenous peptides for which the concentration is assumed to be unaffected. Normalization is performed using the same computational and statistical procedures employed by the main quantification algorithm. The performance of the algorithm will be illustrated on example data sets, and its utility demonstrated for typical proteomics applications. The quantification algorithm supports relative protein quantification based on precursor and product ion intensities acquired by means of data-dependent methods, originating from all common isotopically-labeled approaches, as well as label-free ion intensity-based data-independent methods.  相似文献   

15.
16.
Shewanella oneidensis MR-1 is a gram-negative facultative aerobic bacterium living at oxic-anoxic interfaces in nature. The plasticity of terminal electron-acceptors used under anaerobic conditions is huge, but the adaptation to these different environmental conditions remains unclear. In this work, we used a proteomic approach to study the protein content when the organism is grown under anaerobic respiration conditions on insoluble ferric oxide. By analysis of two-dimensional gel patterns of soluble protein extracts, we discovered 20 differentially displayed proteins. The protein spots were further analyzed by mass spectrometry for which we used, in addition to nano-high-performance liquid chromatography coupled to an electrospray ionization-quadrupole-time of flight instrument, a recently introduced matrix-assisted laser desorption/ionization (MALDI) tandem-time of flight mass spectrometer. The instrument allows the acquisition of high quality spectra, in both the mass spectrometry and tandem mass spectrometry mode, and is therefore able to identify protein spots unambiguously. Advantageous to electrospray ionization is a minimised sample handling, inherent to MALDI ionization, and the presence of high energy fragmentation ions, generating sequence information that also can differentiate isobaric amino acids. With this strategy, we could point out a regulatory protein that is up-regulated under iron(III) respiration. This protein, the aerobic respiration control protein (ArcA), has been reported as being a regulator during anaerobiosis in other species. To our knowledge, this is the first report of the possible involvement of ArcA from S. oneidensis MR-1 in the reduction of ferric oxide.  相似文献   

17.
18.
Ectopic pregnancy (EP) and normal intrauterine pregnancy (IUP) serum proteomes were quantitatively compared to systematically identify candidate biomarkers. A 3-D biomarker discovery strategy consisting of abundant protein immunodepletion, SDS gels, LC-MS/MS, and label-free quantitation of MS signal intensities identified 70 candidate biomarkers with differences between groups greater than 2.5-fold. Further statistical analyses of peptide quantities were used to select the most promising 12 biomarkers for further study, which included known EP biomarkers, novel EP biomarkers (ADAM12 and ISM2), and five specific isoforms of the pregnancy specific beta-1-glycoprotein family. Technical replicates showed good reproducibility and protein intensities from the label-free discovery analysis compared favorably with reported abundance levels of several known reference serum proteins over at least 3 orders of magnitude. Similarly, relative abundances of candidate biomarkers from the label-free discovery analysis were consistent with relative abundances from pilot validation assays performed for five of the 12 most promising biomarkers using label-free multiple reaction monitoring of both the patient serum pools used for discovery and the individual samples that constituted these pools. These results demonstrate robust, reproducible, in-depth 3-D serum proteome discovery, and subsequent pilot-scale validation studies can be achieved readily using label-free quantitation strategies.  相似文献   

19.
LC-MS/MS has emerged as the method of choice for the identification and quantification of protein sample mixtures. For very complex samples such as complete proteomes, the most commonly used LC-MS/MS method, data-dependent acquisition (DDA) precursor selection, is of limited utility. The limited scan speed of current mass spectrometers along with the highly redundant selection of the most intense precursor ions generates a bias in the pool of identified proteins toward those of higher abundance. A directed LC-MS/MS approach that alleviates the limitations of DDA precursor ion selection by decoupling peak detection and sequencing of selected precursor ions is presented. In the first stage of the strategy, all detectable peptide ion signals are extracted from high resolution LC-MS feature maps or aligned sets of feature maps. The selected features or a subset thereof are subsequently sequenced in sequential, non-redundant directed LC-MS/MS experiments, and the MS/MS data are mapped back to the original LC-MS feature map in a fully automated manner. The strategy, implemented on an LTQ-FT MS platform, allowed the specific sequencing of 2,000 features per analysis and enabled the identification of more than 1,600 phosphorylation sites using a single reversed phase separation dimension without the need for time-consuming prefractionation steps. Compared with conventional DDA LC-MS/MS experiments, a substantially higher number of peptides could be identified from a sample, and this increase was more pronounced for low intensity precursor ions.  相似文献   

20.
Proteome comparison of cell lines derived from cancer and normal breast epithelium provide opportunities to identify differentially expressed proteins and pathways associated with specific phenotypes. We employed 16O/18O peptide labeling, FT-ICR MS, and an accurate mass and time (AMT) tag strategy to simultaneously compare the relative abundance of hundreds of proteins in non-cancer and cancer cell lines derived from breast tissue. A cell line reference panel allowed relative protein abundance comparisons among multiple cell lines and across multiple experiments. A peptide database generated from multidimensional LC separations and MS/MS analysis was used for subsequent AMT tag-based peptide identifications. This peptide database represented a total of 2299 proteins, including 514 that were quantified in five cell lines using the AMT tag and 16O/18O strategies. Eighty-six proteins showed at least a threefold protein abundance change between cancer and non-cancer cell lines. Hierarchical clustering of protein abundance ratios revealed that several groups of proteins were differentially expressed between the cancer cell lines.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号