首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Zhang J  Xu X  Gao M  Yang P  Zhang X 《Proteomics》2007,7(4):500-512
The current "shotgun" proteomic analysis, strong cation exchange-RPLC-MS/MS system, is a widely used method for proteome research. Currently, it is not suitable for complicated protein sample analysis, like mammal tissues or cells. To increase the protein identification confidence and number, an additional separation dimension for sample fractionation is necessary to be coupled prior to current multi-dimensional protein identification technology (MudPIT). In this work, SEC was elaborately selected and applied for sample prefractionation in consideration of its non-bias against sample and variety of choice of mobile phases. The analysis of the global lysate of normal human liver tissue sample provided by the China Human Liver Proteome Project, were performed to compare the proteome coverage, sequence coverage (peptide per protein identification) and protein identification efficiency in MudPIT, 3-D LC-MS/MS identification strategy with preproteolytic and postproteolytic fractionation. It was demonstrated that 3-D LC-MS/MS utilizing protein level fractionation was the most effective method. A MASCOT search using the MS/MS results acquired by QSTAR(XL) identified 1622 proteins from 3-D LC-MS/MS identification approaches. A primary analysis on molecular weight, pI and grand average hydrophobicity value distribution of the identified proteins in different approaches was made to further evaluate the 3-D LC-MS/MS analysis strategy.  相似文献   

2.
This study describes a new protein digestion protocol in which a variety of detergents can be used to solubilize membrane proteins and facilitate trypsin digestion with higher efficiency. In this protocol, proteins are dissolved in solutions containing various detergents and directly incorporated into a polyacrylamide gel matrix without electrophoresis. Detergents are subsequently eliminated from the gel matrix while proteins are still immobilized in the gel matrix. After in-gel digestion of proteins, LC-MS/MS is used to analyze the extracted peptides for protein identification. The uniqueness of the protocol is that it allows usage of a variety of detergents in the starting solution without interfering with LC-MS/MS analysis. We hereby demonstrate that different detergents, including ionic SDS, non-ionic Triton X-100 and n-octyl beta-d-glucopyranoside, and zwitterionic CHAPS, can be used to achieve maximum solubilization of membrane proteins with minimal interference with LC-MS/MS analysis. Enhanced digestions, i.e. improved number and intensity of detected peptides, are also demonstrated for digestion-resistant proteins such as myoglobin, ubiquitin, and bacteriorhodopsin. An additional advantage of the Tube-Gel digestion protocol is that, even without electrophoresis separation, it allows high throughput analysis of complex protein mixtures when coupled with LC-MS/MS. The protocol was used to analyze a complex membrane protein mixture prepared from prostate cancer cells. The protocol involves only a single digestion and 2.5 h of LC-MS/MS analysis and identified 178 membrane proteins. In comparison, the same membrane fraction was resolved by SDS-PAGE, and 20 gel slices were excised and individually digested and analyzed by LC-MS/MS. The more elaborate effort demanded more than 50 h of LC-MS/MS analysis and identified 268 proteins. The new Tube-Gel digestion protocol is an alternative method for high throughput analysis of membrane proteins.  相似文献   

3.

Background

Quantitative proteomics holds great promise for identifying proteins that are differentially abundant between populations representing different physiological or disease states. A range of computational tools is now available for both isotopically labeled and label-free liquid chromatography mass spectrometry (LC-MS) based quantitative proteomics. However, they are generally not comparable to each other in terms of functionality, user interfaces, information input/output, and do not readily facilitate appropriate statistical data analysis. These limitations, along with the array of choices, present a daunting prospect for biologists, and other researchers not trained in bioinformatics, who wish to use LC-MS-based quantitative proteomics.

Results

We have developed Corra, a computational framework and tools for discovery-based LC-MS proteomics. Corra extends and adapts existing algorithms used for LC-MS-based proteomics, and statistical algorithms, originally developed for microarray data analyses, appropriate for LC-MS data analysis. Corra also adapts software engineering technologies (e.g. Google Web Toolkit, distributed processing) so that computationally intense data processing and statistical analyses can run on a remote server, while the user controls and manages the process from their own computer via a simple web interface. Corra also allows the user to output significantly differentially abundant LC-MS-detected peptide features in a form compatible with subsequent sequence identification via tandem mass spectrometry (MS/MS). We present two case studies to illustrate the application of Corra to commonly performed LC-MS-based biological workflows: a pilot biomarker discovery study of glycoproteins isolated from human plasma samples relevant to type 2 diabetes, and a study in yeast to identify in vivo targets of the protein kinase Ark1 via phosphopeptide profiling.

Conclusion

The Corra computational framework leverages computational innovation to enable biologists or other researchers to process, analyze and visualize LC-MS data with what would otherwise be a complex and not user-friendly suite of tools. Corra enables appropriate statistical analyses, with controlled false-discovery rates, ultimately to inform subsequent targeted identification of differentially abundant peptides by MS/MS. For the user not trained in bioinformatics, Corra represents a complete, customizable, free and open source computational platform enabling LC-MS-based proteomic workflows, and as such, addresses an unmet need in the LC-MS proteomics field.  相似文献   

4.
Reversed-phase liquid chromatography (LC) directly coupled with electrospray-tandem mass spectrometry (MS/MS) is a successful choice to obtain a large number of product ion spectra from a complex peptide mixture. We describe a search validation program, ScoreRidge, developed for analysis of LC-MS/MS data. The program validates peptide assignments to product ion spectra resulting from usual probability-based searches against primary structure databases. The validation is based only on correlation between the measured LC elution time of each peptide and the deduced elution time from the amino acid sequence assigned to product ion spectra obtained from the MS/MS analysis of the peptide. Sufficient numbers of probable assignments gave a highly correlative curve. Any peptide assignments within a certain tolerance from the correlation curve were accepted for the following arrangement step to list identified proteins. Using this data validation program, host protein candidates responsible for interaction with human hepatitis B virus core protein were identified from a partially purified protein mixture. The present simple and practical program complements protein identification from usual product ion search algorithms and reduces manual interpretation of the search result data. It will lead to more explicit protein identification from complex peptide mixtures such as whole proteome digests from tissue samples.  相似文献   

5.
Affinity purification of protein complexes followed by identification using liquid chromatography/mass spectrometry (LC-MS/MS) is a robust method to study the fundamental process of protein interaction. Although affinity isolation reduces the complexity of the sample, fractionation prior to LC-MS/MS analysis is still necessary to maximize protein coverage. In this study, we compared the protein coverage obtained via LC-MS/MS analysis of protein complexes prefractionated using two commonly employed methods, SDS-PAGE and strong cation exchange chromatography (SCX). The two complexes analyzed focused on the nuclear proteins Bmi-1 and GATA3 that were expressed within the cells at low and high levels, respectively. Prefractionation of the complexes at the peptide level using SCX consistently resulted in the identification of approximately 3-fold more proteins compared to separation at the protein level using SDS-PAGE. The increase in the number of identified proteins was especially pronounced for the Bmi-1 complex, where the target protein was expressed at a low level. The data show that prefractionation of affinity isolated protein complexes using SCX prior to LC-MS/MS analysis significantly increases the number of identified proteins and individual protein coverage, particularly for target proteins expressed at low levels.  相似文献   

6.
Viral hemorrhagic fever is a clinical syndrome that poses serious global health threat. Among the causative agents, dengue virus (DV) has the highest incidence rate and its infection is the major cause of viral hemorrhagic fever in the world. Although the pathophysiological mechanisms of DV-induced diseases are not yet understood, it is well accepted that liver is a site of viral replication. In this study, we used proteomics to analyze infection of a hepatic cell lineage, HepG2, with DV, focusing on the secreted proteins. 1D-electrophoresis and liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) were used, allowing the identification of a total of 107 proteins, among which 35 were found only in control secretome and 24 only in infected cells secretome. To validate these data, we performed 2D-eletrophoresis followed by MALDI-TOF/TOF, resulting in the identification of 20 proteins, 8 of them confirming LC-MS/MS results. We discuss the results obtained taking into account the proteins previously described in the secretome of HepG2 cells, proteins present in human plasma and proteins of interest for dengue pathogenesis. Altogether the data presented here provide clues for the progress in the understanding of the role of liver secretion in the progression of the disease.  相似文献   

7.
A "one-pot" alternative method for processing proteins and isolating peptide mixtures from bacterial samples is presented for liquid chromatography-tandem mass spectrometry (LC-MS/MS) analysis and data reduction. The conventional in-solution digestion of the protein contents of bacteria is compared to a small disposable filter unit placed inside a centrifuge vial for processing and digestion of bacterial proteins. Each processing stage allows filtration of excess reactants and unwanted byproduct while retaining the proteins. Upon addition of trypsin, the peptide mixture solution is passed through the filter while retaining the trypsin enzyme. The peptide mixture is then analyzed by LC-MS/MS with an in-house BACid algorithm for a comparison of the experimental unique peptides to a constructed proteome database of bacterial genus, specie, and strain entries. The concentration of bacteria was varied from 10 × 10(7) to 3.3 × 10(3) cfu/mL for analysis of the effect of concentration on the ability of the sample processing, LC-MS/MS, and data analysis methods to identify bacteria. The protein processing method and dilution procedure result in reliable identification of pure suspensions and mixtures at high and low bacterial concentrations.  相似文献   

8.
LC-MS/MS has emerged as the method of choice for the identification and quantification of protein sample mixtures. For very complex samples such as complete proteomes, the most commonly used LC-MS/MS method, data-dependent acquisition (DDA) precursor selection, is of limited utility. The limited scan speed of current mass spectrometers along with the highly redundant selection of the most intense precursor ions generates a bias in the pool of identified proteins toward those of higher abundance. A directed LC-MS/MS approach that alleviates the limitations of DDA precursor ion selection by decoupling peak detection and sequencing of selected precursor ions is presented. In the first stage of the strategy, all detectable peptide ion signals are extracted from high resolution LC-MS feature maps or aligned sets of feature maps. The selected features or a subset thereof are subsequently sequenced in sequential, non-redundant directed LC-MS/MS experiments, and the MS/MS data are mapped back to the original LC-MS feature map in a fully automated manner. The strategy, implemented on an LTQ-FT MS platform, allowed the specific sequencing of 2,000 features per analysis and enabled the identification of more than 1,600 phosphorylation sites using a single reversed phase separation dimension without the need for time-consuming prefractionation steps. Compared with conventional DDA LC-MS/MS experiments, a substantially higher number of peptides could be identified from a sample, and this increase was more pronounced for low intensity precursor ions.  相似文献   

9.

Background  

Mass Spectrometry coupled to Liquid Chromatography (LC-MS) is commonly used to analyze the protein content of biological samples in large scale studies. The data resulting from an LC-MS experiment is huge, highly complex and noisy. Accordingly, it has sparked new developments in Bioinformatics, especially in the fields of algorithm development, statistics and software engineering. In a quantitative label-free mass spectrometry experiment, crucial steps are the detection of peptide features in the mass spectra and the alignment of samples by correcting for shifts in retention time. At the moment, it is difficult to compare the plethora of algorithms for these tasks. So far, curated benchmark data exists only for peptide identification algorithms but no data that represents a ground truth for the evaluation of feature detection, alignment and filtering algorithms.  相似文献   

10.
The identification and validation of the targets of active compounds identified in cell-based assays is an important step in preclinical drug development. New analytical approaches that combine drug affinity pull-down assays with mass spectrometry (MS) could lead to the identification of new targets and druggable pathways. In this work, we investigate a drug-target system consisting of ampicillin- and penicillin-binding proteins (PBPs) to evaluate and compare different amino-reactive resins for the immobilization of the affinity compound and mass spectrometric methods to identify proteins from drug affinity pull-down assays. First, ampicillin was immobilized onto various amino-reactive resins, which were compared in the ampicillin-PBP model with respect to their nonspecific binding of proteins from an Escherichia coli membrane extract. Dynal M-270 magnetic beads were chosen to further study the system as a model for capturing and identifying the targets of ampicillin, PBPs that were specifically and covalently bound to the immobilized ampicillin. The PBPs were identified, after in situ digestion of proteins bound to ampicillin directly on the beads, by using either one-dimensional (1-D) or two-dimensional (2-D) liquid chromatography (LC) separation techniques followed by tandem mass spectrometry (MS/MS) analysis. Alternatively, an elution with N-lauroylsarcosine (sarcosyl) from the ampicillin beads followed by in situ digestion and 2-D LC-MS/MS analysis identified proteins potentially interacting noncovalently with the PBPs or the ampicillin. The in situ approach required only little time, resources, and sample for the analysis. The combination of drug affinity pull-down assays with in situ digestion and 2-D LC-MS/MS analysis is a useful tool in obtaining complex information about a primary drug target as well as its protein interactors.  相似文献   

11.
Genes that encode glycosylphosphatidylinositol anchored proteins (GPI-APs) constitute an estimated 1-2% of eukaryote genomes. Current computational methods for the prediction of GPI-APs are sensitive and specific; however, the analysis of the processing site (omega- or omega-site) of GPI-APs is still challenging. Only 10% of the proteins that are annotated as GPI-APs have the omega-site experimentally verified. We describe an integrated computational and experimental proteomics approach for the identification and characterization of GPI-APs that provides the means to identify GPI-APs and the derived GPI-anchored peptides in LC-MS/MS data sets. The method takes advantage of sequence features of GPI-APs and the known core structure of the GPI-anchor. The first stage of the analysis encompasses LC-MS/MS based protein identification. The second stage involves prediction of the processing sites of the identified GPI-APs and prediction of the corresponding terminal tryptic peptides. The third stage calculates possible GPI structures on the peptides from stage two. The fourth stage calculates the scores by comparing the theoretical spectra of the predicted GPI-peptides against the observed MS/MS spectra. Automated identification of C-terminal GPI-peptides from porcine membrane dipeptidase, folate receptor and CD59 in complex LC-MS/MS data sets demonstrates the sensitivity and specificity of this integrated computational and experimental approach.  相似文献   

12.
A method was developed for the liquid chromatographic-mass spectrometric (LC-MS) identification of extremely neurotoxic toxins. The method combines sample treatment in a safety containment and analysis of detoxified material in a common laboratory facility. The method was applied to the characterization of neat tetanus toxin and subsequent identification of the toxin in cell lysate supernatants and culture supernatants from different Clostridium tetani bacteria strains. Characterization of the neat toxin was accomplished by (1) accurate mass measurement of enzyme digest fragments of the toxin and (2) tandem mass spectrometric (MS/MS) amino acid sequencing of selected peptides. Accurate mass measurement proved no longer feasible for the analysis of supernatants, due to the overwhelming presence of peptides from proteins other than toxin. Even when high-molecular-weight proteins were filtered from the lysates and treated, the retained protein fraction yielded too many peptides. However, MS/MS could successfully be applied when the findings from the characterization of neat toxin were employed. Thus, LC-MS/MS of selected precursor ions from trypsin digest fragments yielded specific sequence data for identification of the toxin. This procedure provided reliable identification of the toxin at levels above 1 microg/ml and within a day. Investigations with the method developed will be extended to the botulinum neurotoxins.  相似文献   

13.
Protein identification is a key and essential step in mass spectrometry (MS) based proteome research. To date, there are many protein identification strategies that employ either MS data or MS/MS data for database searching. While MS-based methods provide wider coverage than MS/MS-based methods, their identification accuracy is lower since MS data have less information than MS/MS data. Thus, it is desired to design more sophisticated algorithms that achieve higher identification accuracy using MS data. Peptide Mass Fingerprinting (PMF) has been widely used to identify single purified proteins from MS data for many years. In this paper, we extend this technology to protein mixture identification. First, we formulate the problem of protein mixture identification as a Partial Set Covering (PSC) problem. Then, we present several algorithms that can solve the PSC problem efficiently. Finally, we extend the partial set covering model to both MS/MS data and the combination of MS data and MS/MS data. The experimental results on simulated data and real data demonstrate the advantages of our method: 1) it outperforms previous MS-based approaches significantly; 2) it is useful in the MS/MS-based protein inference; and 3) it combines MS data and MS/MS data in a unified model such that the identification performance is further improved.  相似文献   

14.
LC-MS/MS has demonstrated potential for detecting plant pathogens. Unlike PCR or ELISA, LC-MS/MS does not require pathogen-specific reagents for the detection of pathogen-specific proteins and peptides. However, the MS/MS approach we and others have explored does require a protein sequence reference database and database-search software to interpret tandem mass spectra. To evaluate the limitations of database composition on pathogen identification, we analyzed proteins from cultured Ustilago maydis, Phytophthora sojae, Fusarium graminearum, and Rhizoctonia solani by LC-MS/MS. When the search database did not contain sequences for a target pathogen, or contained sequences to related pathogens, target pathogen spectra were reliably matched to protein sequences from nontarget organisms, giving an illusion that proteins from nontarget organisms were identified. Our analysis demonstrates that when database-search software is used as part of the identification process, a paradox exists whereby additional sequences needed to detect a wide variety of possible organisms may lead to more cross-species protein matches and misidentification of pathogens.  相似文献   

15.
SELDI-TOF MS has been demonstrated as a powerful tool for biomarker discovery. However, a major disadvantage of SELDI-TOF MS is the lack of direct identification of the discriminatory peaks discovered. We describe a novel experimental identification strategy where peptides/proteins captured to a weak cation exchange ProteinArray surface (CM10) are eluted, and thereafter identified by utilizing a sensitive LC-MS/MS (i.e. LTQ Orbitrap). A mixture of four known proteins was used to test the novel experimental approach described, and all four proteins were successfully identified. Additionally, a biomarker candidate previously discovered in plasma of Atlantic cod (Gadus morhua) by SELDI-TOF MS was identified. Thus, this study indicated that a combination of on-chip elution and a highly sensitive LC-MS/MS system can be an alternative approach to identify biomarker candidates discovered by use of SELDI-TOF MS.  相似文献   

16.
Protein phosphorylation is a key post-translational modification that governs biological processes. Despite the fact that a number of analytical strategies have been exploited for the characterization of protein phosphorylation, the identification of protein phosphorylation sites is still challenging. We proposed here an alternative approach to mine phosphopeptide signals generated from a mixture of proteins when liquid chromatography-tandem mass spectrometry (LC-MS/MS) analysis is involved. The approach combined dephosphorylation reaction, accurate mass measurements from a quadrupole/time-of-flight mass spectrometer, and a computing algorithm to differentiate possible phosphopeptide signals obtained from the LC-MS analyses by taking advantage of the mass shift generated by alkaline phosphatase treatment. The retention times and m/z values of these selected LC-MS signals were used to facilitate subsequent LC-MS/MS experiments for phosphorylation site determination. Unlike commonly used neutral loss scan experiments for phosphopeptide detection, this strategy may not bias against tyrosine-phosphorylated peptides. We have demonstrated the applicability of this strategy to sequence more, in comparison with conventional data-dependent LC-MS/MS experiments, phosphopeptides in a mixture of alpha- and beta-caseins. The analytical scheme was applied to characterize the nasopharyngeal carcinoma (NPC) cellular phosphoproteome and yielded 221 distinct phosphorylation sites. Our data presented in this paper demonstrated the merits of computation in mining phosphopeptide signals from a complex mass spectrometric data set.  相似文献   

17.
Assembling peptides identified from LC-MS/MS spectra into a list of proteins is a critical step in analyzing shotgun proteomics data. As one peptide sequence can be mapped to multiple proteins in a database, na?ve protein assembly can substantially overstate the number of proteins found in samples. We model the peptide-protein relationships in a bipartite graph and use efficient graph algorithms to identify protein clusters with shared peptides and to derive the minimal list of proteins. We test the effects of this parsimony analysis approach using MS/MS data sets generated from a defined human protein mixture, a yeast whole cell extract, and a human serum proteome after MARS column depletion. The results demonstrate that the bipartite parsimony technique not only simplifies protein lists but also improves the accuracy of protein identification. We use bipartite graphs for the visualization of the protein assembly results to render the parsimony analysis process transparent to users. Our approach also groups functionally related proteins together and improves the comprehensibility of the results. We have implemented the tool in the IDPicker package. The source code and binaries for this protein assembly pipeline are available under Mozilla Public License at the following URL: http://www.mc.vanderbilt.edu/msrc/bioinformatics/.  相似文献   

18.
We describe an integrated suite of algorithms and software for general accurate mass and time (AMT) tagging data analysis of mass spectrometry data. The AMT approach combines identifications from liquid chromatography (LC) tandem mass spectrometry (MS/MS) data with peptide accurate mass and retention time locations from high-resolution LC-MS data. Our workflow includes the traditional AMT approach, in which MS/MS identifications are located in external databases, as well as methods based on more recent hybrid instruments such as the LTQ-FT or Orbitrap, where MS/MS identifications are embedded with the MS data. We demonstrate our AMT workflow's utility for general data synthesis by combining data from two dissimilar biospecimens. Specifically, we demonstrate its use relevant to serum biomarker discovery by identifying which peptides sequenced by MS/MS analysis of tumor tissue may also be present in the plasma of tumor-bearing and control mice. The analysis workflow, referred to as msInspect/AMT, extends and combines existing open-source platforms for LC-MS/MS (CPAS) and LC-MS (msInspect) data analysis and is available in an unrestricted open-source distribution.  相似文献   

19.
Cho H  Smalley DM  Theodorescu D  Ley K  Lee JK 《Proteomics》2007,7(20):3681-3692
LC-MS/MS with certain labeling techniques such as isotope-coded affinity tag (ICAT) enables quantitative analysis of paired protein samples. However, current identification and quantification of differentially expressed peptides (and proteins) are not reliable for large proteomics screening of complex biological samples. The number of replicates is often limited because of the high cost of experiments and the limited supply of samples. Traditionally, a simple fold change cutoff is used, which results in a high rate of false positives. Standard statistical methods such as the two-sample t-test are unreliable and severely underpowered due to high variability in LC-MS/MS data, especially when only a small number of replicates are available. Using an advanced error pooling technique, we propose a novel statistical method that can reliably identify differentially expressed proteins while maintaining a high sensitivity, particularly with a small number of replicates. The proposed method was applied both to an extensive simulation study and a proteomics comparison between microparticles (MPs) generated from platelet (platelet MPs) and MPs isolated from plasma (plasma MPs). In these studies, we show a significant improvement of our statistical analysis in the identification of proteins that are differentially expressed but not detected by other statistical methods. In particular, several important proteins - two peptides for beta-globin and three peptides for von Willebrand Factor (vWF) - were identified with very small false discovery rates (FDRs) by our method, while none was significant when other conventional methods were used. These proteins have been reported with their important roles in microparticles in human blood cells: vWF is a platelet and endothelial cell product that binds to P-selectin, GP1b, and GP IIb/IIIa, and beta-globin is one of the peptides of hemoglobin involved in the transportation of oxygen by red blood cells.  相似文献   

20.
We developed a new approach that employs a novel computer algorithm for the sensitive and high-throughput analysis of tertiary and quaternary interaction sites from chemically cross-linked proteins or multi-protein complexes. First, we directly analyze the digests of the chemically cross-linked proteins using only high-accuracy LC-MS/MS data. We analyze these data using a computer algorithm, we term X!Link, to find cross-links between two peptides. Our algorithm is rapid, taking only a few seconds to analyze approximately 5000 MS/MS spectra. We applied this algorithm to analyze cross-linked sites generated chemically using the amino specific reagent, BS3, in both cytochrome c and the mitochondrial division dynamin mutant, Dnm1G385D, which exists as a stable homodimer. From cytochrome c, a well-established test protein, we identified a total of 31 cross-links, 21 interpeptide and 10 intrapeptide cross-links, in 257 MS/MS spectra from a single LC-MS/MS data set. The high sensitivity of this technique is indicated by the fact that all 19 lysines in cytochrome c were detected as a cross-link product and 33% of all the Lys pairs within 20 A were also observed as a cross-link. Analysis of the cross-linked dimeric form of Dnm1G385D identified a total of 46 cross-links, 38 interpeptide and 8 intrapeptide cross-links, in 98 MS/MS spectra in a single LC-MS/MS data set. These results represent the most abundant cross-links identified in a single protein or protein dimer to date. Statistical analysis suggests a 1% false discovery rate after optimization of filtering parameters. Further analysis of the cross-links identified using our approach indicates that careful manual inspection is important for the correct assignment of cross-linking sites when multiple cross-linkable sites or several similar sequences exist. In summary, we have developed a sensitive MS-based approach to identify peptide-peptide cross-links that does not require isotopic labeling or comparison with non-cross-linked controls, making it faster and simpler than current methodologies.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号