首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The identification of proteins separated on two-dimensional gels is most commonly performed by trypsin digestion and subsequent matrix-assisted laser desorption ionization (MALDI) with time-of-flight (TOF). Recently, atmospheric pressure (AP) MALDI coupled to an ion trap (IT) has emerged as a convenient method to obtain tandem mass spectra (MS/MS) from samples on MALDI target plates. In the present work, we investigated the feasibility of using the two methodologies in line as a standard method for protein identification. In this setup, the high mass accuracy MALDI-TOF spectra are used to calibrate the peptide precursor masses in the lower mass accuracy AP-MALDI-IT MS/MS spectra. Several software tools were developed to automate the analysis process. Two sets of MALDI samples, consisting of 142 and 421 gel spots, respectively, were analyzed in a highly automated manner. In the first set, the protein identification rate increased from 61% for MALDI-TOF only to 85% for MALDI-TOF combined with AP-MALDI-IT. In the second data set the increase in protein identification rate was from 44% to 58%. AP-MALDI-IT MS/MS spectra were in general less effective than the MALDI-TOF spectra for protein identification, but the combination of the two methods clearly enhanced the confidence in protein identification.  相似文献   

2.
Protein identification using 2D-LC-MS/MS   总被引:3,自引:0,他引:3  
Multidimensional liquid chromatography techniques have been coupled to tandem mass spectrometry to provide a robust method to identify proteins in complex mixtures. Data acquisition is interfaced directly with search algorithms for identification through cross-correlation with databases. This review describes the most recent advances in methodologies for protein identification by mass spectrometry and describes the limitations of the application of the technologies.  相似文献   

3.

Background  

The observed molecular weight of a protein on a 1D polyacrylamide gel can provide meaningful insight into its biological function. Differences between a protein's observed molecular weight and that predicted by its full length amino acid sequence can be the result of different types of post-translational events, such as alternative splicing (AS), endoproteolytic processing (EPP), and post-translational modifications (PTMs). The characterization of these events is one of the important goals of total proteome profiling (TPP). LC/MS/MS has emerged as one of the primary tools for TPP, but since this method identifies tryptic fragments of proteins, it has not generally been used for large-scale determination of the molecular weight of intact proteins in complex mixtures.  相似文献   

4.
It is an established fact that allelic variation and post-translational modifications create different variants of proteins, which are observed as isoelectric and size subspecies in two-dimensional gel based proteomics. Here we explore the stromal proteome of spinach and Arabidopsis chloroplast and show that clustering of mass spectra is a useful tool for investigating such variants and detecting modified peptides with amino acid substitutions or post-translational modifications. This study employs data mining by hierarchical clustering of MALDI-MS spectra, using the web version of the SPECLUST program (http://bioinfo.thep.lu.se/speclust.html). The tool can also be used to remove peaks of contaminating proteins and to improve protein identification, especially for species without a fully sequenced genome. Mutually exclusive peptide peaks within a cluster provide a good starting point for MS/MS investigation of modified peptides, here exemplified by the identification of an A to E substitution that accounts for the isoelectric heterogeneity in protein isoforms.  相似文献   

5.
In proteomics, tandem mass spectrometry is the key technology for peptide sequencing. However, partially due to the deficiency of peptide identification software, a large portion of the tandem mass spectra are discarded in almost all proteomics centers because they are not interpretable. The problem is more acute with the lower quality data from low end but more popular devices such as the ion trap instruments. In order to deal with the noisy and low quality data, this paper develops a systematic machine learning approach to construct a robust linear scoring function, whose coefficients are determined by a linear programming. A prototype, PRIMA, was implemented. When tested with large benchmarks of varying qualities, PRIMA consistently has higher accuracy than commonly used software MASCOT, SEQUEST and X! Tandem.  相似文献   

6.
Zhang N  Chen R  Young N  Wishart D  Winter P  Weiner JH  Li L 《Proteomics》2007,7(4):484-493
Both organic solvent and surfactant have been used for dissolving membrane proteins for shotgun proteomics. In this work, two methods of protein solubilization, namely using 60% methanol or 1% SDS, to dissolve and analyze the inner membrane fraction of an Escherichia coli K12 cell lysate were compared. A total of 358 proteins (1417 unique peptides) from the methanol-solubilized protein mixture and 299 proteins (892 peptides) from the SDS-solubilized sample-were identified by using trypsin digestion and 2-D LC-ESI MS/MS. It was found that the methanol method detected more hydrophobic peptides, resulting in a greater number of proteins identified, than the SDS method. We found that 159 out of 358 proteins (44%) and 120 out of 299 proteins (40%) detected from the methanol- and SDS-solubilized samples, respectively, are integral membrane proteins. Among the 190 integral membrane proteins 70 were identified exclusively in the methanol-solubilized sample, 89 were identified by both methods, and only 31 proteins were exclusively identified by the SDS method. It is shown that the integral membrane proteins reflected the theoretical proteome for number of transmembrane helices, length, functional class, and topology, indicating there was no bias in the proteins identified.  相似文献   

7.
Protein identification using mass spectrometry is an indispensable computational tool in the life sciences. A dramatic increase in the use of proteomic strategies to understand the biology of living systems generates an ongoing need for more effective, efficient, and accurate computational methods for protein identification. A wide range of computational methods, each with various implementations, are available to complement different proteomic approaches. A solid knowledge of the range of algorithms available and, more critically, the accuracy and effectiveness of these techniques is essential to ensure as many of the proteins as possible, within any particular experiment, are correctly identified. Here, we undertake a systematic review of the currently available methods and algorithms for interpreting, managing, and analyzing biological data associated with protein identification. We summarize the advances in computational solutions as they have responded to corresponding advances in mass spectrometry hardware. The evolution of scoring algorithms and metrics for automated protein identification are also discussed with a focus on the relative performance of different techniques. We also consider the relative advantages and limitations of different techniques in particular biological contexts. Finally, we present our perspective on future developments in the area of computational protein identification by considering the most recent literature on new and promising approaches to the problem as well as identifying areas yet to be explored and the potential application of methods from other areas of computational biology.  相似文献   

8.
The use of nLC-ESI-MS/MS in shotgun proteomics experiments and GeLC-MS/MS analysis is well accepted and routinely available in most proteomics laboratories. However, the same cannot be said for nLC-MALDI MS/MS, which has yet to experience such widespread acceptance, despite the fact that the MALDI technology offers several critical advantages over ESI. As an illustration, in an analysis of moderately complex sample of E. coli proteins, the use MALDI in addition to ESI in GeLC-MS/MS resulted in a 16% average increase in protein identifications, while with more complex samples the number of additional protein identifications increased by an average of 45%. The size of the unique peptides identified by MALDI was, on average, 25% larger than the unique peptides identified by ESI, and they were found to be slightly more hydrophilic. The insensitivity of MALDI to the presence of ionization suppression agents was shown to be a significant advantage, suggesting it be used as a complement to ESI when ion suppression is a possibility. Furthermore, the higher resolution of the TOF/TOF instrument improved the sensitivity, accuracy, and precision of the data over that obtained using only ESI-based iTRAQ experiments using a linear ion trap. Nevertheless, accurate data can be generated with either instrument. These results demonstrate that coupling nanoLC with both ESI and MALDI ionization interfaces improves proteome coverage, reduces the deleterious effects of ionization suppression agents, and improves quantitation, particularly in complex samples.  相似文献   

9.
To understand physiological processes, insight into protein complexes is very important. Through a combination of blue native gel electrophoresis and LC-MS/MS, we were able to isolate protein complexes and identify their potential subunits from Nicotiana tabacum cv. Bright Yellow-2. For this purpose, a bioanalytical approach was used that works without a priori knowledge of the interacting proteins. Different clustering methods (e.g., k-means and hierarchical clustering) and a biclustering approach were evaluated according to their ability to group proteins by their migration profile and to correlate the proteins to a specific complex. The biclustering approach was identified as a very powerful tool for the exploration of protein complexes of whole cell lysates since it allows for the promiscuous nature of proteins. Furthermore, it searches for associations between proteins that co-occur frequently throughout the BN gel, which increases the confidence of the putative associations between co-migrating proteins. The statistical significance and biological relevance of the profile clusters were verified using functional gene ontology annotation. The proof of concept for identifying protein complexes by our BN PAGE/LC-MS/MS approach is provided through the analysis of known protein complexes. Both well characterized long-lived protein complexes as well as potential temporary sequential multi-enzyme complexes were characterized.  相似文献   

10.
We derive the optimal number of peaks (defined as the minimum number that provides the required efficiency of spectra identification) in the theoretical spectra as a function of (i) the experimental accuracy, sigma, of the measured ratio m/z; (ii) experimental spectrum density; (iii) size of the database; (iv) number of peaks in the theoretical spectra; and (v) types of ions that the peaks represent. We show that if theoretical spectra are constructed including b and y ions alone, then for sigma = 0.5, which is typical for high-throughput data, peptide chains of eight amino acids or longer can be identified based on the positions of peaks alone, at a rate of false identification below 1%. To discriminate between shorter peptides, additional (e.g., intensity-inferred) information is necessary. We derive the dependence of the probability of false identification on the number of peaks in the theoretical spectra and on the types of ions that the peaks represent. Our results suggest that the class of mass spectrum identification problems, for which more elaborate development of fragmentation rules (such as intensity model) is required, can be reduced to the problems that involve homologous peptides.  相似文献   

11.
The subject of this tutorial is protein identification and characterisation by database searching of MS/MS Data. Peptide Mass Fingerprinting is excluded because it is covered in a separate tutorial. Practical aspects of database searching are emphasised, such as choice of sequence database, effect of mass tolerance, and how to identify post-translational modifications. The relationship between sensitivity and specificity is discussed, as is the challenge of using peptide match information to infer which proteins were present in the sample. Since these tutorials are introductory in nature, most references are to reviews, rather than primary research papers. Some familiarity with mass spectrometry and protein chemistry is assumed. There is an accompanying slide presentation, including speaker notes, and a collection of web-based, practical exercises, designed to reinforce key points. This Tutorial is part of the International Proteomics Tutorial Programme (IPTP 6).  相似文献   

12.
A notable inefficiency of shotgun proteomics experiments is the repeated rediscovery of the same identifiable peptides by sequence database searching methods, which often are time-consuming and error-prone. A more precise and efficient method, in which previously observed and identified peptide MS/MS spectra are catalogued and condensed into searchable spectral libraries to allow new identifications by spectral matching, is seen as a promising alternative. To that end, an open-source, functionally complete, high-throughput and readily extensible MS/MS spectral searching tool, SpectraST, was developed. A high-quality spectral library was constructed by combining the high-confidence identifications of millions of spectra taken from various data repositories and searched using four sequence search engines. The resulting library consists of over 30,000 spectra for Saccharomyces cerevisiae. Using this library, SpectraST vastly outperforms the sequence search engine SEQUEST in terms of speed and the ability to discriminate good and bad hits. A unique advantage of SpectraST is its full integration into the popular Trans Proteomic Pipeline suite of software, which facilitates user adoption and provides important functionalities such as peptide and protein probability assignment, quantification, and data visualization. This method of spectral library searching is especially suited for targeted proteomics applications, offering superior performance to traditional sequence searching.  相似文献   

13.

Background

Bloodstream infections are responsible for thousands of deaths each year. The rapid identification of the microorganisms causing these infections permits correct therapeutic management that will improve the prognosis of the patient. In an attempt to reduce the time spent on this step, microorganism identification devices have been developed, including the VITEK® 2 system, which is currently used in routine clinical microbiology laboratories.

Methods

This study evaluated the accuracy of the VITEK® 2 system in the identification of 400 microorganisms isolated from blood cultures and compared the results to those obtained with conventional phenotypic and genotypic methods. In parallel to the phenotypic identification methods, the DNA of these microorganisms was extracted directly from the blood culture bottles for genotypic identification by the polymerase chain reaction (PCR) and DNA sequencing.

Results

The automated VITEK® 2 system correctly identified 94.7 % (379/400) of the isolates. The YST and GN cards resulted in 100 % correct identifications of yeasts (15/15) and Gram-negative bacilli (165/165), respectively. The GP card correctly identified 92.6 % (199/215) of Gram-positive cocci, while the ANC card was unable to correctly identify any Gram-positive bacilli (0/5).

Conclusions

The performance of the VITEK® 2 system was considered acceptable and statistical analysis showed that the system is a suitable option for routine clinical microbiology laboratories to identify different microorganisms.
  相似文献   

14.
Pinus radiata is one of the most economically important forest tree species, with a worldwide production of around 370 million m (3) of wood per year. Current selection of elite trees to be used in conservation and breeding programes requires the physiological and molecular characterization of available populations. To identify key proteins related to tree growth, productivity and responses to environmental factors, a proteomic approach is being utilized. In this paper, we present the first report of the 2-DE protein reference map of physiologically mature P. radiata needles, as a basis for subsequent differential expression proteomic studies related to growth, development, biomass production and responses to stresses. After TCA/acetone protein extraction of needle tissue, 549 +/- 21 well-resolved spots were detected in Coommassie-stained gels within the 5-8 pH and 10-100 kDa M(r) ranges. The analytical and biological variance determined for 450 spots were of 31 and 42%, respectively. After LC/MS/MS analysis of in-gel tryptic digested spots, proteins were identified by using the novel Paragon algorithm that tolerates amino acid substitution in the first-pass search. It allowed the confident identification of 115 out of the 150 protein spots subjected to MS, quite unusual high percentage for a poor sequence database, as is the case of P. radiata. Proteins were classified into 12 or 18 groups based on their corresponding cell component or biological process/pathway categories, respectively. Carbohydrate metabolism and photosynthetic enzymes predominate in the 2-DE protein profile of P. radiata needles.  相似文献   

15.
16.
Peptide detectability is defined as the probability that a peptide is identified in an LC-MS/MS experiment and has been useful in providing solutions to protein inference and label-free quantification. Previously, predictors for peptide detectability trained on standard or complex samples were proposed. Although the models trained on complex samples may benefit from the large training data sets, it is unclear to what extent they are affected by the unequal abundances of identified proteins. To address this challenge and improve detectability prediction, we present a new algorithm for the iterative learning of peptide detectability from complex mixtures. We provide evidence that the new method approximates detectability with useful accuracy and, based on its design, can be used to interpret the outcome of other learning strategies. We studied the properties of peptides from the bacterium Deinococcus radiodurans and found that at standard quantities, its tryptic peptides can be roughly classified as either detectable or undetectable, with a relatively small fraction having medium detectability. We extend the concept of detectability from peptides to proteins and apply the model to predict the behavior of a replicate LC-MS/MS experiment from a single analysis. Finally, our study summarizes a theoretical framework for peptide/protein identification and label-free quantification.  相似文献   

17.
Duan J  Liang Z  Yang C  Zhang J  Zhang L  Zhang W  Zhang Y 《Proteomics》2006,6(2):412-419
A monolithic enzymatic microreactor was prepared in a fused-silica capillary by in situ polymerization of acrylamide, glycidyl methacrylate (GMA) and ethylene dimethacrylate (EDMA) in the presence of a binary porogenic mixture of dodecanol and cyclohexanol, followed by ammonia solution treatment, glutaraldehyde activation and trypsin modification. The choice of acrylamide as co-monomer was found useful to improve the efficiency of trypsin modification, thus, to increase the enzyme activity. The optimized microreactor offered very low back pressure, enabling the fast digestion of proteins flowing through the reactor. The performance of the monolithic microreactor was demonstrated with the digestion of cytochrome c at high flow rate. The digests were then characterized by CE and HPLC-MS/MS with the sequence coverage of 57.7%. The digestion efficiency was found over 230 times as high as that of the conventional method. In addition, for the first time, protein digestion carried out in a mixture of water and ACN was compared with the conventional aqueous reaction using MS/MS detection, and the former solution was found more compatible and more efficient for protein digestion.  相似文献   

18.
Enrichment is essential for phosphoproteome analysis because phosphorylated proteins are usually present in cells in low abundance. Recently, titanium dioxide (TiO2) has been demonstrated to enrich phosphopeptides from simple peptide mixtures with high specificity; however, the technology has not been optimized. In the present study, significant non-specific bindings were observed when proteome samples were applied to TiO2 columns. Column wash with an NH4Glu solution after loading peptide mixtures significantly increased the efficiency of TiO2 phosphopeptide enrichment with a recovery of up to 84%. Also, for proteome samples, more than a 2-fold increase in unique phosphopeptide identifications has been achieved. The use of NH4Glu for a TiO2 column wash does not significantly reduce the phosphopeptide recovery. A total of 858 phosphopeptides corresponding to 1034 distinct phosphosites has been identified from HeLa cells using the improved TiO2 enrichment procedure in combination with data-dependent neutral loss nano-RPLC-MS2-MS3 analysis. While 41 and 35% of the phosphopeptides were identified only by MS2 and MS3, respectively, 24% was identified by both MS2 and MS3. Cross-validation of the phosphopeptide assignment by MS2 and MS3 scans resulted in the highest confidence in identification (99.5%). Many phosphosites identified in this study appear to be novel, including sites from antigen Ki-67, nucleolar phosphoprotein p130, and Treacle protein. The study also indicates that evaluation of confidence levels for phosphopeptide identification via the reversed sequence database searching strategy might underestimate the false positive rate.  相似文献   

19.
Here we have addressed common issues of resolution in two-dimensional polyacrylamide gel electrophoresis (2DE) experiments including proteins 'stacked' at pH extremes, unresolved peptides migrating at the front of separation, and areas of the 2D gel obscured by high abundance proteins. Postfractionation, by selective application of well-established electrophoretic separations immediately following standard 2DE, yields markedly improved resolution in these traditional problem areas using no more specialized equipment or techniques than SDS-PAGE itself.  相似文献   

20.
Tandem mass spectrometry-based proteomics experiments produce large amounts of raw data, and different database search engines are needed to reliably identify all the proteins from this data. Here, we present Compid, an easy-to-use software tool that can be used to integrate and compare protein identification results from two search engines, Mascot and Paragon. Additionally, Compid enables extraction of information from large Mascot result files that cannot be opened via the Web interface and calculation of general statistical information about peptide and protein identifications in a data set. To demonstrate the usefulness of this tool, we used Compid to compare Mascot and Paragon database search results for mitochondrial proteome sample of human keratinocytes. The reports generated by Compid can be exported and opened as Excel documents or as text files using configurable delimiters, allowing the analysis and further processing of Compid output with a multitude of programs. Compid is freely available and can be downloaded from http://users.utu.fi/lanatr/compid. It is released under an open source license (GPL), enabling modification of the source code. Its modular architecture allows for creation of supplementary software components e.g. to enable support for additional input formats and report categories.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号