首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
Knowledge of elaborate structures of protein complexes is fundamental for understanding their functions and regulations. Although cross-linking coupled with mass spectrometry (MS) has been presented as a feasible strategy for structural elucidation of large multisubunit protein complexes, this method has proven challenging because of technical difficulties in unambiguous identification of cross-linked peptides and determination of cross-linked sites by MS analysis. In this work, we developed a novel cross-linking strategy using a newly designed MS-cleavable cross-linker, disuccinimidyl sulfoxide (DSSO). DSSO contains two symmetric collision-induced dissociation (CID)-cleavable sites that allow effective identification of DSSO-cross-linked peptides based on their distinct fragmentation patterns unique to cross-linking types (i.e. interlink, intralink, and dead end). The CID-induced separation of interlinked peptides in MS/MS permits MS3 analysis of single peptide chain fragment ions with defined modifications (due to DSSO remnants) for easy interpretation and unambiguous identification using existing database searching tools. Integration of data analyses from three generated data sets (MS, MS/MS, and MS3) allows high confidence identification of DSSO cross-linked peptides. The efficacy of the newly developed DSSO-based cross-linking strategy was demonstrated using model peptides and proteins. In addition, this method was successfully used for structural characterization of the yeast 20 S proteasome complex. In total, 13 non-redundant interlinked peptides of the 20 S proteasome were identified, representing the first application of an MS-cleavable cross-linker for the characterization of a multisubunit protein complex. Given its effectiveness and simplicity, this cross-linking strategy can find a broad range of applications in elucidating the structural topology of proteins and protein complexes.Proteins form stable and dynamic multisubunit complexes under different physiological conditions to maintain cell viability and normal cell homeostasis. Detailed knowledge of protein interactions and protein complex structures is fundamental to understanding how individual proteins function within a complex and how the complex functions as a whole. However, structural elucidation of large multisubunit protein complexes has been difficult because of a lack of technologies that can effectively handle their dynamic and heterogeneous nature. Traditional methods such as nuclear magnetic resonance (NMR) analysis and x-ray crystallography can yield detailed information on protein structures; however, NMR spectroscopy requires large quantities of pure protein in a specific solvent, whereas x-ray crystallography is often limited by the crystallization process.In recent years, chemical cross-linking coupled with mass spectrometry (MS) has become a powerful method for studying protein interactions (13). Chemical cross-linking stabilizes protein interactions through the formation of covalent bonds and allows the detection of stable, weak, and/or transient protein-protein interactions in native cells or tissues (49). In addition to capturing protein interacting partners, many studies have shown that chemical cross-linking can yield low resolution structural information about the constraints within a molecule (2, 3, 10) or protein complex (1113). The application of chemical cross-linking, enzymatic digestion, and subsequent mass spectrometric and computational analyses for the elucidation of three-dimensional protein structures offers distinct advantages over traditional methods because of its speed, sensitivity, and versatility. Identification of cross-linked peptides provides distance constraints that aid in constructing the structural topology of proteins and/or protein complexes. Although this approach has been successful, effective detection and accurate identification of cross-linked peptides as well as unambiguous assignment of cross-linked sites remain extremely challenging due to their low abundance and complicated fragmentation behavior in MS analysis (2, 3, 10, 14). Therefore, new reagents and methods are urgently needed to allow unambiguous identification of cross-linked products and to improve the speed and accuracy of data analysis to facilitate its application in structural elucidation of large protein complexes.A number of approaches have been developed to facilitate MS detection of low abundance cross-linked peptides from complex mixtures. These include selective enrichment using affinity purification with biotinylated cross-linkers (1517) and click chemistry with alkyne-tagged (18) or azide-tagged (19, 20) cross-linkers. In addition, Staudinger ligation has recently been shown to be effective for selective enrichment of azide-tagged cross-linked peptides (21). Apart from enrichment, detection of cross-linked peptides can be achieved by isotope-labeled (2224), fluorescently labeled (25), and mass tag-labeled cross-linking reagents (16, 26). These methods can identify cross-linked peptides with MS analysis, but interpretation of the data generated from interlinked peptides (two peptides connected with the cross-link) by automated database searching remains difficult. Several bioinformatics tools have thus been developed to interpret MS/MS data and determine interlinked peptide sequences from complex mixtures (12, 14, 2732). Although promising, further developments are still needed to make such data analyses as robust and reliable as analyzing MS/MS data of single peptide sequences using existing database searching tools (e.g. Protein Prospector, Mascot, or SEQUEST).Various types of cleavable cross-linkers with distinct chemical properties have been developed to facilitate MS identification and characterization of cross-linked peptides. These include UV photocleavable (33), chemical cleavable (19), isotopically coded cleavable (24), and MS-cleavable reagents (16, 26, 3438). MS-cleavable cross-linkers have received considerable attention because the resulting cross-linked products can be identified based on their characteristic fragmentation behavior observed during MS analysis. Gas-phase cleavage sites result in the detection of a “reporter” ion (26), single peptide chain fragment ions (3538), or both reporter and fragment ions (16, 34). In each case, further structural characterization of the peptide product ions generated during the cleavage reaction can be accomplished by subsequent MSn1 analysis. Among these linkers, the “fixed charge” sulfonium ion-containing cross-linker developed by Lu et al. (37) appears to be the most attractive as it allows specific and selective fragmentation of cross-linked peptides regardless of their charge and amino acid composition based on their studies with model peptides.Despite the availability of multiple types of cleavable cross-linkers, most of the applications have been limited to the study of model peptides and single proteins. Additionally, complicated synthesis and fragmentation patterns have impeded most of the known MS-cleavable cross-linkers from wide adaptation by the community. Here we describe the design and characterization of a novel and simple MS-cleavable cross-linker, DSSO, and its application to model peptides and proteins and the yeast 20 S proteasome complex. In combination with new software developed for data integration, we were able to identify DSSO-cross-linked peptides from complex peptide mixtures with speed and accuracy. Given its effectiveness and simplicity, we anticipate a broader application of this MS-cleavable cross-linker in the study of structural topology of other protein complexes using cross-linking and mass spectrometry.  相似文献   

2.
A complete understanding of the biological functions of large signaling peptides (>4 kDa) requires comprehensive characterization of their amino acid sequences and post-translational modifications, which presents significant analytical challenges. In the past decade, there has been great success with mass spectrometry-based de novo sequencing of small neuropeptides. However, these approaches are less applicable to larger neuropeptides because of the inefficient fragmentation of peptides larger than 4 kDa and their lower endogenous abundance. The conventional proteomics approach focuses on large-scale determination of protein identities via database searching, lacking the ability for in-depth elucidation of individual amino acid residues. Here, we present a multifaceted MS approach for identification and characterization of large crustacean hyperglycemic hormone (CHH)-family neuropeptides, a class of peptide hormones that play central roles in the regulation of many important physiological processes of crustaceans. Six crustacean CHH-family neuropeptides (8–9.5 kDa), including two novel peptides with extensive disulfide linkages and PTMs, were fully sequenced without reference to genomic databases. High-definition de novo sequencing was achieved by a combination of bottom-up, off-line top-down, and on-line top-down tandem MS methods. Statistical evaluation indicated that these methods provided complementary information for sequence interpretation and increased the local identification confidence of each amino acid. Further investigations by MALDI imaging MS mapped the spatial distribution and colocalization patterns of various CHH-family neuropeptides in the neuroendocrine organs, revealing that two CHH-subfamilies are involved in distinct signaling pathways.Neuropeptides and hormones comprise a diverse class of signaling molecules involved in numerous essential physiological processes, including analgesia, reward, food intake, learning and memory (1). Disorders of the neurosecretory and neuroendocrine systems influence many pathological processes. For example, obesity results from failure of energy homeostasis in association with endocrine alterations (2, 3). Previous work from our lab used crustaceans as model organisms found that multiple neuropeptides were implicated in control of food intake, including RFamides, tachykinin related peptides, RYamides, and pyrokinins (46).Crustacean hyperglycemic hormone (CHH)1 family neuropeptides play a central role in energy homeostasis of crustaceans (717). Hyperglycemic response of the CHHs was first reported after injection of crude eyestalk extract in crustaceans. Based on their preprohormone organization, the CHH family can be grouped into two sub-families: subfamily-I containing CHH, and subfamily-II containing molt-inhibiting hormone (MIH) and mandibular organ-inhibiting hormone (MOIH). The preprohormones of the subfamily-I have a CHH precursor related peptide (CPRP) that is cleaved off during processing; and preprohormones of the subfamily-II lack the CPRP (9). Uncovering their physiological functions will provide new insights into neuroendocrine regulation of energy homeostasis.Characterization of CHH-family neuropeptides is challenging. They are comprised of more than 70 amino acids and often contain multiple post-translational modifications (PTMs) and complex disulfide bridge connections (7). In addition, physiological concentrations of these peptide hormones are typically below picomolar level, and most crustacean species do not have available genome and proteome databases to assist MS-based sequencing.MS-based neuropeptidomics provides a powerful tool for rapid discovery and analysis of a large number of endogenous peptides from the brain and the central nervous system. Our group and others have greatly expanded the peptidomes of many model organisms (3, 1833). For example, we have discovered more than 200 neuropeptides with several neuropeptide families consisting of as many as 20–40 members in a simple crustacean model system (5, 6, 2531, 34). However, a majority of these neuropeptides are small peptides with 5–15 amino acid residues long, leaving a gap of identifying larger signaling peptides from organisms without sequenced genome. The observed lack of larger size peptide hormones can be attributed to the lack of effective de novo sequencing strategies for neuropeptides larger than 4 kDa, which are inherently more difficult to fragment using conventional techniques (3437). Although classical proteomics studies examine larger proteins, these tools are limited to identification based on database searching with one or more peptides matching without complete amino acid sequence coverage (36, 38).Large populations of neuropeptides from 4–10 kDa exist in the nervous systems of both vertebrates and invertebrates (9, 39, 40). Understanding their functional roles requires sufficient molecular knowledge and a unique analytical approach. Therefore, developing effective and reliable methods for de novo sequencing of large neuropeptides at the individual amino acid residue level is an urgent gap to fill in neurobiology. In this study, we present a multifaceted MS strategy aimed at high-definition de novo sequencing and comprehensive characterization of the CHH-family neuropeptides in crustacean central nervous system. The high-definition de novo sequencing was achieved by a combination of three methods: (1) enzymatic digestion and LC-tandem mass spectrometry (MS/MS) bottom-up analysis to generate detailed sequences of proteolytic peptides; (2) off-line LC fractionation and subsequent top-down MS/MS to obtain high-quality fragmentation maps of intact peptides; and (3) on-line LC coupled to top-down MS/MS to allow rapid sequence analysis of low abundance peptides. Combining the three methods overcomes the limitations of each, and thus offers complementary and high-confidence determination of amino acid residues. We report the complete sequence analysis of six CHH-family neuropeptides including the discovery of two novel peptides. With the accurate molecular information, MALDI imaging and ion mobility MS were conducted for the first time to explore their anatomical distribution and biochemical properties.  相似文献   

3.
Database search programs are essential tools for identifying peptides via mass spectrometry (MS) in shotgun proteomics. Simultaneously achieving high sensitivity and high specificity during a database search is crucial for improving proteome coverage. Here we present JUMP, a new hybrid database search program that generates amino acid tags and ranks peptide spectrum matches (PSMs) by an integrated score from the tags and pattern matching. In a typical run of liquid chromatography coupled with high-resolution tandem MS, more than 95% of MS/MS spectra can generate at least one tag, whereas the remaining spectra are usually too poor to derive genuine PSMs. To enhance search sensitivity, the JUMP program enables the use of tags as short as one amino acid. Using a target-decoy strategy, we compared JUMP with other programs (e.g. SEQUEST, Mascot, PEAKS DB, and InsPecT) in the analysis of multiple datasets and found that JUMP outperformed these preexisting programs. JUMP also permitted the analysis of multiple co-fragmented peptides from “mixture spectra” to further increase PSMs. In addition, JUMP-derived tags allowed partial de novo sequencing and facilitated the unambiguous assignment of modified residues. In summary, JUMP is an effective database search algorithm complementary to current search programs.Peptide identification by tandem mass spectra is a critical step in mass spectrometry (MS)-based1 proteomics (1). Numerous computational algorithms and software tools have been developed for this purpose (26). These algorithms can be classified into three categories: (i) pattern-based database search, (ii) de novo sequencing, and (iii) hybrid search that combines database search and de novo sequencing. With the continuous development of high-performance liquid chromatography and high-resolution mass spectrometers, it is now possible to analyze almost all protein components in mammalian cells (7). In contrast to rapid data collection, it remains a challenge to extract accurate information from the raw data to identify peptides with low false positive rates (specificity) and minimal false negatives (sensitivity) (8).Database search methods usually assign peptide sequences by comparing MS/MS spectra to theoretical peptide spectra predicted from a protein database, as exemplified in SEQUEST (9), Mascot (10), OMSSA (11), X!Tandem (12), Spectrum Mill (13), ProteinProspector (14), MyriMatch (15), Crux (16), MS-GFDB (17), Andromeda (18), BaMS2 (19), and Morpheus (20). Some other programs, such as SpectraST (21) and Pepitome (22), utilize a spectral library composed of experimentally identified and validated MS/MS spectra. These methods use a variety of scoring algorithms to rank potential peptide spectrum matches (PSMs) and select the top hit as a putative PSM. However, not all PSMs are correctly assigned. For example, false peptides may be assigned to MS/MS spectra with numerous noisy peaks and poor fragmentation patterns. If the samples contain unknown protein modifications, mutations, and contaminants, the related MS/MS spectra also result in false positives, as their corresponding peptides are not in the database. Other false positives may be generated simply by random matches. Therefore, it is of importance to remove these false PSMs to improve dataset quality. One common approach is to filter putative PSMs to achieve a final list with a predefined false discovery rate (FDR) via a target-decoy strategy, in which decoy proteins are merged with target proteins in the same database for estimating false PSMs (2326). However, the true and false PSMs are not always distinguishable based on matching scores. It is a problem to set up an appropriate score threshold to achieve maximal sensitivity and high specificity (13, 27, 28).De novo methods, including Lutefisk (29), PEAKS (30), NovoHMM (31), PepNovo (32), pNovo (33), Vonovo (34), and UniNovo (35), identify peptide sequences directly from MS/MS spectra. These methods can be used to derive novel peptides and post-translational modifications without a database, which is useful, especially when the related genome is not sequenced. High-resolution MS/MS spectra greatly facilitate the generation of peptide sequences in these de novo methods. However, because MS/MS fragmentation cannot always produce all predicted product ions, only a portion of collected MS/MS spectra have sufficient quality to extract partial or full peptide sequences, leading to lower sensitivity than achieved with the database search methods.To improve the sensitivity of the de novo methods, a hybrid approach has been proposed to integrate peptide sequence tags into PSM scoring during database searches (36). Numerous software packages have been developed, such as GutenTag (37), InsPecT (38), Byonic (39), DirecTag (40), and PEAKS DB (41). These methods use peptide tag sequences to filter a protein database, followed by error-tolerant database searching. One restriction in most of these algorithms is the requirement of a minimum tag length of three amino acids for matching protein sequences in the database. This restriction reduces the sensitivity of the database search, because it filters out some high-quality spectra in which consecutive tags cannot be generated.In this paper, we describe JUMP, a novel tag-based hybrid algorithm for peptide identification. The program is optimized to balance sensitivity and specificity during tag derivation and MS/MS pattern matching. JUMP can use all potential sequence tags, including tags consisting of only one amino acid. When we compared its performance to that of two widely used search algorithms, SEQUEST and Mascot, JUMP identified ∼30% more PSMs at the same FDR threshold. In addition, the program provides two additional features: (i) using tag sequences to improve modification site assignment, and (ii) analyzing co-fragmented peptides from mixture MS/MS spectra.  相似文献   

4.
Laserspray ionization (LSI) mass spectrometry (MS) allows, for the first time, the analysis of proteins directly from tissue using high performance atmospheric pressure ionization mass spectrometers. Several abundant and numerous lower abundant protein ions with molecular masses up to ∼20,000 Da were detected as highly charged ions from delipified mouse brain tissue mounted on a common microscope slide and coated with 2,5-dihydroxyacetophenone as matrix. The ability of LSI to produce multiply charged ions by laser ablation at atmospheric pressure allowed protein analysis at 100,000 mass resolution on an Orbitrap Exactive Fourier transform mass spectrometer. A single acquisition was sufficient to identify the myelin basic protein N-terminal fragment directly from tissue using electron transfer dissociation on a linear trap quadrupole (LTQ) Velos. The high mass resolution and mass accuracy, also obtained with a single acquisition, are useful in determining protein molecular weights and from the electron transfer dissociation data in confirming database-generated sequences. Furthermore, microscopy images of the ablated areas show matrix ablation of ∼15 μm-diameter spots in this study. The results suggest that LSI-MS at atmospheric pressure potentially combines speed of analysis and imaging capability common to matrix-assisted laser desorption/ionization and soft ionization, multiple charging, improved fragmentation, and cross-section analysis common to electrospray ionization.Tissue imaging by mass spectrometry (MS) is proving useful in areas such as detecting tumor margins, determining sites of high drug uptake, and mapping signaling molecules in brain tissue (18). Imaging using secondary ion mass spectrometry is well established but is only marginally useful with intact molecular mass measurements from biological tissue (911). Matrix-assisted laser desorption/ionization (MALDI)-MS operating under vacuum conditions has been used for tissue imaging with success, especially for abundant components such as membrane lipids, drug metabolites, and proteins (1214). Spatial resolution of ∼20 μm has been achieved (15), and the MALDI-MS method has been applied in an attempt to shed light on Parkinson disease (16, 17), muscular dystrophy (18), obesity, and cancer (12, 19).Unfortunately, there are disadvantages in using vacuum-based MS for tissue imaging in relation to analysis of unadulterated tissue. Also, the mass spectrometers used in these studies frequently have much lower mass resolution and mass accuracy than are available with atmospheric pressure ionization (API)1 instruments and are not as widely available. Because the vacuum ionization methods produce singly charged ions, mass-selected fragmentation methods provide only limited information, especially for proteins. In addition, no advanced fragmentation such as electron transfer dissociation (ETD) (2022) is available for confident protein confirmation or identification. Atmospheric pressure (AP) MALDI can be coupled to high performance mass spectrometers but suffers from sensitivity issues for tissue imaging where high spatial resolution is desired (23). AP MALDI also primarily produces singly charged ions (24, 25). Thus, mass and cross-section analysis of intact proteins has yet to be accomplished using AP MALDI because of intrinsic mass range limitations of API instruments, which frequently have a mass-to-charge (m/z) limit of <4000. Thus, new improved methods of mass-specific tissue imaging, especially at AP, are needed.The potential of laserspray ionization (LSI) (Scheme 1) (2633) for protein tissue analysis is reported here. LSI has advantages relative to other MS-based methods, including speed of analysis, laser ablation of small volumes, more relevant AP conditions, extended mass range and improved fragmentation through multiple charging, and the ability to obtain cross-section data for proteins on appropriate instrumentation. The applicability of LSI for high mass compounds on high performance API mass spectrometers (Orbitrap Exactive and SYNAPT G2) has been demonstrated producing ESI-like multiply protonated ions (2628). The first experiments showing sequence analysis by ETD using the LSI method were successfully carried out on a Thermo Fisher Scientific (San Jose, CA) LTQ-ETD mass spectrometer (26). Nearly complete sequence coverage was obtained for ubiquitin, an important regulatory protein. Applying ETD fragmentation to LSI-MS analyses potentially provides a new method for studying biological processes, including the mapping of phosphorylation, glycosylation, and ubiquitination sites from intact proteins and directly from tissue.Open in a separate windowScheme 1.Overview of LSI-MS operated in transmission geometry.Furthermore, unlike ESI and related ESI-based methods such as desorption-ESI (34), the LSI method has been shown to allow analysis of lipids in tissue from ablated areas <80 μm (30). In comparison with literature reports for AP MALDI at the same stage of development (35), LSI is more than an order of magnitude more sensitive and is capable of analyzing proteins on high resolution mass spectrometers as was demonstrated by obtaining full-acquisition mass spectra at 100,000 mass resolution (FWHH, m/z 200) after application of only 20 fmol of bovine pancreas insulin in the matrix 2,5-dihydroxyacetophenone (2,5-DHAP) onto a glass microscope slide (33). The analysis speed of LSI was demonstrated by obtaining mass spectra of five samples in 8 s (32). Here, we show the utility of LSI for intact peptide and protein analyses directly from mouse brain tissue. The ability to obtain a protein mass spectrum directly from mouse brain tissue in a single laser shot at 100,000 mass resolution and with ETD fragmentation is demonstrated.  相似文献   

5.
A decoding algorithm is tested that mechanistically models the progressive alignments that arise as the mRNA moves past the rRNA tail during translation elongation. Each of these alignments provides an opportunity for hybridization between the single-stranded, -terminal nucleotides of the 16S rRNA and the spatially accessible window of mRNA sequence, from which a free energy value can be calculated. Using this algorithm we show that a periodic, energetic pattern of frequency 1/3 is revealed. This periodic signal exists in the majority of coding regions of eubacterial genes, but not in the non-coding regions encoding the 16S and 23S rRNAs. Signal analysis reveals that the population of coding regions of each bacterial species has a mean phase that is correlated in a statistically significant way with species () content. These results suggest that the periodic signal could function as a synchronization signal for the maintenance of reading frame and that codon usage provides a mechanism for manipulation of signal phase.[1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32]  相似文献   

6.
7.
The combination of chemical cross-linking and mass spectrometry has recently been shown to constitute a powerful tool for studying protein–protein interactions and elucidating the structure of large protein complexes. However, computational methods for interpreting the complex MS/MS spectra from linked peptides are still in their infancy, making the high-throughput application of this approach largely impractical. Because of the lack of large annotated datasets, most current approaches do not capture the specific fragmentation patterns of linked peptides and therefore are not optimal for the identification of cross-linked peptides. Here we propose a generic approach to address this problem and demonstrate it using disulfide-bridged peptide libraries to (i) efficiently generate large mass spectral reference data for linked peptides at a low cost and (ii) automatically train an algorithm that can efficiently and accurately identify linked peptides from MS/MS spectra. We show that using this approach we were able to identify thousands of MS/MS spectra from disulfide-bridged peptides through comparison with proteome-scale sequence databases and significantly improve the sensitivity of cross-linked peptide identification. This allowed us to identify 60% more direct pairwise interactions between the protein subunits in the 20S proteasome complex than existing tools on cross-linking studies of the proteasome complexes. The basic framework of this approach and the MS/MS reference dataset generated should be valuable resources for the future development of new tools for the identification of linked peptides.The study of protein–protein interactions is crucial to understanding how cellular systems function because proteins act in concert through a highly organized set of interactions. Most cellular processes are carried out by large macromolecular assemblies and regulated through complex cascades of transient protein–protein interactions (1). In the past several years numerous high-throughput studies have pioneered the systematic characterization of protein–protein interactions in model organisms (24). Such studies mainly utilize two techniques: the yeast two-hybrid system, which aims at identifying binary interactions (5), and affinity purification combined with tandem mass spectrometry analysis for the identification of multi-protein assemblies (68). Together these led to a rapid expansion of known protein–protein interactions in human and other model organisms. Patche and Aloy recently estimated that there are more than one million interactions catalogued to date (9).But despite rapid progress, most current techniques allow one to determine only whether proteins interact, which is only the first step toward understanding how proteins interact. A more complete picture comes from characterizing the three-dimensional structures of protein complexes, which provide mechanistic insights that govern how interactions occur and the high specificity observed inside the cell. Traditionally the gold-standard methods used to solve protein structures are x-ray crystallography and NMR, and there have been several efforts similar to structural genomics (10) aiming to comprehensively solve the structures of protein complexes (11, 12). Although there has been accelerated growth of structures for protein monomers in the Protein Data Bank in recent years (11), the growth of structures for protein complexes has remained relatively small (9). Many factors, including their large size, transient nature, and dynamics of interactions, have prevented many complexes from being solved via traditional approaches in structural biology. Thus, the development of complementary analytical techniques with which to probe the structure of large protein complexes continues to evolve (1318).Recent developments have advanced the analysis of protein structures and interaction by combining cross-linking and tandem mass spectrometry (17, 1924). The basic idea behind this technique is to capture and identify pairs of amino acid residues that are spatially close to each other. When these linked pairs of residues are from the same protein (intraprotein cross-links), they provide distance constraints that help one infer the possible conformations of protein structures. Conversely, when pairs of residues come from different proteins (interprotein cross-links), they provide information about how proteins interact with one another. Although cross-linking strategies date back almost a decade (25, 26), difficulty in analyzing the complex MS/MS spectrum generated from linked peptides made this approach challenging, and therefore it was not widely used. With recent advances in mass spectrometry instrumentation, there has been renewed interest in employing this strategy to determine protein structures and identify protein–protein interactions. However, most studies thus far have been focused on purified protein complexes. With today''s mass spectrometers being capable of analyzing tens of thousands of spectra in a single experiment, it is now potentially feasible to extend this approach to the analysis of complex biological samples. Researchers have tried to realize this goal using both experimental and computational approaches. Indeed, a plethora of chemical cross-linking reagents are now available for stabilizing these complexes, and some are designed to allow for easier peptide identification when employed in concert with MS analysis (20, 27, 28). There have also been several recent efforts to develop computational methods for the automatic identification of linked peptides from MS/MS spectra (2936). However, because of the lack of large annotated training data, most approaches to date either borrow fragmentation models learned from unlinked, linear peptides or learn the fragmentation statistics from training data of limited size (30, 37), which might not generalize well across different samples. In some cases it is possible to generate relatively large training data, but it is often very labor intensive and involves hundreds of separate LC-MS/MS runs (36). Here, employing disulfide-bridged peptides as an example, we propose a novel method that uses a combinatorial peptide library to (a) efficiently generate a large mass spectral reference dataset for linked peptides and (b) use these data to automatically train our new algorithm, MXDB, which can efficiently and accurately identify linked peptides from MS/MS spectra.  相似文献   

8.
Protein–protein interactions (PPIs) are fundamental to the structure and function of protein complexes. Resolving the physical contacts between proteins as they occur in cells is critical to uncovering the molecular details underlying various cellular activities. To advance the study of PPIs in living cells, we have developed a new in vivo cross-linking mass spectrometry platform that couples a novel membrane-permeable, enrichable, and MS-cleavable cross-linker with multistage tandem mass spectrometry. This strategy permits the effective capture, enrichment, and identification of in vivo cross-linked products from mammalian cells and thus enables the determination of protein interaction interfaces. The utility of the developed method has been demonstrated by profiling PPIs in mammalian cells at the proteome scale and the targeted protein complex level. Our work represents a general approach for studying in vivo PPIs and provides a solid foundation for future studies toward the complete mapping of PPI networks in living systems.Protein–protein interactions (PPIs)1 play a key role in defining protein functions in biological systems. Aberrant PPIs can have drastic effects on biochemical activities essential to cell homeostasis, growth, and proliferation, and thereby lead to various human diseases (1). Consequently, PPI interfaces have been recognized as a new paradigm for drug development. Therefore, mapping PPIs and their interaction interfaces in living cells is critical not only for a comprehensive understanding of protein function and regulation, but also for describing the molecular mechanisms underlying human pathologies and identifying potential targets for better therapeutics.Several strategies exist for identifying and mapping PPIs, including yeast two-hybrid, protein microarray, and affinity purification mass spectrometry (AP-MS) (25). Thanks to new developments in sample preparation strategies, mass spectrometry technologies, and bioinformatics tools, AP-MS has become a powerful and preferred method for studying PPIs at the systems level (69). Unlike other approaches, AP-MS experiments allow the capture of protein interactions directly from their natural cellular environment, thus better retaining native protein structures and biologically relevant interactions. In addition, a broader scope of PPI networks can be obtained with greater sensitivity, accuracy, versatility, and speed. Despite the success of this very promising technique, AP-MS experiments can lead to the loss of weak/transient interactions and/or the reorganization of protein interactions during biochemical manipulation under native purification conditions. To circumvent these problems, in vivo chemical cross-linking has been successfully employed to stabilize protein interactions in native cells or tissues prior to cell lysis (1016). The resulting covalent bonds formed between interacting partners allow affinity purification under stringent and fully denaturing conditions, consequently reducing nonspecific background while preserving stable and weak/transient interactions (1216). Subsequent mass spectrometric analysis can reveal not only the identities of interacting proteins, but also cross-linked amino acid residues. The latter provides direct molecular evidence describing the physical contacts between and within proteins (17). This information can be used for computational modeling to establish structural topologies of proteins and protein complexes (1722), as well as for generating experimentally derived protein interaction network topology maps (23, 24). Thus, cross-linking mass spectrometry (XL-MS) strategies represent a powerful and emergent technology that possesses unparalleled capabilities for studying PPIs.Despite their great potential, current XL-MS studies that have aimed to identify cross-linked peptides have been mostly limited to in vitro cross-linking experiments, with few successfully identifying protein interaction interfaces in living cells (24, 25). This is largely because XL-MS studies remain challenging due to the inherent difficulty in the effective MS detection and accurate identification of cross-linked peptides, as well as in unambiguous assignment of cross-linked residues. In general, cross-linked products are heterogeneous and low in abundance relative to non-cross-linked products. In addition, their MS fragmentation is too complex to be interpreted using conventional database searching tools (17, 26). It is noted that almost all of the current in vivo PPI studies utilize formaldehyde cross-linking because of its membrane permeability and fast kinetics (1016). However, in comparison to the most commonly used amine reactive NHS ester cross-linkers, identification of formaldehyde cross-linked peptides is even more challenging because of its promiscuous nonspecific reactivity and extremely short spacer length (27). Therefore, further developments in reagents and methods are urgently needed to enable simple MS detection and effective identification of in vivo cross-linked products, and thus allow the mapping of authentic protein contact sites as established in cells, especially for protein complexes.Various efforts have been made to address the limitations of XL-MS studies, resulting in new developments in bioinformatics tools for improved data interpretation (2832) and new designs of cross-linking reagents for enhanced MS analysis of cross-linked peptides (24, 3339). Among these approaches, the development of new cross-linking reagents holds great promise for mapping PPIs on the systems level. One class of cross-linking reagents containing an enrichment handle have been shown to allow selective isolation of cross-linked products from complex mixtures, boosting their detectability by MS (3335, 4042). A second class of cross-linkers containing MS-cleavable bonds have proven to be effective in facilitating the unambiguous identification of cross-linked peptides (3639, 43, 44), as the resulting cross-linked products can be identified based on their characteristic and simplified fragmentation behavior during MS analysis. Therefore, an ideal cross-linking reagent would possess the combined features of both classes of cross-linkers. To advance the study of in vivo PPIs, we have developed a new XL-MS platform based on a novel membrane-permeable, enrichable, and MS-cleavable cross-linker, Azide-A-DSBSO (azide-tagged, acid-cleavable disuccinimidyl bis-sulfoxide), and multistage tandem mass spectrometry (MSn). This new XL-MS strategy has been successfully employed to map in vivo PPIs from mammalian cells at both the proteome scale and the targeted protein complex level.  相似文献   

9.
Significant progress in instrumentation and sample preparation approaches have recently expanded the potential of MALDI imaging mass spectrometry to the analysis of phospholipids and other endogenous metabolites naturally occurring in tissue specimens. Here we explore some of the requirements necessary for the successful analysis and imaging of phospholipids from thin tissue sections of various dimensions by MALDI time-of-flight mass spectrometry. We address methodology issues relative to the imaging of whole-body sections such as those cut from model laboratory animals, sections of intermediate dimensions typically prepared from individual organs, as well as the requirements for imaging areas of interests from these sections at a cellular scale spatial resolution. We also review existing limitations of MALDI imaging MS technology relative to compound identification. Finally, we conclude with a perspective on important issues relative to data exploitation and management that need to be solved to maximize biological understanding of the tissue specimen investigated.Since its introduction in the late 90s (1), MALDI imaging mass spectrometry (MS) technology has witnessed a phenomenal expansion. Initially introduced for the mapping of intact proteins from fresh frozen tissue sections (2), imaging MS is now routinely applied to a wide range of different compounds including peptides, proteins, lipids, metabolites, and xenobiotics (37). Numerous compound-specific sample preparation protocols and analytical strategies have been developed. These include tissue sectioning and handling (814), automated matrix deposition approaches and data acquisition strategies (1521), and the emergence of in situ tissue chemistries (2225). Originally performed on sections cut from fresh frozen tissue specimens, methodologies incorporating an in situ enzymatic digestion step prior to matrix application have been optimized to access the proteome locked in formalin-fixed paraffin-embedded tissue biopsies (2529). The possibility to use tissues preserved using non-cross-linking approaches has also been demonstrated (3032). These methodologies are of high importance for the study of numerous diseases because they potentially allow the retrospective analysis for biomarker validation and discovery of the millions of tissue biopsies currently stored worldwide in tissue banks and repositories.In the past decade, instrumentation for imaging MS has also greatly evolved. Whereas the first MS images were collected with time-of-flight instruments (TOF) capable of repetition rates of a few hertz, modern systems are today capable of acquiring data in the kilohertz range and above with improved sensitivity, mass resolving power, and accuracy, significantly reducing acquisition time and improving image quality (33, 34). Beyond time-of-flight analyzers, other MALDI-based instruments have been used such as ion traps (3537), Qq TOF instruments (3840), and trap-TOF (16, 41). Ion mobility technology has also been used in conjunction with imaging MS (4244). More recently, MALDI FT/ICR and Orbitrap mass spectrometers have been demonstrated to be extremely valuable instruments for the performance of imaging MS at very high mass resolving power (4547). These non-TOF-based systems have proven to be extremely powerful for the imaging of lower molecular weight compounds such as lipids, drugs, and metabolites. Home-built instrumentation and analytical approaches to probe tissues at higher spatial resolution (1–10 μm) have also been described (4850). In parallel to instrumentation developments, automated data acquisition, image visualization, and processing software packages have now also been developed by most manufacturers.To date, a wide range of biological systems have been studied using imaging MS as a primary methodology. Of strong interest are the organization and identification of the molecular composition of diseased tissues in direct correlation with the underlying histology and how it differs from healthy tissues. Such an approach has been used for the study of cancers (5154), neurologic disorders (5557), and other diseases (58, 59). The clinical potential of the imaging MS technology is enormous (7, 60, 61). Results give insights into the onset and progression of diseases, identify novel sets of disease-specific markers, and can provide a molecular confirmation of diagnosis as well as aide in outcome prediction (6264). Imaging MS has also been extensively used to study the development, functioning, and aging of different organs such as the kidney, prostate, epididymis, and eye lens (6570). Beyond the study of isolated tissues or organs, whole-body sections from several model animals such as leeches, mice, and rats have been investigated (7174). For these analyses, specialized instrumentation and protocols are necessary for tissue sectioning and handling (72, 73). Whole-body imaging MS opens the door to the study of the localization and accumulation of administered pharmaceuticals and their known metabolites at the level of entire organisms as well as the monitoring of their efficacy or toxicity as a function of time or dose (72, 73, 75, 76).There is considerable interest in determining the identification and localization of small biomolecules such as lipids in tissues because they are involved in many essential biological functions including cell signaling, energy storage, and membrane structure and function. Defects in lipid metabolism play a role in many diseases such as muscular dystrophy and cardiovascular disease. Phospholipids in tissues have been intensively studied by several groups (37, 40, 7783). In this respect, for optimal recovery of signal, several variables such as the choice of matrix for both imaging and fragmentation, solvent system, and instrument polarity have been investigated (20, 84). Particularly, the use of lithium cation adducts to facilitate phospholipid identification by tandem MS directly from tissue has also been reported (85). Of significant interest is the recent emergence of two new solvent-free matrix deposition approaches that perform exceptionally well for phospholipid imaging analyses. The first approach, described by Hankin et al. (86), consists in depositing the matrix on the sections through a sublimation process. The described sublimation system consists of sublimation glassware, a heated sand or oil bath (100–200 °C), and a primary vacuum pump (∼5 × 10−2 torr). Within a few minutes of initiating the sublimation process, an exceptionally homogeneous film of matrix forms on the section. The thickness of the matrix may be controlled by regulating pressure, temperature, and sublimation time. The second approach, described by Puolitaival et al.(87), uses a fine mesh sieve (≤20 μm) to filter finely ground matrix on the tissue sections. Agitation of the sieve results in passage of the matrix through the mesh and the deposition of a fairly homogeneous layer of submicrometer matrix crystals of the surface of the sections. The matrix density on the sections is controlled by direct observation using a standard light microscope. This matrix deposition approach was also found to be ideal to image certain drug compounds (88, 89). Both strategies allow very rapid production of homogeneous matrix coatings on tissue sections with a fairly inexpensive setup. Signal recovery was found to be comparable with those obtained by conventional spray deposition. With the appropriate size sublimation device or sieve, larger sections with dimensions of several centimeters such as those cut from mouse or rat whole bodies can also be rapidly and homogeneously coated.Here we present several examples of MALDI imaging MS of phospholipids from tissue sections using TOF mass spectrometers over a wide range of dimensions from whole-body sections (several centimeters), to individual organs (several millimeters), down to high spatial resolution imaging of selected tissue areas (hundreds of micrometers) at 10-μm lateral resolution and below. For all of these dimension ranges, technological considerations and practical aspects are discussed. In light of the imaging MS results, we also address issues faced for compound identification by tandem MS analysis performed directly on the sections. Finally, we discuss under “Perspective” our vision of the future of the field as well as the technological improvements and analytical tools that need to be improved upon and developed.  相似文献   

10.
11.
A Boolean network is a model used to study the interactions between different genes in genetic regulatory networks. In this paper, we present several algorithms using gene ordering and feedback vertex sets to identify singleton attractors and small attractors in Boolean networks. We analyze the average case time complexities of some of the proposed algorithms. For instance, it is shown that the outdegree-based ordering algorithm for finding singleton attractors works in time for , which is much faster than the naive time algorithm, where is the number of genes and is the maximum indegree. We performed extensive computational experiments on these algorithms, which resulted in good agreement with theoretical results. In contrast, we give a simple and complete proof for showing that finding an attractor with the shortest period is NP-hard.[1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32]  相似文献   

12.
The use of ultraviolet photodissociation (UVPD) for the activation and dissociation of peptide anions is evaluated for broader coverage of the proteome. To facilitate interpretation and assignment of the resulting UVPD mass spectra of peptide anions, the MassMatrix database search algorithm was modified to allow automated analysis of negative polarity MS/MS spectra. The new UVPD algorithms were developed based on the MassMatrix database search engine by adding specific fragmentation pathways for UVPD. The new UVPD fragmentation pathways in MassMatrix were rigorously and statistically optimized using two large data sets with high mass accuracy and high mass resolution for both MS1 and MS2 data acquired on an Orbitrap mass spectrometer for complex Halobacterium and HeLa proteome samples. Negative mode UVPD led to the identification of 3663 and 2350 peptides for the Halo and HeLa tryptic digests, respectively, corresponding to 655 and 645 peptides that were unique when compared with electron transfer dissociation (ETD), higher energy collision-induced dissociation, and collision-induced dissociation results for the same digests analyzed in the positive mode. In sum, 805 and 619 proteins were identified via UVPD for the Halobacterium and HeLa samples, respectively, with 49 and 50 unique proteins identified in contrast to the more conventional MS/MS methods. The algorithm also features automated charge determination for low mass accuracy data, precursor filtering (including intact charge-reduced peaks), and the ability to combine both positive and negative MS/MS spectra into a single search, and it is freely open to the public. The accuracy and specificity of the MassMatrix UVPD search algorithm was also assessed for low resolution, low mass accuracy data on a linear ion trap. Analysis of a known mixture of three mitogen-activated kinases yielded similar sequence coverage percentages for UVPD of peptide anions versus conventional collision-induced dissociation of peptide cations, and when these methods were combined into a single search, an increase of up to 13% sequence coverage was observed for the kinases. The ability to sequence peptide anions and cations in alternating scans in the same chromatographic run was also demonstrated. Because ETD has a significant bias toward identifying highly basic peptides, negative UVPD was used to improve the identification of the more acidic peptides in conjunction with positive ETD for the more basic species. In this case, tryptic peptides from the cytosolic section of HeLa cells were analyzed by polarity switching nanoLC-MS/MS utilizing ETD for cation sequencing and UVPD for anion sequencing. Relative to searching using ETD alone, positive/negative polarity switching significantly improved sequence coverages across identified proteins, resulting in a 33% increase in unique peptide identifications and more than twice the number of peptide spectral matches.The advent of new high-performance tandem mass spectrometers equipped with the most versatile collision- and electron-based activation methods and ever more powerful database search algorithms has catalyzed tremendous progress in the field of proteomics (14). Despite these advances in instrumentation and methodologies, there are few methods that fully exploit the information available from the acidic proteome or acidic regions of proteins. Typical high-throughput, bottom-up workflows consist of the chromatographic separation of complex mixtures of digested proteins followed by online mass spectrometry (MS) and MSn analysis. This bottom-up approach remains the most popular strategy for protein identification, biomarker discovery, quantitative proteomics, and elucidation of post-translational modifications. To date, proteome characterization via mass spectrometry has overwhelmingly focused on the analysis of peptide cations (5), resulting in an inherent bias toward basic peptides that easily ionize under acidic mobile phase conditions and positive polarity MS settings. Given that ∼50% of peptides/proteins are naturally acidic (6) and that many of the most important post-translational modifications (e.g. phosphorylation, acetylation, sulfonation, etc.) significantly decrease the isoelectric points of peptides (7, 8), there is a compelling need for better analytical methodologies for characterization of the acidic proteome.A principal reason for the shortage of methods for peptide anion characterization is the lack of MS/MS techniques suitable for the efficient and predictable dissociation of peptide anions. Although there are a growing array of new ion activation methods for the dissociation of peptides, most have been developed for the analysis of positively charged peptides. Collision-induced dissociation (CID)1 of peptide anions, for example, often yields unpredictable or uninformative fragmentation behavior, with spectra dominated by neutral losses from both precursor and product ions (9), resulting in insufficient peptide sequence information. The two most promising new electron-based methods, electron-capture dissociation and electron-transfer dissociation (ETD), are applicable only to positively charged ions, not to anions (1013). Because of the known inadequacy of CID and the lack of feasibility of electron-capture dissociation and ETD for peptide anion sequencing, several alternative MSn methods have been developed recently. Electron detachment dissociation using high-energy electrons to induce backbone cleavages was developed for peptide anions (14, 15). Another new technique, negative ETD, entails reactions of radical cation reagents with peptide anions to promote electron transfer from the peptide to the reagent that causes radical-directed dissociation (16, 17). Activated-electron photodetachment dissociation, an MS3 technique, uses UV irradiation to produce intact peptide radical anions, which are then collisionally activated (18, 19). Although they represent inroads in the characterization of peptide anions, these methods also suffer from several significant shortcomings. Electron detachment dissociation and activated-electron photodetachment dissociation are both low-efficiency methods that require long averaging cycles and activation times that range from half a second to multiple seconds, impeding the integration of these methods with chromatographic timescales (1419). In addition, the fragmentation patterns frequently yield many high-abundance neutral losses from product ions, which clutter the spectra (1417), and few sequence ions (14, 18, 19). Recently, we reported the use of 193-nm photons (ultraviolet photodissociation (UVPD)) for peptide anion activation, which was shown to yield rich and predictable fragmentation patterns with high sequence coverage on a fast liquid chromatographic timeline (20). This method showed promise for a range of peptide charge states (i.e. from 3- to 1-), as well as for both unmodified and phosphorylated species.Several widely used or commercial database searching techniques are available for automated “bottom-up” analysis of peptide cations; SEQUEST (21), MASCOT (22), OMSSA (23), X! Tandem (24), and MASPIC (25) are all popular choices and yield comparable results (26). MassMatrix (27), a recently introduced searching algorithm, uses a mass accuracy sensitive probability-based scoring scheme for both the total number of matched product ions and the total abundance of matched products. This searching method also utilizes LC retention times to filter false positive peptide matches (28) and has been shown to yield results comparable to or better than those obtained with SEQUEST, MASCOT, OMSSA, and X! Tandem (29). Despite the ongoing innovation in automated peptide cation analysis, there is a lack of publically available methods for automated peptide anion analysis.In this work, we have modified the mass accuracy sensitive probabilistic MassMatrix algorithms to allow database searching of negative polarity MS/MS spectra. The algorithm is specific to the fragmentation behavior generated from 193-nm UVPD of peptide anions. The UVPD pathways in MassMatrix were rigorously and statistically optimized using two large data sets with high mass accuracy and high mass resolution for both MS1 and MS2 data acquired on an Orbitrap mass spectrometer for complex HeLa and Halo proteome samples. For low mass accuracy/low mass resolution data, we also incorporated a charge-state-filtering algorithm that identifies the charge state of each MS/MS spectrum based on the fragmentation patterns prior to searching. MassMatrix not only can analyze both positive and negative polarity LC-MS/MS files separately, but also can combine files from different polarities and different dissociation methods into a single search, thus maximizing the information content for a given proteomics experiment. The explicit incorporation of mass accuracy in the scores for the UVPD MS/MS spectra of peptide anions increases peptide assignments and identifications. Finally, we showcase the utility of integrating MassMatrix searching with positive/negative polarity MS/MS switching (i.e. data-dependent positive ETD and negative UVPD during a single proteomic LC-MS/MS run). MassMatrix is available to the public as a free search engine online.  相似文献   

13.
14.
The orbitrap mass analyzer combines high sensitivity, high resolution, and high mass accuracy in a compact format. In proteomics applications, it is used in a hybrid configuration with a linear ion trap (LTQ-Orbitrap) where the linear trap quadrupole (LTQ) accumulates, isolates, and fragments peptide ions. Alternatively, isolated ions can be fragmented by higher energy collisional dissociation. A recently introduced stand-alone orbitrap analyzer (Exactive) also features a higher energy collisional dissociation cell but cannot isolate ions. Here we report that this instrument can efficiently characterize protein mixtures by alternating MS and “all-ion fragmentation” (AIF) MS/MS scans in a manner similar to that previously described for quadrupole time-of-flight instruments. We applied the peak recognition algorithms of the MaxQuant software at both the precursor and product ion levels. Assignment of fragment ions to co-eluting precursor ions was facilitated by high resolution (100,000 at m/z 200) and high mass accuracy. For efficient fragmentation of different mass precursors, we implemented a stepped collision energy procedure with cumulative MS readout. AIF on the Exactive identified 45 of 48 proteins in an equimolar protein standard mixture and all of them when using a small database. The technique also identified proteins with more than 100-fold abundance differences in a high dynamic range standard. When applied to protein identification in gel slices, AIF unambiguously characterized an immunoprecipitated protein that was barely visible by Coomassie staining and quantified it relative to contaminating proteins. AIF on a benchtop orbitrap instrument is therefore an attractive technology for a wide range of proteomics analyses.Mass spectrometry (MS)-based proteomics is commonly performed in a “shotgun” format where proteins are digested to peptides, which are separated and analyzed by liquid chromatography-tandem mass spectrometry (LC-MS/MS) (1, 2). Many peptides typically co-elute from the column and are selected for fragmentation on the basis of their abundance (“data dependent acquisition”). The precursor mass, which can be determined with high mass accuracy in most current instruments, together with a list of fragment ions, which are often determined at lower mass accuracy, are together used to identify the peptide in a sequence database. This scheme is the basis of most of current proteomics research from the identification of single protein bands to the comprehensive characterization of entire proteomes. To minimize stochastic effects from the selection of peptides for fragmentation and to maximize coverage in complex mixtures, very high sequencing speed is desirable. Although this is achievable, it requires complex instrumentation, and there is still no guarantee that all peptides in a mixture are fragmented and identified. Illustrating this challenge, when the Association of Biomolecular Resource Facilities (ABRF)1 and the Human Proteome Organisation (HUPO) conducted studies of protein identification success in different laboratories, results were varying (4, 5).2 Despite using state of the art proteomics workflows, often with extensive fractionation, only a few laboratories correctly identified all of the proteins in an equimolar 49-protein mixture (ABRF) or a 20-protein mixture (HUPO).As an alternative to data-dependent shotgun proteomics, the mass spectrometer can be operated to fragment the entire mass range of co-eluting analytes. This approach has its roots in precursor ion scanning techniques in which all precursors were fragmented simultaneously either in the source region or in the collision cell, and the appearance of specific “reporter ions” for a modification of interest was recorded (68). Several groups reported the identification of peptides from MS scans in conjunction with MS/MS scans without precursor ion selection (912). Yates and co-workers (13) pursued an intermediate strategy by cycling through the mass range in 10 m/z fragmentation windows. The major challenge of data-independent acquisition is that the direct relationship between precursor and fragments is lost. In most of the above studies, this problem was alleviated by making use of the fact that precursors and fragments have to “co-elute.”In recent years, data-independent proteomics has mainly been pursued on the quadrupole TOF platform where it has been termed MSE in analogy to MS2, MS3, and MSn techniques used for fragmenting one peptide at a time. Geromanos and co-workers (1416) applied MSE to absolute quantification of proteins in mixtures. Another study showed excellent protein coverage of yeast enolase with data-independent peptide fragmentation where enolase peptide intensities varied over 2 orders of magnitude (17). In a recent comparison of data-dependent and -independent peptide fragmentation, the authors concluded that fragmentation information was highly comparable (18, 19).Recently, the orbitrap mass analyzer (2023) has been introduced in a benchtop format without the linear ion trap that normally performs ion accumulation, fragmentation, and analysis of the fragments. This instrument, termed Exactive, was developed for small molecule applications such as metabolite analysis. It can be obtained with a higher energy collisional dissociation (HCD) cell (24), enabling efficient fragmentation but no precursor ion selection. This option is called “all-ion fragmentation” (AIF) by the manufacturer, and this is the term that we use below. We reasoned that the high resolution (100,000 compared with 10,000 in quadrupole TOF) and mass accuracy of this device in both the MS and MS/MS modes might facilitate the analysis of the complex fragmentation spectra generated by dissociating several precursors simultaneously. The simplicity and compactness of this instrumentation platform would then make it interesting for diverse proteomics applications.  相似文献   

15.
Quantifying the similarity of spectra is an important task in various areas of spectroscopy, for example, to identify a compound by comparing sample spectra to those of reference standards. In mass spectrometry based discovery proteomics, spectral comparisons are used to infer the amino acid sequence of peptides. In targeted proteomics by selected reaction monitoring (SRM) or SWATH MS, predetermined sets of fragment ion signals integrated over chromatographic time are used to identify target peptides in complex samples. In both cases, confidence in peptide identification is directly related to the quality of spectral matches. In this study, we used sets of simulated spectra of well-controlled dissimilarity to benchmark different spectral comparison measures and to develop a robust scoring scheme that quantifies the similarity of fragment ion spectra. We applied the normalized spectral contrast angle score to quantify the similarity of spectra to objectively assess fragment ion variability of tandem mass spectrometric datasets, to evaluate portability of peptide fragment ion spectra for targeted mass spectrometry across different types of mass spectrometers and to discriminate target assays from decoys in targeted proteomics. Altogether, this study validates the use of the normalized spectral contrast angle as a sensitive spectral similarity measure for targeted proteomics, and more generally provides a methodology to assess the performance of spectral comparisons and to support the rational selection of the most appropriate similarity measure. The algorithms used in this study are made publicly available as an open source toolset with a graphical user interface.In “bottom-up” proteomics, peptide sequences are identified by the information contained in their fragment ion spectra (1). Various methods have been developed to generate peptide fragment ion spectra and to match them to their corresponding peptide sequences. They can be broadly grouped into discovery and targeted methods. In the widely used discovery (also referred to as shotgun) proteomic approach, peptides are identified by establishing peptide to spectrum matches via a method referred to as database searching. Each acquired fragment ion spectrum is searched against theoretical peptide fragment ion spectra computed from the entries of a specified sequence database, whereby the database search space is constrained to a user defined precursor mass tolerance (2, 3). The quality of the match between experimental and theoretical spectra is typically expressed with multiple scores. These include the number of matching or nonmatching fragments, the number of consecutive fragment ion matches among others. With few exceptions (47) commonly used search engines do not use the relative intensities of the acquired fragment ion signals even though this information could be expected to strengthen the confidence of peptide identification because the relative fragment ion intensity pattern acquired under controlled fragmentation conditions can be considered as a unique “fingerprint” for a given precursor. Thanks to community efforts in acquiring and sharing large number of datasets, the proteomes of some species are now essentially mapped out and experimental fragment ion spectra covering entire proteomes are increasingly becoming accessible through spectral databases (816). This has catalyzed the emergence of new proteomics strategies that differ from classical database searching in that they use prior spectral information to identify peptides. Those comprise inclusion list sequencing (directed sequencing), spectral library matching, and targeted proteomics (17). These methods explicitly use the information contained in empirical fragment ion spectra, including the fragment ion signal intensity to identify the target peptide. For these methods, it is therefore of highest importance to accurately control and quantify the degree of reproducibility of the fragment ion spectra across experiments, instruments, labs, methods, and to quantitatively assess the similarity of spectra. To date, dot product (1824), its corresponding arccosine spectral contrast angle (2527) and (Pearson-like) spectral correlation (2831), and other geometrical distance measures (18, 32), have been used in the literature for assessing spectral similarity. These measures have been used in different contexts including shotgun spectra clustering (19, 26), spectral library searching (18, 20, 21, 24, 25, 2729), cross-instrument fragmentation comparisons (22, 30) and for scoring transitions in targeted proteomics analyses such as selected reaction monitoring (SRM)1 (23, 31). However, to our knowledge, those scores have never been objectively benchmarked for their performance in discriminating well-defined levels of dissimilarities between spectra. In particular, similarity scores obtained by different methods have not yet been compared for targeted proteomics applications, where the sensitive discrimination of highly similar spectra is critical for the confident identification of targeted peptides.In this study, we have developed a method to objectively assess the similarity of fragment ion spectra. We provide an open-source toolset that supports these analyses. Using a computationally generated benchmark spectral library with increasing levels of well-controlled spectral dissimilarity, we performed a comprehensive and unbiased comparison of the performance of the main scores used to assess spectral similarity in mass spectrometry.We then exemplify how this method, in conjunction with its corresponding benchmarked perturbation spectra set, can be applied to answer several relevant questions for MS-based proteomics. As a first application, we show that it can efficiently assess the absolute levels of peptide fragmentation variability inherent to any given mass spectrometer. By comparing the instrument''s intrinsic fragmentation conservation distribution to that of the benchmarked perturbation spectra set, nominal values of spectral similarity scores can indeed be translated into a more directly understandable percentage of variability inherent to the instrument fragmentation. As a second application, we show that the method can be used to derive an absolute measure to estimate the conservation of peptide fragmentation between instruments or across proteomics methods. This allowed us to quantitatively evaluate, for example, the transferability of fragment ion spectra acquired by data dependent analysis in a first instrument into a fragment/transition assay list used for targeted proteomics applications (e.g. SRM or targeted extraction of data independent acquisition SWATH MS (33)) on another instrument. Third, we used the method to probe the fragmentation patterns of peptides carrying a post-translation modification (e.g. phosphorylation) by comparing the spectra of modified peptide with those of their unmodified counterparts. Finally, we used the method to determine the overall level of fragmentation conservation that is required to support target-decoy discrimination and peptide identification in targeted proteomics approaches such as SRM and SWATH MS.  相似文献   

16.
17.
A variety of high-throughput methods have made it possible to generate detailed temporal expression data for a single gene or large numbers of genes. Common methods for analysis of these large data sets can be problematic. One challenge is the comparison of temporal expression data obtained from different growth conditions where the patterns of expression may be shifted in time. We propose the use of wavelet analysis to transform the data obtained under different growth conditions to permit comparison of expression patterns from experiments that have time shifts or delays. We demonstrate this approach using detailed temporal data for a single bacterial gene obtained under 72 different growth conditions. This general strategy can be applied in the analysis of data sets of thousands of genes under different conditions.[1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29]  相似文献   

18.
Insulin plays a central role in the regulation of vertebrate metabolism. The hormone, the post-translational product of a single-chain precursor, is a globular protein containing two chains, A (21 residues) and B (30 residues). Recent advances in human genetics have identified dominant mutations in the insulin gene causing permanent neonatal-onset DM2 (14). The mutations are predicted to block folding of the precursor in the ER of pancreatic β-cells. Although expression of the wild-type allele would in other circumstances be sufficient to maintain homeostasis, studies of a corresponding mouse model (57) suggest that the misfolded variant perturbs wild-type biosynthesis (8, 9). Impaired β-cell secretion is associated with ER stress, distorted organelle architecture, and cell death (10). These findings have renewed interest in insulin biosynthesis (1113) and the structural basis of disulfide pairing (1419). Protein evolution is constrained not only by structure and function but also by susceptibility to toxic misfolding.Insulin plays a central role in the regulation of vertebrate metabolism. The hormone, the post-translational product of a single-chain precursor, is a globular protein containing two chains, A (21 residues) and B (30 residues). Recent advances in human genetics have identified dominant mutations in the insulin gene causing permanent neonatal-onset DM2 (14). The mutations are predicted to block folding of the precursor in the ER of pancreatic β-cells. Although expression of the wild-type allele would in other circumstances be sufficient to maintain homeostasis, studies of a corresponding mouse model (57) suggest that the misfolded variant perturbs wild-type biosynthesis (8, 9). Impaired β-cell secretion is associated with ER stress, distorted organelle architecture, and cell death (10). These findings have renewed interest in insulin biosynthesis (1113) and the structural basis of disulfide pairing (1419). Protein evolution is constrained not only by structure and function but also by susceptibility to toxic misfolding.  相似文献   

19.
20.
It remains extraordinarily challenging to elucidate endogenous protein-protein interactions and proximities within the cellular milieu. The dynamic nature and the large range of affinities of these interactions augment the difficulty of this undertaking. Among the most useful tools for extracting such information are those based on affinity capture of target bait proteins in combination with mass spectrometric readout of the co-isolated species. Although highly enabling, the utility of affinity-based methods is generally limited by difficulties in distinguishing specific from nonspecific interactors, preserving and isolating all unique interactions including those that are weak, transient, or rapidly exchanging, and differentiating proximal interactions from those that are more distal. Here, we have devised and optimized a set of methods to address these challenges. The resulting pipeline involves flash-freezing cells in liquid nitrogen to preserve the cellular environment at the moment of freezing; cryomilling to fracture the frozen cells into intact micron chunks to allow for rapid access of a chemical reagent and to stabilize the intact endogenous subcellular assemblies and interactors upon thawing; and utilizing the high reactivity of glutaraldehyde to achieve sufficiently rapid stabilization at low temperatures to preserve native cellular interactions. In the course of this work, we determined that relatively low molar ratios of glutaraldehyde to reactive amines within the cellular milieu were sufficient to preserve even labile and transient interactions. This mild treatment enables efficient and rapid affinity capture of the protein assemblies of interest under nondenaturing conditions, followed by bottom-up MS to identify and quantify the protein constituents. For convenience, we have termed this approach Stabilized Affinity Capture Mass Spectrometry. Here, we demonstrate that Stabilized Affinity Capture Mass Spectrometry allows us to stabilize and elucidate local, distant, and transient protein interactions within complex cellular milieux, many of which are not observed in the absence of chemical stabilization.Insights into many cellular processes require detailed information about interactions between the participating proteins. However, the analysis of such interactions can be challenging because of the often-diverse physicochemical properties and the abundances of the constituent proteins, as well as the sometimes wide range of affinities and complex dynamics of the interactions. One of the key challenges has been acquiring information concerning transient, low affinity interactions in highly complex cellular milieux (3, 4).Methods that allow elucidation of such information include co-localization microscopy (5), fluorescence protein Förster resonance energy transfer (4), immunoelectron microscopy (5), yeast two-hybrid (6), and affinity capture (7, 8). Among these, affinity capture (AC)1 has the unique potential to detect all specific in vivo interactions simultaneously, including those that interact both directly and indirectly. In recent times, the efficacy of such affinity isolation experiments has been greatly enhanced through the use of sensitive modern mass spectrometric protein identification techniques (9). Nevertheless, AC suffers from several shortcomings. These include the problem of 1) distinguishing specific from nonspecific interactors (10, 11); 2) preserving and isolating all unique interactions including those that are weak and/or transient, as well as those that exchange rapidly (10, 12, 13); and 3) differentiating proximal from more distant interactions (14).We describe here an approach to address these issues, which makes use of chemical stabilization of protein assemblies in the complex cellular milieu prior to AC. Chemical stabilization is an emerging technique for stabilizing and elucidating protein associations both in vitro (1520) and in vivo (3, 12, 14, 2129), with mass spectrometric (MS) readout of the AC proteins and their connectivities. Such chemical stabilization methods are indeed well-established and are often used in electron microscopy for preserving complexes and subcellular structures both in the cellular milieu (3) and in purified complexes (30, 31), wherein the most reliable, stable, and established stabilization reagents is glutaraldehyde. Recently, glutaraldehyde has been applied in the “GraFix” protocol in which purified protein complexes are subjected to centrifugation through a density gradient that also contains a gradient of glutaraldehyde (30, 31), allowing for optimal stabilization of authentic complexes and minimization of nonspecific associations and aggregation. GraFix has also been combined with mass spectrometry on purified complexes bound to EM grids to obtain a compositional analysis of the complexes (32), thereby raising the possibility that glutaraldehyde can be successfully utilized in conjunction with AC in complex cellular milieux directly.In this work, we present a robust pipeline for determining specific protein-protein interactions and proximities from cellular milieux. The first steps of the pipeline involve the well-established techniques of flash freezing the cells of interest in liquid nitrogen and cryomilling, which have been known for over a decade (33, 34) to preserve the cellular environment, as well as having shown outstanding performance when used in analysis of macromolecular interactions in yeast (3539), bacterial (40, 41), trypanosome (42), mouse (43), and human (4447) systems. The resulting frozen powder, composed of intact micron chunks of cells that have great surface area and outstanding solvent accessibility, is well suited for rapid low temperature chemical stabilization using glutaraldehyde. We selected glutaraldehyde for our procedure based on the fact that it is a very reactive stabilizing reagent, even at lower temperatures, and because it has already been shown to stabilize enzymes in their functional state (4850). We employed highly efficient, rapid, single stage affinity capture (36, 51) for isolation and bottom-up MS for analysis of the macromolecular assemblies of interest (5254). For convenience, we have termed this approach Stabilized Affinity-Capture Mass Spectrometry (SAC-MS).  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号