首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
2.
A complete understanding of the biological functions of large signaling peptides (>4 kDa) requires comprehensive characterization of their amino acid sequences and post-translational modifications, which presents significant analytical challenges. In the past decade, there has been great success with mass spectrometry-based de novo sequencing of small neuropeptides. However, these approaches are less applicable to larger neuropeptides because of the inefficient fragmentation of peptides larger than 4 kDa and their lower endogenous abundance. The conventional proteomics approach focuses on large-scale determination of protein identities via database searching, lacking the ability for in-depth elucidation of individual amino acid residues. Here, we present a multifaceted MS approach for identification and characterization of large crustacean hyperglycemic hormone (CHH)-family neuropeptides, a class of peptide hormones that play central roles in the regulation of many important physiological processes of crustaceans. Six crustacean CHH-family neuropeptides (8–9.5 kDa), including two novel peptides with extensive disulfide linkages and PTMs, were fully sequenced without reference to genomic databases. High-definition de novo sequencing was achieved by a combination of bottom-up, off-line top-down, and on-line top-down tandem MS methods. Statistical evaluation indicated that these methods provided complementary information for sequence interpretation and increased the local identification confidence of each amino acid. Further investigations by MALDI imaging MS mapped the spatial distribution and colocalization patterns of various CHH-family neuropeptides in the neuroendocrine organs, revealing that two CHH-subfamilies are involved in distinct signaling pathways.Neuropeptides and hormones comprise a diverse class of signaling molecules involved in numerous essential physiological processes, including analgesia, reward, food intake, learning and memory (1). Disorders of the neurosecretory and neuroendocrine systems influence many pathological processes. For example, obesity results from failure of energy homeostasis in association with endocrine alterations (2, 3). Previous work from our lab used crustaceans as model organisms found that multiple neuropeptides were implicated in control of food intake, including RFamides, tachykinin related peptides, RYamides, and pyrokinins (46).Crustacean hyperglycemic hormone (CHH)1 family neuropeptides play a central role in energy homeostasis of crustaceans (717). Hyperglycemic response of the CHHs was first reported after injection of crude eyestalk extract in crustaceans. Based on their preprohormone organization, the CHH family can be grouped into two sub-families: subfamily-I containing CHH, and subfamily-II containing molt-inhibiting hormone (MIH) and mandibular organ-inhibiting hormone (MOIH). The preprohormones of the subfamily-I have a CHH precursor related peptide (CPRP) that is cleaved off during processing; and preprohormones of the subfamily-II lack the CPRP (9). Uncovering their physiological functions will provide new insights into neuroendocrine regulation of energy homeostasis.Characterization of CHH-family neuropeptides is challenging. They are comprised of more than 70 amino acids and often contain multiple post-translational modifications (PTMs) and complex disulfide bridge connections (7). In addition, physiological concentrations of these peptide hormones are typically below picomolar level, and most crustacean species do not have available genome and proteome databases to assist MS-based sequencing.MS-based neuropeptidomics provides a powerful tool for rapid discovery and analysis of a large number of endogenous peptides from the brain and the central nervous system. Our group and others have greatly expanded the peptidomes of many model organisms (3, 1833). For example, we have discovered more than 200 neuropeptides with several neuropeptide families consisting of as many as 20–40 members in a simple crustacean model system (5, 6, 2531, 34). However, a majority of these neuropeptides are small peptides with 5–15 amino acid residues long, leaving a gap of identifying larger signaling peptides from organisms without sequenced genome. The observed lack of larger size peptide hormones can be attributed to the lack of effective de novo sequencing strategies for neuropeptides larger than 4 kDa, which are inherently more difficult to fragment using conventional techniques (3437). Although classical proteomics studies examine larger proteins, these tools are limited to identification based on database searching with one or more peptides matching without complete amino acid sequence coverage (36, 38).Large populations of neuropeptides from 4–10 kDa exist in the nervous systems of both vertebrates and invertebrates (9, 39, 40). Understanding their functional roles requires sufficient molecular knowledge and a unique analytical approach. Therefore, developing effective and reliable methods for de novo sequencing of large neuropeptides at the individual amino acid residue level is an urgent gap to fill in neurobiology. In this study, we present a multifaceted MS strategy aimed at high-definition de novo sequencing and comprehensive characterization of the CHH-family neuropeptides in crustacean central nervous system. The high-definition de novo sequencing was achieved by a combination of three methods: (1) enzymatic digestion and LC-tandem mass spectrometry (MS/MS) bottom-up analysis to generate detailed sequences of proteolytic peptides; (2) off-line LC fractionation and subsequent top-down MS/MS to obtain high-quality fragmentation maps of intact peptides; and (3) on-line LC coupled to top-down MS/MS to allow rapid sequence analysis of low abundance peptides. Combining the three methods overcomes the limitations of each, and thus offers complementary and high-confidence determination of amino acid residues. We report the complete sequence analysis of six CHH-family neuropeptides including the discovery of two novel peptides. With the accurate molecular information, MALDI imaging and ion mobility MS were conducted for the first time to explore their anatomical distribution and biochemical properties.  相似文献   

3.
Significant progress in instrumentation and sample preparation approaches have recently expanded the potential of MALDI imaging mass spectrometry to the analysis of phospholipids and other endogenous metabolites naturally occurring in tissue specimens. Here we explore some of the requirements necessary for the successful analysis and imaging of phospholipids from thin tissue sections of various dimensions by MALDI time-of-flight mass spectrometry. We address methodology issues relative to the imaging of whole-body sections such as those cut from model laboratory animals, sections of intermediate dimensions typically prepared from individual organs, as well as the requirements for imaging areas of interests from these sections at a cellular scale spatial resolution. We also review existing limitations of MALDI imaging MS technology relative to compound identification. Finally, we conclude with a perspective on important issues relative to data exploitation and management that need to be solved to maximize biological understanding of the tissue specimen investigated.Since its introduction in the late 90s (1), MALDI imaging mass spectrometry (MS) technology has witnessed a phenomenal expansion. Initially introduced for the mapping of intact proteins from fresh frozen tissue sections (2), imaging MS is now routinely applied to a wide range of different compounds including peptides, proteins, lipids, metabolites, and xenobiotics (37). Numerous compound-specific sample preparation protocols and analytical strategies have been developed. These include tissue sectioning and handling (814), automated matrix deposition approaches and data acquisition strategies (1521), and the emergence of in situ tissue chemistries (2225). Originally performed on sections cut from fresh frozen tissue specimens, methodologies incorporating an in situ enzymatic digestion step prior to matrix application have been optimized to access the proteome locked in formalin-fixed paraffin-embedded tissue biopsies (2529). The possibility to use tissues preserved using non-cross-linking approaches has also been demonstrated (3032). These methodologies are of high importance for the study of numerous diseases because they potentially allow the retrospective analysis for biomarker validation and discovery of the millions of tissue biopsies currently stored worldwide in tissue banks and repositories.In the past decade, instrumentation for imaging MS has also greatly evolved. Whereas the first MS images were collected with time-of-flight instruments (TOF) capable of repetition rates of a few hertz, modern systems are today capable of acquiring data in the kilohertz range and above with improved sensitivity, mass resolving power, and accuracy, significantly reducing acquisition time and improving image quality (33, 34). Beyond time-of-flight analyzers, other MALDI-based instruments have been used such as ion traps (3537), Qq TOF instruments (3840), and trap-TOF (16, 41). Ion mobility technology has also been used in conjunction with imaging MS (4244). More recently, MALDI FT/ICR and Orbitrap mass spectrometers have been demonstrated to be extremely valuable instruments for the performance of imaging MS at very high mass resolving power (4547). These non-TOF-based systems have proven to be extremely powerful for the imaging of lower molecular weight compounds such as lipids, drugs, and metabolites. Home-built instrumentation and analytical approaches to probe tissues at higher spatial resolution (1–10 μm) have also been described (4850). In parallel to instrumentation developments, automated data acquisition, image visualization, and processing software packages have now also been developed by most manufacturers.To date, a wide range of biological systems have been studied using imaging MS as a primary methodology. Of strong interest are the organization and identification of the molecular composition of diseased tissues in direct correlation with the underlying histology and how it differs from healthy tissues. Such an approach has been used for the study of cancers (5154), neurologic disorders (5557), and other diseases (58, 59). The clinical potential of the imaging MS technology is enormous (7, 60, 61). Results give insights into the onset and progression of diseases, identify novel sets of disease-specific markers, and can provide a molecular confirmation of diagnosis as well as aide in outcome prediction (6264). Imaging MS has also been extensively used to study the development, functioning, and aging of different organs such as the kidney, prostate, epididymis, and eye lens (6570). Beyond the study of isolated tissues or organs, whole-body sections from several model animals such as leeches, mice, and rats have been investigated (7174). For these analyses, specialized instrumentation and protocols are necessary for tissue sectioning and handling (72, 73). Whole-body imaging MS opens the door to the study of the localization and accumulation of administered pharmaceuticals and their known metabolites at the level of entire organisms as well as the monitoring of their efficacy or toxicity as a function of time or dose (72, 73, 75, 76).There is considerable interest in determining the identification and localization of small biomolecules such as lipids in tissues because they are involved in many essential biological functions including cell signaling, energy storage, and membrane structure and function. Defects in lipid metabolism play a role in many diseases such as muscular dystrophy and cardiovascular disease. Phospholipids in tissues have been intensively studied by several groups (37, 40, 7783). In this respect, for optimal recovery of signal, several variables such as the choice of matrix for both imaging and fragmentation, solvent system, and instrument polarity have been investigated (20, 84). Particularly, the use of lithium cation adducts to facilitate phospholipid identification by tandem MS directly from tissue has also been reported (85). Of significant interest is the recent emergence of two new solvent-free matrix deposition approaches that perform exceptionally well for phospholipid imaging analyses. The first approach, described by Hankin et al. (86), consists in depositing the matrix on the sections through a sublimation process. The described sublimation system consists of sublimation glassware, a heated sand or oil bath (100–200 °C), and a primary vacuum pump (∼5 × 10−2 torr). Within a few minutes of initiating the sublimation process, an exceptionally homogeneous film of matrix forms on the section. The thickness of the matrix may be controlled by regulating pressure, temperature, and sublimation time. The second approach, described by Puolitaival et al.(87), uses a fine mesh sieve (≤20 μm) to filter finely ground matrix on the tissue sections. Agitation of the sieve results in passage of the matrix through the mesh and the deposition of a fairly homogeneous layer of submicrometer matrix crystals of the surface of the sections. The matrix density on the sections is controlled by direct observation using a standard light microscope. This matrix deposition approach was also found to be ideal to image certain drug compounds (88, 89). Both strategies allow very rapid production of homogeneous matrix coatings on tissue sections with a fairly inexpensive setup. Signal recovery was found to be comparable with those obtained by conventional spray deposition. With the appropriate size sublimation device or sieve, larger sections with dimensions of several centimeters such as those cut from mouse or rat whole bodies can also be rapidly and homogeneously coated.Here we present several examples of MALDI imaging MS of phospholipids from tissue sections using TOF mass spectrometers over a wide range of dimensions from whole-body sections (several centimeters), to individual organs (several millimeters), down to high spatial resolution imaging of selected tissue areas (hundreds of micrometers) at 10-μm lateral resolution and below. For all of these dimension ranges, technological considerations and practical aspects are discussed. In light of the imaging MS results, we also address issues faced for compound identification by tandem MS analysis performed directly on the sections. Finally, we discuss under “Perspective” our vision of the future of the field as well as the technological improvements and analytical tools that need to be improved upon and developed.  相似文献   

4.
Database search programs are essential tools for identifying peptides via mass spectrometry (MS) in shotgun proteomics. Simultaneously achieving high sensitivity and high specificity during a database search is crucial for improving proteome coverage. Here we present JUMP, a new hybrid database search program that generates amino acid tags and ranks peptide spectrum matches (PSMs) by an integrated score from the tags and pattern matching. In a typical run of liquid chromatography coupled with high-resolution tandem MS, more than 95% of MS/MS spectra can generate at least one tag, whereas the remaining spectra are usually too poor to derive genuine PSMs. To enhance search sensitivity, the JUMP program enables the use of tags as short as one amino acid. Using a target-decoy strategy, we compared JUMP with other programs (e.g. SEQUEST, Mascot, PEAKS DB, and InsPecT) in the analysis of multiple datasets and found that JUMP outperformed these preexisting programs. JUMP also permitted the analysis of multiple co-fragmented peptides from “mixture spectra” to further increase PSMs. In addition, JUMP-derived tags allowed partial de novo sequencing and facilitated the unambiguous assignment of modified residues. In summary, JUMP is an effective database search algorithm complementary to current search programs.Peptide identification by tandem mass spectra is a critical step in mass spectrometry (MS)-based1 proteomics (1). Numerous computational algorithms and software tools have been developed for this purpose (26). These algorithms can be classified into three categories: (i) pattern-based database search, (ii) de novo sequencing, and (iii) hybrid search that combines database search and de novo sequencing. With the continuous development of high-performance liquid chromatography and high-resolution mass spectrometers, it is now possible to analyze almost all protein components in mammalian cells (7). In contrast to rapid data collection, it remains a challenge to extract accurate information from the raw data to identify peptides with low false positive rates (specificity) and minimal false negatives (sensitivity) (8).Database search methods usually assign peptide sequences by comparing MS/MS spectra to theoretical peptide spectra predicted from a protein database, as exemplified in SEQUEST (9), Mascot (10), OMSSA (11), X!Tandem (12), Spectrum Mill (13), ProteinProspector (14), MyriMatch (15), Crux (16), MS-GFDB (17), Andromeda (18), BaMS2 (19), and Morpheus (20). Some other programs, such as SpectraST (21) and Pepitome (22), utilize a spectral library composed of experimentally identified and validated MS/MS spectra. These methods use a variety of scoring algorithms to rank potential peptide spectrum matches (PSMs) and select the top hit as a putative PSM. However, not all PSMs are correctly assigned. For example, false peptides may be assigned to MS/MS spectra with numerous noisy peaks and poor fragmentation patterns. If the samples contain unknown protein modifications, mutations, and contaminants, the related MS/MS spectra also result in false positives, as their corresponding peptides are not in the database. Other false positives may be generated simply by random matches. Therefore, it is of importance to remove these false PSMs to improve dataset quality. One common approach is to filter putative PSMs to achieve a final list with a predefined false discovery rate (FDR) via a target-decoy strategy, in which decoy proteins are merged with target proteins in the same database for estimating false PSMs (2326). However, the true and false PSMs are not always distinguishable based on matching scores. It is a problem to set up an appropriate score threshold to achieve maximal sensitivity and high specificity (13, 27, 28).De novo methods, including Lutefisk (29), PEAKS (30), NovoHMM (31), PepNovo (32), pNovo (33), Vonovo (34), and UniNovo (35), identify peptide sequences directly from MS/MS spectra. These methods can be used to derive novel peptides and post-translational modifications without a database, which is useful, especially when the related genome is not sequenced. High-resolution MS/MS spectra greatly facilitate the generation of peptide sequences in these de novo methods. However, because MS/MS fragmentation cannot always produce all predicted product ions, only a portion of collected MS/MS spectra have sufficient quality to extract partial or full peptide sequences, leading to lower sensitivity than achieved with the database search methods.To improve the sensitivity of the de novo methods, a hybrid approach has been proposed to integrate peptide sequence tags into PSM scoring during database searches (36). Numerous software packages have been developed, such as GutenTag (37), InsPecT (38), Byonic (39), DirecTag (40), and PEAKS DB (41). These methods use peptide tag sequences to filter a protein database, followed by error-tolerant database searching. One restriction in most of these algorithms is the requirement of a minimum tag length of three amino acids for matching protein sequences in the database. This restriction reduces the sensitivity of the database search, because it filters out some high-quality spectra in which consecutive tags cannot be generated.In this paper, we describe JUMP, a novel tag-based hybrid algorithm for peptide identification. The program is optimized to balance sensitivity and specificity during tag derivation and MS/MS pattern matching. JUMP can use all potential sequence tags, including tags consisting of only one amino acid. When we compared its performance to that of two widely used search algorithms, SEQUEST and Mascot, JUMP identified ∼30% more PSMs at the same FDR threshold. In addition, the program provides two additional features: (i) using tag sequences to improve modification site assignment, and (ii) analyzing co-fragmented peptides from mixture MS/MS spectra.  相似文献   

5.
Protein–protein interactions (PPIs) are fundamental to the structure and function of protein complexes. Resolving the physical contacts between proteins as they occur in cells is critical to uncovering the molecular details underlying various cellular activities. To advance the study of PPIs in living cells, we have developed a new in vivo cross-linking mass spectrometry platform that couples a novel membrane-permeable, enrichable, and MS-cleavable cross-linker with multistage tandem mass spectrometry. This strategy permits the effective capture, enrichment, and identification of in vivo cross-linked products from mammalian cells and thus enables the determination of protein interaction interfaces. The utility of the developed method has been demonstrated by profiling PPIs in mammalian cells at the proteome scale and the targeted protein complex level. Our work represents a general approach for studying in vivo PPIs and provides a solid foundation for future studies toward the complete mapping of PPI networks in living systems.Protein–protein interactions (PPIs)1 play a key role in defining protein functions in biological systems. Aberrant PPIs can have drastic effects on biochemical activities essential to cell homeostasis, growth, and proliferation, and thereby lead to various human diseases (1). Consequently, PPI interfaces have been recognized as a new paradigm for drug development. Therefore, mapping PPIs and their interaction interfaces in living cells is critical not only for a comprehensive understanding of protein function and regulation, but also for describing the molecular mechanisms underlying human pathologies and identifying potential targets for better therapeutics.Several strategies exist for identifying and mapping PPIs, including yeast two-hybrid, protein microarray, and affinity purification mass spectrometry (AP-MS) (25). Thanks to new developments in sample preparation strategies, mass spectrometry technologies, and bioinformatics tools, AP-MS has become a powerful and preferred method for studying PPIs at the systems level (69). Unlike other approaches, AP-MS experiments allow the capture of protein interactions directly from their natural cellular environment, thus better retaining native protein structures and biologically relevant interactions. In addition, a broader scope of PPI networks can be obtained with greater sensitivity, accuracy, versatility, and speed. Despite the success of this very promising technique, AP-MS experiments can lead to the loss of weak/transient interactions and/or the reorganization of protein interactions during biochemical manipulation under native purification conditions. To circumvent these problems, in vivo chemical cross-linking has been successfully employed to stabilize protein interactions in native cells or tissues prior to cell lysis (1016). The resulting covalent bonds formed between interacting partners allow affinity purification under stringent and fully denaturing conditions, consequently reducing nonspecific background while preserving stable and weak/transient interactions (1216). Subsequent mass spectrometric analysis can reveal not only the identities of interacting proteins, but also cross-linked amino acid residues. The latter provides direct molecular evidence describing the physical contacts between and within proteins (17). This information can be used for computational modeling to establish structural topologies of proteins and protein complexes (1722), as well as for generating experimentally derived protein interaction network topology maps (23, 24). Thus, cross-linking mass spectrometry (XL-MS) strategies represent a powerful and emergent technology that possesses unparalleled capabilities for studying PPIs.Despite their great potential, current XL-MS studies that have aimed to identify cross-linked peptides have been mostly limited to in vitro cross-linking experiments, with few successfully identifying protein interaction interfaces in living cells (24, 25). This is largely because XL-MS studies remain challenging due to the inherent difficulty in the effective MS detection and accurate identification of cross-linked peptides, as well as in unambiguous assignment of cross-linked residues. In general, cross-linked products are heterogeneous and low in abundance relative to non-cross-linked products. In addition, their MS fragmentation is too complex to be interpreted using conventional database searching tools (17, 26). It is noted that almost all of the current in vivo PPI studies utilize formaldehyde cross-linking because of its membrane permeability and fast kinetics (1016). However, in comparison to the most commonly used amine reactive NHS ester cross-linkers, identification of formaldehyde cross-linked peptides is even more challenging because of its promiscuous nonspecific reactivity and extremely short spacer length (27). Therefore, further developments in reagents and methods are urgently needed to enable simple MS detection and effective identification of in vivo cross-linked products, and thus allow the mapping of authentic protein contact sites as established in cells, especially for protein complexes.Various efforts have been made to address the limitations of XL-MS studies, resulting in new developments in bioinformatics tools for improved data interpretation (2832) and new designs of cross-linking reagents for enhanced MS analysis of cross-linked peptides (24, 3339). Among these approaches, the development of new cross-linking reagents holds great promise for mapping PPIs on the systems level. One class of cross-linking reagents containing an enrichment handle have been shown to allow selective isolation of cross-linked products from complex mixtures, boosting their detectability by MS (3335, 4042). A second class of cross-linkers containing MS-cleavable bonds have proven to be effective in facilitating the unambiguous identification of cross-linked peptides (3639, 43, 44), as the resulting cross-linked products can be identified based on their characteristic and simplified fragmentation behavior during MS analysis. Therefore, an ideal cross-linking reagent would possess the combined features of both classes of cross-linkers. To advance the study of in vivo PPIs, we have developed a new XL-MS platform based on a novel membrane-permeable, enrichable, and MS-cleavable cross-linker, Azide-A-DSBSO (azide-tagged, acid-cleavable disuccinimidyl bis-sulfoxide), and multistage tandem mass spectrometry (MSn). This new XL-MS strategy has been successfully employed to map in vivo PPIs from mammalian cells at both the proteome scale and the targeted protein complex level.  相似文献   

6.
Knowledge of elaborate structures of protein complexes is fundamental for understanding their functions and regulations. Although cross-linking coupled with mass spectrometry (MS) has been presented as a feasible strategy for structural elucidation of large multisubunit protein complexes, this method has proven challenging because of technical difficulties in unambiguous identification of cross-linked peptides and determination of cross-linked sites by MS analysis. In this work, we developed a novel cross-linking strategy using a newly designed MS-cleavable cross-linker, disuccinimidyl sulfoxide (DSSO). DSSO contains two symmetric collision-induced dissociation (CID)-cleavable sites that allow effective identification of DSSO-cross-linked peptides based on their distinct fragmentation patterns unique to cross-linking types (i.e. interlink, intralink, and dead end). The CID-induced separation of interlinked peptides in MS/MS permits MS3 analysis of single peptide chain fragment ions with defined modifications (due to DSSO remnants) for easy interpretation and unambiguous identification using existing database searching tools. Integration of data analyses from three generated data sets (MS, MS/MS, and MS3) allows high confidence identification of DSSO cross-linked peptides. The efficacy of the newly developed DSSO-based cross-linking strategy was demonstrated using model peptides and proteins. In addition, this method was successfully used for structural characterization of the yeast 20 S proteasome complex. In total, 13 non-redundant interlinked peptides of the 20 S proteasome were identified, representing the first application of an MS-cleavable cross-linker for the characterization of a multisubunit protein complex. Given its effectiveness and simplicity, this cross-linking strategy can find a broad range of applications in elucidating the structural topology of proteins and protein complexes.Proteins form stable and dynamic multisubunit complexes under different physiological conditions to maintain cell viability and normal cell homeostasis. Detailed knowledge of protein interactions and protein complex structures is fundamental to understanding how individual proteins function within a complex and how the complex functions as a whole. However, structural elucidation of large multisubunit protein complexes has been difficult because of a lack of technologies that can effectively handle their dynamic and heterogeneous nature. Traditional methods such as nuclear magnetic resonance (NMR) analysis and x-ray crystallography can yield detailed information on protein structures; however, NMR spectroscopy requires large quantities of pure protein in a specific solvent, whereas x-ray crystallography is often limited by the crystallization process.In recent years, chemical cross-linking coupled with mass spectrometry (MS) has become a powerful method for studying protein interactions (13). Chemical cross-linking stabilizes protein interactions through the formation of covalent bonds and allows the detection of stable, weak, and/or transient protein-protein interactions in native cells or tissues (49). In addition to capturing protein interacting partners, many studies have shown that chemical cross-linking can yield low resolution structural information about the constraints within a molecule (2, 3, 10) or protein complex (1113). The application of chemical cross-linking, enzymatic digestion, and subsequent mass spectrometric and computational analyses for the elucidation of three-dimensional protein structures offers distinct advantages over traditional methods because of its speed, sensitivity, and versatility. Identification of cross-linked peptides provides distance constraints that aid in constructing the structural topology of proteins and/or protein complexes. Although this approach has been successful, effective detection and accurate identification of cross-linked peptides as well as unambiguous assignment of cross-linked sites remain extremely challenging due to their low abundance and complicated fragmentation behavior in MS analysis (2, 3, 10, 14). Therefore, new reagents and methods are urgently needed to allow unambiguous identification of cross-linked products and to improve the speed and accuracy of data analysis to facilitate its application in structural elucidation of large protein complexes.A number of approaches have been developed to facilitate MS detection of low abundance cross-linked peptides from complex mixtures. These include selective enrichment using affinity purification with biotinylated cross-linkers (1517) and click chemistry with alkyne-tagged (18) or azide-tagged (19, 20) cross-linkers. In addition, Staudinger ligation has recently been shown to be effective for selective enrichment of azide-tagged cross-linked peptides (21). Apart from enrichment, detection of cross-linked peptides can be achieved by isotope-labeled (2224), fluorescently labeled (25), and mass tag-labeled cross-linking reagents (16, 26). These methods can identify cross-linked peptides with MS analysis, but interpretation of the data generated from interlinked peptides (two peptides connected with the cross-link) by automated database searching remains difficult. Several bioinformatics tools have thus been developed to interpret MS/MS data and determine interlinked peptide sequences from complex mixtures (12, 14, 2732). Although promising, further developments are still needed to make such data analyses as robust and reliable as analyzing MS/MS data of single peptide sequences using existing database searching tools (e.g. Protein Prospector, Mascot, or SEQUEST).Various types of cleavable cross-linkers with distinct chemical properties have been developed to facilitate MS identification and characterization of cross-linked peptides. These include UV photocleavable (33), chemical cleavable (19), isotopically coded cleavable (24), and MS-cleavable reagents (16, 26, 3438). MS-cleavable cross-linkers have received considerable attention because the resulting cross-linked products can be identified based on their characteristic fragmentation behavior observed during MS analysis. Gas-phase cleavage sites result in the detection of a “reporter” ion (26), single peptide chain fragment ions (3538), or both reporter and fragment ions (16, 34). In each case, further structural characterization of the peptide product ions generated during the cleavage reaction can be accomplished by subsequent MSn1 analysis. Among these linkers, the “fixed charge” sulfonium ion-containing cross-linker developed by Lu et al. (37) appears to be the most attractive as it allows specific and selective fragmentation of cross-linked peptides regardless of their charge and amino acid composition based on their studies with model peptides.Despite the availability of multiple types of cleavable cross-linkers, most of the applications have been limited to the study of model peptides and single proteins. Additionally, complicated synthesis and fragmentation patterns have impeded most of the known MS-cleavable cross-linkers from wide adaptation by the community. Here we describe the design and characterization of a novel and simple MS-cleavable cross-linker, DSSO, and its application to model peptides and proteins and the yeast 20 S proteasome complex. In combination with new software developed for data integration, we were able to identify DSSO-cross-linked peptides from complex peptide mixtures with speed and accuracy. Given its effectiveness and simplicity, we anticipate a broader application of this MS-cleavable cross-linker in the study of structural topology of other protein complexes using cross-linking and mass spectrometry.  相似文献   

7.
The combination of chemical cross-linking and mass spectrometry has recently been shown to constitute a powerful tool for studying protein–protein interactions and elucidating the structure of large protein complexes. However, computational methods for interpreting the complex MS/MS spectra from linked peptides are still in their infancy, making the high-throughput application of this approach largely impractical. Because of the lack of large annotated datasets, most current approaches do not capture the specific fragmentation patterns of linked peptides and therefore are not optimal for the identification of cross-linked peptides. Here we propose a generic approach to address this problem and demonstrate it using disulfide-bridged peptide libraries to (i) efficiently generate large mass spectral reference data for linked peptides at a low cost and (ii) automatically train an algorithm that can efficiently and accurately identify linked peptides from MS/MS spectra. We show that using this approach we were able to identify thousands of MS/MS spectra from disulfide-bridged peptides through comparison with proteome-scale sequence databases and significantly improve the sensitivity of cross-linked peptide identification. This allowed us to identify 60% more direct pairwise interactions between the protein subunits in the 20S proteasome complex than existing tools on cross-linking studies of the proteasome complexes. The basic framework of this approach and the MS/MS reference dataset generated should be valuable resources for the future development of new tools for the identification of linked peptides.The study of protein–protein interactions is crucial to understanding how cellular systems function because proteins act in concert through a highly organized set of interactions. Most cellular processes are carried out by large macromolecular assemblies and regulated through complex cascades of transient protein–protein interactions (1). In the past several years numerous high-throughput studies have pioneered the systematic characterization of protein–protein interactions in model organisms (24). Such studies mainly utilize two techniques: the yeast two-hybrid system, which aims at identifying binary interactions (5), and affinity purification combined with tandem mass spectrometry analysis for the identification of multi-protein assemblies (68). Together these led to a rapid expansion of known protein–protein interactions in human and other model organisms. Patche and Aloy recently estimated that there are more than one million interactions catalogued to date (9).But despite rapid progress, most current techniques allow one to determine only whether proteins interact, which is only the first step toward understanding how proteins interact. A more complete picture comes from characterizing the three-dimensional structures of protein complexes, which provide mechanistic insights that govern how interactions occur and the high specificity observed inside the cell. Traditionally the gold-standard methods used to solve protein structures are x-ray crystallography and NMR, and there have been several efforts similar to structural genomics (10) aiming to comprehensively solve the structures of protein complexes (11, 12). Although there has been accelerated growth of structures for protein monomers in the Protein Data Bank in recent years (11), the growth of structures for protein complexes has remained relatively small (9). Many factors, including their large size, transient nature, and dynamics of interactions, have prevented many complexes from being solved via traditional approaches in structural biology. Thus, the development of complementary analytical techniques with which to probe the structure of large protein complexes continues to evolve (1318).Recent developments have advanced the analysis of protein structures and interaction by combining cross-linking and tandem mass spectrometry (17, 1924). The basic idea behind this technique is to capture and identify pairs of amino acid residues that are spatially close to each other. When these linked pairs of residues are from the same protein (intraprotein cross-links), they provide distance constraints that help one infer the possible conformations of protein structures. Conversely, when pairs of residues come from different proteins (interprotein cross-links), they provide information about how proteins interact with one another. Although cross-linking strategies date back almost a decade (25, 26), difficulty in analyzing the complex MS/MS spectrum generated from linked peptides made this approach challenging, and therefore it was not widely used. With recent advances in mass spectrometry instrumentation, there has been renewed interest in employing this strategy to determine protein structures and identify protein–protein interactions. However, most studies thus far have been focused on purified protein complexes. With today''s mass spectrometers being capable of analyzing tens of thousands of spectra in a single experiment, it is now potentially feasible to extend this approach to the analysis of complex biological samples. Researchers have tried to realize this goal using both experimental and computational approaches. Indeed, a plethora of chemical cross-linking reagents are now available for stabilizing these complexes, and some are designed to allow for easier peptide identification when employed in concert with MS analysis (20, 27, 28). There have also been several recent efforts to develop computational methods for the automatic identification of linked peptides from MS/MS spectra (2936). However, because of the lack of large annotated training data, most approaches to date either borrow fragmentation models learned from unlinked, linear peptides or learn the fragmentation statistics from training data of limited size (30, 37), which might not generalize well across different samples. In some cases it is possible to generate relatively large training data, but it is often very labor intensive and involves hundreds of separate LC-MS/MS runs (36). Here, employing disulfide-bridged peptides as an example, we propose a novel method that uses a combinatorial peptide library to (a) efficiently generate a large mass spectral reference dataset for linked peptides and (b) use these data to automatically train our new algorithm, MXDB, which can efficiently and accurately identify linked peptides from MS/MS spectra.  相似文献   

8.
We report a novel strategy for studying synaptic pathology by concurrently measuring levels of four SNARE complex proteins from individual brain tissue samples. This method combines affinity purification and mass spectrometry and can be applied directly for studies of SNARE complex proteins in multiple species or modified to target other key elements in neuronal function. We use the technique to demonstrate altered levels of presynaptic proteins in Alzheimer disease patients and prion-infected mice.One prominent pathological feature of neuropsychiatric disorders such as Alzheimer disease (AD)1 is severe synaptic loss (13). Previous reports of AD patients have shown that presynaptic dysfunction might occur early in the disease process (1, 4). Cortical synapse pathology has also been shown to correlate to the severity of dementia more closely than other pathological hallmarks of AD such as plaques and neurofibrillary tangles (5, 6). The SNARE proteins are essential components for the regulation of neurotransmitter exocytosis at the presynaptic site (7). Animal models suggest that changed expression or modification of SNARE complex proteins (synaptosomal-associated protein 25 (SNAP-25), syntaxin-1, and vesicle-associated membrane protein (VAMP)) alters synaptic function and is an interesting target for the development of therapeutics for neuropsychiatric illness (8, 9). The constituents of the SNARE complex are either localized in synaptic vesicles (VAMPs) or anchored at the presynaptic plasma membrane (SNAP-25 and syntaxin). The SNARE proteins are tightly assembled, and subsequent neurotransmitter release of the complex is quickly dissociated by N-ethylmaleimide-sensitive factor (7, 1012). Because they are both strongly associated into complexes and membrane associated, the SNARE proteins are difficult to analyze via mass spectrometry, which is incompatible with most detergents necessary for the solubilization of proteins. Each SNARE complex protein exists in several isoforms that are differently distributed within the central nervous system (1318). Post-translational modifications and truncated variants of the SNARE proteins make investigation of the protein expression even more complicated.In this study we developed an approach for the characterization and concurrent quantification of SNARE complex proteins that combines affinity purification by immunoprecipitation and mass spectrometry (IP-MS). We used precipitation with monoclonal antibodies against SNAP-25 to target the SNARE complex proteins and nanoflow LC–tandem mass spectrometry (LC-MS/MS) to characterize the co-immunoprecipitated interaction partners. Selected reaction monitoring (SRM) on a triple quadrupole mass spectrometer coupled to a microflow LC system was used for quantification of the SNARE proteins. To demonstrate the usability of the IP-MS method, we performed a comparison of SNARE complex protein levels in brain tissue from AD patients and age-matched controls, as well as a study of SNARE complex protein levels in brain tissue from prion-infected mice.  相似文献   

9.
Comprehensive proteomic profiling of biological specimens usually requires multidimensional chromatographic peptide fractionation prior to mass spectrometry. However, this approach can suffer from poor reproducibility because of the lack of standardization and automation of the entire workflow, thus compromising performance of quantitative proteomic investigations. To address these variables we developed an online peptide fractionation system comprising a multiphasic liquid chromatography (LC) chip that integrates reversed phase and strong cation exchange chromatography upstream of the mass spectrometer (MS). We showed superiority of this system for standardizing discovery and targeted proteomic workflows using cancer cell lysates and nondepleted human plasma. Five-step multiphase chip LC MS/MS acquisition showed clear advantages over analyses of unfractionated samples by identifying more peptides, consuming less sample and often improving the lower limits of quantitation, all in highly reproducible, automated, online configuration. We further showed that multiphase chip LC fractionation provided a facile means to detect many N- and C-terminal peptides (including acetylated N terminus) that are challenging to identify in complex tryptic peptide matrices because of less favorable ionization characteristics. Given as much as 95% of peptides were detected in only a single salt fraction from cell lysates we exploited this high reproducibility and coupled it with multiple reaction monitoring on a high-resolution MS instrument (MRM-HR). This approach increased target analyte peak area and improved lower limits of quantitation without negatively influencing variance or bias. Further, we showed a strategy to use multiphase LC chip fractionation LC-MS/MS for ion library generation to integrate with SWATHTM data-independent acquisition quantitative workflows. All MS data are available via ProteomeXchange with identifier PXD001464.Mass spectrometry based proteomic quantitation is an essential technique used for contemporary, integrative biological studies. Whether used in discovery experiments or for targeted biomarker applications, quantitative proteomic studies require high reproducibility at many levels. It requires reproducible run-to-run peptide detection, reproducible peptide quantitation, reproducible depth of proteome coverage, and ideally, a high degree of cross-laboratory analytical reproducibility. Mass spectrometry centered proteomics has evolved steadily over the past decade, now mature enough to derive extensive draft maps of the human proteome (1, 2). Nonetheless, a key requirement yet to be realized is to ensure that quantitative proteomics can be carried out in a timely manner while satisfying the aforementioned challenges associated with reproducibility. This is especially important for recent developments using data independent MS quantitation and multiple reaction monitoring on high-resolution MS (MRM-HR)1 as they are both highly dependent on LC peptide retention time reproducibility and precursor detectability, while attempting to maximize proteome coverage (3). Strategies usually employed to increase the depth of proteome coverage utilize various sample fractionation methods including gel-based separation, affinity enrichment or depletion, protein or peptide chemical modification-based enrichment, and various peptide chromatography methods, particularly ion exchange chromatography (410). In comparison to an unfractionated “naive” sample, the trade-off in using these enrichments/fractionation approaches are higher risk of sample losses, introduction of undesired chemical modifications (e.g. oxidation, deamidation, N-terminal lactam formation), and the potential for result skewing and bias, as well as numerous time and human resources required to perform the sample preparation tasks. Online-coupled approaches aim to minimize those risks and address resource constraints. A widely practiced example of the benefits of online sample fractionation has been the decade long use of combining strong cation exchange chromatography (SCX) with C18 reversed-phase (RP) for peptide fractionation (known as MudPIT – multidimensional protein identification technology), where SCX and RP is performed under the same buffer conditions and the SCX elution performed with volatile organic cations compatible with reversed phase separation (11). This approach greatly increases analyte detection while avoiding sample handling losses. The MudPIT approach has been widely used for discovery proteomics (1214), and we have previously shown that multiphasic separations also have utility for targeted proteomics when configured for selected reaction monitoring MS (SRM-MS). We showed substantial advantages of MudPIT-SRM-MS with reduced ion suppression, increased peak areas and lower limits of detection (LLOD) compared with conventional RP-SRM-MS (15).To improve the reproducibility of proteomic workflows, increase throughput and minimize sample loss, numerous microfluidic devices have been developed and integrated for proteomic applications (16, 17). These devices can broadly be classified into two groups: (1) microfluidic chips for peptide separation (1825) and; (2) proteome reactors that combine enzymatic processing with peptide based fractionation (2630). Because of the small dimension of these devices, they are readily able to integrate into nanoLC workflows. Various applications have been described including increasing proteome coverage (22, 27, 28) and targeting of phosphopeptides (24, 31, 32), glycopeptides and released glycans (29, 33, 34).In this work, we set out to take advantage of the benefits of multiphasic peptide separations and address the reproducibility needs required for high-throughput comparative proteomics using a variety of workflows. We integrated a multiphasic SCX and RP column in a “plug-and-play” microfluidic chip format for online fractionation, eliminating the need for users to make minimal dead volume connections between traps and columns. We show the flexibility of this format to provide robust peptide separation and reproducibility using conventional and topical mass spectrometry workflows. This was undertaken by coupling the multiphase liquid chromatography (LC) chip to a fast scanning Q-ToF mass spectrometer for data dependent MS/MS, data independent MS (SWATH) and for targeted proteomics using MRM-HR, showing clear advantages for repeatable analyses compared with conventional proteomic workflows.  相似文献   

10.
Based on conventional data-dependent acquisition strategy of shotgun proteomics, we present a new workflow DeMix, which significantly increases the efficiency of peptide identification for in-depth shotgun analysis of complex proteomes. Capitalizing on the high resolution and mass accuracy of Orbitrap-based tandem mass spectrometry, we developed a simple deconvolution method of “cloning” chimeric tandem spectra for cofragmented peptides. Additional to a database search, a simple rescoring scheme utilizes mass accuracy and converts the unwanted cofragmenting events into a surprising advantage of multiplexing. With the combination of cloning and rescoring, we obtained on average nine peptide-spectrum matches per second on a Q-Exactive workbench, whereas the actual MS/MS acquisition rate was close to seven spectra per second. This efficiency boost to 1.24 identified peptides per MS/MS spectrum enabled analysis of over 5000 human proteins in single-dimensional LC-MS/MS shotgun experiments with an only two-hour gradient. These findings suggest a change in the dominant “one MS/MS spectrum - one peptide” paradigm for data acquisition and analysis in shotgun data-dependent proteomics. DeMix also demonstrated higher robustness than conventional approaches in terms of lower variation among the results of consecutive LC-MS/MS runs.Shotgun proteomics analysis based on a combination of high performance liquid chromatography and tandem mass spectrometry (MS/MS) (1) has achieved remarkable speed and efficiency (27). In a single four-hour long high performance liquid chromatography-MS/MS run, over 40,000 peptides and 5000 proteins can be identified using a high-resolution Orbitrap mass spectrometer with data-dependent acquisition (DDA)1 (2, 3). However, in a typical LC-MS analysis of unfractionated human cell lysate, over 100,000 individual peptide isotopic patterns can be detected (4), which corresponds to simultaneous elution of hundreds of peptides. With this complexity, a mass spectrometer needs to achieve ≥25 Hz MS/MS acquisition rate to fully sample all the detectable peptides, and ≥17 Hz to cover reasonably abundant ones (4). Although this acquisition rate is reachable by modern time-of-flight (TOF) instruments, the reported DDA identification results do not encompass all expected peptides. Recently, the next-generation Orbitrap instrument, working at 20 Hz MS/MS acquisition rate, demonstrated nearly full profiling of yeast proteome using an 80 min gradient, which opened the way for comprehensive analysis of human proteome in a time efficient manner (5).During the high performance liquid chromatography-MS/MS DDA analysis of complex samples, high density of co-eluting peptides results in a high probability for two or more peptides to overlap within an MS/MS isolation window. With the commonly used ±1.0–2.0 Th isolation windows, most MS/MS spectra are chimeric (4, 810), with cofragmenting precursors being naturally multiplexed. However, as has been discussed previously (9, 10), the cofragmentation events are currently ignored in most of the conventional analysis workflows. According to the prevailing assumption of “one MS/MS spectrum–one peptide,” chimeric MS/MS spectra are generally unwelcome in DDA, because the product ions from different precursors may interfere with the assignment of MS/MS fragment identities, increasing the rate of false discoveries in database search (8, 9). In some studies, the precursor isolation width was set as narrow as ±0.35 Th to prevent unwanted ions from being coselected, fragmented or detected (4, 5).On the contrary, multiplexing by cofragmentation is considered to be one of the solid advantages in data-independent acquisition (DIA) (1013). In several commonly used DIA methods, the precursor ion selection windows are set much wider than in DDA: from 25 Th as in SWATH (12), to extremely broad range as in AIF (13). In order to use the benefit of MS/MS multiplexing in DDA, several approaches have been proposed to deconvolute chimeric MS/MS spectra. In “alternative peptide identification” method implemented in Percolator (14), a machine learning algorithm reranks and rescores peptide-spectrum matches (PSMs) obtained from one or more MS/MS search engines. But the deconvolution in Percolator is limited to cofragmented peptides with masses differing from the target peptide by the tolerance of the database search, which can be as narrow as a few ppm. The “active demultiplexing” method proposed by Ledvina et al. (15) actively separates MS/MS data from several precursors using masses of complementary fragments. However, higher-energy collisional dissociation often produces MS/MS spectra with too few complementary pairs for reliable peptide identification. The “MixDB” method introduces a sophisticated new search engine, also with a machine learning algorithm (9). And the “second peptide identification” method implemented in Andromeda/MaxQuant workflow (16) submits the same dataset to the search engine several times based on the list of chromatographic peptide features, subtracting assigned MS/MS peaks after each identification round. This approach is similar to the ProbIDTree search engine that also performed iterative identification while removing assigned peaks after each round of identification (17).One important factor for spectral deconvolution that has not been fully utilized in most conventional workflows is the excellent mass accuracy achievable with modern high-resolution mass spectrometry (18). An Orbitrap Fourier-transform mass spectrometer can provide mass accuracy in the range of hundreds of ppb (parts per billion) for mass peaks with high signal-to-noise (S/N) ratio (19). However, the mass error of peaks with lower S/N ratios can be significantly higher and exceed 1 ppm. Despite this dependence of the mass accuracy from the S/N level, most MS and MS/MS search engines only allow users to set hard cut-off values for the mass error tolerances. Moreover, some search engines do not provide the option of choosing a relative error tolerance for MS/MS fragments. Such negligent treatment of mass accuracy reduces the analytical power of high accuracy experiments (18).Identification results coming from different MS/MS search engines are sometimes not consistent because of different statistical assumptions used in scoring PSMs. Introduction of tools integrating the results of different search engines (14, 20, 21) makes the data interpretation even more complex and opaque for the user. The opposite trend—simplification of MS/MS data interpretation—is therefore a welcome development. For example, an extremely straightforward algorithm recently proposed by Wenger et al. (22) demonstrated a surprisingly high performance in peptide identification, even though it is only marginally more complex than simply counting the number of matches of theoretical fragment peaks in high resolution MS/MS, without any a priori statistical assumption.In order to take advantage of natural multiplexing of MS/MS spectra in DDA, as well as properly utilize high accuracy of Orbitrap-based mass spectrometry, we developed a simple and robust data analysis workflow DeMix. It is presented in Fig. 1 as an expansion of the conventional workflow. Principles of some of the processes used by the workflow are borrowed from other approaches, including the custom-made mass peak centroiding (20), chromatographic feature detection (19, 20), and two-pass database search with the first limited pass to provide a “software lock mass” for mass scale recalibration (23).Open in a separate windowFig. 1.An overview of the DeMix workflow that expands the conventional workflow, shown by the dashed line. Processes are colored in purple for TOPP, red for search engine (Morpheus/Mascot/MS-GF+), and blue for in-house programs.In DeMix workflow, the deconvolution of chimeric MS/MS spectra consists of simply “cloning” an MS/MS spectrum if a potential cofragmented peptide is detected. The list of candidate peptide precursors is generated from chromatographic feature detection, as in the MaxQuant/Andromeda workflow (16, 19), but using The OpenMS Proteomics Pipeline (TOPP) (20, 24). During the cloning, the precursor is replaced by the new candidate, but no changes in the MS/MS fragment list are made, and therefore the cloned MS/MS spectra remain chimeric. Processing such spectra requires a search engine tolerant to the presence of unassigned peaks, as such peaks are always expected when multiple precursors cofragment. Thus, we chose Morpheus (22) as a search engine. Based on the original search algorithm, we implement a reformed scoring scheme: Morpheus-AS (advanced scoring). It inherits all the basic principles from Morpheus but deeper utilizes the high mass accuracy of the data. This kind of database search removes the necessity of spectral processing for physical separation of MS/MS data into multiple subspectra (15), or consecutive subtraction of peaks (16, 17).Despite the fact that DeMix workflow is largely a combination of known approaches, it provides remarkable improvement compared with the state-of-the-art. On our Orbitrap Q-Exactive workbench, testing on a benchmark dataset of two-hour single-dimension LC-MS/MS experiments from HeLa cell lysate, we identified on average 1.24 peptide per MS/MS spectrum, breaking the “one MS/MS spectrum–one peptide” paradigm on the level of whole data set. At 1% false discovery rate (FDR), we obtained on average nine PSMs per second (at the actual acquisition rate of ca. seven MS/MS spectra per second), and detected 40 human proteins per minute.  相似文献   

11.
Laserspray ionization (LSI) mass spectrometry (MS) allows, for the first time, the analysis of proteins directly from tissue using high performance atmospheric pressure ionization mass spectrometers. Several abundant and numerous lower abundant protein ions with molecular masses up to ∼20,000 Da were detected as highly charged ions from delipified mouse brain tissue mounted on a common microscope slide and coated with 2,5-dihydroxyacetophenone as matrix. The ability of LSI to produce multiply charged ions by laser ablation at atmospheric pressure allowed protein analysis at 100,000 mass resolution on an Orbitrap Exactive Fourier transform mass spectrometer. A single acquisition was sufficient to identify the myelin basic protein N-terminal fragment directly from tissue using electron transfer dissociation on a linear trap quadrupole (LTQ) Velos. The high mass resolution and mass accuracy, also obtained with a single acquisition, are useful in determining protein molecular weights and from the electron transfer dissociation data in confirming database-generated sequences. Furthermore, microscopy images of the ablated areas show matrix ablation of ∼15 μm-diameter spots in this study. The results suggest that LSI-MS at atmospheric pressure potentially combines speed of analysis and imaging capability common to matrix-assisted laser desorption/ionization and soft ionization, multiple charging, improved fragmentation, and cross-section analysis common to electrospray ionization.Tissue imaging by mass spectrometry (MS) is proving useful in areas such as detecting tumor margins, determining sites of high drug uptake, and mapping signaling molecules in brain tissue (18). Imaging using secondary ion mass spectrometry is well established but is only marginally useful with intact molecular mass measurements from biological tissue (911). Matrix-assisted laser desorption/ionization (MALDI)-MS operating under vacuum conditions has been used for tissue imaging with success, especially for abundant components such as membrane lipids, drug metabolites, and proteins (1214). Spatial resolution of ∼20 μm has been achieved (15), and the MALDI-MS method has been applied in an attempt to shed light on Parkinson disease (16, 17), muscular dystrophy (18), obesity, and cancer (12, 19).Unfortunately, there are disadvantages in using vacuum-based MS for tissue imaging in relation to analysis of unadulterated tissue. Also, the mass spectrometers used in these studies frequently have much lower mass resolution and mass accuracy than are available with atmospheric pressure ionization (API)1 instruments and are not as widely available. Because the vacuum ionization methods produce singly charged ions, mass-selected fragmentation methods provide only limited information, especially for proteins. In addition, no advanced fragmentation such as electron transfer dissociation (ETD) (2022) is available for confident protein confirmation or identification. Atmospheric pressure (AP) MALDI can be coupled to high performance mass spectrometers but suffers from sensitivity issues for tissue imaging where high spatial resolution is desired (23). AP MALDI also primarily produces singly charged ions (24, 25). Thus, mass and cross-section analysis of intact proteins has yet to be accomplished using AP MALDI because of intrinsic mass range limitations of API instruments, which frequently have a mass-to-charge (m/z) limit of <4000. Thus, new improved methods of mass-specific tissue imaging, especially at AP, are needed.The potential of laserspray ionization (LSI) (Scheme 1) (2633) for protein tissue analysis is reported here. LSI has advantages relative to other MS-based methods, including speed of analysis, laser ablation of small volumes, more relevant AP conditions, extended mass range and improved fragmentation through multiple charging, and the ability to obtain cross-section data for proteins on appropriate instrumentation. The applicability of LSI for high mass compounds on high performance API mass spectrometers (Orbitrap Exactive and SYNAPT G2) has been demonstrated producing ESI-like multiply protonated ions (2628). The first experiments showing sequence analysis by ETD using the LSI method were successfully carried out on a Thermo Fisher Scientific (San Jose, CA) LTQ-ETD mass spectrometer (26). Nearly complete sequence coverage was obtained for ubiquitin, an important regulatory protein. Applying ETD fragmentation to LSI-MS analyses potentially provides a new method for studying biological processes, including the mapping of phosphorylation, glycosylation, and ubiquitination sites from intact proteins and directly from tissue.Open in a separate windowScheme 1.Overview of LSI-MS operated in transmission geometry.Furthermore, unlike ESI and related ESI-based methods such as desorption-ESI (34), the LSI method has been shown to allow analysis of lipids in tissue from ablated areas <80 μm (30). In comparison with literature reports for AP MALDI at the same stage of development (35), LSI is more than an order of magnitude more sensitive and is capable of analyzing proteins on high resolution mass spectrometers as was demonstrated by obtaining full-acquisition mass spectra at 100,000 mass resolution (FWHH, m/z 200) after application of only 20 fmol of bovine pancreas insulin in the matrix 2,5-dihydroxyacetophenone (2,5-DHAP) onto a glass microscope slide (33). The analysis speed of LSI was demonstrated by obtaining mass spectra of five samples in 8 s (32). Here, we show the utility of LSI for intact peptide and protein analyses directly from mouse brain tissue. The ability to obtain a protein mass spectrum directly from mouse brain tissue in a single laser shot at 100,000 mass resolution and with ETD fragmentation is demonstrated.  相似文献   

12.
It remains extraordinarily challenging to elucidate endogenous protein-protein interactions and proximities within the cellular milieu. The dynamic nature and the large range of affinities of these interactions augment the difficulty of this undertaking. Among the most useful tools for extracting such information are those based on affinity capture of target bait proteins in combination with mass spectrometric readout of the co-isolated species. Although highly enabling, the utility of affinity-based methods is generally limited by difficulties in distinguishing specific from nonspecific interactors, preserving and isolating all unique interactions including those that are weak, transient, or rapidly exchanging, and differentiating proximal interactions from those that are more distal. Here, we have devised and optimized a set of methods to address these challenges. The resulting pipeline involves flash-freezing cells in liquid nitrogen to preserve the cellular environment at the moment of freezing; cryomilling to fracture the frozen cells into intact micron chunks to allow for rapid access of a chemical reagent and to stabilize the intact endogenous subcellular assemblies and interactors upon thawing; and utilizing the high reactivity of glutaraldehyde to achieve sufficiently rapid stabilization at low temperatures to preserve native cellular interactions. In the course of this work, we determined that relatively low molar ratios of glutaraldehyde to reactive amines within the cellular milieu were sufficient to preserve even labile and transient interactions. This mild treatment enables efficient and rapid affinity capture of the protein assemblies of interest under nondenaturing conditions, followed by bottom-up MS to identify and quantify the protein constituents. For convenience, we have termed this approach Stabilized Affinity Capture Mass Spectrometry. Here, we demonstrate that Stabilized Affinity Capture Mass Spectrometry allows us to stabilize and elucidate local, distant, and transient protein interactions within complex cellular milieux, many of which are not observed in the absence of chemical stabilization.Insights into many cellular processes require detailed information about interactions between the participating proteins. However, the analysis of such interactions can be challenging because of the often-diverse physicochemical properties and the abundances of the constituent proteins, as well as the sometimes wide range of affinities and complex dynamics of the interactions. One of the key challenges has been acquiring information concerning transient, low affinity interactions in highly complex cellular milieux (3, 4).Methods that allow elucidation of such information include co-localization microscopy (5), fluorescence protein Förster resonance energy transfer (4), immunoelectron microscopy (5), yeast two-hybrid (6), and affinity capture (7, 8). Among these, affinity capture (AC)1 has the unique potential to detect all specific in vivo interactions simultaneously, including those that interact both directly and indirectly. In recent times, the efficacy of such affinity isolation experiments has been greatly enhanced through the use of sensitive modern mass spectrometric protein identification techniques (9). Nevertheless, AC suffers from several shortcomings. These include the problem of 1) distinguishing specific from nonspecific interactors (10, 11); 2) preserving and isolating all unique interactions including those that are weak and/or transient, as well as those that exchange rapidly (10, 12, 13); and 3) differentiating proximal from more distant interactions (14).We describe here an approach to address these issues, which makes use of chemical stabilization of protein assemblies in the complex cellular milieu prior to AC. Chemical stabilization is an emerging technique for stabilizing and elucidating protein associations both in vitro (1520) and in vivo (3, 12, 14, 2129), with mass spectrometric (MS) readout of the AC proteins and their connectivities. Such chemical stabilization methods are indeed well-established and are often used in electron microscopy for preserving complexes and subcellular structures both in the cellular milieu (3) and in purified complexes (30, 31), wherein the most reliable, stable, and established stabilization reagents is glutaraldehyde. Recently, glutaraldehyde has been applied in the “GraFix” protocol in which purified protein complexes are subjected to centrifugation through a density gradient that also contains a gradient of glutaraldehyde (30, 31), allowing for optimal stabilization of authentic complexes and minimization of nonspecific associations and aggregation. GraFix has also been combined with mass spectrometry on purified complexes bound to EM grids to obtain a compositional analysis of the complexes (32), thereby raising the possibility that glutaraldehyde can be successfully utilized in conjunction with AC in complex cellular milieux directly.In this work, we present a robust pipeline for determining specific protein-protein interactions and proximities from cellular milieux. The first steps of the pipeline involve the well-established techniques of flash freezing the cells of interest in liquid nitrogen and cryomilling, which have been known for over a decade (33, 34) to preserve the cellular environment, as well as having shown outstanding performance when used in analysis of macromolecular interactions in yeast (3539), bacterial (40, 41), trypanosome (42), mouse (43), and human (4447) systems. The resulting frozen powder, composed of intact micron chunks of cells that have great surface area and outstanding solvent accessibility, is well suited for rapid low temperature chemical stabilization using glutaraldehyde. We selected glutaraldehyde for our procedure based on the fact that it is a very reactive stabilizing reagent, even at lower temperatures, and because it has already been shown to stabilize enzymes in their functional state (4850). We employed highly efficient, rapid, single stage affinity capture (36, 51) for isolation and bottom-up MS for analysis of the macromolecular assemblies of interest (5254). For convenience, we have termed this approach Stabilized Affinity-Capture Mass Spectrometry (SAC-MS).  相似文献   

13.
A decoding algorithm is tested that mechanistically models the progressive alignments that arise as the mRNA moves past the rRNA tail during translation elongation. Each of these alignments provides an opportunity for hybridization between the single-stranded, -terminal nucleotides of the 16S rRNA and the spatially accessible window of mRNA sequence, from which a free energy value can be calculated. Using this algorithm we show that a periodic, energetic pattern of frequency 1/3 is revealed. This periodic signal exists in the majority of coding regions of eubacterial genes, but not in the non-coding regions encoding the 16S and 23S rRNAs. Signal analysis reveals that the population of coding regions of each bacterial species has a mean phase that is correlated in a statistically significant way with species () content. These results suggest that the periodic signal could function as a synchronization signal for the maintenance of reading frame and that codon usage provides a mechanism for manipulation of signal phase.[1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32]  相似文献   

14.
Selected reaction monitoring mass spectrometry (SRM-MS) is playing an increasing role in quantitative proteomics and biomarker discovery studies as a method for high throughput candidate quantification and verification. Although SRM-MS offers advantages in sensitivity and quantification compared with other MS-based techniques, current SRM technologies are still challenged by detection and quantification of low abundance proteins (e.g. present at ∼10 ng/ml or lower levels in blood plasma). Here we report enhanced detection sensitivity and reproducibility for SRM-based targeted proteomics by coupling a nanospray ionization multicapillary inlet/dual electrodynamic ion funnel interface to a commercial triple quadrupole mass spectrometer. Because of the increased efficiency in ion transmission, significant enhancements in overall signal intensities and improved limits of detection were observed with the new interface compared with the original interface for SRM measurements of tryptic peptides from proteins spiked into non-depleted mouse plasma over a range of concentrations. Overall, average SRM peak intensities were increased by ∼70-fold. The average level of detection for peptides also improved by ∼10-fold with notably improved reproducibility of peptide measurements as indicated by the reduced coefficients of variance. The ability to detect proteins ranging from 40 to 80 ng/ml within mouse plasma was demonstrated for all spiked proteins without the application of front-end immunoaffinity depletion and fractionation. This significant improvement in detection sensitivity for low abundance proteins in complex matrices is expected to enhance a broad range of SRM-MS applications including targeted protein and metabolite validation.Although mass spectrometry (MS)-based proteomics is a promising high throughput technology for biomarker discovery and validation (15), only a handful of cancer biomarkers have been approved by the United States Food and Drug Administration for clinical use in the last decade (6, 7). Assuming that low abundance biomarkers do exist in the biofluids to be studied, the success of biomarker discovery efforts primarily depends on the sensitivity, accuracy, and robustness of the measurement technologies; the quality and size of patient cohorts and clinical samples and execution within the context of an overall difficult and expensive path to clinical application that encompasses discovery, verification, and validation stages (1, 5, 810). A multiplexed assay platform increasingly considered for biomarker verification is selected reaction monitoring (SRM)1 by tandem mass spectrometry using e.g. a triple quadrupole (QqQ) mass spectrometer to attain high throughput quantitative measurements of targeted proteins in complex matrices (1, 11, 12).SRM utilizes two stages of mass filtering by selecting a specific analyte ion of interest (precursor ion) in the first stage followed by a specific fragment ion derived from the precursor (fragment ion) filter in the second stage after collision-activated dissociation. Typically, several transitions (precursor/fragment ion pairs) are monitored for greater selectivity and confidence in a targeted peptide assay, and large numbers of peptides can be monitored during a single LC-MS/MS analysis. The two-stage mass selection by individual quadrupoles enables more rapid and continuous monitoring of specific ions derived from analytes of interest such as peptides and leads to significantly enhanced detection sensitivity and quantitative accuracy compared with broad (i.e. non-targeted) LC-MS or LC-MS/MS measurements (11, 12). Both the sensitivity and selectivity of SRM-MS make this technique well suited for the targeted detection and quantification of low abundance proteins in highly complex biofluids (1316). The precision and reproducibility of SRM-based measurements of proteins in plasma across different laboratories have recently been assessed (17).Despite its promise, present SRM measurements still do not provide sufficient sensitivity for reliable detection and quantification of low abundance proteins in biofluids (e.g. present in plasma at ∼10 ng/ml or lower levels) primarily because of factors related to high sample complexity and the large dynamic range of relative protein abundances (7, 18, 19). Given sufficient selectivity, the sensitivity achievable is generally related to the peptide MS and MS/MS signal intensities obtained. One of the key factors limiting peptide MS intensities is the significant ion losses encountered between the electrospray ionization (ESI) source and the interface to the mass spectrometer. In typical LC-ESI-MS interfaces, the mass spectrometer inlet (e.g. heated capillary followed by a skimmer) presently provides total ion utilization and ion transmission efficiencies on the order of ∼1% (20) due to a combination of limited ion sampling from the atmospheric pressure ion source into the inlet and inefficient transmission of ions entering the first reduced pressure stage of the mass spectrometer.The electrodynamic ion funnel (21), which has been developed to efficiently capture, focus, and transmit ions to the high vacuum region of the mass spectrometer, is expected to provide a large benefit to SRM analyses. The original ion funnel interfaces, which operated at a maximum of ∼5 torr, were able to enhance signal intensities for a variety of MS analyzers (2224) by replacing the inefficient skimmer interface. Although achieving near lossless ion transmission to high vacuum, losses at the atmospheric pressure interface went unmitigated. More recently, a high pressure ion funnel interface capable of operating at a pressure of ∼30 torr was introduced (25). The higher operating pressures accommodated greater gas loads and enabled more efficient ion sampling from atmospheric pressure through a multicapillary inlet. With a dual ion funnel interface comprising a high pressure ion funnel with a heated multicapillary inlet followed by a standard ion funnel operated at 1–2 torrs, highly efficient ion sampling from atmospheric pressure to high vacuum is readily achieved.In this study, we report the enhanced sensitivity and reproducibility of SRM-based targeted proteomics measurements achieved by implementing a dual stage electrodynamic ion funnel interface that incorporates a multicapillary inlet with a triple quadrupole mass spectrometer. A series of LC-SRM-MS measurements were made using mouse plasma samples spiked with various concentrations of tryptic peptides from five standard proteins to evaluate the improvements in detection sensitivity and reproducibility attained by this modified interface relative to a standard Thermo (single capillary inlet/skimmer) interface. A ∼10-fold improvement in the limit of detection (LOD) as well as improved measurement reproducibility was achieved.  相似文献   

15.
The field of proteomics has evolved hand-in-hand with technological advances in LC-MS/MS systems, now enabling the analysis of very deep proteomes in a reasonable time. However, most applications do not deal with full cell or tissue proteomes but rather with restricted subproteomes relevant for the research context at hand or resulting from extensive fractionation. At the same time, investigation of many conditions or perturbations puts a strain on measurement capacity. Here, we develop a high-throughput workflow capable of dealing with large numbers of low or medium complexity samples and specifically aim at the analysis of 96-well plates in a single day (15 min per sample). We combine parallel sample processing with a modified liquid chromatography platform driving two analytical columns in tandem, which are coupled to a quadrupole Orbitrap mass spectrometer (Q Exactive HF). The modified LC platform eliminates idle time between measurements, and the high sequencing speed of the Q Exactive HF reduces required measurement time. We apply the pipeline to the yeast chromatin remodeling landscape and demonstrate quantification of 96 pull-downs of chromatin complexes in about 1 day. This is achieved with only 500 μg input material, enabling yeast cultivation in a 96-well format. Our system retrieved known complex-members and the high throughput allowed probing with many bait proteins. Even alternative complex compositions were detectable in these very short gradients. Thus, sample throughput, sensitivity and LC/MS-MS duty cycle are improved severalfold compared with established workflows. The pipeline can be extended to different types of interaction studies and to other medium complexity proteomes.Shotgun proteomics is concerned with the identification and quantification of proteins (13). Prior to analysis, the proteins are digested into peptides, resulting in highly complex mixtures. To deal with this complexity, the peptides are separated by liquid chromatography followed by online analysis with mass spectrometry (MS), today facilitating the characterization of almost complete cell line proteomes in a short time (35). In addition to the characterization of entire proteomes, there is also a great demand for analyzing low or medium complexity samples. Given the trend toward a systems biology view, relatively larges sets of samples often have to be measured. One such category of lower complexity protein mixtures occurs in the determination of physical interaction partners of a protein of interest, which requires the identification and quantification of the proteins “pulled-down” or immunoprecipitated via a bait protein. Protein interactions are essential for almost all biological processes and orchestrate a cell''s behavior by regulating enzymes, forming macromolecular assemblies and functionalizing multiprotein complexes that are capable of more complex behavior than the sum of their parts. The human genome has almost 20,000 protein encoding genes, and it has been estimated that 80% of the proteins engage in complex interactions and that 130,000 to 650,000 protein interactions can take place in a human cell (6, 7). These numbers demonstrate a clear need for systematic and high-throughput mapping of protein–protein interactions (PPIs) to understand these complexes.The introduction of generic methods to detect PPIs, such as the yeast two-hybrid screen (Y2H) (8) or affinity purification combined with mass spectrometry (AP-MS)1 (9), have revolutionized the protein interactomics field. AP-MS in particular has emerged as an important tool to catalogue interactions with the aim of better understanding basic biochemical mechanisms in many different organisms (1017). It can be performed under near-physiological conditions and is capable of identifying functional protein complexes (18). In addition, the combination of affinity purification with quantitative mass spectrometry has greatly improved the discrimination of true interactors from unspecific background binders, a long-standing challenge in the AP-MS field (1921). Nowadays, quantitative AP-MS is employed to address many different biological questions, such as detection of dynamic changes in PPIs upon perturbation (2225) or the impact of posttranslational signaling on PPIs (26, 27). Recent developments even make it possible to provide abundances and stoichiometry information of the bait and prey proteins under study, combined with quantitative data from very deep cellular proteomes. Furthermore, sample preparation in AP-MS can now be performed in high-throughput formats capable of producing hundreds of samples per day. With such throughput in sample generation, the LC-MS/MS part of the AP-MS pipeline has become a major bottleneck for large studies, limiting throughput to a small fraction of the available samples. In principle, this limitation could be circumvented by multiplexing analysis via isotope-labeling strategies (28, 29) or by drastically reducing the measurement time per sample (3032). The former strategy requires exquisite control of the processing steps and has not been widely implemented yet. The latter strategy depends on mass spectrometers with sufficiently high sequencing speed to deal with the pull-down in a very short time. Since its introduction about 10 years ago (33), the Orbitrap mass spectrometer has featured ever-faster sequencing capabilities, with the Q Exactive HF now reaching a peptide sequencing speed of up to 17 Hz (34). This should now make it feasible to substantially lower the amount of time spent per measurement.Although very short LC-MS/MS runs can in principle be used for high-throughput analyses, they usually lead to a drop in LC-MS duty cycle. This is because each sample needs initial washing, loading, and equilibration steps, independent of gradient time, which takes a substantial percentage for most LC setups - typically at least 15–20 min. To achieve a more efficient LC-MS duty cycle, while maintaining high sensitivity, a second analytical column can be introduced. This enables the parallelization of several steps related to sample loading and to the LC operating steps, including valve switching. Such dual analytical column or “double-barrel: setups have been described for various applications and platforms (30, 3539).Starting from the reported performance and throughput of workflows that are standard today (16, 21, 4042), we asked if it would be possible to obtain a severalfold increase in both sample throughput and sensitivity, as well as a considerable reduction in overall wet lab costs and working time. Specifically, our goal was to quantify 96 medium complexity samples in a single day. Such a number of samples can be processed with a 96-well plate, which currently is the format of choice for highly parallelized sample preparation workflows, often with a high degree of automation. We investigated which advances were needed in sample preparation, liquid chromatography, and mass spectrometry. Based on our findings, we developed a parallelized platform for high-throughput sample preparation and LC-MS/MS analysis, which we applied to pull-down samples from the yeast chromatin remodeling landscape. The extent of retrieval of known complex members served as a quality control of the developed pipeline.  相似文献   

16.
17.
18.
Insulin plays a central role in the regulation of vertebrate metabolism. The hormone, the post-translational product of a single-chain precursor, is a globular protein containing two chains, A (21 residues) and B (30 residues). Recent advances in human genetics have identified dominant mutations in the insulin gene causing permanent neonatal-onset DM2 (14). The mutations are predicted to block folding of the precursor in the ER of pancreatic β-cells. Although expression of the wild-type allele would in other circumstances be sufficient to maintain homeostasis, studies of a corresponding mouse model (57) suggest that the misfolded variant perturbs wild-type biosynthesis (8, 9). Impaired β-cell secretion is associated with ER stress, distorted organelle architecture, and cell death (10). These findings have renewed interest in insulin biosynthesis (1113) and the structural basis of disulfide pairing (1419). Protein evolution is constrained not only by structure and function but also by susceptibility to toxic misfolding.Insulin plays a central role in the regulation of vertebrate metabolism. The hormone, the post-translational product of a single-chain precursor, is a globular protein containing two chains, A (21 residues) and B (30 residues). Recent advances in human genetics have identified dominant mutations in the insulin gene causing permanent neonatal-onset DM2 (14). The mutations are predicted to block folding of the precursor in the ER of pancreatic β-cells. Although expression of the wild-type allele would in other circumstances be sufficient to maintain homeostasis, studies of a corresponding mouse model (57) suggest that the misfolded variant perturbs wild-type biosynthesis (8, 9). Impaired β-cell secretion is associated with ER stress, distorted organelle architecture, and cell death (10). These findings have renewed interest in insulin biosynthesis (1113) and the structural basis of disulfide pairing (1419). Protein evolution is constrained not only by structure and function but also by susceptibility to toxic misfolding.  相似文献   

19.
20.
A Boolean network is a model used to study the interactions between different genes in genetic regulatory networks. In this paper, we present several algorithms using gene ordering and feedback vertex sets to identify singleton attractors and small attractors in Boolean networks. We analyze the average case time complexities of some of the proposed algorithms. For instance, it is shown that the outdegree-based ordering algorithm for finding singleton attractors works in time for , which is much faster than the naive time algorithm, where is the number of genes and is the maximum indegree. We performed extensive computational experiments on these algorithms, which resulted in good agreement with theoretical results. In contrast, we give a simple and complete proof for showing that finding an attractor with the shortest period is NP-hard.[1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32]  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号