共查询到20条相似文献,搜索用时 31 毫秒
1.
Shi Yan Lothar Brecker Chunsheng Jin Alexander Titz Martin Dragosits Niclas G. Karlsson Verena Jantsch Iain B. H. Wilson Katharina Paschinger 《Molecular & cellular proteomics : MCP》2015,14(8):2111-2125
The N-glycosylation of the model nematode Caenorhabditis elegans has proven to be highly variable and rather complex; it is an example to contradict the existing impression that “simple” organisms possess also a rather simple glycomic capacity. In previous studies in a number of laboratories, N-glycans with up to four fucose residues have been detected. However, although the linkage of three fucose residues to the N,N′-diacetylchitobiosyl core has been proven by structural and enzymatic analyses, the nature of the fourth fucose has remained uncertain. By constructing a triple mutant with deletions in the three genes responsible for core fucosylation (fut-1, fut-6 and fut-8), we have produced a nematode strain lacking products of these enzymes, but still retaining maximally one fucose residue on its N-glycans. Using mass spectrometry and HPLC in conjunction with chemical and enzymatic treatments as well as NMR, we examined a set of α-mannosidase-resistant N-glycans. Within this glycomic subpool, we can reveal that the core β-mannose can be trisubstituted and so carries not only the ubiquitous α1,3- and α1,6-mannose residues, but also a “bisecting” β-galactose, which is substoichiometrically modified with fucose or methylfucose. In addition, the α1,3-mannose can also be α-galactosylated. Our data, showing the presence of novel N-glycan modifications, will enable more targeted studies to understand the biological functions and interactions of nematode glycans.Nematodes represent, along with arthropods, one of the largest groups of animals to exist on the planet; 25.000 species are described, but the existence of up to one million has been estimated (1, 2). They have various ecological niches and include free-living “worms” in the soil, fungivorous, entomopathogenic, and necromenic species as well as parasites of plants and mammals, which share the basic conserved body plan (more-or-less a digestive tube surrounded with muscle, whether larger or smaller). There are five major clades (Rhabditina, Enoplia, Spirurina, Tylenchina, and Dorylaimia) (2), yet the glycosylation of only a few nematode species has been studied with an inevitable focus on the model nematode Caenorhabditis elegans and parasitic species (3). Thereby, the use of C. elegans mutants has been highly valuable in dissecting aspects of nematode N-glycan biosynthesis and revealing the in vivo substrates for certain glycosyltransferases (4).As many nematodes are parasites, their interactions with the immune systems of their hosts have attracted attention; particularly, there are relationships between autoimmunity, allergy, vaccination, and helminth infections. The “old friends” hypothesis seeks to understand the evolutionary factors that have shaped the immune system and to explain correlations between lifestyles in the developed world and modern “epidemics,” which are due to immunological misbalance (5–7). Promising data have suggested that “worm therapy” may bring advantages to some patients with Crohn''s disease or allergies (8, 9); however, such approaches are controversial. Nevertheless, crude extracts even of Caenorhabditis elegans were shown to induce a glycan-dependent Th2 response (10), whereas the excretory-secretory products of some nematodes also have immunomodulatory activity (11). Furthermore, the native glycoproteins of some nematodes have proven effective in vaccination trials, whereas recombinant forms are not, which is suggestive that post-translational modifications may have a role in an efficacious immune response (12).As at least some of the molecules relevant to nematode immunomodulation or vaccination are glycoproteins, a proper understanding of nematode glycosylation is of biomedical and veterinary relevance. Over the years, it has become apparent that the core chitobiosyl region of nematode N-glycans is subject to a range of modifications, with up to three core fucose residues being present (α1,3- and α1,6-linked on the reducing-terminal “proximal” GlcNAc and α1,3-linked on the second “distal” GlcNAc). However, up to four fucose residues have been detected on C. elegans N-glycans and the exact nature of the linkage of the fourth fucose has remained obscure despite work in our own and other laboratories (3, 13–15). Combined with the latest knowledge regarding the specificity of C. elegans core fucosyltransferases (13, 16, 17) as well as our recent data regarding the exact structures of N-glycans from the C. elegans double hexosaminidase mutant and other nematodes (18–20), we concluded that some models for the tri- and tetrafucosylated N-glycans were incorrect. By preparing a triple mutant unable to core fucosylate its N-glycans, we generated a C. elegans strain containing maximally one fucose residue on the N-linked oligosaccharides. Thereby a pool of unusual mannosidase-resistant N-glycans was identified and, using mass spectrometry (MS) and NMR, we reveal their modification with bisecting galactose frequently capped with fucose or methylfucose. 相似文献
2.
Antonio Palmeri Gabriele Ausiello Fabrizio Ferrè Manuela Helmer-Citterich Pier Federico Gherardini 《Molecular & cellular proteomics : MCP》2014,13(9):2198-2212
Phosphorylation is a widespread post-translational modification that modulates the function of a large number of proteins. Here we show that a significant proportion of all the domains in the human proteome is significantly enriched or depleted in phosphorylation events. A substantial improvement in phosphosites prediction is achieved by leveraging this observation, which has not been tapped by existing methods. Phosphorylation sites are often not shared between multiple occurrences of the same domain in the proteome, even when the phosphoacceptor residue is conserved. This is partly because of different functional constraints acting on the same domain in different protein contexts. Moreover, by augmenting domain alignments with structural information, we were able to provide direct evidence that phosphosites in protein-protein interfaces need not be positionally conserved, likely because they can modulate interactions simply by sitting in the same general surface area.Phosphorylation, the most widespread protein post-translational modification, is an important regulator of protein function. The addition of phosphate groups on serine, threonine, and tyrosine residues can modulate the activity of the target protein by inducing complex conformational changes, by modifying protein electrostatics, and by regulating domain-peptide interactions, as in 14-3-3 or SH2 domains, that specifically recognize phosphorylated residues. The standard experimental technique for the high-throughput identification of phosphorylation sites is mass spectrometry (1).Phosphorylation is catalyzed by protein kinases, a family that in humans comprises ∼540 members (2, 3). It is well understood that these enzymes recognize specific sequence motifs in their substrates (4, 5). Accordingly the sequence around the phosphorylation site is undisputedly the most important feature for phosphosite prediction (6, 7). However the “context,” in a broad sense, where these motifs occur is also important as sequence alone is not enough to achieve the observed specificity of phosphorylation. Therefore, several studies have characterized multiple aspects of phosphosites such as their preference for loops and disordered regions (reviewed in (8)), or the tendency of phosphoserines and phosphothreonines to occur in clusters (9), and these features have been used to improve the performance of phosphosite predictors (6, 7, 10–12). Moreover placing kinases and substrates in the context of protein interaction networks has been shown to improve the prediction of phosphorylation by specific kinases (13).Perhaps one of the most puzzling observations when looking at the phosphoproteome as a whole, is the fact that a large proportion of phosphorylation sites is poorly conserved. This has led to various hypotheses. First some sites may represent nonfunctional, possibly low-stoichiometry, phosphorylation events that are picked up because of the sensitivity of mass-spectrometry (14, 15). Indeed functionally characterized sites and those matching known kinase motifs are more conserved on average (15–17). However, although in biology function often equates with conservation, there could be genuinely functional fast-evolving phosphosites, that are responsible for species-specific differences in signaling and regulation. Moreover in some cases, especially in the regulation of protein-protein interactions, the exact position of the phosphosites may be unimportant (18, 19).Here we explore the issues of “context” and “conservation” of phosphorylation sites from the perspective of protein domains. To this end, we assembled a comprehensive database of phosphosites from publicly available sources and studied their proteome distribution with respect to the location and identity of protein domains. We focus on the human phosphoproteome because it has been very well characterized in a multitude of low- and high-throughput experiments, thus providing the opportunity for a comprehensive, proteome-wide, study. In particular, the issues we want to address are the following:
- Are specific domain types preferentially phosphorylated? Or conversely are some domains specifically depleted of phosphorylation sites?
- Can the domain context be used to improve the prediction of phosphorylation sites?
- What is the conservation pattern of phosphosites when looking at multiple instances of the same domain in the proteome?
3.
4.
Lee A. Beaudette Stephen Davies Phillip M. Fedorak Owen P. Ward Michael A. Pickard 《Applied and environmental microbiology》1998,64(6):2020
Two methods were used to compare the biodegradation of six polychlorinated biphenyl (PCB) congeners by 12 white rot fungi. Four fungi were found to be more active than Phanerochaete chrysosporium ATCC 24725. Biodegradation of the following congeners was monitored by gas chromatography: 2,3-dichlorobiphenyl, 4,4′-dichlorobiphenyl, 2,4′,5-trichlorobiphenyl (2,4′,5-TCB), 2,2′,4,4′-tetrachlorobiphenyl, 2,2′,5,5′-tetrachlorobiphenyl, and 2,2′,4,4′,5,5′-hexachlorobiphenyl. The congener tested for mineralization was 2,4′,5-[U-14C]TCB. Culture supernatants were also assayed for lignin peroxidase and manganese peroxidase activities. Of the fungi tested, two strains of Bjerkandera adusta (UAMH 8258 and UAMH 7308), one strain of Pleurotus ostreatus (UAMH 7964), and Trametes versicolor UAMH 8272 gave the highest biodegradation and mineralization. P. chrysosporium ATCC 24725, a strain frequently used in studies of PCB degradation, gave the lowest mineralization and biodegradation activities of the 12 fungi reported here. Low but detectable levels of lignin peroxidase and manganese peroxidase activity were present in culture supernatants, but no correlation was observed among any combination of PCB congener biodegradation, mineralization, and lignin peroxidase or manganese peroxidase activity. With the exception of P. chrysosporium, congener loss ranged from 40 to 96%; however, these values varied due to nonspecific congener binding to fungal biomass and glassware. Mineralization was much lower, ≤11%, because it measures a complete oxidation of at least part of the congener molecule but the results were more consistent and therefore more reliable in assessment of PCB biodegradation.Polychlorinated biphenyls (PCBs) are produced by chlorination of biphenyl, resulting in up to 209 different congeners. Commercial mixtures range from light oily fluids to waxes, and their physical properties make them useful as heat transfer fluids, hydraulic fluids, solvent extenders, plasticizers, flame retardants, organic diluents, and dielectric fluids (1, 21). Approximately 24 million lb are in the North American environment (19). The stability and hydrophobic nature of these compounds make them a persistent environmental hazard.To date, bacterial transformations have been the main focus of PCB degradation research. Aerobic bacteria use a biphenyl-induced dioxygenase enzyme system to attack less-chlorinated congeners (mono- to hexachlorobiphenyls) (1, 5, 7, 8, 22). Although more-chlorinated congeners are recalcitrant to aerobic bacterial degradation, microorganisms in anaerobic river sediments reductively dechlorinate these compounds, mainly removing the meta and para chlorines (1, 6, 10, 33, 34).The degradation of PCBs by white rot fungi has been known since 1985 (11, 18). Many fungi have been tested for their ability to degrade PCBs, including the white rot fungi Coriolus versicolor (18), Coriolopsis polysona (41), Funalia gallica (18), Hirneola nigricans (35), Lentinus edodes (35), Phanerochaete chrysosporium (3, 11, 14, 17, 18, 35, 39, 41–43), Phlebia brevispora (18), Pleurotus ostreatus (35, 43), Poria cinerescens (18), Px strain (possibly Lentinus tigrinus) (35), and Trametes versicolor (41, 43). There have also been studies of PCB metabolism by ectomycorrhizal fungi (17) and other fungi such as Aspergillus flavus (32), Aspergillus niger (15), Aureobasidium pullulans (18), Candida boidinii (35), Candida lipolytica (35), Cunninghamella elegans (16), and Saccharomyces cerevisiae (18, 38). The mechanism of PCB biodegradation has not been definitively determined for any fungi. White rot fungi produce several nonspecific extracellular enzymes which have been the subject of extensive research. These nonspecific peroxidases are normally involved in lignin degradation but can oxidize a wide range of aromatic compounds including polycyclic aromatic hydrocarbons (37). Two peroxidases, lignin peroxidase (LiP) and Mn peroxidase (MnP), are secreted into the environment of the fungus under conditions of nitrogen limitation in P. chrysosporium (23, 25, 27, 29) but are not stress related in fungi such as Bjerkandera adusta or T. versicolor (12, 30).Two approaches have been used to determine the biodegradability of PCBs by fungi: (i) loss of the parent congener analyzed by gas chromatography (GC) (17, 32, 35, 42, 43) and (ii) mineralization experiments in which the 14C of the universally labeled 14C parent congener is recovered as 14CO2 (11, 14, 18, 39, 41). In the first method, the loss of a peak on a chromatogram makes it difficult to decide whether the PCB is being partly degraded, mineralized, adsorbed to the fungal biomass, or bound to glassware, soil particles, or wood chips. Even when experiments with killed-cell and abiotic controls are performed, the extraction efficiency and standard error can make data difficult to interpret. For example, recoveries can range anywhere from 40 to 100% depending on the congener used and the fungus being investigated (17). On the other hand, recovery of significant amounts of 14CO2 from the cultures incubated with a 14C substrate provides definitive proof of fungal metabolism. There appears to be only one report relating data from these two techniques (18), and in that study, [U-14C]Aroclor 1254, rather than an individual congener, was used.In this study, we examined the ability of 12 white rot fungal strains to metabolize selected PCB congeners to determine which strains were the most active degraders. Included in this group was P. chrysosporium ATCC 24725, a strain used extensively in PCB studies (3, 14, 18, 35, 39, 41–43). Six PCB congeners were selected to give a range of chlorine substitutions and therefore a range of potential biodegradability which was monitored by GC. One of the chosen congeners was 14C labeled and used in studies to compare the results from a mineralization method with those from the GC method. 相似文献
5.
6.
Caleen B. Ramsook Cho Tan Melissa C. Garcia Raymond Fung Gregory Soybelman Ryan Henry Anna Litewka Shanique O'Meally Henry N. Otoo Roy A. Khalaf Anne M. Dranginis Nand K. Gaur Stephen A. Klotz Jason M. Rauceo Chong K. Jue Peter N. Lipke 《Eukaryotic cell》2010,9(3):393-404
The occurrence of highly conserved amyloid-forming sequences in Candida albicans Als proteins (H. N. Otoo et al., Eukaryot. Cell 7:776–782, 2008) led us to search for similar sequences in other adhesins from C. albicans and Saccharomyces cerevisiae. The β-aggregation predictor TANGO found highly β-aggregation-prone sequences in almost all yeast adhesins. These sequences had an unusual amino acid composition: 77% of their residues were β-branched aliphatic amino acids Ile, Thr, and Val, which is more than 4-fold greater than their prevalence in the S. cerevisiae proteome. High β-aggregation potential peptides from S. cerevisiae Flo1p and C. albicans Eap1p rapidly formed insoluble amyloids, as determined by Congo red absorbance, thioflavin T fluorescence, and fiber morphology. As examples of the amyloid-forming ability of the native proteins, soluble glycosylphosphatidylinositol (GPI)-less fragments of C. albicans Als5p and S. cerevisiae Muc1p also formed amyloids within a few days under native conditions at nM concentrations. There was also evidence of amyloid formation in vivo: the surfaces of cells expressing wall-bound Als1p, Als5p, Muc1p, or Flo1p were birefringent and bound the fluorescent amyloid-reporting dye thioflavin T. Both of these properties increased upon aggregation of the cells. In addition, amyloid binding dyes strongly inhibited aggregation and flocculation. The results imply that amyloid formation is an intrinsic property of yeast cell adhesion proteins from many gene families and that amyloid formation is an important component of cellular aggregation mediated by these proteins.Protein amyloids are characteristic of pathological conditions, including neurodegenerative diseases (4, 11, 17, 38). These protein aggregates can also occur naturally in adhesive bacterial curli (3), melanosomes (14), condensed peptide hormone arrays (24), as regulatory prions in yeast (2, 5), and fungal hydrophobins, which are nonantigenic coats to some fungi (1, 33, 39). Nevertheless, such natural occurrences are relatively few, considering the negative free energy for amyloid formation (28).We have recently discovered that there are amyloid-forming sequences in the cell surface Als adhesins of Candida albicans. Cells that express these adhesins aggregate readily, and the aggregation has amyloid-like properties, including protein conformational shifting, surface birefringence, and ability to bind the amyloid-active dyes Congo red and amino-naphthalene sulfonic acid (ANS) (29). A five- to seven-residue sequence in Als1p, Als3p, and Als5p has extremely high potential for formation of β-aggregates, according to the protein state prediction program TANGO (13, 27, 31). Such β-aggregates include amyloids, which are ordered structures with paracrystalline regions of stacked parallel β-strands that are perpendicular to the long axis of micrometer-long fibrils. The strands are stabilized by interaction of identical sequences from many protein molecules (31, 32). Where TANGO analyses have shown that specific sequences have β-aggregate potentials greater than 20%, an insoluble β-aggregate state is likely to form. These β-aggregates nucleate formation of amyloids if the proteins can associate to form fibers (13, 27, 31). Sequences in the conserved 127-residue T region of Als1p, Als3p, and Als5p have β-aggregation potentials of >90% (27). An oligopeptide with this sequence, as well as 412- and 645-residue fragments of Als5p formed authentic amyloids, as determined by characteristic dye binding and fiber morphology. The amyloid-forming sequences were rich in the β-branched amino acids Thr, Val, and Ile. This amino acid composition is unusual among proteins in general, but is common in the Thr-rich mid-piece domains of yeast adhesins.Yeasts display many cell-wall-bound adhesins that mediate colonial and biofilm interactions as well as host-pathogen binding (9, 21, 41). Such adhesins have a common mosaic structure. In general, the adhesins have N-terminal globular binding domains (often immunoglobulin-like or lectin-like), Thr-rich mid-piece sequences including tandem repeats, and 300- to 800-residue heavily glycosylated Ser and Thr-rich “stalk” domains near the C-terminal domain that extend the active regions from the surface of the wall. The adhesins are covalently cross-linked to wall polysaccharides through modified glycosylphosphatidylinositol (GPI) anchors and/or glycosyl esters of glutamic acid (9, 18).Because the yeast adhesins share this common modular domain structure, we searched among known and putative yeast adhesins for sequences with high β-aggregation potential. We have found that many of these proteins share amyloid-forming sequences and amyloid-like behavior on activation. 相似文献
7.
Bo Zhang Mohammad Pirmoradian Alexey Chernobrovkin Roman A. Zubarev 《Molecular & cellular proteomics : MCP》2014,13(11):3211-3223
Based on conventional data-dependent acquisition strategy of shotgun proteomics, we present a new workflow DeMix, which significantly increases the efficiency of peptide identification for in-depth shotgun analysis of complex proteomes. Capitalizing on the high resolution and mass accuracy of Orbitrap-based tandem mass spectrometry, we developed a simple deconvolution method of “cloning” chimeric tandem spectra for cofragmented peptides. Additional to a database search, a simple rescoring scheme utilizes mass accuracy and converts the unwanted cofragmenting events into a surprising advantage of multiplexing. With the combination of cloning and rescoring, we obtained on average nine peptide-spectrum matches per second on a Q-Exactive workbench, whereas the actual MS/MS acquisition rate was close to seven spectra per second. This efficiency boost to 1.24 identified peptides per MS/MS spectrum enabled analysis of over 5000 human proteins in single-dimensional LC-MS/MS shotgun experiments with an only two-hour gradient. These findings suggest a change in the dominant “one MS/MS spectrum - one peptide” paradigm for data acquisition and analysis in shotgun data-dependent proteomics. DeMix also demonstrated higher robustness than conventional approaches in terms of lower variation among the results of consecutive LC-MS/MS runs.Shotgun proteomics analysis based on a combination of high performance liquid chromatography and tandem mass spectrometry (MS/MS) (1) has achieved remarkable speed and efficiency (2–7). In a single four-hour long high performance liquid chromatography-MS/MS run, over 40,000 peptides and 5000 proteins can be identified using a high-resolution Orbitrap mass spectrometer with data-dependent acquisition (DDA)1 (2, 3). However, in a typical LC-MS analysis of unfractionated human cell lysate, over 100,000 individual peptide isotopic patterns can be detected (4), which corresponds to simultaneous elution of hundreds of peptides. With this complexity, a mass spectrometer needs to achieve ≥25 Hz MS/MS acquisition rate to fully sample all the detectable peptides, and ≥17 Hz to cover reasonably abundant ones (4). Although this acquisition rate is reachable by modern time-of-flight (TOF) instruments, the reported DDA identification results do not encompass all expected peptides. Recently, the next-generation Orbitrap instrument, working at 20 Hz MS/MS acquisition rate, demonstrated nearly full profiling of yeast proteome using an 80 min gradient, which opened the way for comprehensive analysis of human proteome in a time efficient manner (5).During the high performance liquid chromatography-MS/MS DDA analysis of complex samples, high density of co-eluting peptides results in a high probability for two or more peptides to overlap within an MS/MS isolation window. With the commonly used ±1.0–2.0 Th isolation windows, most MS/MS spectra are chimeric (4, 8–10), with cofragmenting precursors being naturally multiplexed. However, as has been discussed previously (9, 10), the cofragmentation events are currently ignored in most of the conventional analysis workflows. According to the prevailing assumption of “one MS/MS spectrum–one peptide,” chimeric MS/MS spectra are generally unwelcome in DDA, because the product ions from different precursors may interfere with the assignment of MS/MS fragment identities, increasing the rate of false discoveries in database search (8, 9). In some studies, the precursor isolation width was set as narrow as ±0.35 Th to prevent unwanted ions from being coselected, fragmented or detected (4, 5).On the contrary, multiplexing by cofragmentation is considered to be one of the solid advantages in data-independent acquisition (DIA) (10–13). In several commonly used DIA methods, the precursor ion selection windows are set much wider than in DDA: from 25 Th as in SWATH (12), to extremely broad range as in AIF (13). In order to use the benefit of MS/MS multiplexing in DDA, several approaches have been proposed to deconvolute chimeric MS/MS spectra. In “alternative peptide identification” method implemented in Percolator (14), a machine learning algorithm reranks and rescores peptide-spectrum matches (PSMs) obtained from one or more MS/MS search engines. But the deconvolution in Percolator is limited to cofragmented peptides with masses differing from the target peptide by the tolerance of the database search, which can be as narrow as a few ppm. The “active demultiplexing” method proposed by Ledvina et al. (15) actively separates MS/MS data from several precursors using masses of complementary fragments. However, higher-energy collisional dissociation often produces MS/MS spectra with too few complementary pairs for reliable peptide identification. The “MixDB” method introduces a sophisticated new search engine, also with a machine learning algorithm (9). And the “second peptide identification” method implemented in Andromeda/MaxQuant workflow (16) submits the same dataset to the search engine several times based on the list of chromatographic peptide features, subtracting assigned MS/MS peaks after each identification round. This approach is similar to the ProbIDTree search engine that also performed iterative identification while removing assigned peaks after each round of identification (17).One important factor for spectral deconvolution that has not been fully utilized in most conventional workflows is the excellent mass accuracy achievable with modern high-resolution mass spectrometry (18). An Orbitrap Fourier-transform mass spectrometer can provide mass accuracy in the range of hundreds of ppb (parts per billion) for mass peaks with high signal-to-noise (S/N) ratio (19). However, the mass error of peaks with lower S/N ratios can be significantly higher and exceed 1 ppm. Despite this dependence of the mass accuracy from the S/N level, most MS and MS/MS search engines only allow users to set hard cut-off values for the mass error tolerances. Moreover, some search engines do not provide the option of choosing a relative error tolerance for MS/MS fragments. Such negligent treatment of mass accuracy reduces the analytical power of high accuracy experiments (18).Identification results coming from different MS/MS search engines are sometimes not consistent because of different statistical assumptions used in scoring PSMs. Introduction of tools integrating the results of different search engines (14, 20, 21) makes the data interpretation even more complex and opaque for the user. The opposite trend—simplification of MS/MS data interpretation—is therefore a welcome development. For example, an extremely straightforward algorithm recently proposed by Wenger et al. (22) demonstrated a surprisingly high performance in peptide identification, even though it is only marginally more complex than simply counting the number of matches of theoretical fragment peaks in high resolution MS/MS, without any a priori statistical assumption.In order to take advantage of natural multiplexing of MS/MS spectra in DDA, as well as properly utilize high accuracy of Orbitrap-based mass spectrometry, we developed a simple and robust data analysis workflow DeMix. It is presented in Fig. 1 as an expansion of the conventional workflow. Principles of some of the processes used by the workflow are borrowed from other approaches, including the custom-made mass peak centroiding (20), chromatographic feature detection (19, 20), and two-pass database search with the first limited pass to provide a “software lock mass” for mass scale recalibration (23).Open in a separate windowFig. 1.An overview of the DeMix workflow that expands the conventional workflow, shown by the dashed line. Processes are colored in purple for TOPP, red for search engine (Morpheus/Mascot/MS-GF+), and blue for in-house programs.In DeMix workflow, the deconvolution of chimeric MS/MS spectra consists of simply “cloning” an MS/MS spectrum if a potential cofragmented peptide is detected. The list of candidate peptide precursors is generated from chromatographic feature detection, as in the MaxQuant/Andromeda workflow (16, 19), but using The OpenMS Proteomics Pipeline (TOPP) (20, 24). During the cloning, the precursor is replaced by the new candidate, but no changes in the MS/MS fragment list are made, and therefore the cloned MS/MS spectra remain chimeric. Processing such spectra requires a search engine tolerant to the presence of unassigned peaks, as such peaks are always expected when multiple precursors cofragment. Thus, we chose Morpheus (22) as a search engine. Based on the original search algorithm, we implement a reformed scoring scheme: Morpheus-AS (advanced scoring). It inherits all the basic principles from Morpheus but deeper utilizes the high mass accuracy of the data. This kind of database search removes the necessity of spectral processing for physical separation of MS/MS data into multiple subspectra (15), or consecutive subtraction of peaks (16, 17).Despite the fact that DeMix workflow is largely a combination of known approaches, it provides remarkable improvement compared with the state-of-the-art. On our Orbitrap Q-Exactive workbench, testing on a benchmark dataset of two-hour single-dimension LC-MS/MS experiments from HeLa cell lysate, we identified on average 1.24 peptide per MS/MS spectrum, breaking the “one MS/MS spectrum–one peptide” paradigm on the level of whole data set. At 1% false discovery rate (FDR), we obtained on average nine PSMs per second (at the actual acquisition rate of ca. seven MS/MS spectra per second), and detected 40 human proteins per minute. 相似文献
8.
9.
James D. Jontes E. Michael Ostap Thomas D. Pollard Ronald A. Milligan 《The Journal of cell biology》1998,141(1):155-162
The Acanthamoeba castellanii myosin-Is were the first unconventional myosins to be discovered, and the myosin-I class has since been found to be one of the more diverse and abundant classes of the myosin superfamily. We used two-dimensional (2D) crystallization on phospholipid monolayers and negative stain electron microscopy to calculate a projection map of a “classical” myosin-I, Acanthamoeba myosin-IB (MIB), at ∼18 Å resolution. Interpretation of the projection map suggests that the MIB molecules sit upright on the membrane. We also used cryoelectron microscopy and helical image analysis to determine the three-dimensional structure of actin filaments decorated with unphosphorylated (inactive) MIB. The catalytic domain is similar to that of other myosins, whereas the large carboxy-terminal tail domain differs greatly from brush border myosin-I (BBM-I), another member of the myosin-I class. These differences may be relevant to the distinct cellular functions of these two types of myosin-I. The catalytic domain of MIB also attaches to F-actin at a significantly different angle, ∼10°, than BBM-I. Finally, there is evidence that the tails of adjacent MIB molecules interact in both the 2D crystal and in the decorated actin filaments.The myosin superfamily consists of at least 12 distinct classes that vary both in the sequence of their conserved myosin catalytic domains as well as in their unique tails (Mooseker and Cheney, 1995; Sellers and Goodson, 1995). For many years the only known myosins were the double-headed, filament-forming myosins found in muscle (conventional myosins or myosins-II). The remaining classes of myosin have been termed “unconventional myosins” to differentiate them from the myosins-II. Probably the most thoroughly studied class of unconventional myosins is the myosin-I class. These small, single-headed myosins bind to membrane lipids through a basic domain in their tail (for review see Pollard et al., 1991; Mooseker and Cheney, 1995). The first unconventional myosin (and first myosin-I) was isolated from Acanthamoeba castellanii (Pollard and Korn, 1973
a,b), and was purified on the basis of its K+, EDTA, and actin-activated MgATPase activities. However, this myosin was unusual in that it had a single heavy chain of ∼140 kD, in contrast to the two ∼200-kD heavy chains of myosin-II (Pollard and Korn, 1973
a).Three isoforms of the classical Acanthamoeba myosins-I are now known: myosins-IA, -IB, and -IC (Maruta and Korn, 1977a
,b; Maruta et al., 1979). Each of the isoforms consists of a conserved myosin catalytic domain, a binding site for one or two light chains, a basic domain, a GPA1(Q) domain (rich in glycine, proline and alanine [glutamine]), and an scr-homology domain-3 (SH3) domain (Pollard et al., 1991; Mooseker and Cheney, 1995). These myosins-I can associate with membranes or with actin filaments through their tail domains. An electrostatic association of myosin-I with anionic phospholipids and with base-stripped membranes has been shown to occur (Adams and Pollard, 1989; Miyata et al., 1989; Hayden et al., 1990), and this interaction has been mapped to the basic domain (Doberstein and Pollard, 1992). Interestingly, these myosins also contain a second, ATP-insensitive actin binding site (Lynch et al., 1986) enabling them to mediate actin–actin movements (Albanesi et al., 1985; Fujisaki et al., 1985). In myosin-IA (Lynch et al., 1986) and myosin-IC (Doberstein and Pollard, 1992), this binding site was localized to the GPA domain.
Acanthamoeba myosins-I have maximal steady-state actin-activated ATPase rates of ∼10–20 s−1 (Pollard and Korn, 1973
b; Albanesi et al., 1983), and an unusual triphasic dependence upon actin concentration (Pollard and Korn, 1973
b; Albanesi et al., 1983). This triphasic activation is due to the actin cross-linking ability imparted by the ATP-insensitive actin binding site on the tail (Albanesi et al., 1985). Analysis of the individual steps in the ATPase cycle by transient kinetics revealed that the mechanism of myosin-IA is similar to slow skeletal muscle myosin, whereas myosin-IB (MIB) is similar to fast skeletal muscle myosin (Ostap and Pollard, 1996). The in vitro motility of myosin-I has also been well characterized (Zot et al., 1992). The maximal rate of filament sliding is ∼0.2 μm s−1. Interestingly, this rate is ∼10–50× slower than the rates observed for skeletal muscle myosin, even though the ATPase rates are comparable.MIB consists of a 125-kD heavy chain and a single 27-kD light chain (Maruta et al., 1979; Jung et al., 1987). This isoform is primarily associated with the plasma membrane as well as vacuolar membranes (Baines et al., 1992). MIB appears to be associated with the plasma membrane at sites of phagocytosis and was concentrated at the tips of pseudopodia (Baines et al., 1992). This localization suggests that MIB may be involved in membrane dynamics at the cell surface. MIB is regulated by heavy chain phosphorylation of serine 411 (Brzeska et al., 1989, 1990), which is located at the actin-binding site (Rayment et al., 1993). Similar to the myosin-I isoforms in Acanthamoeba, heavy chain phosphorylation results in >20-fold activation of the actin-activated myosin-I ATPase activity (Albanesi et al., 1983). This activation is not the result of changes in the binding of myosin-I to F-actin (Albanesi et al., 1983; Ostap and Pollard, 1996). The transient kinetic studies of Ostap and Pollard (1996) suggest that phosphorylation regulates the rate-limiting phosphate release step, the transition from weakly bound intermediates in rapid equilibrium with actin to strongly bound states, capable of sustaining force.Despite the extensive analysis of ameboid myosin-I biochemical properties and in vivo function, there is little structural information on these myosins. The only detailed structural information available for the myosins-I comes from recent electron microscopy studies on brush border myosin-I (BBM-I) (Jontes et al., 1995; Jontes and Milligan, 1997a
,b; Whittaker and Milligan, 1997), a structurally distinct myosin-I subtype. Therefore, we investigated the structure of a “classical,” ameboid-type myosin, Acanthamoeba MIB using electron microscopy. First, electron micrographs of negatively stained two-dimensional (2D) crystals were used to generate a projection map of MIB at ∼18 Å resolution. In addition, we used cryoelectron microscopy and helical image analysis to produce a moderate resolution three-dimensional (3D) map (30 Å) of actin filaments decorated with MIB. These studies enabled us to compare the structure of MIB with BBM-I. The comparison of MIB with BBM-I reveals marked structural differences in the tail domains of these two proteins; MIB appears to have a much shorter “lever arm” and a more compact tail, whereas most of the BBM-I mass is composed of an extended light chain–binding domain (LCBD). In addition, the MIB catalytic domain appears to be slightly tilted compared to BBM-I, with respect to the F-actin axis. Our structural results suggest that these two types of myosin-I may have distinct intracellular functions. 相似文献
10.
Xiaowen Liu Yakov Sirotkin Yufeng Shen Gordon Anderson Yihsuan S. Tsai Ying S. Ting David R. Goodlett Richard D. Smith Vineet Bafna Pavel A. Pevzner 《Molecular & cellular proteomics : MCP》2012,11(6)
In the last two years, because of advances in protein separation and mass spectrometry, top-down mass spectrometry moved from analyzing single proteins to analyzing complex samples and identifying hundreds and even thousands of proteins. However, computational tools for database search of top-down spectra against protein databases are still in their infancy. We describe MS-Align+, a fast algorithm for top-down protein identification based on spectral alignment that enables searches for unexpected post-translational modifications. We also propose a method for evaluating statistical significance of top-down protein identifications and further benchmark various software tools on two top-down data sets from Saccharomyces cerevisiae and Salmonella typhimurium. We demonstrate that MS-Align+ significantly increases the number of identified spectra as compared with MASCOT and OMSSA on both data sets. Although MS-Align+ and ProSightPC have similar performance on the Salmonella typhimurium data set, MS-Align+ outperforms ProSightPC on the (more complex) Saccharomyces cerevisiae data set.In the past two decades, proteomics was dominated by bottom-up mass spectrometry that analyzes digested peptides rather than intact proteins. Bottom-up approaches, although powerful, do have limitations in analyzing protein species, e.g. various proteolytic forms of the same protein or various protein isoforms resulting from alternative splicing. Top-down mass spectrometry focuses on analyzing intact proteins and large peptides (1–10) and has advantages in localizing multiple post-translational modifications (PTMs)1 in a coordinated fashion (e.g. combinatorial PTM code) and identifying multiple protein species (e.g. proteolytically processed protein species) (11). Until recently, most top-down studies were limited to single purified proteins (12–15). Top-down studies of protein mixtures were restricted by difficulties in separating and fragmenting intact proteins and a shortage of robust computational tools.In the last two years, because of advances in protein separation and top-down instrumentation, top-down mass spectrometry moved from analyzing single proteins to analyzing complex samples containing hundreds and even thousands of proteins (16–21). Because algorithms for interpreting top-down spectra are still in their infancy, many recent developments include computational innovations in protein identification.Because top-down spectra are complex, the first step in top-down spectral interpretation is usually spectral deconvolution, which converts a complex top-down spectrum to a list of monoisotopic masses (a deconvolved spectrum). Every protein (possibly with modifications) can be scored against a top-down deconvoluted spectrum, resulting in a Protein-Spectrum-Match (PrSM). The top-down protein identification problem is finding a protein in a database with the highest scoring PrSM for a top-down spectrum and further output the PrSM if it is statistically significant. There are several software tools for top-down protein identification (Software Identification of unexpected modifications Proteogenomics search against 6-frame translation Speed Estimation of statistical significance ProSightPC +/−a + Fast/Slowb + PIITA +/− − Fast − UStag + + Fast − MS-TopDown + − Slow − MS-Align+ + + Fast +