首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Chromosome‐centric Human Proteome Project aims at identifying and characterizing protein products encoded from all human protein‐coding genes. As of early 2017, 19 837 protein‐coding genes have been annotated in the neXtProt database including 2691 missing proteins that have never been identified by mass spectrometry. Missing proteins may be low abundant in many cell types or expressed only in a few cell types in human body such as sperms in testis. In this study, we performed expression proteomics of two near‐haploid cell types such as HAP1 and KBM‐7 to hunt for missing proteins. Proteomes from the two haploid cell lines were analyzed on an LTQ Orbitrap Velos, producing a total of 200 raw mass spectrometry files. After applying 1% false discovery rates at both levels of peptide‐spectrum matches and proteins, more than 10 000 proteins were identified from HAP1 and KBM‐7, resulting in the identification of nine missing proteins. Next, unmatched spectra were searched against protein databases translated in three frames from noncoding RNAs derived from RNA‐Seq data, resulting in six novel protein‐coding regions after careful manual inspection. This study demonstrates that expression proteomics coupled to proteogenomic analysis can be employed to identify many annotated and unannotated missing proteins.  相似文献   

2.
The genome sequencing of H37Rv strain of Mycobacterium tuberculosis was completed in 1998 followed by the whole genome sequencing of a clinical isolate, CDC1551 in 2002. Since then, the genomic sequences of a number of other strains have become available making it one of the better studied pathogenic bacterial species at the genomic level. However, annotation of its genome remains challenging because of high GC content and dissimilarity to other model prokaryotes. To this end, we carried out an in-depth proteogenomic analysis of the M. tuberculosis H37Rv strain using Fourier transform mass spectrometry with high resolution at both MS and tandem MS levels. In all, we identified 3176 proteins from Mycobacterium tuberculosis representing ~80% of its total predicted gene count. In addition to protein database search, we carried out a genome database search, which led to identification of ~250 novel peptides. Based on these novel genome search-specific peptides, we discovered 41 novel protein coding genes in the H37Rv genome. Using peptide evidence and alternative gene prediction tools, we also corrected 79 gene models. Finally, mass spectrometric data from N terminus-derived peptides confirmed 727 existing annotations for translational start sites while correcting those for 33 proteins. We report creation of a high confidence set of protein coding regions in Mycobacterium tuberculosis genome obtained by high resolution tandem mass-spectrometry at both precursor and fragment detection steps for the first time. This proteogenomic approach should be generally applicable to other organisms whose genomes have already been sequenced for obtaining a more accurate catalogue of protein-coding genes.  相似文献   

3.
4.
5.
6.
Protonated peptides derived from proline‐rich proteins (PRP) are often difficult to sequence by standard collision‐induced dissociation (CID) mass spectrometry (MS) due to preferential amide bond cleavage N‐terminal to proline. In connection with bovine spongiform encephalopathy regulations, proteolytic products derived from the PRP collagen have been suggested as markers for contamination of animal feedstuffs with processed animal protein (Fernandez Ocaña, M. et al., Analyst 2004, 129, 111–115). Herein, we report the identification of these marker peptides using the strategy of C‐terminal sequencing by CID MS from their sodium and lithium adducts. Upon fragmentation a new cationized peptide was produced that is one C‐terminal amino acid shorter in length. This dissociation pathway allowed for the facile identification of the C‐terminal residue by matrix‐assisted laser desorption/ionization tandem time‐of‐flight mass spectrometry. Each newly formed cationized peptide was further fragmented by up to seven stages of electrospray ionization ion trap MS. Proline‐rich C‐terminal sequence tags were established which permitted successful database identification of collagen alpha type I proteins.  相似文献   

7.
Mass spectrometry‐based proteomics starts with identifications of peptides and proteins, which provide the bases for forming the next‐level hypotheses whose “validations” are often employed for forming even higher level hypotheses and so forth. Scientifically meaningful conclusions are thus attainable only if the number of falsely identified peptides/proteins is accurately controlled. For this reason, RAId continued to be developed in the past decade. RAId employs rigorous statistics for peptides/proteins identification, hence assigning accurate P‐values/E‐values that can be used confidently to control the number of falsely identified peptides and proteins. The RAId web service is a versatile tool built to identify peptides and proteins from tandem mass spectrometry data. Not only recognizing various spectra file formats, the web service also allows four peptide scoring functions and choice of three statistical methods for assigning P‐values/E‐values to identified peptides. Users may upload their own protein database or use one of the available knowledge integrated organismal databases that contain annotated information such as single amino acid polymorphisms, post‐translational modifications, and their disease associations. The web service also provides a friendly interface to display, sort using different criteria, and download the identified peptides and proteins. RAId web service is freely available at https://www.ncbi.nlm.nih.gov/CBBresearch/Yu/raid  相似文献   

8.
Site‐specific chemical cross‐linking in combination with mass spectrometry analysis has emerged as a powerful proteomic approach for studying the three‐dimensional structure of protein complexes and in mapping protein–protein interactions (PPIs). Building on the success of MS analysis of in vitro cross‐linked proteins, which has been widely used to investigate specific interactions of bait proteins and their targets in various organisms, we report a workflow for in vivo chemical cross‐linking and MS analysis in a multicellular eukaryote. This approach optimizes the in vivo protein cross‐linking conditions in Arabidopsis thaliana, establishes a MudPIT procedure for the enrichment of cross‐linked peptides, and develops an integrated software program, exhaustive cross‐linked peptides identification tool (ECL), to identify the MS spectra of in planta chemical cross‐linked peptides. In total, two pairs of in vivo cross‐linked peptides of high confidence have been identified from two independent biological replicates. This work demarks the beginning of an alternative proteomic approach in the study of in vivo protein tertiary structure and PPIs in multicellular eukaryotes.  相似文献   

9.
Simultaneous sequencing, using a combination of mass spectrometry and Edman degradation, of three approximately 15-kDa variants of a cuticular protein extracted from the meal beetle Tenebrio molitor larva is demonstrated. The information obtained by matrix-assisted laser desorption ionization mass spectrometry (MALDI MS) time-course monitoring of enzymatic digests was found essential to identify the differences among the three variants and for alignment of the peptides in the sequence. To determine whether each individual insect larva contains all three protein variants, proteins extracted from single animals were separated by two-dimensional gel electrophoresis, electroeluted from the gel spots, and analyzed by MALDI MS. Molecular weights of the proteins present in each sample could be obtained, and mass spectrometric mapping of the peptides after digestion with trypsin gave additional information. The protein isoforms were found to be allelic variants.  相似文献   

10.
In this work, for the first time, a novel C60‐functionalized magnetic silica microsphere (designated C60‐f‐MS) was synthesized by radical polymerization of C60 molecules on the surface of magnetic silica microspheres. The resulting C60‐f‐MS microsphere has magnetite core and thin C60 modified silica shell, which endow them with useful magnetic responsivity and surface affinity toward low‐concentration peptides and proteins. As a result of their excellent magnetic property, the synthesized C60‐f‐MS microspheres can be easily separated from sample solution without ultracentrifuge. The C60‐f‐MS microspheres were successfully applied to the enrichment of low‐concentration peptides in tryptic protein digest and human urine via a MALDI‐TOF MS analysis. Moreover, they were demonstrated to have enrichment efficiency for low‐concentration proteins. Due to the novel materials maintaining excellent magnetic properties and admirable adsorption, the process of enrichment and desalting is very fast (only 5 min), convenient and efficient. As it has been demonstrated in the study, newly developed fullerene‐derivatized magnetic silica materials are superior to those already available in the market. The facile and low‐cost synthesis as well as the convenient and efficient enrichment process of the novel C60‐f‐MS microspheres makes it a promising candidate for isolation of low‐concentration peptides and proteins even in complex biological samples such as serum, plasma, and urine or cell lysate.  相似文献   

11.
In eukaryotes, mechanisms such as alternative splicing (AS) and alternative translation initiation (ATI) contribute to organismal protein diversity. Specifically, splicing factors play crucial roles in responses to environment and development cues; however, the underlying mechanisms are not well investigated in plants. Here, we report the parallel employment of short‐read RNA sequencing, single molecule long‐read sequencing and proteomic identification to unravel AS isoforms and previously unannotated proteins in response to abscisic acid (ABA) treatment. Combining the data from the two sequencing methods, approximately 83.4% of intron‐containing genes were alternatively spliced. Two AS types, which are referred to as alternative first exon (AFE) and alternative last exon (ALE), were more abundant than intron retention (IR); however, by contrast to AS events detected under normal conditions, differentially expressed AS isoforms were more likely to be translated. ABA extensively affects the AS pattern, indicated by the increasing number of non‐conventional splicing sites. This work also identified thousands of unannotated peptides and proteins by ATI based on mass spectrometry and a virtual peptide library deduced from both strands of coding regions within the Arabidopsis genome. The results enhance our understanding of AS and alternative translation mechanisms under normal conditions, and in response to ABA treatment.  相似文献   

12.
13.
Mass spectrometry is now an indispensable tool in the armamentarium of molecular biophysics, where it is used for tasks ranging from protein sequencing and mapping of post‐translational modifications to studies of higher order structure, conformational dynamics, and interactions of proteins with small molecule ligands and other biopolymers. This mini‐review highlights several popular mass spectrometry‐based tools that are now commonly used for structural studies of proteins beyond their covalent structure with a particular emphasis on hydrogen exchange and direct electrospray ionization mass spectrometry.  相似文献   

14.
We show a sensitive and straightforward off‐line nano‐LC‐MALDI‐MS/MS workflow that allowed the first comprehensive neuropeptidomic analysis of an insect disease vector. This approach was applied to identify neuropeptides in the brain of Rhodnius prolixus, a vector of Chagas disease. This work will contribute to the annotation of genes in the ongoing R. prolixus genome sequence project. Peptides were identified by de novo sequencing and comparisons to known neuropeptides from different organisms by database search. By these means, we were able to identify 42 novel neuropeptides from R. prolixus. The peptides were classified as extended FMRF‐amide‐related peptides, sulfakinins, myosuppressins, short neuropeptide F, long neuropeptide F, SIF‐amide‐related peptides, tachykinins, orcokinins, allatostatins, allatotropins, calcitonin‐like diuretic hormones, corazonin, and pyrokinin. Some of them were detected in multiple isoforms and/or truncated fragments. Interestingly, some of the R. prolixus peptides, as myosuppressin and sulfakinins, are unique in their characteristic C‐terminal domain among insect neuropeptides identified so far.  相似文献   

15.
Quantitative proteomics based on MS is useful for pointing out the differences in some food proteomes relevant to human nutrition. Stable isotope label‐free (SIF) techniques are suitable for comparing an unlimited number of samples by the use of relatively simple experimental workflows. We have developed an internal standard label‐free method based on the intensities of peptide precursor ions from MS/MS spectra, collected in data dependent runs, for the simultaneous qualitative characterization and relative quantification of storage proteins of Lupinus albus seeds in protein extracts of four lupin cultivars (cv Adam, Arés, Lucky, Multitalia). The use of an innovative microfluidic system, the HPLC‐Chip, coupled with a classical IT mass spectrometer, has allowed a complete qualitative characterization of all proteins. In particular, the homology search mode has permitted to identify single amino acid substitutions in the sequences of vicilins (β‐conglutin precursor and vicilin‐like protein). The MS/MS sequencing of substituted peptides confirms the high heterogeneity of vicilins according to the peculiar characteristics of the vicilin‐encoding gene family. Two suitable bioinformatics parameters were optimized for the differential analyses of the main bioactive proteins: the “normalized protein average of common reproducible peptides” (N‐ACRP) for γ‐conglutin, which is a homogeneous protein, and the “normalized protein mean peptide spectral intensity” (N‐MEAN) for the highly heterogenous class of the vicilins.  相似文献   

16.
17.
18.
19.
Post‐translationally modified peptides present in low concentrations are often not selected for CID, resulting in no sequence information for these peptides. We have developed a software POSTMan (POST‐translational Modification analysis) allowing post‐translationally modified peptides to be targeted for fragmentation. The software aligns LC‐MS runs (MS1 data) between individual runs or within a single run and isolates pairs of peptides which differ by a user defined mass difference (post‐translationally modified peptides). The method was validated for acetylated peptides and allowed an assessment of even the basal protein phosphorylation of phenylalanine hydroxylase (PHA) in intact cells.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号