首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
Protein quantification using data‐independent acquisition methods such as SWATH‐MS most commonly relies on spectral matching to a reference MS/MS assay library. To enable deep proteome coverage and efficient use of existing data, in silico approaches have been described to use archived or publicly available large reference spectral libraries for spectral matching. Since implicit in the use of larger libraries is the increasing likelihood of false‐discoveries, new workflows are needed to ensure high confidence in protein matching under these conditions. We present a workflow which introduces a range of filters and thresholds aimed at increasing confidence that the resulting proteins are reliably detected and their quantitation is consistent and reproducible. We demonstrated the workflow using extended libraries with SWATH data from human plasma samples and yeast‐spiked human K562 cell lysate digest.  相似文献   

2.
This review provides a brief overview of the development of data‐independent acquisition (DIA) mass spectrometry‐based proteomics and selected DIA data analysis tools. Various DIA acquisition schemes for proteomics are summarized first including Shotgun‐CID, DIA, MSE, PAcIFIC, AIF, SWATH, MSX, SONAR, WiSIM, BoxCar, Scanning SWATH, diaPASEF, and PulseDIA, as well as the mass spectrometers enabling these methods. Next, the software tools for DIA data analysis are classified into three groups: library‐based tools, library‐free tools, and statistical validation tools. The approaches are reviewed for generating spectral libraries for six selected library‐based DIA data analysis software tools which are tested by the authors, including OpenSWATH, Spectronaut, Skyline, PeakView, DIA‐NN, and EncyclopeDIA. An increasing number of library‐free DIA data analysis tools are developed including DIA‐Umpire, Group‐DIA, PECAN, PEAKS, which facilitate identification of novel proteoforms. The authors share their user experience of when to use DIA‐MS, and several selected DIA data analysis software tools. Finally, the state of the art DIA mass spectrometry and software tools, and the authors’ views of future directions are summarized.  相似文献   

3.
Renal cell carcinoma (RCC) represents 2.2% of all cancer incidences; however, prognostic or predictive RCC biomarkers at protein level are largely missing. To support proteomics research of localized and metastatic RCC, we introduce a new library of targeted mass spectrometry assays for accurate protein quantification in malignant and normal kidney tissue. Aliquots of 86 initially localized RCC, 75 metastatic RCC and 17 adjacent non-cancerous fresh frozen tissue lysates were trypsin digested, pooled, and fractionated using hydrophilic chromatography. The fractions were analyzed using LC-MS/MS on QExactive HF-X mass spectrometer in data-dependent acquisition (DDA) mode. A resulting spectral library contains 77,817 peptides representing 7960 protein groups (FDR = 1%). Further, we confirm applicability of this library on four RCC datasets measured in data-independent acquisition (DIA) mode, demonstrating a specific quantification of a substantially increased part of RCC proteome, depending on LC-MS/MS instrumentation. Impact of sample specificity of the library on the results of targeted DIA data extraction was demonstrated by parallel analyses of two datasets by two pan human libraries. The new RCC specific library has potential to contribute to better understanding the RCC development at molecular level, leading to new diagnostic and therapeutic targets.  相似文献   

4.
Data independent acquisition (DIA/SWATH) MS is a primary strategy in quantitative proteomics. diaPASEF is a recent adaptation using trapped ion mobility spectrometry (TIMS) to improve selectivity/sensitivity. Complex DIA spectra are typically analyzed with reference to spectral libraries. The best-established method for generating libraries uses offline fractionation to increase depth of coverage. More recently strategies for spectral library generation based on gas phase fractionation (GPF), where a representative sample is injected serially using narrow DIA windows that cover different mass ranges of the complete precursor space, have been introduced that performed comparably to deep offline fractionation-based libraries. We investigated whether an analogous GPF-based approach that accounts for the ion mobility (IM) dimension is useful for the analysis of diaPASEF data. We developed a rapid library generation approach using an IM-GPF acquisition scheme in the m/z versus 1/K0 space requiring seven injections of a representative sample and compared this with libraries generated by direct deconvolution-based analysis of diaPASEF data or by deep offline fractionation. We found that library generation by IM-GPF outperformed direct library generation from diaPASEF and had performance approaching that of the deep library. This establishes the IM-GPF scheme as a pragmatic approach to rapid library generation for analysis of diaPASEF data.  相似文献   

5.
The characterization of peptides presented by human leukocyte antigen (HLA) class I molecules is crucial for understanding immune processes, biomarker discovery, and the development of novel immunotherapies or vaccines. Mass spectrometry allows the direct identification of thousands of HLA‐bound peptides from cell lines, blood, or tissue. In recent years, data‐independent acquisition (DIA) mass spectrometry methods have evolved, promising to increase reproducibility and sensitivity over classical data‐dependent acquisition (DDA) workflows. Here, we describe a DIA setup on the Q Exactive mass spectrometer, optimized regarding the unique properties of HLA class I peptides. The methodology enables sensitive and highly reproducible characterization of HLA peptidomes from individual cell lines. From up to 16 DDA analyses of 100 million human cells, more than 10 000 peptides could be confidently identified, serving as basis for the generation of spectral libraries. This knowledge enabled the subsequent interrogation of DIA data, leading to the identification of peptide sets with >90% overlap between replicate samples, a prerequisite for the comparative study of closely related specimens. Furthermore, >3000 peptides could be identified from just one million cells after DIA analysis using a library generated from 300 million cells. The reduction in sample quantity and the high reproducibility of DIA‐based HLA peptidome analysis should facilitate personalized medicine applications.  相似文献   

6.
数据非依赖采集(DIA)是蛋白质组学领域近年来快速发展的质谱采集技术,其通过无偏碎裂隔离窗口内的所有母离子采集二级谱图,理论上可实现蛋白质样品的深度覆盖,同时具有高通量、高重现性和高灵敏度的优点。现有的DIA数据采集方法可以分为全窗口碎裂方法、隔离窗口序列碎裂方法和四维DIA数据采集方法(4D-DIA)3大类。针对DIA数据的不同特点,主要数据解析方法包括谱库搜索方法、蛋白质序列库直接搜索方法、伪二级谱图鉴定方法和从头测序方法4大类。解析得到的肽段鉴定结果需要进行可信度评估,包括使用机器学习方法的重排序和对报告结果集合的假发现率估计两个步骤,实现对数据解析结果的质控。本文对DIA数据的采集方法、数据解析方法及软件和鉴定结果可信度评估方法进行了整理和综述,并展望了未来的发展方向。  相似文献   

7.
Data-independent acquisition (DIA) of tandem mass spectrometry spectra has emerged as a promising technology to improve coverage and quantification of proteins in complex mixtures. The success of DIA experiments is dependent on the quality of spectral libraries used for data base searching. Frequently, these libraries need to be generated by labor and time intensive data dependent acquisition (DDA) experiments. Recently, several algorithms have been published that allow the generation of theoretical libraries by an efficient prediction of retention time and intensity of the fragment ions. Sequential windowed acquisition of all theoretical fragment ion spectra mass spectrometry (SWATH-MS) is a DIA method that can be applied at an unprecedented speed, but the fragmentation spectra suffer from a lower quality than data acquired on Orbitrap instruments. To reliably generate theoretical libraries that can be used in SWATH experiments, we developed deep-learning for SWATH analysis (dpSWATH), to improve the sensitivity and specificity of data generated by Q-TOF mass spectrometers. The theoretical library built by dpSWATH allowed us to increase the identification rate of proteins compared to traditional or library-free methods. Based on our analysis we conclude that dpSWATH is a superior prediction framework for SWATH-MS measurements than other algorithms based on Orbitrap data.  相似文献   

8.
Data‐independent acquisition (DIA) generates comprehensive yet complex mass spectrometric data, which imposes the use of data‐dependent acquisition (DDA) libraries for deep peptide‐centric detection. Here, it is shown that DIA can be redeemed from this dependency by combining predicted fragment intensities and retention times with narrow window DIA. This eliminates variation in library building and omits stochastic sampling, finally making the DIA workflow fully deterministic. Especially for clinical proteomics, this has the potential to facilitate inter‐laboratory comparison.  相似文献   

9.
Advances in liquid chromatography‐mass spectrometry have facilitated the incorporation of proteomic studies to many biology experimental workflows. Data‐independent acquisition platforms, such as sequential window acquisition of all theoretical mass spectra (SWATH‐MS), offer several advantages for label‐free quantitative assessment of complex proteomes over data‐dependent acquisition (DDA) approaches. However, SWATH data interpretation requires spectral libraries as a detailed reference resource. The guinea pig (Cavia porcellus) is an excellent experimental model for translation to many aspects of human physiology and disease, yet there is limited experimental information regarding its proteome. To overcome this knowledge gap, a comprehensive spectral library of the guinea pig proteome is generated. Homogenates and tryptic digests are prepared from 16 tissues and subjected to >200 DDA runs. Analysis of >250 000 peptide‐spectrum matches resulted in a library of 73 594 peptides from 7666 proteins. Library validation is provided by i) analyzing externally derived SWATH files ( https://doi.org/10.1016/j.jprot.2018.03.023 ) and comparing peptide intensity quantifications; ii) merging of externally derived data to the base library. This furnishes the research community with a comprehensive proteomic resource that will facilitate future molecular‐phenotypic studies using (re‐engaging) the guinea pig as an experimental model of relevance to human biology. The spectral library and raw data are freely accessible in the MassIVE repository (MSV000083199).  相似文献   

10.
Post-translational modifications (PTMs) dynamically regulate proteins and biological pathways, typically through the combined effects of multiple PTMs. Lysine residues are targeted for various PTMs, including malonylation and succinylation. However, PTMs offer specific challenges to mass spectrometry-based proteomics during data acquisition and processing. Thus, novel and innovative workflows using data-independent acquisition (DIA) ensure confident PTM identification, precise site localization, and accurate and robust label-free quantification. In this study, we present a powerful approach that combines antibody-based enrichment with comprehensive DIA acquisitions and spectral library-free data processing using directDIA (Spectronaut). Identical DIA data can be used to generate spectral libraries and comprehensively identify and quantify PTMs, reducing the amount of enriched sample and acquisition time needed, while offering a fully automated workflow. We analyzed brains from wild-type and Sirtuin 5 (SIRT5)-knock-out mice, and discovered and quantified 466 malonylated and 2211 succinylated peptides. SIRT5 regulation remodeled the acylomes by targeting 164 malonylated and 578 succinylated sites. Affected pathways included carbohydrate and lipid metabolisms, synaptic vesicle cycle, and neurodegenerative diseases. We found 48 common SIRT5-regulated malonylation and succinylation sites, suggesting potential PTM crosstalk. This innovative and efficient workflow offers deeper insights into the mouse brain lysine malonylome and succinylome.  相似文献   

11.
Data independent acquisition (DIA) proteomics techniques have matured enormously in recent years, thanks to multiple technical developments in, for example, instrumentation and data analysis approaches. However, there are many improvements that are still possible for DIA data in the area of the FAIR (Findability, Accessibility, Interoperability and Reusability) data principles. These include more tailored data sharing practices and open data standards since public databases and data standards for proteomics were mostly designed with DDA data in mind. Here we first describe the current state of the art in the context of FAIR data for proteomics in general, and for DIA approaches in particular. For improving the current situation for DIA data, we make the following recommendations for the future: (i) development of an open data standard for spectral libraries; (ii) make mandatory the availability of the spectral libraries used in DIA experiments in ProteomeXchange resources; (iii) improve the support for DIA data in the data standards developed by the Proteomics Standards Initiative; and (iv) improve the support for DIA datasets in ProteomeXchange resources, including more tailored metadata requirements.  相似文献   

12.
A quadrupole time-of-flight mass spectrometer coupled with a trapped ion mobility spectrometry (timsTOF) operated in parallel accumulation-serial fragmentation (PASEF) mode has recently emerged as a platform capable of providing four-dimensional (4D) features comprising of elution time, collision cross section (CCS), mass-to-charge ratio, and intensity of peptides. The PASEF mode provides ∼100% ion sampling efficiency both in data-dependent acquisition (DDA) and data-independent acquisition (DIA) modes without sacrificing sensitivity. In addition, targeted measurements using PASEF integrated parallel reaction monitoring (PRM) mode have also been described. However, only limited number of studies have used timsTOF for analysis of clinical samples. Although Orbitrap mass spectrometers have been used for biomarker discovery from cerebrospinal fluid (CSF) in a variety of neurological diseases, these Orbitrap-derived datasets cannot readily be applied for driving experiments on timsTOF mass spectrometers. We generated a catalog of peptides and proteins in human CSF in DDA mode on a timsTOF mass spectrometer and used these data to build a spectral library. This strategy allowed us to use elution times and ion mobility values from the spectral library to design PRM experiments for quantifying previously discovered biomarkers from CSF samples in Alzheimer's disease. When the same samples were analyzed using a DIA approach combined with a spectral library search, a higher number of proteins were identified than in a library-free approach. Overall, we have established a spectral library of CSF as a resource and demonstrated its utility for PRM and DIA studies, which should facilitate studies of neurological disorders.  相似文献   

13.
数据非依赖采集(data-independent acquisition,DIA)是一种高通量、无偏性的质谱数据采集方法,具有定量结果重现性好,对低丰度蛋白质友好的特点,是近年来进行大队列蛋白质组研究的首选方法之一。由于DIA产生的二级谱是混合谱,包含了多个肽段的碎片离子信息,使得蛋白质鉴定和定量更加困难。目前,DIA数据分析方法分为两大类,即以肽为中心和以谱图为中心。其中,以肽为中心的分析方法鉴定更灵敏,定量更准确,已成为DIA数据解析的主流方法。其分析流程包括构建谱图库、提取色谱峰群、特征打分和结果质控4个关键步骤。本文综述了以肽为中心的DIA数据分析流程,介绍了基于此流程的数据分析软件及相关比较评估工作,进一步总结了已有的算法改进工作,最后对未来发展方向进行了展望。  相似文献   

14.
Data‐independent acquisition (DIA) is an emerging technology for quantitative proteomics. Current DIA focusses on the identification and quantitation of fragment ions that are generated from multiple peptides contained in the same selection window of several to tens of m/z. An alternative approach is WiSIM‐DIA, which combines conventional DIA with wide‐SIM (wide selected‐ion monitoring) windows to partition the precursor m/z space to produce high‐quality precursor ion chromatograms. However, WiSIM‐DIA has been underexplored; it remains unclear if it is a viable alternative to DIA. We demonstrate that WiSIM‐DIA quantified more than 24 000 unique peptides over five orders of magnitude in a single 2 h analysis of a neuronal synapse‐enriched fraction, compared to 31 000 in DIA. There is a strong correlation between abundance values of peptides quantified in both the DIA and WiSIM‐DIA datasets. Interestingly, the S/N ratio of these peptides is not correlated. We further show that peptide identification directly from DIA spectra identified >2000 proteins, which included unique peptides not found in spectral libraries generated by DDA.  相似文献   

15.
To address the increasing need for detecting and validating protein biomarkers in clinical specimens, mass spectrometry (MS)-based targeted proteomic techniques, including the selected reaction monitoring (SRM), parallel reaction monitoring (PRM), and massively parallel data-independent acquisition (DIA), have been developed. For optimal performance, they require the fragment ion spectra of targeted peptides as prior knowledge. In this report, we describe a MS pipeline and spectral resource to support targeted proteomics studies for human tissue samples. To build the spectral resource, we integrated common open-source MS computational tools to assemble a freely accessible computational workflow based on Docker. We then applied the workflow to generate DPHL, a comprehensive DIA pan-human library, from 1096 data-dependent acquisition (DDA) MS raw files for 16 types of cancer samples. This extensive spectral resource was then applied to a proteomic study of 17 prostate cancer (PCa) patients. Thereafter, PRM validation was applied to a larger study of 57 PCa patients and the differential expression of three proteins in prostate tumor was validated. As a second application, the DPHL spectral resource was applied to a study consisting of plasma samples from 19 diffuse large B cell lymphoma (DLBCL) patients and 18 healthy control subjects. Differentially expressed proteins between DLBCL patients and healthy control subjects were detected by DIA-MS and confirmed by PRM. These data demonstrate that the DPHL supports DIA and PRM MS pipelines for robust protein biomarker discovery. DPHL is freely accessible at https://www.iprox.org/page/project.html?id=IPX0001400000.  相似文献   

16.
For data‐independent acquisition by means of sequential window acquisition of all theoretical fragment ion spectra (SWATH), a reference library of data‐dependent acquisition (DDA) runs is typically used to correlate the quantitative data from the fragment ion spectra with peptide identifications. The quality and coverage of such a reference library is therefore essential when processing SWATH data. In general, library sizes can be increased by reducing the impact of DDA precursor selection with replicate runs or fractionation. However, these strategies can affect the match between the library and SWATH measurement, and thus larger library sizes do not necessarily correspond to improved SWATH quantification. Here, three fractionation strategies to increase local library size were compared to standard library building using replicate DDA injection: protein SDS‐PAGE fractionation, peptide high‐pH RP‐HPLC fractionation and MS‐acquisition gas phase fractionation. The impact of these libraries on SWATH performance was evaluated in terms of the number of extracted peptides and proteins, the match quality of the peptides and the extraction reproducibility of the transitions. These analyses were conducted using the hydrophilic proteome of differentiating human embryonic stem cells. Our results show that SWATH quantitative results and interpretations are affected by choice of fractionation technique. Data are available via ProteomeXchange with identifier PXD006190.  相似文献   

17.
Quantitative proteomics methods have emerged as powerful tools for measuring protein expression changes at the proteome level. Using MS‐based approaches, it is now possible to routinely quantify thousands of proteins. However, prefractionation of the samples at the protein or peptide level is usually necessary to go deep into the proteome, increasing both MS analysis time and technical variability. Recently, a new MS acquisition method named SWATH is introduced with the potential to provide good coverage of the proteome as well as a good measurement precision without prior sample fractionation. In contrast to shotgun‐based MS however, a library containing experimental acquired spectra is necessary for the bioinformatics analysis of SWATH data. In this study, spectral libraries for two widely used models are built to study crop ripening or animal embryogenesis, Solanum lycopersicum (tomato) and Drosophila melanogaster, respectively. The spectral libraries comprise fragments for 5197 and 6040 proteins for S. lycopersicum and D. melanogaster, respectively, and allow reproducible quantification for thousands of peptides per MS analysis. The spectral libraries and all MS data are available in the MassIVE repository with the dataset identifiers MSV000081074 and MSV000081075 and the PRIDE repository with the dataset identifiers PXD006493 and PXD006495.  相似文献   

18.
Proteogenomics is based on the use of customized genome or RNA sequencing databases for interrogation of shotgun proteomics data in search for proteome‐level evidence of genome variations or RNA editing. In this work, the products of adenosine‐to‐inosine RNA editing in human and murine brain proteomes are identified using publicly available brain proteome LC‐MS/MS datasets and an RNA editome database compiled from several sources. After filtering of false‐positive results, 20 and 37 sites of editing in proteins belonging to 14 and 32 genes are identified for murine and human brain proteomes, respectively. Eight sites of editing identified with high spectral counts overlapped between human and mouse brain samples. Some of these sites have been previously reported using orthogonal methods, such as α‐amino‐3‐hydroxy‐5‐methyl‐4‐isoxazolepropionic acid (AMPA) glutamate receptors, CYFIP2, coatomer alpha. Also, differential editing between neurons and microglia is demonstrated in this work for some of the proteins from primary murine brain cell cultures. Because many edited sites are still not characterized functionally at the protein level, the results provide a necessary background for their further analysis in normal and diseased cells and tissues using targeted proteomic approaches.  相似文献   

19.
Data‐independent acquisition (DIA) approaches, such as SWATH®‐MS, are showing great potential to reliably quantify significant numbers of peptides and proteins in an unbiased manner. These developments have enhanced interest in developing a single DIA method that integrates qualitative and quantitative analysis, eliminating the need of a prebuilt library of peptide spectra, which are created through data‐dependent acquisition methods or from public repositories. Here, we introduce a new DIA approach, referred to as “SWATH‐ID,” which was developed to allow peptide identification as well as quantitation. The SWATH‐ID method is composed of small Q1 windows, achieving better selectivity and thus significantly improving high‐confidence peptide extractions from data files. Furthermore, the SWATH‐ID approach transmits precursor ions without fragmentation as well as their fragments within the same SWATH acquisition period. This provides a single scan that includes all precursor ions within the isolation window as well as a record of all of their fragment ions, substantially negating the need for a survey scan. In this way all precursors present in a small Q1 window are associated with their fragment ions, improving the identification specificity and providing a more comprehensive and in‐depth view of protein and peptide species in complex samples.  相似文献   

20.
Artemisia annua is well known for biosynthesizing the antimalarial drug artemisinin. Here, a global proteomic profiling of A. annua is conducted with identification of a total of 13 403 proteins based on the genome sequence annotation database. Furthermore, a spectral library is generated to perform quantitative proteomic analysis using data independent acquisition mass spectrometry. Specifically, proteins between two chemotypes that produce high (HAP) and low (LAP) artemisinin content, respectively, are comprehensively quantified and compared. 182 proteins are identified with abundance significantly different between these two chemotypes means after the statistic use the p‐value and fold change it is found 182 proteins can reach the demand conditions which represent the expression are significantly different between the high artemisnin content plants (HAPs) and the low artemisnin content plants (LAPs). Data are available via ProteomeXchange with identifier PXD015547. Overall, this current study globally identifies the proteome of A. annua and quantitatively compares the targeted sub‐proteomes between the two cultivars of HAP and LAP, providing systematic information on metabolic pathways of A. annua.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号