首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 531 毫秒
1.

Background

In a single proteomic project, tandem mass spectrometers can produce hundreds of millions of tandem mass spectra. However, majority of tandem mass spectra are of poor quality, it wastes time to search them for peptides. Therefore, the quality assessment (before database search) is very useful in the pipeline of protein identification via tandem mass spectra, especially on the reduction of searching time and the decrease of false identifications. Most existing methods for quality assessment are supervised machine learning methods based on a number of features which describe the quality of tandem mass spectra. These methods need the training datasets with knowing the quality of all spectra, which are usually unavailable for the new datasets.

Results

This study proposes an unsupervised machine learning method for quality assessment of tandem mass spectra without any training dataset. This proposed method estimates the conditional probabilities of spectra being high quality from the quality assessments based on individual features. The probabilities are estimated through a constraint optimization problem. An efficient algorithm is developed to solve the constraint optimization problem and is proved to be convergent. Experimental results on two datasets illustrate that if we search only tandem spectra with the high quality determined by the proposed method, we can save about 56 % and 62% of database searching time while losing only a small amount of high-quality spectra.

Conclusions

Results indicate that the proposed method has a good performance for the quality assessment of tandem mass spectra and the way we estimate the conditional probabilities is effective.
  相似文献   

2.

Background

The sequence database searching has been the dominant method for peptide identification, in which a large number of peptide spectra generated from LC/MS/MS experiments are searched using a search engine against theoretical fragmentation spectra derived from a protein sequences database or a spectral library. Selecting trustworthy peptide spectrum matches (PSMs) remains a challenge.

Results

A novel scoring method named FC-Ranker is developed to assign a nonnegative weight to each target PSM based on the possibility of its being correct. Particularly, the scores of PSMs are updated by using a fuzzy SVM classification model and a fuzzy silhouette index iteratively. Trustworthy PSMs will be assigned high scores when the algorithm stops.

Conclusions

Our experimental studies show that FC-Ranker outperforms other post-database search algorithms over a variety of datasets, and it can be extended to solve a general classification problem with uncertain labels.
  相似文献   

3.
Yang  Runmin  Zhu  Daming 《BMC genomics》2018,19(7):666-39

Background

Database search has been the main approach for proteoform identification by top-down tandem mass spectrometry. However, when the target proteoform that produced the spectrum contains post-translational modifications (PTMs) and/or mutations, it is quite time consuming to align a query spectrum against all protein sequences without any PTMs and mutations in a large database. Consequently, it is essential to develop efficient and sensitive filtering algorithms for speeding up database search.

Results

In this paper, we propose a spectrum graph matching (SGM) based protein sequence filtering method for top-down mass spectral identification. It uses the subspectra of a query spectrum to generate spectrum graphs and searches them against a protein database to report the best candidates. As the sequence tag and gaped tag approaches need the preprocessing step to extract and select tags, the SGM filtering method circumvents this preprocessing step, thus simplifying data processing. We evaluated the filtration efficiency of the SGM filtering method with various parameter settings on an Escherichia coli top-down mass spectrometry data set and compared the performances of the SGM filtering method and two tag-based filtering methods on a data set of MCF-7 cells.

Conclusions

Experimental results on the data sets show that the SGM filtering method achieves high sensitivity in protein sequence filtration. When coupled with a spectral alignment algorithm, the SGM filtering method significantly increases the number of identified proteoform spectrum-matches compared with the tag-based methods in top-down mass spectrometry data analysis.
  相似文献   

4.
5.

Introduction

Persons living with HIV (PLWH) are at higher risk for cardiovascular disease (CVD) events than uninfected persons. Current risk-stratification methods to define PLWH at highest risk for CVD events are lacking.

Methods

Using tandem flow injection mass spectrometry, we quantified plasma levels of 60 metabolites in 24 matched pairs of PLWH [1:1 with and without known coronary artery disease (CAD)]. Metabolite levels were reduced to interpretable factors using principal components analysis.

Results

Factors derived from short-chain dicarboxylacylcarnitines (SCDA) (p?=?0.08) and glutamine/valine (p?=?0.003) were elevated in CAD cases compared to controls.

Conclusion

SCDAs and glutamine/valine may be valuable markers of cardiovascular risk among persons living with HIV in the future, pending validation in larger cohorts.
  相似文献   

6.

Introduction

Data sharing is being increasingly required by journals and has been heralded as a solution to the ‘replication crisis’.

Objectives

(i) Review data sharing policies of journals publishing the most metabolomics papers associated with open data and (ii) compare these journals’ policies to those that publish the most metabolomics papers.

Methods

A PubMed search was used to identify metabolomics papers. Metabolomics data repositories were manually searched for linked publications.

Results

Journals that support data sharing are not necessarily those with the most papers associated to open metabolomics data.

Conclusion

Further efforts are required to improve data sharing in metabolomics.
  相似文献   

7.

Introduction

Metabolite identification in biological samples using Nuclear Magnetic Resonance (NMR) spectra is a challenging task due to the complexity of the biological matrices.

Objectives

This paper introduces a new, automated computational scheme for the identification of metabolites in 1D 1H NMR spectra based on the Human Metabolome Database.

Methods

The methodological scheme comprises of the sequential application of preprocessing, data reduction, metabolite screening and combination selection.

Results

The proposed scheme has been tested on the 1D 1H NMR spectra of: (a) an amino acid mixture, (b) a serum sample spiked with the amino acid mixture, (c) 20 blood serum, (d) 20 human amniotic fluid samples, (e) 160 serum samples from publicly available database. The methodological scheme was compared against widely used software tools, exhibiting good performance in terms of correct assignment of the metabolites.

Conclusions

This new robust scheme accomplishes to automatically identify peak resonances in 1H-NMR spectra with high accuracy and less human intervention with a wide range of applications in metabolic profiling.
  相似文献   

8.

Introduction

Concerning NMR-based metabolomics, 1D spectra processing often requires an expert eye for disentangling the intertwined peaks.

Objectives

The objective of NMRProcFlow is to assist the expert in this task in the best way without requirement of programming skills.

Methods

NMRProcFlow was developed to be a graphical and interactive 1D NMR (1H & 13C) spectra processing tool.

Results

NMRProcFlow (http://nmrprocflow.org), dedicated to metabolic fingerprinting and targeted metabolomics, covers all spectra processing steps including baseline correction, chemical shift calibration and alignment.

Conclusion

Biologists and NMR spectroscopists can easily interact and develop synergies by visualizing the NMR spectra along with their corresponding experimental-factor levels, thus setting a bridge between experimental design and subsequent statistical analyses.
  相似文献   

9.

Introduction

Atherosclerotic diseases are the leading cause of death worldwide. Biomarkers of atherosclerosis are required to monitor and prevent disease progression. While mass spectrometry is a promising technique to search for such biomarkers, its clinical application is hampered by the laborious processes for sample preparation and analysis.

Methods

We developed a rapid method to detect plasma metabolites by probe electrospray ionization mass spectrometry (PESI-MS), which employs an ambient ionization technique enabling atmospheric pressure rapid mass spectrometry. To create an automatic diagnosis system of atherosclerotic disorders, we applied machine learning techniques to the obtained spectra.

Results

Using our system, we successfully discriminated between rabbits with and without dyslipidemia. The causes of dyslipidemia (genetic lipoprotein receptor deficiency or dietary cholesterol overload) were also distinguishable by this method. Furthermore, after induction of atherosclerosis in rabbits with a cholesterol-rich diet, we were able to detect dynamic changes in plasma metabolites. The major metabolites detected by PESI-MS included cholesterol sulfate and a phospholipid (PE18:0/20:4), which are promising new biomarkers of atherosclerosis.

Conclusion

We developed a remarkably fast and easy method to detect potential new biomarkers of atherosclerosis in plasma using PESI-MS.
  相似文献   

10.
11.

Introduction

Onion (Allium cepa) represents one of the most important horticultural crops and is used as food, spice and medicinal plant almost worldwide. Onion bulbs accumulate a broad range of primary and secondary metabolites which impact nutritional, sensory and technological properties.

Objectives

To complement existing analytical methods targeting individual compound classes this work aimed at the development and validation of an analytical workflow for comprehensive metabolite profiling of onion bulbs.

Method

Metabolite profiling was performed by liquid chromatography coupled with electrospray ionization quadrupole time-of-flight mass spectrometry (LC/ESI-QTOFMS). For annotation of metabolites accurate mass tandem mass spectrometry experiments were carried out.

Results

On the basis of LC/ESI-QTOFMS and two chromatographic methods an analytical workflow was developed which facilitates profiling of polar and semi-polar onion metabolites including fructooligosaccharides, proteinogenic amino acids, peptides, S-substituted cysteine conjugates, flavonoids and saponins. To minimize enzymatic conversion of S-alk(en)ylcysteine sulfoxides, a sample preparation and extraction protocol for fresh onions was developed comprising cryohomogenization and a low-temperature quenching step. A total of 123 metabolites were annotated and characterized by chromatographic and tandem mass spectral data. For validation, recovery rates and matrix effects were determined for 15 model compounds. Repeatability and linearity were assessed for more than 80 endogenous metabolites.

Conclusion

As exemplarily demonstrated by comparative metabolic analysis of six onion cultivars the established analytical workflow in combination with targeted and non-targeted data analysis strategies can be successfully applied for comprehensive metabolite profiling of onion bulbs.
  相似文献   

12.

Background

Liquid chromatography combined with tandem mass spectrometry is an important tool in proteomics for peptide identification. Liquid chromatography temporally separates the peptides in a sample. The peptides that elute one after another are analyzed via tandem mass spectrometry by measuring the mass-to-charge ratio of a peptide and its fragments. De novo peptide sequencing is the problem of reconstructing the amino acid sequences of a peptide from this measurement data. Past de novo sequencing algorithms solely consider the mass spectrum of the fragments for reconstructing a sequence.

Results

We propose to additionally exploit the information obtained from liquid chromatography. We study the problem of computing a sequence that is not only in accordance with the experimental mass spectrum, but also with the chromatographic retention time. We consider three models for predicting the retention time and develop algorithms for de novo sequencing for each model.

Conclusions

Based on an evaluation for two prediction models on experimental data from synthesized peptides we conclude that the identification rates are improved by exploiting the chromatographic information. In our evaluation, we compare our algorithms using the retention time information with algorithms using the same scoring model, but not the retention time.
  相似文献   

13.

Introduction

Due to its proximity with the brain, cerebrospinal fluid (CSF) could be a medium of choice for the discovery of biomarkers of neurological and psychiatric diseases using untargeted analytical approaches.

Objectives

This study explored the CSF lipidome in order to generate a robust mass spectral database using an untargeted lipidomic approach.

Methods

Cerebrospinal fluid samples from 45 individuals were analyzed by liquid chromatography coupled to high-resolution mass spectrometry method (LC-HRMS). A dedicated data processing workflow was implemented using XCMS software and adapted filters to select reliable features. In addition, an automatic annotation using an in silico lipid database and several MS/MS experiments were performed to identify CSF lipid species.

Results

Using this complete workflow, 771 analytically relevant monoisotopic lipid species corresponding to 550 unique lipids which represent five major lipid families (i.e., free fatty acids, sphingolipids, glycerophospholipids, glycerolipids, and sterol lipids) were detected and annotated. In addition, MS/MS experiments enabled to improve the annotation of 304 lipid species. Thanks to LC-HRMS, it was possible to discriminate between isobaric and also isomeric lipid species; and interestingly, our study showed that isobaric ions represent about 50 % of the total annotated lipid species in the human CSF.

Conclusion

This work provides an extensive LC/HRMS database of the human CSF lipidome which constitutes a relevant foundation for future studies aimed at finding biomarkers of neurological disorders.
  相似文献   

14.

Introduction

Data processing is one of the biggest problems in metabolomics, given the high number of samples analyzed and the need of multiple software packages for each step of the processing workflow.

Objectives

Merge in the same platform the steps required for metabolomics data processing.

Methods

KniMet is a workflow for the processing of mass spectrometry-metabolomics data based on the KNIME Analytics platform.

Results

The approach includes key steps to follow in metabolomics data processing: feature filtering, missing value imputation, normalization, batch correction and annotation.

Conclusion

KniMet provides the user with a local, modular and customizable workflow for the processing of both GC–MS and LC–MS open profiling data.
  相似文献   

15.
16.

Introduction

Tandem mass spectrometry (MS/MS) has been widely used for identifying metabolites in many areas. However, computationally identifying metabolites from MS/MS data is challenging due to the unknown of fragmentation rules, which determine the precedence of chemical bond dissociation. Although this problem has been tackled by different ways, the lack of computational tools to flexibly represent adjacent structures of chemical bonds is still a long-term bottleneck for studying fragmentation rules.

Objectives

This study aimed to develop computational methods for investigating fragmentation rules by analyzing annotated MS/MS data.

Methods

We implemented a computational platform, MIDAS-G, for investigating fragmentation rules. MIDAS-G processes a metabolite as a simple graph and uses graph grammars to recognize specific chemical bonds and their adjacent structures. We can apply MIDAS-G to investigate fragmentation rules by adjusting bond weights in the scoring model of the metabolite identification tool and comparing metabolite identification performances.

Results

We used MIDAS-G to investigate four bond types on real annotated MS/MS data in experiments. The experimental results matched data collected from wet labs and literature. The effectiveness of MIDAS-G was confirmed.

Conclusion

We developed a computational platform for investigating fragmentation rules of tandem mass spectrometry. This platform is freely available for download.
  相似文献   

17.

Introduction

Untargeted metabolomics of cord blood indicated that antiretroviral therapy to HIV-infected mothers (HIV-ART) did not compromise the exposed neonates with regard to the stress of neonatal hypoglycaemia at birth. However, identified biomarkers reflected stress in their energy metabolism, raising concern over developmental risks in some newborns exposed to ART.

Objectives

This study addresses the concern over HIV-ART-induced metabolic perturbations by expanding the metabolomics study to the amino acid profiles in cord blood collected at birth from newborns either exposed or unexposed to HIV-ART in utero.

Methods

Amino acid profiles derived from liquid chromatographic triple quadruple spectra of cord blood from neonates exposed and unexposed to HIV-ART (cohort 1) were investigated using a metabolomics approach. Amino acid data, generated by ultra performance liquid chromatography–tandem mass spectrometry from similar cases (cohort 2), were included for comparison.

Results

Multivariate and supporting statistics indicated differentiation between the exposed and unexposed neonates in both cohorts, caused by a general decrease or downregulation of amino acid concentrations in the cord blood samples from the exposed cases. Specifically, significant upregulation of aspartic acid in both cohorts and downregulation of arginine, and of threonine, tryptophan and lysine in cohorts 1 and 2, respectively, were observed.

Conclusions

The benefits of ART for HIV-infected pregnant women are well established. However, the amino acid profile of cord blood, obtained from the two independent cohorts, adds to observed metabolic risks of in utero HIV-ART-exposed newborns. These risks could potentially have adverse consequences for the future health of some exposed infants.
  相似文献   

18.

Introduction

Untargeted and targeted analyses are two classes of metabolic study. Both strategies have been advanced by high resolution mass spectrometers coupled with chromatography, which have the advantages of high mass sensitivity and accuracy. State-of-art methods for mass spectrometric data sets do not always quantify metabolites of interest in a targeted assay efficiently and accurately.

Objectives

TarMet can quantify targeted metabolites as well as their isotopologues through a reactive and user-friendly graphical user interface.

Methods

TarMet accepts vendor-neutral data files (NetCDF, mzXML and mzML) as inputs. Then it extracts ion chromatograms, detects peak position and bounds and confirms the metabolites via the isotope patterns. It can integrate peak areas for all isotopologues automatically.

Results

TarMet detects more isotopologues and quantify them better than state-of-art methods, and it can process isotope tracer assay well.

Conclusion

TarMet is a better tool for targeted metabolic and stable isotope tracer analyses.
  相似文献   

19.

Introduction

Experiments in metabolomics rely on the identification and quantification of metabolites in complex biological mixtures. This remains one of the major challenges in NMR/mass spectrometry analysis of metabolic profiles. These features are mandatory to make metabolomics asserting a general approach to test a priori formulated hypotheses on the basis of exhaustive metabolome characterization rather than an exploratory tool dealing with unknown metabolic features.

Objectives

In this article we propose a method, named ASICS, based on a strong statistical theory that handles automatically the metabolites identification and quantification in proton NMR spectra.

Methods

A statistical linear model is built to explain a complex spectrum using a library containing pure metabolite spectra. This model can handle local or global chemical shift variations due to experimental conditions using a warping function. A statistical lasso-type estimator identifies and quantifies the metabolites in the complex spectrum. This estimator shows good statistical properties and handles peak overlapping issues.

Results

The performances of the method were investigated on known mixtures (such as synthetic urine) and on plasma datasets from duck and human. Results show noteworthy performances, outperforming current existing methods.

Conclusion

ASICS is a completely automated procedure to identify and quantify metabolites in 1H NMR spectra of biological mixtures. It will enable empowering NMR-based metabolomics by quickly and accurately helping experts to obtain metabolic profiles.
  相似文献   

20.

Introduction

Poultry is one of the most consumed meat in the world and its related industry is always looking for ways to improve animal welfare and productivity. It is therefore essential to understand the metabolic response of the chicken to new feed formulas, various supplements, infections and treatments.

Objectives

As a basis for future research investigating the impact of diet and infections on chicken’s metabolism, we established a high-resolution proton nuclear magnetic resonance (NMR)-based metabolic atlas of the healthy chicken (Gallus gallus).

Methods

Metabolic extractions were performed prior to 1H-NMR and 2D NMR spectra acquisition on twelve biological matrices: liver, kidney, spleen, plasma, egg yolk and white, colon, caecum, faecal water, ileum, pectoral muscle and brain of 6 chickens. Metabolic profiles were then exhaustively characterized.

Results

Nearly 80 metabolites were identified. A cross-comparison of these matrices was performed to determine metabolic variations between and within each section and highlighted that only eight core metabolites were systematically found in every matrice.

Conclusion

This work constitutes a database for future NMR-based metabolomic investigations in relation to avian production and health.
  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号