首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 375 毫秒
1.

Introduction

Data processing is one of the biggest problems in metabolomics, given the high number of samples analyzed and the need of multiple software packages for each step of the processing workflow.

Objectives

Merge in the same platform the steps required for metabolomics data processing.

Methods

KniMet is a workflow for the processing of mass spectrometry-metabolomics data based on the KNIME Analytics platform.

Results

The approach includes key steps to follow in metabolomics data processing: feature filtering, missing value imputation, normalization, batch correction and annotation.

Conclusion

KniMet provides the user with a local, modular and customizable workflow for the processing of both GC–MS and LC–MS open profiling data.
  相似文献   

2.

Introduction

Data sharing is being increasingly required by journals and has been heralded as a solution to the ‘replication crisis’.

Objectives

(i) Review data sharing policies of journals publishing the most metabolomics papers associated with open data and (ii) compare these journals’ policies to those that publish the most metabolomics papers.

Methods

A PubMed search was used to identify metabolomics papers. Metabolomics data repositories were manually searched for linked publications.

Results

Journals that support data sharing are not necessarily those with the most papers associated to open metabolomics data.

Conclusion

Further efforts are required to improve data sharing in metabolomics.
  相似文献   

3.

Introduction

Liquid chromatography-mass spectrometry (LC-MS) is a commonly used technique in untargeted metabolomics owing to broad coverage of metabolites, high sensitivity and simple sample preparation. However, data generated from multiple batches are affected by measurement errors inherent to alterations in signal intensity, drift in mass accuracy and retention times between samples both within and between batches. These measurement errors reduce repeatability and reproducibility and may thus decrease the power to detect biological responses and obscure interpretation.

Objective

Our aim was to develop procedures to address and correct for within- and between-batch variability in processing multiple-batch untargeted LC-MS metabolomics data to increase their quality.

Methods

Algorithms were developed for: (i) alignment and merging of features that are systematically misaligned between batches, through aggregating feature presence/missingness on batch level and combining similar features orthogonally present between batches; and (ii) within-batch drift correction using a cluster-based approach that allows multiple drift patterns within batch. Furthermore, a heuristic criterion was developed for the feature-wise choice of reference-based or population-based between-batch normalisation.

Results

In authentic data, between-batch alignment resulted in picking 15 % more features and deconvoluting 15 % of features previously erroneously aligned. Within-batch correction provided a decrease in median quality control feature coefficient of variation from 20.5 to 15.1 %. Algorithms are open source and available as an R package (‘batchCorr’).

Conclusions

The developed procedures provide unbiased measures of improved data quality, with implications for improved data analysis. Although developed for LC-MS based metabolomics, these methods are generic and can be applied to other data suffering from similar limitations.
  相似文献   

4.

Background

Untargeted mass spectrometry (MS)-based metabolomics data often contain missing values that reduce statistical power and can introduce bias in biomedical studies. However, a systematic assessment of the various sources of missing values and strategies to handle these data has received little attention. Missing data can occur systematically, e.g. from run day-dependent effects due to limits of detection (LOD); or it can be random as, for instance, a consequence of sample preparation.

Methods

We investigated patterns of missing data in an MS-based metabolomics experiment of serum samples from the German KORA F4 cohort (n?=?1750). We then evaluated 31 imputation methods in a simulation framework and biologically validated the results by applying all imputation approaches to real metabolomics data. We examined the ability of each method to reconstruct biochemical pathways from data-driven correlation networks, and the ability of the method to increase statistical power while preserving the strength of established metabolic quantitative trait loci.

Results

Run day-dependent LOD-based missing data accounts for most missing values in the metabolomics dataset. Although multiple imputation by chained equations performed well in many scenarios, it is computationally and statistically challenging. K-nearest neighbors (KNN) imputation on observations with variable pre-selection showed robust performance across all evaluation schemes and is computationally more tractable.

Conclusion

Missing data in untargeted MS-based metabolomics data occur for various reasons. Based on our results, we recommend that KNN-based imputation is performed on observations with variable pre-selection since it showed robust results in all evaluation schemes.
  相似文献   

5.

Introduction

Improving feed utilization in cattle is required to reduce input costs, increase production, and ultimately improve sustainability of the beef cattle industry. Characterizing metabolic differences between efficient and non-efficient animals will allow stakeholders to identify more efficient cattle during backgrounding.

Objectives

This study used an untargeted metabolomics approach to determine differences in serum metabolites between animals of low and high residual feed intake.

Methods

Residual feed intake was determined for 50 purebred Angus steers and 29 steers were selected for the study steers based on low versus high feed efficiency. Blood samples were collected from steers and analyzed using untargeted metabolomics via mass spectrometry. Metabolite data was analyzed using Metaboanalyst, visualized using orthogonal partial least squares discriminant analysis, and p-values derived from permutation testing. Non-esterified fatty acids, urea nitrogen, and glucose were measured using commercially available calorimetric assay kits. Differences in metabolites measured were grouped by residual feed intake was measured using one-way analysis of variance in SAS 9.4.

Results

Four metabolites were found to be associated with differences in feed efficiency. No differences were found in other serum metabolites, including serum urea nitrogen, non-esterified fatty acids, and glucose.

Conclusions

Four metabolites that differed between low and high residual feed intake have important functions related to nutrient utilization, among other functions, in cattle. This information will allow identification of more efficient steers during backgrounding.
  相似文献   

6.

Introduction

Untargeted metabolomics workflows include numerous points where variance and systematic errors can be introduced. Due to the diversity of the lipidome, manual peak picking and quantitation using molecule specific internal standards is unrealistic, and therefore quality peak picking algorithms and further feature processing and normalization algorithms are important. Subsequent normalization, data filtering, statistical analysis, and biological interpretation are simplified when quality data acquisition and feature processing are employed.

Objectives

Metrics for QC are important throughout the workflow. The robust workflow presented here provides techniques to ensure that QC checks are implemented throughout sample preparation, data acquisition, pre-processing, and analysis.

Methods

The untargeted lipidomics workflow includes sample standardization prior to acquisition, blocks of QC standards and blanks run at systematic intervals between randomized blocks of experimental data, blank feature filtering (BFF) to remove features not originating from the sample, and QC analysis of data acquisition and processing.

Results

The workflow was successfully applied to mouse liver samples, which were investigated to discern lipidomic changes throughout the development of nonalcoholic fatty liver disease (NAFLD). The workflow, including a novel filtering method, BFF, allows improved confidence in results and conclusions for lipidomic applications.

Conclusion

Using a mouse model developed for the study of the transition of NAFLD from an early stage known as simple steatosis, to the later stage, nonalcoholic steatohepatitis, in combination with our novel workflow, we have identified phosphatidylcholines, phosphatidylethanolamines, and triacylglycerols that may contribute to disease onset and/or progression.
  相似文献   

7.

Introduction

Processing delays after blood collection is a common pre-analytical condition in large epidemiologic studies. It is critical to evaluate the suitability of blood samples with processing delays for metabolomics analysis as it is a potential source of variation that could attenuate associations between metabolites and disease outcomes.

Objectives

We aimed to evaluate the reproducibility of metabolites over extended processing delays up to 48 h. We also aimed to test the reproducibility of the metabolomics platform.

Methods

Blood samples were collected from 18 healthy volunteers. Blood was stored in the refrigerator and processed for plasma at 0, 15, 30, and 48 h after collection. Plasma samples were metabolically profiled using an untargeted, ultrahigh performance liquid chromatography–tandem mass spectrometry (UPLC–MS/MS) platform. Reproducibility of 1012 metabolites over processing delays and reproducibility of the platform were determined by intraclass correlation coefficients (ICCs) with variance components estimated from mixed-effects models.

Results

The majority of metabolites (approximately 70% of 1012) were highly reproducible (ICCs?≥?0.75) over 15-, 30- or 48-h processing delays. Nucleotides, energy-related metabolites, peptides, and carbohydrates were most affected by processing delays. The platform was highly reproducible with a median technical ICC of 0.84 (interquartile range 0.68–0.93).

Conclusion

Most metabolites measured by the UPLC–MS/MS platform show acceptable reproducibility up to 48-h processing delays. Metabolites of certain pathways need to be interpreted cautiously in relation to outcomes in epidemiologic studies with prolonged processing delays.
  相似文献   

8.

Introduction

Metabolomics is a well-established tool in systems biology, especially in the top–down approach. Metabolomics experiments often results in discovery studies that provide intriguing biological hypotheses but rarely offer mechanistic explanation of such findings. In this light, the interpretation of metabolomics data can be boosted by deploying systems biology approaches.

Objectives

This review aims to provide an overview of systems biology approaches that are relevant to metabolomics and to discuss some successful applications of these methods.

Methods

We review the most recent applications of systems biology tools in the field of metabolomics, such as network inference and analysis, metabolic modelling and pathways analysis.

Results

We offer an ample overview of systems biology tools that can be applied to address metabolomics problems. The characteristics and application results of these tools are discussed also in a comparative manner.

Conclusions

Systems biology-enhanced analysis of metabolomics data can provide insights into the molecular mechanisms originating the observed metabolic profiles and enhance the scientific impact of metabolomics studies.
  相似文献   

9.

Introduction

Oxygen is essential for metabolic processes and in the absence thereof alternative metabolic pathways are required for energy production, as seen in marine invertebrates like abalone. Even though hypoxia has been responsible for significant losses to the aquaculture industry, the overall metabolic adaptations of abalone in response to environmental hypoxia are as yet, not fully elucidated.

Objective

To use a multiplatform metabolomics approach to characterize the metabolic changes associated with energy production in abalone (Haliotis midae) when exposed to environmental hypoxia.

Methods

Metabolomics analysis of abalone adductor and foot muscle, left and right gill, hemolymph, and epipodial tissue samples were conducted using a multiplatform approach, which included untargeted NMR spectroscopy, untargeted and targeted LC–MS spectrometry, and untargeted and semi-targeted GC-MS spectrometric analyses.

Results

Increased levels of anaerobic end-products specific to marine animals were found which include alanopine, strombine, tauropine and octopine. These were accompanied by elevated lactate, succinate and arginine, of which the latter is a product of phosphoarginine breakdown in abalone. Primarily amino acid metabolism was affected, with carbohydrate and lipid metabolism assisting with anaerobic energy production to a lesser extent. Different tissues showed varied metabolic responses to hypoxia, with the largest metabolic changes in the adductor muscle.

Conclusions

From this investigation, it becomes evident that abalone have well-developed (yet understudied) metabolic mechanisms for surviving hypoxic periods. Furthermore, metabolomics serves as a powerful tool for investigating the altered metabolic processes in abalone.
  相似文献   

10.
11.

Introduction

Concerning NMR-based metabolomics, 1D spectra processing often requires an expert eye for disentangling the intertwined peaks.

Objectives

The objective of NMRProcFlow is to assist the expert in this task in the best way without requirement of programming skills.

Methods

NMRProcFlow was developed to be a graphical and interactive 1D NMR (1H & 13C) spectra processing tool.

Results

NMRProcFlow (http://nmrprocflow.org), dedicated to metabolic fingerprinting and targeted metabolomics, covers all spectra processing steps including baseline correction, chemical shift calibration and alignment.

Conclusion

Biologists and NMR spectroscopists can easily interact and develop synergies by visualizing the NMR spectra along with their corresponding experimental-factor levels, thus setting a bridge between experimental design and subsequent statistical analyses.
  相似文献   

12.

Introduction

Natural products from culture collections have enormous impact in advancing discovery programs for metabolites of biotechnological importance. These discovery efforts rely on the metabolomic characterization of strain collections.

Objective

Many emerging approaches compare metabolomic profiles of such collections, but few enable the analysis and prioritization of thousands of samples from diverse organisms while delivering chemistry specific read outs.

Method

In this work we utilize untargeted LC–MS/MS based metabolomics together with molecular networking to inventory the chemistries associated with 1000 marine microorganisms.

Result

This approach annotated 76 molecular families (a spectral match rate of 28 %), including clinically and biotechnologically important molecules such as valinomycin, actinomycin D, and desferrioxamine E. Targeting a molecular family produced primarily by one microorganism led to the isolation and structure elucidation of two new molecules designated maridric acids A and B.

Conclusion

Molecular networking guided exploration of large culture collections allows for rapid dereplication of know molecules and can highlight producers of uniques metabolites. These methods, together with large culture collections and growing databases, allow for data driven strain prioritization with a focus on novel chemistries.
  相似文献   

13.

Introduction

Mass spectrometry is the current technique of choice in studying drug metabolism. High-resolution mass spectrometry in combination with MS/MS gas-phase experiments has the potential to contribute to rapid advances in this field. However, the data emerging from such fragmentation spectral files pose challenges to downstream analysis, given their complexity and size.

Objectives

This study aims to detect and visualize antihypertensive drug metabolites in untargeted metabolomics experiments based on the spectral similarity of their fragmentation spectra. Furthermore, spectral clusters of endogenous metabolites were also examined.

Methods

Here we apply a molecular networking approach to seek drugs and their metabolites, in fragmentation spectra from urine derived from a cohort of 26 patients on antihypertensive therapy. The mass spectrometry data was collected on a Thermo Q-Exactive coupled to pHILIC chromatography using data dependent analysis (DDA) MS/MS gas-phase experiments.

Results

In total, 165 separate drug metabolites were found and structurally annotated (17 by spectral matching and 122 by classification based on a clustered fragmentation pattern). The clusters could be traced to 13 drugs including the known antihypertensives verapamil, losartan and amlodipine. The molecular networking approach also generated clusters of endogenous metabolites, including carnitine derivatives, and conjugates containing glutamine, glutamate and trigonelline.

Conclusions

The approach offers unprecedented capability in the untargeted identification of drugs and their metabolites at the population level and has great potential to contribute to understanding stratified responses to drugs where differences in drug metabolism may determine treatment outcome.
  相似文献   

14.

Introduction

Although cultured cells are nowadays regularly analyzed by metabolomics technologies, some issues in study setup and data processing are still not resolved to complete satisfaction: a suitable harvesting method for adherent cells, a fast and robust method for data normalization, and the proof that metabolite levels can be normalized to cell number.

Objectives

We intended to develop a fast method for normalization of cell culture metabolomics samples, to analyze how metabolite levels correlate with cell numbers, and to elucidate the impact of the kind of harvesting on measured metabolite profiles.

Methods

We cultured four different human cell lines and used them to develop a fluorescence-based method for DNA quantification. Further, we assessed the correlation between metabolite levels and cell numbers and focused on the impact of the harvesting method (scraping or trypsinization) on the metabolite profile.

Results

We developed a fast, sensitive and robust fluorescence-based method for DNA quantification showing excellent linear correlation between fluorescence intensities and cell numbers for all cell lines. Furthermore, 82–97 % of the measured intracellular metabolites displayed linear correlation between metabolite concentrations and cell numbers. We observed differences in amino acids, biogenic amines, and lipid levels between trypsinized and scraped cells.

Conclusion

We offer a fast, robust, and validated normalization method for cell culture metabolomics samples and demonstrate the eligibility of the normalization of metabolomics data to the cell number. We show a cell line and metabolite-specific impact of the harvesting method on metabolite concentrations.
  相似文献   

15.

Introduction

Untargeted metabolomics studies for biomarker discovery often have hundreds to thousands of human samples. Data acquisition of large-scale samples has to be divided into several batches and may span from months to as long as several years. The signal drift of metabolites during data acquisition (intra- and inter-batch) is unavoidable and is a major confounding factor for large-scale metabolomics studies.

Objectives

We aim to develop a data normalization method to reduce unwanted variations and integrate multiple batches in large-scale metabolomics studies prior to statistical analyses.

Methods

We developed a machine learning algorithm-based method, support vector regression (SVR), for large-scale metabolomics data normalization and integration. An R package named MetNormalizer was developed and provided for data processing using SVR normalization.

Results

After SVR normalization, the portion of metabolite ion peaks with relative standard deviations (RSDs) less than 30 % increased to more than 90 % of the total peaks, which is much better than other common normalization methods. The reduction of unwanted analytical variations helps to improve the performance of multivariate statistical analyses, both unsupervised and supervised, in terms of classification and prediction accuracy so that subtle metabolic changes in epidemiological studies can be detected.

Conclusion

SVR normalization can effectively remove the unwanted intra- and inter-batch variations, and is much better than other common normalization methods.
  相似文献   

16.

Introduction

Hypoxia commonly occurs in cancers and is highly related with the occurrence, development and metastasis of cancer. Treatment of triple negative breast cancer remains challenge. Knowledge about the metabolic status of triple negative breast cancer cell lines in hypoxia is valuable for the understanding of molecular mechanisms of this tumor subtype to develop effective therapeutics.

Objectives

Comprehensively characterize the metabolic profiles of triple negative breast cancer cell line MDA-MB-231 in normoxia and hypoxia and the pathways involved in metabolic changes in hypoxia.

Methods

Differences in metabolic profiles affected pathways of MDA-MB-231 cells in normoxia and hypoxia were characterized using GC–MS based untargeted and stable isotope assisted metabolomic techniques.

Results

Thirty-three metabolites were significantly changed in hypoxia and nine pathways were involved. Hypoxia increased glycolysis, inhibited TCA cycle, pentose phosphate pathway and pyruvate carboxylation, while increased glutaminolysis in MDA-MB-231 cells.

Conclusion

The current results provide metabolic differences of MDA-MB-231 cells in normoxia and hypoxia conditions as well as the involved metabolic pathways, demonstrating the power of combined use of untargeted and stable isotope-assisted metabolomic methods in comprehensive metabolomic analysis.
  相似文献   

17.

Introduction

For pediatric diseases like childhood leukemia, a short latency period points to in-utero exposures as potentially important risk factors. Untargeted metabolomics of small molecules in archived newborn dried blood spots (DBS) offers an avenue for discovering early-life exposures that contribute to disease risks.

Objectives

The purpose of this study was to develop a quantitative method for untargeted analysis of archived newborn DBS for use in an epidemiological study (California Childhood Leukemia Study, CCLS).

Methods

Using experimental DBS from the blood of an adult volunteer, we optimized extraction of small molecules and integrated measurement of potassium as a proxy for blood hematocrit. We then applied this extraction method to 4.7-mm punches from 106 control DBS samples from the CCLS. Sample extracts were analyzed with liquid chromatography—high resolution mass spectrometry (LC-HRMS) and an untargeted workflow was used to screen for metabolites that discriminate population characteristics such as sex, ethnicity, and birth weight.

Results

Thousands of small molecules were measured in extracts of archived DBS. Normalizing for potassium levels removed variability related to varying hematocrit across DBS punches. Of the roughly 1000 prevalent small molecules that were tested, multivariate linear regression detected significant associations with ethnicity (three metabolites) and birth weight (15 metabolites) after adjusting for multiple testing.

Conclusions

This untargeted workflow can be used for analysis of small molecules in archived DBS to discover novel biomarkers, to provide insights into the initiation and progression of diseases, and to provide guidance for disease prevention.
  相似文献   

18.

Introduction

The shelf-life of fresh-cut lettuce after storage is limited by several factors that affect its quality and lead to consumer rejection. Different metabolic events occur after cutting as an abiotic stress response.

Objectives

This study aims to explore the metabolome of iceberg lettuce and to understand the changes related to storage time and genetics applying an untargeted metabolomics approach.

Methods

Two cultivars with different browning susceptibility, fast-browning (FB) and slow-browning (SB), were analyzed by UPLC-ESI-QTOF-MS just after cutting (d0) and after five days of storage (d5). Extraction, metabolic profiling, and data-pretreatment procedures were optimized to obtain a robust and reliable data set.

Results

Preliminary principal component analysis and hierarchical cluster analysis of the full dataset [around 8551 extracted, aligned and filtered molecular features (MFs)] showed a clear separation between the different samples (FB-d0, FB-d5, SB-d0, and SB-d5), highlighting a clear storage time-dependent effect. After statistical analysis applying Student’s t test, 536 MFs were detected as significantly different between d0 and d5 of storage in FB and 633 in SB. Some of them (221) were common to both cultivars. Out of these significant compounds, 22 were tentatively identified by matching their molecular formulae with those previously reported in the literature. Five families of metabolites were detected: amino acids, phenolic compounds, sesquiterpene lactones, fatty acids, and lysophospholipids. All compounds showed a clear trend to decrease at d5 except phenolic compounds that increased after storage.

Conclusion

The untargeted metabolomics analysis is a powerful tool for characterizing the changes on lettuce metabolome associated with cultivar and especially with storage time. Some families of compounds affected by storage time were reported to be closely related to quality loss.
  相似文献   

19.

Background

Until recently, plant metabolomics have provided a deep understanding on the metabolic regulation in individual plants as experimental units. The application of these techniques to agricultural systems subjected to more complex interactions is a step towards the implementation of translational metabolomics in crop breeding.

Aim of Review

We present here a review paper discussing advances in the knowledge reached in the last years derived from the application of metabolomic techniques that evolved from biomarker discovery to improve crop yield and quality.

Key Scientific Concepts of Review

Translational metabolomics applied to crop breeding programs.
  相似文献   

20.

Introduction

Climate change is a major concern for the scientific community, demanding novel information about the effects of environmental stressors on living organisms. Metabolic profiling is required for achieving the most extensive possible range of compounds and their concentration changes on stressed conditions.

Objectives

Individuals of the crustacean species Daphnia magna were exposed to three different abiotic factors linked to global climate change: high salinity, high temperature levels and hypoxia. Advanced chemometric tools were used to characterize the metabolites affected by the exposure.

Method

An exploratory analysis of gas chromatography-mass spectrometry (GCMS) data was performed to discriminate between control and exposed daphnid samples. Due to the complexity of these GCMS data sets, a comprehensive untargeted analysis of the full scan data was performed using multivariate curve resolution-alternating least squares (MCR-ALS) method. This approach enabled to resolve most of the metabolite signals from interference peaks caused by derivatization reactions. Metabolites with significant changes in their peak areas were tentatively identified and the involved metabolic pathways explored.

Results

D. magna metabolic biomarkers are proposed for the considered physical factors. Metabolites related with energy metabolic pathways including some amino acids, carbohydrates, organic acids and nucleosides were identified as potential biomarkers of the investigated treatments.

Conclusions

The proposed untargeted GCMS metabolomics strategy and multivariate data analysis tools were useful to investigate D. magna metabolome under environmental stressed conditions.
  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号