首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
Current quantitative metabolomic research in brain tissue is challenged by several analytical issues. To compare data of metabolite pattern, ratios of individual metabolite concentrations and composed classifiers characterizing a distinct state, standardized workup conditions, and extraction medium are crucial. Differences in physicochemical properties of individual compounds and compound classes such as polarity determine extraction yields and, thus, ratios of compounds with varying properties. Also, variations in suppressive effects related to coextracted matrix components affect standards or references and their concentration-dependent responses.The selection of a common tissue extraction protocol is an ill-posed problem because it can be regarded as a multiple objective decision depending on factors such as sample handling practicability, measurement precision, control of matrix effects, and relevance of the chemical assay. This study systematically evaluates the impact of extraction solvents and the impact of the complex brain tissue on measured metabolite levels, taking into account ionization efficiency as well as challenges encountered in the trace-level quantification of the analytes in brain matrices. In comparison with previous studies that relied on nontargeted platforms, consequently emphasizing the global behavior of the metabolomic fingerprint, here we focus on several series of metabolites spanning over extensive polarity, concentration, and molecular mass ranges.  相似文献   

2.
We report the release of mzIdentML, an exchange standard for peptide and protein identification data, designed by the Proteomics Standards Initiative. The format was developed by the Proteomics Standards Initiative in collaboration with instrument and software vendors, and the developers of the major open-source projects in proteomics. Software implementations have been developed to enable conversion from most popular proprietary and open-source formats, and mzIdentML will soon be supported by the major public repositories. These developments enable proteomics scientists to start working with the standard for exchanging and publishing data sets in support of publications and they provide a stable platform for bioinformatics groups and commercial software vendors to work with a single file format for identification data.  相似文献   

3.

Background  

Mass spectrometry (MS) coupled with online separation methods is commonly applied for differential and quantitative profiling of biological samples in metabolomic as well as proteomic research. Such approaches are used for systems biology, functional genomics, and biomarker discovery, among others. An ongoing challenge of these molecular profiling approaches, however, is the development of better data processing methods. Here we introduce a new generation of a popular open-source data processing toolbox, MZmine 2.  相似文献   

4.
Saliva is a readily accessible and informative biofluid, making it ideal for the early detection of a wide range of diseases including cardiovascular, renal, and autoimmune diseases, viral and bacterial infections and, importantly, cancers. Saliva-based diagnostics, particularly those based on metabolomics technology, are emerging and offer a promising clinical strategy, characterizing the association between salivary analytes and a particular disease. Here, we conducted a comprehensive metabolite analysis of saliva samples obtained from 215 individuals (69 oral, 18 pancreatic and 30 breast cancer patients, 11 periodontal disease patients and 87 healthy controls) using capillary electrophoresis time-of-flight mass spectrometry (CE-TOF-MS). We identified 57 principal metabolites that can be used to accurately predict the probability of being affected by each individual disease. Although small but significant correlations were found between the known patient characteristics and the quantified metabolites, the profiles manifested relatively higher concentrations of most of the metabolites detected in all three cancers in comparison with those in people with periodontal disease and control subjects. This suggests that cancer-specific signatures are embedded in saliva metabolites. Multiple logistic regression models yielded high area under the receiver-operating characteristic curves (AUCs) to discriminate healthy controls from each disease. The AUCs were 0.865 for oral cancer, 0.973 for breast cancer, 0.993 for pancreatic cancer, and 0.969 for periodontal diseases. The accuracy of the models was also high, with cross-validation AUCs of 0.810, 0.881, 0.994, and 0.954, respectively. Quantitative information for these 57 metabolites and their combinations enable us to predict disease susceptibility. These metabolites are promising biomarkers for medical screening.  相似文献   

5.
6.
A "one-pot" alternative method for processing proteins and isolating peptide mixtures from bacterial samples is presented for liquid chromatography-tandem mass spectrometry (LC-MS/MS) analysis and data reduction. The conventional in-solution digestion of the protein contents of bacteria is compared to a small disposable filter unit placed inside a centrifuge vial for processing and digestion of bacterial proteins. Each processing stage allows filtration of excess reactants and unwanted byproduct while retaining the proteins. Upon addition of trypsin, the peptide mixture solution is passed through the filter while retaining the trypsin enzyme. The peptide mixture is then analyzed by LC-MS/MS with an in-house BACid algorithm for a comparison of the experimental unique peptides to a constructed proteome database of bacterial genus, specie, and strain entries. The concentration of bacteria was varied from 10 × 10(7) to 3.3 × 10(3) cfu/mL for analysis of the effect of concentration on the ability of the sample processing, LC-MS/MS, and data analysis methods to identify bacteria. The protein processing method and dilution procedure result in reliable identification of pure suspensions and mixtures at high and low bacterial concentrations.  相似文献   

7.
Missing values in mass spectrometry metabolomic datasets occur widely and can originate from a number of sources, including for both technical and biological reasons. Currently, little is known about these data, i.e. about their distributions across datasets, the need (or not) to consider them in the data processing pipeline, and most importantly, the optimal way of assigning them values prior to univariate or multivariate data analysis. Here, we address all of these issues using direct infusion Fourier transform ion cyclotron resonance mass spectrometry data. We have shown that missing data are widespread, accounting for ca. 20% of data and affecting up to 80% of all variables, and that they do not occur randomly but rather as a function of signal intensity and mass-to-charge ratio. We have demonstrated that missing data estimation algorithms have a major effect on the outcome of data analysis when comparing the differences between biological sample groups, including by t test, ANOVA and principal component analysis. Furthermore, results varied significantly across the eight algorithms that we assessed for their ability to impute known, but labelled as missing, entries. Based on all of our findings we identified the k-nearest neighbour imputation method (KNN) as the optimal missing value estimation approach for our direct infusion mass spectrometry datasets. However, we believe the wider significance of this study is that it highlights the importance of missing metabolite levels in the data processing pipeline and offers an approach to identify optimal ways of treating missing data in metabolomics experiments.  相似文献   

8.

Missing values in mass spectrometry metabolomic datasets occur widely and can originate from a number of sources, including for both technical and biological reasons. Currently, little is known about these data, i.e. about their distributions across datasets, the need (or not) to consider them in the data processing pipeline, and most importantly, the optimal way of assigning them values prior to univariate or multivariate data analysis. Here, we address all of these issues using direct infusion Fourier transform ion cyclotron resonance mass spectrometry data. We have shown that missing data are widespread, accounting for ca. 20% of data and affecting up to 80% of all variables, and that they do not occur randomly but rather as a function of signal intensity and mass-to-charge ratio. We have demonstrated that missing data estimation algorithms have a major effect on the outcome of data analysis when comparing the differences between biological sample groups, including by t test, ANOVA and principal component analysis. Furthermore, results varied significantly across the eight algorithms that we assessed for their ability to impute known, but labelled as missing, entries. Based on all of our findings we identified the k-nearest neighbour imputation method (KNN) as the optimal missing value estimation approach for our direct infusion mass spectrometry datasets. However, we believe the wider significance of this study is that it highlights the importance of missing metabolite levels in the data processing pipeline and offers an approach to identify optimal ways of treating missing data in metabolomics experiments.

  相似文献   

9.
Curation and interpretation of protein databank-search results by human experts are key aspects of MS-based proteomic data acquisition. These tasks are often overlooked due to the vast amount of data to inspect. We have developed myProMS, a web server designed to ease search results validation and interpretation by improving data organization, mining and sharing between MS specialists and biologists during MS-based collaborative projects. A demo is accessible at http://bioinfo.curie.fr/myproms.  相似文献   

10.
Mass spectrometry (MS)-based shotgun proteomics allows protein identifications even in complex biological samples. Protein abundances can then be estimated from the counts of tandem MS (MS/MS) spectra attributable to each protein, provided one accounts for differential MS detectability of contributing peptides. We developed a method, APEX, which calculates Absolute Protein EXpression levels based upon learned correction factors, MS/MS spectral counts and each protein's probability of correct identification. This protocol describes APEX-based calculations in three parts. (i) Using training data, peptide sequences and their sequence properties, a model is built to estimate MS detectability (O(i)) for any given protein. (ii) Absolute protein abundances are calculated from spectral counts, identification probabilities and the learned O(i)-values. (iii) Simple statistics allow calculation of differential expression in two distinct biological samples, i.e., measuring relative protein abundances. APEX-based protein abundances span 3-4 orders of magnitude and are applicable to mixtures of 100s to 1,000s of proteins.  相似文献   

11.
The accurate mass and time (AMT) tag strategy has been recognized as a powerful tool for high-throughput analysis in liquid chromatography–mass spectrometry (LC–MS)-based proteomics. Due to the complexity of the human proteome, this strategy requires highly accurate mass measurements for confident identifications. We have developed a method of building a reference map that allows relaxed criteria for mass errors yet delivers high confidence for peptide identifications. The samples used for generating the peptide database were produced by collecting cysteine-containing peptides from T47D cells and then fractionating the peptides using strong cationic exchange chromatography (SCX). LC–tandem mass spectrometry (MS/MS) data from the SCX fractions were combined to create a comprehensive reference map. After the reference map was built, it was possible to skip the SCX step in further proteomic analyses. We found that the reference-driven identification increases the overall throughput and proteomic coverage by identifying peptides with low intensity or complex interference. The use of the reference map also facilitates the quantitation process by allowing extraction of peptide intensities of interest and incorporating models of theoretical isotope distribution.  相似文献   

12.
The revival of interest in cancer cell metabolism in recent years has prompted the need for quantitative analytical platforms for studying metabolites from in vivo sources. We implemented a quantitative polar metabolomics profiling platform using selected reaction monitoring with a 5500 QTRAP hybrid triple quadrupole mass spectrometer that covers all major metabolic pathways. The platform uses hydrophilic interaction liquid chromatography with positive/negative ion switching to analyze 258 metabolites (289 Q1/Q3 transitions) from a single 15-min liquid chromatography-mass spectrometry acquisition with a 3-ms dwell time and a 1.55-s duty cycle time. Previous platforms use more than one experiment to profile this number of metabolites from different ionization modes. The platform is compatible with polar metabolites from any biological source, including fresh tissues, cancer cells, bodily fluids and formalin-fixed paraffin-embedded tumor tissue. Relative quantification can be achieved without using internal standards, and integrated peak areas based on total ion current can be used for statistical analyses and pathway analyses across biological sample conditions. The procedure takes ~12 h from metabolite extraction to peak integration for a data set containing 15 total samples (~6 h for a single sample).  相似文献   

13.
In many metabolomics studies, NMR spectra are divided into bins of fixed width. This spectral quantification technique, known as uniform binning, is used to reduce the number of variables for pattern recognition techniques and to mitigate effects from variations in peak positions; however, shifts in peaks near the boundaries can cause dramatic quantitative changes in adjacent bins due to non-overlapping boundaries. Here we describe a new Gaussian binning method that incorporates overlapping bins to minimize these effects. A Gaussian kernel weights the signal contribution relative to distance from bin center, and the overlap between bins is controlled by the kernel standard deviation. Sensitivity to peak shift was assessed for a series of test spectra where the offset frequency was incremented in 0.5 Hz steps. For a 4 Hz shift within a bin width of 24 Hz, the error for uniform binning increased by 150%, while the error for Gaussian binning increased by 50%. Further, using a urinary metabolomics data set (from a toxicity study) and principal component analysis (PCA), we showed that the information content in the quantified features was equivalent for Gaussian and uniform binning methods. The separation between groups in the PCA scores plot, measured by the J 2 quality metric, is as good or better for Gaussian binning versus uniform binning. The Gaussian method is shown to be robust in regards to peak shift, while still retaining the information needed by classification and multivariate statistical techniques for NMR-metabolomics data.  相似文献   

14.
Warp2D is a novel time alignment approach, which uses the overlapping peak volume of the reference and sample peak lists to correct misleading peak shifts. Here, we present an easy-to-use web interface for high-throughput Warp2D batch processing time alignment service using the Dutch Life Science Grid, reducing processing time from days to hours. This service provides the warping function, the sample chromatogram peak list with adjusted retention times and normalized quality scores based on the sum of overlapping peak volume of all peaks. Heat maps before and after time alignment are created from the arithmetic mean of the sum of overlapping peak area rearranged with hierarchical clustering, allowing the quality control of the time alignment procedure. Taverna workflow and command line tool are provided for remote processing of local user data. AVAILABILITY: online data processing service is available at http://www.nbpp.nl/warp2d.html. Taverna workflow is available at myExperiment with title '2D Time Alignment-Webservice and Workflow' at http://www.myexperiment.org/workflows/1283.html. Command line tool is available at http://www.nbpp.nl/Warp2D_commandline.zip. CONTACT: p.l.horvatovich@rug.nl SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.  相似文献   

15.
16.
Microbial cell factories have been extensively engineered to produce free fatty acids (FFAs) as key components of crucial nutrients, soaps, industrial chemicals, and fuels. However, our ability to control the composition of microbially synthesized FFAs is still limited, particularly, for producing medium-chain fatty acids (MCFAs). This is mainly due to the lack of high-throughput approaches for FFA analysis to engineer enzymes with desirable product specificity. Here we report a mass spectrometry (MS)-based method for rapid profiling of MCFAs in Saccharomyces cerevisiae by using membrane lipids as a proxy. In particular, matrix-assisted laser desorption/ionization time-of-flight (MALDI-ToF) MS was used to detect shorter acyl chain phosphatidylcholines from membrane lipids and a higher m/z peak ratio at 730 and 758 was used as an indication for improved MCFA production. This colony-based method can be performed at a rate of ~2 s per sample, representing a substantial improvement over gas chromatography-MS (typically >30 min per sample) as the gold standard method for FFA detection. To demonstrate the power of this method, we performed site-saturation mutagenesis of the yeast fatty acid synthase and identified nine missense mutations that resulted in improved MCFA production relative to the wild-type strain. Colony-based MALDI-ToF MS screening provides an effective approach for engineering microbial fatty acid compositions in a high-throughput manner.  相似文献   

17.
MOTIVATION: High-resolution mass spectrometers generate large data files that are complex, noisy and require extensive processing to extract the optimal data from raw spectra. This processing is readily achieved in software and is often embedded in manufacturers' instrument control and data processing environments. However, the speed of this data processing is such that it is usually performed off-line, post data acquisition. We have been exploring strategies that would allow real-time advanced processing of mass spectrometric data, making use of the reconfigurable computing paradigm, which exploits the flexibility and versatility of Field Programmable Gate Arrays (FPGAs). This approach has emerged as a powerful solution for speeding up time-critical algorithms. We describe here a reconfigurable computing solution for processing raw mass spectrometric data generated by MALDI-ToF instruments. The hardware-implemented algorithms for de-noising, baseline correction, peak identification and deisotoping, running on a Xilinx Virtex 2 FPGA at 180 MHz, generate a mass fingerprint over 100 times faster than an equivalent algorithm written in C, running on a Dual 3 GHz Xeon workstation.  相似文献   

18.
The application of mass spectrometry imaging (MS imaging) is rapidly growing with a constantly increasing number of different instrumental systems and software tools. The data format imzML was developed to allow the flexible and efficient exchange of MS imaging data between different instruments and data analysis software. imzML data is divided in two files which are linked by a universally unique identifier (UUID). Experimental details are stored in an XML file which is based on the HUPO-PSI format mzML. Information is provided in the form of a 'controlled vocabulary' (CV) in order to unequivocally describe the parameters and to avoid redundancy in nomenclature. Mass spectral data are stored in a binary file in order to allow efficient storage. imzML is supported by a growing number of software tools. Users will be no longer limited to proprietary software, but are able to use the processing software best suited for a specific question or application. MS imaging data from different instruments can be converted to imzML and displayed with identical parameters in one software package for easier comparison. All technical details necessary to implement imzML and additional background information is available at www.imzml.org.  相似文献   

19.
A simultaneous quantitative assay method for urinary oxysterols and bile acids using GC–MS was developed to investigate the mechanism of liver toxicity induced by drugs or chemicals. Sample preparations were optimized by exploring various extraction solvents, derivatization reagents, and hydrolysis methods to achieve reliable and maximum sensitivity for these two different compound classes. As a result, satisfactory accuracy, precision, and sensitivity were obtained in the validation. The method was then applied to quantify urinary oxysterols and bile acids produced from liver toxicity induced by atorvastatin (250 mg/kg/day). From the results, increases in bile acid levels and decreases in the concentration ratio between cholic acid and chenodeoxycholic acid, which are the distinguishing phenomena observed in serum or bile for liver toxicity, were also observed in urine. Additionally, the mechanism of liver toxicity was investigated with the urinary concentration ratio of product to precursor in the metabolic pathway from cholesterol to bile acids. The results indicated that enzyme activities related to the production and degradation of bile acids, not oxysterols, were significantly changed from liver toxicity. Thus, it was concluded that urinary levels of oxysterols and bile acids could be useful tools for checking liver toxicity and investigating its mechanism.  相似文献   

20.
Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry analysis following tryptic digestion of polyacrylamide gel pieces is a common technique used to identify proteins. This approach is rapid, sensitive, and user friendly, and is becoming widely available to scientists in a variety of biological fields. Here we introduce a simple and effective strategy called "mass processing" where the list of masses generated from a mass spectrometer undergoes two stages of data reduction before identification. Mass processing improves the ability to identify in-gel tryptic-digested proteins by reducing the number of nonsample masses submitted to protein identification database search engines. Our results demonstrate that mass processing improves the statistical score and rank of putative protein identifications, especially with low-quantity samples, thus increasing the ability to confidently identify proteins with mass spectrometry data.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号