Skip to main content
An official website of the United States government
Grant Details

Grant Number: 5U01CA235507-02 Interpret this number
Primary Investigator: Du, Xiuxia
Organization: University Of North Carolina Charlotte
Project Title: Cross-Platform and Graphical Software Tool for Adaptive Lc/MS and Gc/MS Metabolomics Data Preprocessing
Fiscal Year: 2019


Abstract

Project Summary / Abstract Data preprocessing is critical for the success of any MS-based untargeted metabolomics study, as it is the first informatics step for making sense of the data. Despite the enormous contributions that existing software tools have made to metabolomics, errors in compound identification and relative quantitation are still plaguing the field. This issue is becoming more serious as the sensitivity of LC/MS and GC/MS platforms is constantly increasing. Preprocessing involves peak detection, peak grouping and annotation for LC/MS or spectral deconvolution for GC/MS data, and peak alignment. Existing software tools invariably yield an immense number of false positive and false negative peaks, produce inaccurate peak groups, mis-align detected peaks, and extract inaccurate information of relative metabolite quantitation. These errors can translate downstream into spurious or missing compound identifications and cause misleading interpretations of the metabolome. Furthermore, users need to specify a large number of parameters for existing software tools to work. Unfortunately, general users usually do not understand how to optimize these parameters, and maximizing one aspect (e.g., sensitivity) often has deleterious effects on another (e.g., specificity). We will address these challenges by developing more accurate algorithms for improving the rigor and reproducibility of data preprocessing. The proposed algorithms will be implemented in Java and integrated with the widely-used MZmine 2, making the software cross-platform and user-friendly with rich visualization capabilities. In addition, the implementation will be optimized for memory efficiency and computing speed allowing large-scale data preprocessing. Extensive testing of the software will be conducted in close collaborations with metabolomics core facilities and users around the world.



Publications

Gestational exposure to organophosphate ester flame retardants and risk of childhood obesity in the environmental influences on child health outcomes consortium.
Authors: Peterson A.K. , Alexeeff S.E. , Ames J.L. , Feng J. , Yoshida C. , Avalos L.A. , Barrett E.S. , Bastain T.M. , Bennett D.H. , Buckley J.P. , et al. .
Source: Environment International, 2024 Nov; 193, p. 109071.
EPub date: 2024-10-17 00:00:00.0.
PMID: 39437621
Related Citations

Maternal Serum Metabolomics in Mid-Pregnancy Identifies Lipid Pathways as a Key Link to Offspring Obesity in Early Childhood.
Authors: Francis E.C. , Kechris K. , Johnson R.K. , Rawal S. , Pathmasiri W. , Rushing B.R. , Du X. , Jansson T. , Dabelea D. , Sumner S.J. , et al. .
Source: International Journal Of Molecular Sciences, 2024-07-11 00:00:00.0; 25(14), .
EPub date: 2024-07-11 00:00:00.0.
PMID: 39062861
Related Citations

Untargeted metabolomics reveal signatures of a healthy lifestyle.
Authors: Pathmasiri W. , Rushing B.R. , McRitchie S. , Choudhari M. , Du X. , Smirnov A. , Pelleigrini M. , Thompson M.J. , Sakaguchi C.A. , Nieman D.C. , et al. .
Source: Scientific Reports, 2024-06-13 00:00:00.0; 14(1), p. 13630.
EPub date: 2024-06-13 00:00:00.0.
PMID: 38871777
Related Citations

Reproducible mass spectrometry data processing and compound annotation in MZmine 3.
Authors: Heuckeroth S. , Damiani T. , Smirnov A. , Mokshyna O. , Brungs C. , Korf A. , Smith J.D. , Stincone P. , Dreolin N. , Nothias L.F. , et al. .
Source: Nature Protocols, 2024-05-20 00:00:00.0; , .
EPub date: 2024-05-20 00:00:00.0.
PMID: 38769143
Related Citations

Comparison of maternal venous blood metabolomics collected as dried blood spots, dried blood microsamplers, and plasma for integrative environmental health research.
Authors: Petrick L. , Guan H. , Page G.P. , Dolios G. , Niedzwiecki M.M. , Wright R.O. , Wright R.J. , program collaborators for Environmental Influences on Child Health Outcomes .
Source: Environment International, 2024 May; 187, p. 108663.
EPub date: 2024-04-16 00:00:00.0.
PMID: 38657407
Related Citations

Current Practices in LC-MS Untargeted Metabolomics: A Scoping Review on the Use of Pooled Quality Control Samples.
Authors: Broeckling C.D. , Beger R.D. , Cheng L.L. , Cumeras R. , Cuthbertson D.J. , Dasari S. , Davis W.C. , Dunn W.B. , Evans A.M. , Fernández-Ochoa A. , et al. .
Source: Analytical Chemistry, 2023-12-26 00:00:00.0; 95(51), p. 18645-18654.
EPub date: 2023-12-06 00:00:00.0.
PMID: 38055671
Related Citations

Recent advances in mass spectrometry-based computational metabolomics.
Authors: Ebbels T.M.D. , van der Hooft J.J.J. , Chatelaine H. , Broeckling C. , Zamboni N. , Hassoun S. , Mathé E.A. .
Source: Current Opinion In Chemical Biology, 2023 Jun; 74, p. 102288.
EPub date: 2023-03-24 00:00:00.0.
PMID: 36966702
Related Citations

Integrative analysis of multimodal mass spectrometry data in MZmine 3.
Authors: Schmid R. , Heuckeroth S. , Korf A. , Smirnov A. , Myers O. , Dyrlund T.S. , Bushuiev R. , Murray K.J. , Hoffmann N. , Lu M. , et al. .
Source: Nature Biotechnology, 2023 Apr; 41(4), p. 447-449.
PMID: 36859716
Related Citations

Memory-Efficient Searching of Gas-Chromatography Mass Spectra Accelerated by Prescreening.
Authors: Smirnov A. , Liao Y. , Du X. .
Source: Metabolites, 2022-05-29 00:00:00.0; 12(6), .
EPub date: 2022-05-29 00:00:00.0.
PMID: 35736424
Related Citations

ADAP-KDB: A Spectral Knowledgebase for Tracking and Prioritizing Unknown GC-MS Spectra in the NIH's Metabolomics Data Repository.
Authors: Smirnov A. , Liao Y. , Fahy E. , Subramaniam S. , Du X. .
Source: Analytical Chemistry, 2021-09-14 00:00:00.0; 93(36), p. 12213-12220.
EPub date: 2021-08-29 00:00:00.0.
PMID: 34455770
Related Citations

A Practical Guide to Metabolomics Software Development.
Authors: Chang H.Y. , Colby S.M. , Du X. , Gomez J.D. , Helf M.J. , Kechris K. , Kirkpatrick C.R. , Li S. , Patti G.J. , Renslow R.S. , et al. .
Source: Analytical Chemistry, 2021-02-02 00:00:00.0; 93(4), p. 1912-1923.
EPub date: 2021-01-19 00:00:00.0.
PMID: 33467846
Related Citations

Auto-deconvolution and molecular networking of gas chromatography-mass spectrometry data.
Authors: Aksenov A.A. , Laponogov I. , Zhang Z. , Doran S.L.F. , Belluomo I. , Veselkov D. , Bittremieux W. , Nothias L.F. , Nothias-Esposito M. , Maloney K.N. , et al. .
Source: Nature Biotechnology, 2021 02; 39(2), p. 169-173.
EPub date: 2020-11-09 00:00:00.0.
PMID: 33169034
Related Citations

Metabolomics Data Preprocessing Using ADAP and MZmine 2.
Authors: Du X. , Smirnov A. , Pluskal T. , Jia W. , Sumner S. .
Source: Methods In Molecular Biology (clifton, N.j.), 2020; 2104, p. 25-48.
PMID: 31953811
Related Citations

The metaRbolomics Toolbox in Bioconductor and beyond.
Authors: Stanstrup J. , Broeckling C.D. , Helmus R. , Hoffmann N. , Mathé E. , Naake T. , Nicolotti L. , Peters K. , Rainer J. , Salek R.M. , et al. .
Source: Metabolites, 2019-09-23 00:00:00.0; 9(10), .
EPub date: 2019-09-23 00:00:00.0.
PMID: 31548506
Related Citations

ADAP-GC 4.0: Application of Clustering-Assisted Multivariate Curve Resolution to Spectral Deconvolution of Gas Chromatography-Mass Spectrometry Metabolomics Data.
Authors: Smirnov A. , Qiu Y. , Jia W. , Walker D.I. , Jones D.P. , Du X. .
Source: Analytical Chemistry, 2019-07-16 00:00:00.0; 91(14), p. 9069-9077.
EPub date: 2019-07-05 00:00:00.0.
PMID: 31274283
Related Citations



Back to Top