Skip to main content
An official website of the United States government
Grant Details

Grant Number: 5U01CA235510-04 Interpret this number
Primary Investigator: Weinstein, John
Organization: University Of Tx Md Anderson Can Ctr
Project Title: Computational Tools for Analysis and Visualization of Quality Control Issues in Metabolomic Data
Fiscal Year: 2021


Abstract

* * * Abstract * * * In omic studies of all types (e.g., genomic, transcriptomic, proteomic, metabolomic), technical batch effects pose a fundamental challenge to quality control and reproducibility. The possibilities for serious error are greatly magnified in metabolomics, however, due to a range of possible platform, operator, instrument, and environmental factors that can cause batch (or trend) effects. Hence, there is a need for routine surveillance and correction of batch effects within and across metabolomics laboratories and technological platforms. Accordingly, we propose here to develop the MetaBatch algorithms, computational tool, and web portal. For development of MetaBatch, we will leverage our experience in developing MBatch, a tool that became indispensible for quality-control of data in all 33 projects of The Cancer Genome Atlas (TCGA) program. Our first aim is to translate the successful quality control model from TCGA to metabolomics by customizing and extending the MBatch pipeline for detection, quantitation, diagnosis, interpretation, and correction of batch and trend effects. The second aim is to develop and incorporate innovative metabolomics-specific algorithms, including major visualization resources such as our interactive Next-Generation Clustered Heat Maps. The third aim is to distribute MetaBatch to the research community as open-source software and in cloud-based and Galaxy versions. The fourth aim is to provide plug-in capability for integration of MetaBatch with other metabolomic resources, prominently including Metabolomics Workbench (in collaboration with Dr. Shankar Subramaniam) and others developed within the Common Fund Metabolomics Program. Our fifth aim is to promote MetaBatch actively and interact extensively with other Consortium members and the metabolomics research community. With active support from MD Anderson Faculty and Academic Development, we will provide documentation, tutorials, videos, demonstrations, and training to accelerate use and to solicit feedback on limitations, possible improvements, and additional modules that would be useful in real-world workflows. We bring a variety of assets to the project, including: the MBatch resource as a starting point for software development; multidisciplinary expertise in bioinformatics, biostatistics, software engineering, biology, and clinical medicine; PIs with a combined 21 years of experience in molecular profiling studies of clinical disease (in a consortial context); international leadership in batch effects analysis; a software engineering team with a track record of producing high-end, highly visual bioinformatics packages and websites; a team of 20 Analysts whose expertise can be called on; extensive computing resources, including one of the most powerful academically based machines in the world; strong institutional support; and close working relationships with first-class basic, translational, and clinical researchers throughout MD Anderson, one of the foremost cancer centers in the country. Our bottom-line mission will be to aid the research community's effort to improve rigor and reproducibility in metabolomics for scientific understanding and to alleviate disease. !



Publications


None

Back to Top