Skip to main content
An official website of the United States government
Grant Details

Grant Number: 5R01CA262296-05 Interpret this number
Primary Investigator: Cui, Yan
Organization: University Of Tennessee Health Sci Ctr
Project Title: Algorithm-Based Prevention and Reduction of Differences in Cancer Outcomes Arising From Data Imbalance
Fiscal Year: 2025


Abstract

A long-term cumulative data imbalance exists in biomedical research and clinical studies. This severe data imbalance is set to produce differences in cancer outcomes as data-driven, algorithm-based biomedical research and clinical decisions become increasingly common. Differences in cancer outcomes arising from data imbalance may affect all cancer types. The overall objective of this work is to obtain key knowledge and create open resources to establish a new paradigm for machine learning with clinical omics data. Guided by strong preliminary data, we will pursue two specific aims to 1) Discover from cancer clinical omics data and genotype-phenotype data: under what conditions and to what extent the transfer learning scheme improves machine learning model performance across population groups; 2) Create an open resource system for robust machine learning to prevent or reduce differences in cancer outcomes arising from the biomedical data imbalance. The approach is innovative because it represents a substantive departure from the status quo by shifting the paradigm from mixture learning and independent learning schemes to a transfer learning scheme. The proposed research is significant because it is expected to establish a new paradigm for robust machine learning and to provide an open resource system to facilitate the paradigm shift.



Publications

Digital pathways connecting social and biological factors to health outcomes and equity.
Authors: Cui Y. .
Source: Npj Digital Medicine, 2025-03-20 00:00:00.0; 8(1), p. 172.
EPub date: 2025-03-20 00:00:00.0.
PMID: 40113922
Related Citations

Optimizing clinico-genomic disease prediction across ancestries: a machine learning strategy with Pareto improvement.
Authors: Gao Y. , Cui Y. .
Source: Genome Medicine, 2024-06-04 00:00:00.0; 16(1), p. 76.
EPub date: 2024-06-04 00:00:00.0.
PMID: 38835075
Related Citations

Addressing the Challenge of Biomedical Data Inequality: An Artificial Intelligence Perspective.
Authors: Gao Y. , Sharma T. , Cui Y. .
Source: Annual Review Of Biomedical Data Science, 2023-04-27 00:00:00.0; , .
EPub date: 2023-04-27 00:00:00.0.
PMID: 37104653
Related Citations

Clinical time-to-event prediction enhanced by incorporating compatible related outcomes.
Authors: Gao Y. , Cui Y. .
Source: Plos Digital Health, 2022; 1(5), .
EPub date: 2022-05-26 00:00:00.0.
PMID: 35757279
Related Citations

Malignant transformation in human colorectal mucosa as monitored by distribution of laminin, a basement membrane glycoprotein.
Authors: Kellokumpu I. , Ekblom P. , Scheinin T.M. , Andersson L.C. .
Source: Acta Pathologica, Microbiologica, Et Immunologica Scandinavica. Section A, Pathology, 1985 Sep; 93(5), p. 285-91.
PMID: 4050437
Related Citations



Back to Top