Skip to main content
An official website of the United States government
Grant Details

Grant Number: 5R01CA249096-03 Interpret this number
Primary Investigator: Li, Yi
Organization: University Of Michigan At Ann Arbor
Project Title: New Statistical Methods for Modelling Cancer Outcomes
Fiscal Year: 2023


Abstract

PROJECT SUMMARY/ABSTRACT Lung cancer is one of the most common causes of mortality worldwide. Radiomic features have been shown to provide prognostic values in predicting lung cancer outcomes. Quantitative imaging features, often in dauntingly large numbers, are extracted from tumor regions. However, not all these extracted features are useful for tumor characterization, and feature selection is key for best performance. We plan to develop feasible statistical methods to select relevant features and conduct feature learning, i.e. discovery of representations needed for feature detection from the raw data. On the molecular level, expression and genetic variation of some known genes, such as KDM4 genes, have been linked to lung cancer prognosis, though little is known about epigenetic modifications' roles. Even fewer studies have investigated the impact of the interplay of DNA methylation and coexisting chronic obstructive pulmonary disease (COPD; a major clinical risk factor) on lung cancer risks. Statistically, drawing inference when the predictors (the clinical indicators and the methylation sites) outnumber the sample size in regression settings, e.g. generalized linear models, Cox proportional hazards models and censored quantile regression models, is very challenging. We plan to establish a new framework to draw inferences based on these complicated models. Growing evidence has suggested that cancer can be better understood through mutated or dysregulated pathways or networks rather than individual DNA mutations and mechanism of lung cancer involves the interplay of the cellular heterogeneity, the myriad of dysfunctional molecular and genetic networks. We plan to develop new models to analyze those large scale network/pathway data and investigate how their dynamic network structures can be predicted based on DNA mutations. Leveraging the rich Boston Lung Cancer Survival Cohort database with 11,164 lung cancer cases, we expect that our new statistical methods will help identify novel biomarkers linked to lung cancer. Our promising preliminary results indicate the feasibility of the proposed work, which provides a solid radiomic and molecular basis for prediction of lung cancer outcomes. Core methods will be distributed in open-source, freely available software, naturally leading to implementable procedures for researchers and practitioners.



Publications

Residual Volume and Total Lung Capacity at Diagnosis Predict Overall Survival in Non-Small Cell Lung Cancer Patients.
Authors: Zhai T. , Li Y. , Brown R. , Lanuti M. , Gainor J.F. , Christiani D.C. .
Source: Cancer Medicine, 2025 May; 14(10), p. e70962.
PMID: 40371871
Related Citations

Assessing the prognostic utility of clinical and radiomic features for COVID-19 patients admitted to ICU: challenges and lessons learned.
Authors: Sun Y. , Salerno S. , Pan Z. , Yang E. , Sujimongkol C. , Song J. , Wang X. , Han P. , Zeng D. , Kang J. , et al. .
Source: Harvard Data Science Review, 2024 Winter; 6(1), .
EPub date: 2024-01-31 00:00:00.0.
PMID: 38974963
Related Citations

A two-phase epigenome-wide four-way gene-smoking interaction study of overall survival for early-stage non-small cell lung cancer.
Authors: Chen L. , Wang X. , Xie N. , Zhang Z. , Xu X. , Xue M. , Yang Y. , Liu L. , Su L. , Bjaanæs M. , et al. .
Source: Molecular Oncology, 2024-12-04 00:00:00.0; , .
EPub date: 2024-12-04 00:00:00.0.
PMID: 39630602
Related Citations

Automated Interstitial Lung Abnormality Probability Prediction at CT: A Stepwise Machine Learning Approach in the Boston Lung Cancer Study.
Authors: Hata A. , Aoyagi K. , Hino T. , Kawagishi M. , Wada N. , Song J. , Wang X. , Valtchinov V.I. , Nishino M. , Muraguchi Y. , et al. .
Source: Radiology, 2024 Sep; 312(3), p. e233435.
PMID: 39225600
Related Citations

Bayesian Inference for High Dimensional Cox Models with Gaussian and Diffused-Gamma Priors: A Case Study of Mortality in COVID-19 Patients Admitted to the ICU.
Authors: Song J. , Guha S. , Li Y. .
Source: Statistics In Biosciences, 2024 Apr; 16(1), p. 221-249.
EPub date: 2023-11-04 00:00:00.0.
PMID: 38651050
Related Citations

Penalized deep partially linear cox models with application to CT scans of lung cancer patients.
Authors: Sun Y. , Kang J. , Haridas C. , Mayne N. , Potter A. , Yang C.F. , Christiani D.C. , Li Y. .
Source: Biometrics, 2024-01-29 00:00:00.0; 80(1), .
PMID: 38412302
Related Citations

What evidence is needed to inform postoperative opioid consumption guidelines? A cohort study of the Michigan Surgical Quality Collaborative.
Authors: Song J. , Li Y. , Waljee J.F. , Gunaseelan V. , Brummett C.M. , Englesbe M.J. , Bicket M.C. .
Source: Regional Anesthesia And Pain Medicine, 2024-01-11 00:00:00.0; 49(1), p. 23-29.
EPub date: 2024-01-11 00:00:00.0.
PMID: 37247946
Related Citations

What evidence is needed to inform postoperative opioid consumption guidelines? A cohort study of the Michigan Surgical Quality Collaborative.
Authors: Song J. , Li Y. , Waljee J.F. , Gunaseelan V. , Brummett C.M. , Englesbe M.J. , Bicket M.C. .
Source: Regional Anesthesia And Pain Medicine, 2024-01-11 00:00:00.0; 49(1), p. 23-29.
EPub date: 2024-01-11 00:00:00.0.
PMID: 37247946
Related Citations

Incidence and severity of pulmonary embolism in COVID-19 infection: Ancestral, Alpha, Delta, and Omicron variants.
Authors: Wada N. , Li Y. , Gagne S. , Hino T. , Valtchinov V.I. , Gay E. , Nishino M. , Hammer M.M. , Madore B. , Guttmann C.R.G. , et al. .
Source: Medicine, 2023-12-01 00:00:00.0; 102(48), p. e36417.
PMID: 38050198
Related Citations

Debiased lasso for stratified Cox models with application to the national kidney transplant data.
Authors: Xia L. , Nan B. , Li Y. .
Source: The Annals Of Applied Statistics, 2023 Dec; 17(4), p. 3550-3569.
EPub date: 2023-10-30 00:00:00.0.
PMID: 38106966
Related Citations

WNT9A Affects Late-Onset Acute Respiratory Distress Syndrome and 28-Day Survival: Evidence from a Three-Step Multiomics Study.
Authors: Chen J. , Tang J. , Nie M. , Li Y. , Wurfel M.M. , Meyer N.J. , Wei Y. , Zhao Y. , Frank A.J. , Thompson B.T. , et al. .
Source: American Journal Of Respiratory Cell And Molecular Biology, 2023 Aug; 69(2), p. 220-229.
PMID: 37094100
Related Citations

Simultaneous selection and inference for varying coefficients with zero regions: a soft-thresholding approach.
Authors: Yang Y. , Pan Z. , Kang J. , Brummett C. , Li Y. .
Source: Biometrics, 2023-07-17 00:00:00.0; , .
EPub date: 2023-07-17 00:00:00.0.
PMID: 37459178
Related Citations

Statistical Inference for Cox Proportional Hazards Models with a Diverging Number of Covariates.
Authors: Xia L. , Nan B. , Li Y. .
Source: Scandinavian Journal Of Statistics, Theory And Applications, 2023 Jun; 50(2), p. 550-571.
EPub date: 2022-04-25 00:00:00.0.
PMID: 37408772
Related Citations

Use of machine learning to assess the prognostic utility of radiomic features for in-hospital COVID-19 mortality.
Authors: Sun Y. , Salerno S. , He X. , Pan Z. , Yang E. , Sujimongkol C. , Song J. , Wang X. , Han P. , Kang J. , et al. .
Source: Scientific Reports, 2023-05-05 00:00:00.0; 13(1), p. 7318.
EPub date: 2023-05-05 00:00:00.0.
PMID: 37147440
Related Citations

Prediagnosis Smoking Cessation and Overall Survival Among Patients With Non-Small Cell Lung Cancer.
Authors: Wang X. , Romero-Gutierrez C.W. , Kothari J. , Shafer A. , Li Y. , Christiani D.C. .
Source: Jama Network Open, 2023-05-01 00:00:00.0; 6(5), p. e2311966.
EPub date: 2023-05-01 00:00:00.0.
PMID: 37145597
Related Citations

Asynchronous and error-prone longitudinal data analysis via functional calibration.
Authors: Chang X. , Li Y. , Li Y. .
Source: Biometrics, 2023-04-12 00:00:00.0; , .
EPub date: 2023-04-12 00:00:00.0.
PMID: 37042741
Related Citations

Traction Bronchiectasis/Bronchiolectasis in Interstitial Lung Abnormality: Follow-up in the COPDGene.
Authors: Hata A. , Hino T. , Li Y. , Johkoh T. , Christiani D.C. , Lynch D.A. , Cho M.H. , Silverman E.K. , Hunninghake G.M. , Hatabu H. , et al. .
Source: American Journal Of Respiratory And Critical Care Medicine, 2023-03-10 00:00:00.0; , .
EPub date: 2023-03-10 00:00:00.0.
PMID: 36898128
Related Citations

High-Dimensional Survival Analysis: Methods and Applications.
Authors: Salerno S. , Li Y. .
Source: Annual Review Of Statistics And Its Application, 2023 Mar; 10(1), p. 25-49.
EPub date: 2022-10-06 00:00:00.0.
PMID: 36968638
Related Citations

INDIVIDUALIZED RISK ASSESSMENT OF PREOPERATIVE OPIOID USE BY INTERPRETABLE NEURAL NETWORK REGRESSION.
Authors: Sun Y. , Kang J. , Brummett C. , Li Y. .
Source: The Annals Of Applied Statistics, 2023 Mar; 17(1), p. 434-453.
EPub date: 2023-01-24 00:00:00.0.
PMID: 37006707
Related Citations

OWL: an optimized and independently validated machine learning prediction model for lung cancer screening based on the UK Biobank, PLCO, and NLST populations.
Authors: Pan Z. , Zhang R. , Shen S. , Lin Y. , Zhang L. , Wang X. , Ye Q. , Wang X. , Chen J. , Zhao Y. , et al. .
Source: Ebiomedicine, 2023 Feb; 88, p. 104443.
EPub date: 2023-01-24 00:00:00.0.
PMID: 36701900
Related Citations

Inference for High-Dimensional Censored Quantile Regression.
Authors: Fei Z. , Zheng Q. , Hong H.G. , Li Y. .
Source: Journal Of The American Statistical Association, 2023; 118(542), p. 898-912.
EPub date: 2021-08-20 00:00:00.0.
PMID: 37309513
Related Citations

Serial laboratory biomarkers are associated with ICU outcomes in patients hospitalized with COVID-19.
Authors: Wang X. , White E. , Giacona F. , Khurana A. , Li Y. , Christiani D.C. , Alladina J.W. .
Source: Plos One, 2023; 18(11), p. e0293842.
EPub date: 2023-11-07 00:00:00.0.
PMID: 37934759
Related Citations

A trans-omics assessment of gene-gene interaction in early-stage NSCLC.
Authors: Chen J. , Song Y. , Li Y. , Wei Y. , Shen S. , Zhao Y. , You D. , Su L. , Bjaanaes M.M. , Karlsson A. , et al. .
Source: Molecular Oncology, 2022-11-21 00:00:00.0; , .
EPub date: 2022-11-21 00:00:00.0.
PMID: 36408734
Related Citations

Prior Knowledge Guided Ultra-high Dimensional Variable Screening with Application to Neuroimaging Data.
Authors: He J. , Kang J. .
Source: Statistica Sinica, 2022 Oct; 32(4), p. 2095-2117.
PMID: 36052338
Related Citations

Sex disparities in lung cancer survival rates based on screening status.
Authors: Rodriguez Alvarez A.A. , Yuming S. , Kothari J. , Digumarthy S.R. , Byrne N.M. , Li Y. , Christiani D.C. .
Source: Lung Cancer (amsterdam, Netherlands), 2022 09; 171, p. 115-120.
EPub date: 2022-08-01 00:00:00.0.
PMID: 35939954
Related Citations

Sex disparities in lung cancer survival rates based on screening status.
Authors: Rodriguez Alvarez A.A. , Yuming S. , Kothari J. , Digumarthy S.R. , Byrne N.M. , Li Y. , Christiani D.C. .
Source: Lung Cancer (amsterdam, Netherlands), 2022 09; 171, p. 115-120.
EPub date: 2022-08-01 00:00:00.0.
PMID: 35939954
Related Citations

A Large-Scale Genome-Wide Gene-Gene Interaction Study of Lung Cancer Susceptibility in Europeans With a Trans-Ethnic Validation in Asians.
Authors: Zhang R. , Shen S. , Wei Y. , Zhu Y. , Li Y. , Chen J. , Guan J. , Pan Z. , Wang Y. , Zhu M. , et al. .
Source: Journal Of Thoracic Oncology : Official Publication Of The International Association For The Study Of Lung Cancer, 2022 08; 17(8), p. 974-990.
EPub date: 2022-04-30 00:00:00.0.
PMID: 35500836
Related Citations

Spirometry at diagnosis and overall survival in non-small cell lung cancer patients.
Authors: Zhai T. , Li Y. , Brown R. , Lanuti M. , Gainor J.F. , Christiani D.C. .
Source: Cancer Medicine, 2022-05-12 00:00:00.0; , .
EPub date: 2022-05-12 00:00:00.0.
PMID: 35545892
Related Citations

APOLLO: An accurate and independently validated prediction model of lower-grade gliomas overall survival and a comparative study of model performance.
Authors: Chen J. , Shen S. , Li Y. , Fan J. , Xiong S. , Xu J. , Zhu C. , Lin L. , Dong X. , Duan W. , et al. .
Source: Ebiomedicine, 2022 May; 79, p. 104007.
EPub date: 2022-04-15 00:00:00.0.
PMID: 35436725
Related Citations

Epigenome-wide three-way interaction study identifies a complex pattern between TRIM27, KIAA0226, and smoking associated with overall survival of early-stage NSCLC.
Authors: Ji X. , Lin L. , Fan J. , Li Y. , Wei Y. , Shen S. , Su L. , Shafer A. , Bjaanaes M.M. , Karlsson A. , et al. .
Source: Molecular Oncology, 2021-12-21 00:00:00.0; , .
EPub date: 2021-12-21 00:00:00.0.
PMID: 34932879
Related Citations

Rejoinder to discussions of "distributional independent component analysis for diverse neuroimaging modalities".
Authors: Wu B. , Pal S. , Kang J. , Guo Y. .
Source: Biometrics, 2021-11-15 00:00:00.0; , .
EPub date: 2021-11-15 00:00:00.0.
PMID: 34780668
Related Citations

Debiased lasso for generalized linear models with a diverging number of covariates.
Authors: Xia L. , Nan B. , Li Y. .
Source: Biometrics, 2021-10-25 00:00:00.0; , .
EPub date: 2021-10-25 00:00:00.0.
PMID: 34693983
Related Citations

Distributional independent component analysis for diverse neuroimaging modalities.
Authors: Wu B. , Pal S. , Kang J. , Guo Y. .
Source: Biometrics, 2021-10-25 00:00:00.0; , .
EPub date: 2021-10-25 00:00:00.0.
PMID: 34694629
Related Citations

Stratified Cox models with time-varying effects for national kidney transplant patients: A new blockwise steepest ascent method.
Authors: He K. , Zhu J. , Kang J. , Li Y. .
Source: Biometrics, 2021-04-18 00:00:00.0; , .
EPub date: 2021-04-18 00:00:00.0.
PMID: 33870494
Related Citations

Estimation and Inference for High Dimensional Generalized Linear Models: A Splitting and Smoothing Approach.
Authors: Fei Z. , Li Y. .
Source: Journal Of Machine Learning Research : Jmlr, 2021; 22, .
PMID: 34531706
Related Citations

Comprehensive evaluation of COVID-19 patient short- and long-term outcomes: Disparities in healthcare utilization and post-hospitalization outcomes.
Authors: Salerno S. , Sun Y. , Morris E.L. , He X. , Li Y. , Pan Z. , Han P. , Kang J. , Sjoding M.W. , Li Y. .
Source: Plos One, 2021; 16(10), p. e0258278.
EPub date: 2021-10-06 00:00:00.0.
PMID: 34614008
Related Citations



Back to Top