Skip to main content

COVID-19 Resources

What people with cancer should know:

Guidance for cancer researchers:

Get the latest public health information from CDC:

Get the latest research information from NIH:

Grant Details

Grant Number: 5R01CA079934-05 Interpret this number
Primary Investigator: D'Agostino, Ralph
Organization: Wake Forest University Health Sciences
Project Title: Methods for Epidemiologic Data with Missing Values
Fiscal Year: 2003


The long term objective of the research in this resubmission of a FIRST Award proposal is to provide investigators with new methodology for handling missing data in research applications. The aims of this FIRST Award will be 1) to develop and extend statistical methodology for propensity score estimation when predictors contain missing data and 2) to apply this methodology to a variety of applied data sets. To address these aims several goals are proposed. These include: developing and extending methodology for estimating propensity scores when predictors contain missing data; developing methodology that allows predictors to contain missing data that may not be "missing at random"; developing diagnostics to assess the validity and fit for competing models; developing user friendly software for implementing this methodology; and applying these methods to four real data applications. These applications include: 1) using data from the Framingham Heart Study to estimate risk appraisal functions for predicting such outcomes as cardiovascular disease or stroke for individuals with missing risk factors; 2) using data from a diabetes registry consisting of over 120,000 members, provided by the Division of Research at Kaiser Permanente, Northern California, to develop techniques that aid in identifying diabetic persons who are at high risk for developing diabetic related complications in the presence of missing risk factor information; 3) using data from the Postmenopausal Estrogen/Progestin Intervention (PEPI) clinical trial to fit models to estimate propensity scores which represent the probability of medication adherence conditional on predictors that may contain missing data, and then use these propensity scores to find adjusted estimates of the effects of hormone therapy on cardiovascular disease risk factors, bone mineral density, and other symptoms and; 4) using data from the Genetic Epidemiology of Adenomatous Polyps study to fit models estimating the relationship of specific genes to outcomes considering the presence of missing risk factors. The results from this research will make important contributions to medical, epidemiological and statistical research. Methodological and applied publications are anticipated as statistical methodology concerning missing values will be developed and extended. In addition, substantive medical and epidemiological questions will be answered using this new methodology on the applied data sets provided.


Propensity scores in cardiovascular research.
Authors: D'Agostino R.B. .
Source: Circulation, 2007-05-01; 115(17), p. 2340-3.
PMID: 17470708
Related Citations

Assessing the effect of batterer program completion on reassault using propensity scores.
Authors: Jones A.S. , D'Agostino R.B. , Gondolf E.W. , Heckert A. .
Source: Journal of interpersonal violence, 2004 Sep; 19(9), p. 1002-20.
PMID: 15296614
Related Citations

Self-monitoring of blood glucose levels and glycemic control: the Northern California Kaiser Permanente Diabetes registry.
Authors: Karter A.J. , Ackerson L.M. , Darbinian J.A. , D'Agostino R.B. , Ferrara A. , Liu J. , Selby J.V. .
Source: The American journal of medicine, 2001 Jul; 111(1), p. 1-9.
PMID: 11448654
Related Citations

Debate: The slippery slope of surrogate outcomes.
Authors: D'Agostino R.B. .
Source: Current controlled trials in cardiovascular medicine, 2000; 1(2), p. 76-78.
PMID: 11714414
Related Citations

Back to Top