Skip to main content

COVID-19 Resources

What people with cancer should know:

Guidance for cancer researchers:

Get the latest public health information from CDC:

Get the latest research information from NIH:

Grant Details

Grant Number: 4R01CA169043-04 Interpret this number
Primary Investigator: Guan, Yongtao
Organization: University Of Miami Coral Gables
Project Title: New Statistical Methods to Handle Spatial Uncertainty in Cancer Risk Estimation
Fiscal Year: 2016


DESCRIPTION (provided by applicant): The primary goal of this project is to develop novel statistical methods to handle spatial uncertainty in the event locations when conducting cancer risk estimation. We consider two different types of spatial uncertainty, specifically, 1) coarsenin due to the practice of releasing location information at an area level but not the point level and 2) geocoding error resulting from the use of geographic information systems software to convert residential addresses to geographic coordinates (i.e. longitudes and latitudes). Cancer epidemiologists can extract data from many different sources such as census, statewide health surveys, tumor registries and population-based case-control studies, and each source may yield data with different types of spatial uncertainty. Analytic methods are usually adversely affected by the presence of spatial uncertainty, resulting in biased parameter estimates, inflated standard errors, and reduced statistical power to detect spatial clustering and trends. To address these challenges, we propose a set of highly versatile estimation procedures to account for the spatial uncertainty and to efficiently combine data obtained from multiple sources. These procedures are based upon established theories on estimating equations and as such they can be easily implemented in practice. Compared with existing methods, the proposed methods are novel because 1) they permit the inclusion of individual-level risk factors for subjects with spatially uncertain locations, 2) the proposed intensity model admits a flexible semiparametric form and hence removes potentially restrictive assumptions such as the population density being constant over small geographic areas, and 3) they explicitly account for spatial correlation in the disease locations in both parameter estimation and statistical inference. In the substantive applications, we propose to supplement population-based case-control data with tumor registry data, census data and statewide health survey data. To the best of our knowledge, such an approach would be the first in the field and unparalleled. We will implement our proposed methods in a free, user-friendly R package. Our package will provide much- needed tools for more objective investigations of cancer risk factors by accounting for spatial uncertainty in the event locations. It will allow researchers to take advantage of the full spectrum of available data and use the data more effectively to reduce the burden of disease.


Spatiotemporal calibration and resolution refinement of output from deterministic models.
Authors: Gilani O. , McKay L.A. , Gregoire T.G. , Guan Y. , Leaderer B.P. , Holford T.R. .
Source: Statistics in medicine, 2016-06-30; 35(14), p. 2422-40.
EPub date: 2016-01-21.
PMID: 26790617
Related Citations

Disease risk estimation by combining case-control data with aggregated information on the population at risk.
Authors: Chang X. , Waagepetersen R. , Yu H. , Ma X. , Holford T.R. , Wang R. , Guan Y. .
Source: Biometrics, 2015 Mar; 71(1), p. 114-121.
EPub date: 2014-10-28.
PMID: 25351292
Related Citations

Functional Principal Component Analysis of Spatio-Temporal Point Processes with Applications in Disease Surveillance.
Authors: Li Y. , Guan Y. .
Source: Journal of the American Statistical Association, 2014-08-01; 109(507), p. 1205-1215.
PMID: 25368436
Related Citations

A new estimation approach for combining epidemiological data from multiple sources.
Authors: Huang H. , Ma X. , Waagepetersen R. , Holford T.R. , Wang R. , Risch H. , Mueller L. , Guan Y. .
Source: Journal of the American Statistical Association, 2014-01-01; 109(505), p. 11-23.
PMID: 24683281
Related Citations

Back to Top