Grant Details
| Grant Number: |
5R01CA085848-13 Interpret this number |
| Primary Investigator: |
Davidian, Marie |
| Organization: |
North Carolina State University Raleigh |
| Project Title: |
Flexible Statistical Methods for Biomedical Data |
| Fiscal Year: |
2013 |
Abstract
DESCRIPTION (provided by applicant): An ongoing challenge in health sciences research is the development of statistical models used to study relationships between subject characteristics and interventions and disease onset, recurrence, and progression and other health outcomes. This enterprise has become considerably more complex as new technologies, the quest to discover new biomarkers, and improved resources for handling vast repositories of data have led to the collection of high-dimensional information, and there has been extensive research on formal methods for identifying important prognostic variables to include in a model to be used, e.g., to assess population risk. The objective of the first two specific aims of this renewal application is to develop new methods for such variable selection in model-building. Many key health status variables collected in studies of chronic disease, e.g., blood pressure or serum biomarker levels, are imprecise measurements of a "true" quantity relevant to understanding risk, such as long-term blood pressure. The first aim is to develop methods for variable selection when some such covariates are subject to such measurement error. Linear and generalized linear models for independent data and their mixed-effects counterparts for longitudinal and other clustered data are widely used, but these parametric models may not be sufficiently flexible to approximate the complex relationships involved. The second aim is to extend advances made in our previous project period toward new methods for simultaneous parameter estimation and variable selection in more flexible semiparametric such models to develop new techniques that allow for arbitrary numbers of both parametric and nonparametric covariate effects, general outcome variables (e.g., continuous, binary), and adaptive identification of such effects. A key objective in many stud- ies is to elucidate the association between features of longitudinal profiles of biomarkers or other continuous measures and a primary health outcome using so-called joint models. Standard joint models represent the subject-specific profiles via a mixed-effects model, e.g., as straight lines with random subject-specific intercepts and slopes, whose random parameters are included as covariates in a model for the primary outcome. In some settings, interest may focus on the association between outcome and not only features such as slopes but also intra-subject variation in the longitudinal measure. The third aim is to develop new methods for joint models involving both random intra-subject mean and variance parameters, exploiting techniques developed in the previous project period. Many longitudinal measures are censored due to limits of quantification of the assay used in their determination, and longitudinal analysis must take this into appropriate account. Our fourth aim focuses on development of new methods for mixed-effects models that address not only this issue but draw on work in previous project periods to relax the usual normality assumption on random effects and yield an estimate of their density, providing the analyst with a tool for exploring underlying features of the population.
Publications
Assessing gene-environment interactions for common and rare variants with binary traits using gene-trait similarity regression.
Authors: Zhao G.
, Marceau R.
, Zhang D.
, Tzeng J.Y.
.
Source: Genetics, 2015 Mar; 199(3), p. 695-710.
PMID: 25585620
Related Citations
Studying gene and gene-environment effects of uncommon and common variants on continuous traits: a marker-set approach using gene-trait similarity regression.
Authors: Tzeng J.Y.
, Zhang D.
, Pongpanich M.
, Smith C.
, McCarthy M.I.
, Sale M.M.
, Worrall B.B.
, Hsu F.C.
, Thomas D.C.
, Sullivan P.F.
.
Source: American Journal Of Human Genetics, 2011-08-12 00:00:00.0; 89(2), p. 277-88.
PMID: 21835306
Related Citations
Power and sample size calculation for log-rank test with a time lag in treatment effect.
Authors: Zhang D.
, Quan H.
.
Source: Statistics In Medicine, 2009-02-28 00:00:00.0; 28(5), p. 864-79.
PMID: 19152230
Related Citations
Haplotype-based association analysis via variance-components score test.
Authors: Tzeng J.Y.
, Zhang D.
.
Source: American Journal Of Human Genetics, 2007 Nov; 81(5), p. 927-38.
PMID: 17924336
Related Citations
Two-stage functional mixed models for evaluating the effect of longitudinal covariate profiles on a scalar outcome.
Authors: Zhang D.
, Lin X.
, Sowers M.
.
Source: Biometrics, 2007 Jun; 63(2), p. 351-62.
PMID: 17688488
Related Citations
Generalized linear mixed models with varying coefficients for longitudinal data.
Authors: Zhang D.
.
Source: Biometrics, 2004 Mar; 60(1), p. 8-15.
PMID: 15032768
Related Citations
A semiparametric likelihood approach to joint modeling of longitudinal and time-to-event data.
Authors: Song X.
, Davidian M.
, Tsiatis A.A.
.
Source: Biometrics, 2002 Dec; 58(4), p. 742-53.
PMID: 12495128
Related Citations
Estimation of survival distributions of treatment policies in two-stage randomization designs in clinical trials.
Authors: Lunceford J.K.
, Davidian M.
, Tsiatis A.A.
.
Source: Biometrics, 2002 Mar; 58(1), p. 48-57.
PMID: 11890326
Related Citations
Linear mixed models with flexible distributions of random effects for longitudinal data.
Authors: Zhang D.
, Davidian M.
.
Source: Biometrics, 2001 Sep; 57(3), p. 795-802.
PMID: 11550930
Related Citations