Skip to main content
Grant Details

Grant Number: 1R21CA157219-01A1 Interpret this number
Primary Investigator: Li, Yi
Organization: Dana-Farber Cancer Inst
Project Title: Integrated Analysis of High Throughput Cancer Genomic Data
Fiscal Year: 2011


Abstract

DESCRIPTION (provided by applicant): The primary goal of this project is to develop a novel, integrated approach for the analysis of high-throughput cancer genomic data. We plan to develop new variable selection methods for 1) class discovery, that is we propose to determine subgroups of the specified cancer to better understand the underlying cancer biology and 2) predictive gene signatures, that is we propose to determine a subset of genes which are predictive for patients' clinical phenotypes, including survival and response to therapy. Specifically, we will develop a new method for variable selection in clustering. Clustering plays a critical role in the analysis of genomic cancer data. For example, based on the gene expression profiles, important cluster distinctions can be found among a set of tissue samples, which may reflect categories of diseases, mutation status, or different responses to a given therapy. Second, we will develop a new penalized-likelihood method for variable selection in regression which utilizes group information to select groups of correlated genes that share the same biological pathway. The developed methodology will be useful for identifying important gene signatures that may lead to more effective personalized treatment in any health studies where survival time or response to therapy is of interest. PUBLIC HEALTH RELEVANCE: Our project aims to develop a new class of variable selection methods for analyzing high-throughput cancer genomic data. Compared to existing methods, the proposed methods will lead to more powerful methods of class discovery for identifying cancer sub-types and more accurate prediction of patients' survival and response to therapy. The developed methodology will be useful for identifying important gene signatures that may lead to more effective personalized treatment in any health studies where survival time or response to therapy is of interest.



Publications

A semiparametrically efficient estimator of the time-varying effects for survival data with time-dependent treatment.
Authors: Lin H. , Fei Z. , Li Y. .
Source: Scandinavian journal of statistics, theory and applications, 2016 Sep; 43(3), p. 649-663.
EPub date: 2015-11-09.
PMID: 27647948
Related Citations

Selection of latent variables for multiple mixed-outcome models.
Authors: Zhou L. , Lin H. , Song X. , Li Y.I. .
Source: Scandinavian journal of statistics, theory and applications, 2014 Dec; 41(4), p. 1064-1082.
EPub date: 2014-04-02.
PMID: 27642219
Related Citations

Discrete mixture modeling to address genetic heterogeneity in time-to-event regression.
Authors: Eng K.H. , Hanlon B.M. .
Source: Bioinformatics (Oxford, England), 2014-06-15; 30(12), p. 1690-7.
EPub date: 2014-02-14.
PMID: 24532723
Related Citations

Ultrahigh dimensional time course feature selection.
Authors: Xu P. , Zhu L. , Li Y. .
Source: Biometrics, 2014 Jun; 70(2), p. 356-65.
EPub date: 2014-01-19.
PMID: 24571586
Related Citations

The Dantzig Selector for Censored Linear Regression Models.
Authors: Li Y. , Dicker L. , Zhao S.D. .
Source: Statistica Sinica, 2014-01-01; 24(1), p. 251-2568.
PMID: 24478569
Related Citations




Back to Top