Skip to main content

COVID-19 is an emerging, rapidly evolving situation.

What people with cancer should know:

Guidance for cancer researchers:

Get the latest public health information from CDC:

Get the latest research information from NIH:

Grant Details

Grant Number: 5R03CA182984-02 Interpret this number
Primary Investigator: Ma, Shuangge
Organization: Yale University
Project Title: Development of Integrated Analysis Methods and Applications to Tcga Data
Fiscal Year: 2015


DESCRIPTION (provided by applicant): In cancer research, profiling studies have been extensively conducted, measuring genome-wide gene expression levels, DNA modifications, epigenetic regulation, and post-transcriptional regulations. Many studies are "one-dimensional" and restricted to one type of genomic measurement. More recently, "multi-dimensional" studies are becoming more popular. In such studies, the same samples are profiled on multiple layers of genomic activities. A representative example is The Cancer Genome Atlas (TCGA). Multi-dimensional studies offer a unique opportunity to more comprehensively describe the etiology and prognosis of cancer. In the literature, much effort has been devoted to modeling the interconnections among different regulations. In contrast, there are relatively few studies conducting integrated analysis and modeling the associations between multiple types of genomic measurements and cancer outcomes. The existing integrated analysis methods also have serious limitations, which may lead to suboptimal or even biased results. Our goal is to more effectively describe cancer etiology and prognosis by analyzing multi-dimensional genomic data. Motivated by the limitations of existing studies, our first objective is to develop novel statistical methods, effectively integrate multi-dimensional genomic measurements, and establish their associations with cancer outcomes. Such an objective differs significantly from those of published studies. The proposed methods will have significant advantages. They will assume different biological working models, allowing for a direct comparison of these models. They will be applicable to a large number of datasets, can accommodate the joint effects of a large number of markers, and adopt efficient statistical techniques. The second objective is to apply these methods and analyze TCGA data on multiple types of cancers. The specific aims are to (Aim 1) Develop novel statistical methods to integrate multiple types of genomic measurements for cancer outcomes. Three different methods will be developed under different data generating models; (Aim 2) Develop user- friendly software and project website. Analyze TCGA data on multiple types of cancers, particularly including cancers of breast, ovary and prostate and lymphoma. Such data have measurements on gene expression, copy number variation, methylation, microRNA and others available. With the cost of sequencing falling fast, it will soon become a routine practice to profile multi- dimensional genomic characterizations of samples. This study will deliver a new analysis strategy and a set of novel statistical methods. These methods will integrate multiple types of genomic measurements for cancer outcomes and complement the existing methods. The analysis of TCGA data will provide valuable insights into multiple cancers and serve as prototype for future applications.


Analysis of cancer gene expression data with an assisted robust marker identification approach.
Authors: Chai H. , Shi X. , Zhang Q. , Zhao Q. , Huang Y. , Ma S. .
Source: Genetic epidemiology, 2017 12; 41(8), p. 779-789.
EPub date: 2017-09-14.
PMID: 28913902
Related Citations

Identification of Prognostic Genes and Pathways in Lung Adenocarcinoma Using a Bayesian Approach.
Authors: Jiang Y. , Huang Y. , Du Y. , Zhao Y. , Ren J. , Ma S. , Wu C. .
Source: Cancer informatics, 2017; 16, p. 1176935116684825.
EPub date: 2020-12-10.
PMID: 33354107
Related Citations

Integrating multidimensional omics data for cancer outcome.
Authors: Zhu R. , Zhao Q. , Zhao H. , Ma S. .
Source: Biostatistics (Oxford, England), 2016 10; 17(4), p. 605-18.
EPub date: 2016-03-14.
PMID: 26980320
Related Citations

Integrated analysis of multidimensional omics data on cutaneous melanoma prognosis.
Authors: Jiang Y. , Shi X. , Zhao Q. , Krauthammer M. , Rothberg B.E. , Ma S. .
Source: Genomics, 2016 06; 107(6), p. 223-30.
EPub date: 2016-04-30.
PMID: 27141884
Related Citations

Deciphering the associations between gene expression and copy number alteration using a sparse double Laplacian shrinkage approach.
Authors: Shi X. , Zhao Q. , Huang J. , Xie Y. , Ma S. .
Source: Bioinformatics (Oxford, England), 2015-12-15; 31(24), p. 3977-83.
EPub date: 2015-09-03.
PMID: 26342102
Related Citations

Measures for the degree of overlap of gene signatures and applications to TCGA.
Authors: Shi X. , Yi H. , Ma S. .
Source: Briefings in bioinformatics, 2015 Sep; 16(5), p. 735-44.
EPub date: 2014-12-31.
PMID: 25552438
Related Citations

A selective review of robust variable selection with applications in bioinformatics.
Authors: Wu C. , Ma S. .
Source: Briefings in bioinformatics, 2015 Sep; 16(5), p. 873-83.
EPub date: 2014-12-05.
PMID: 25479793
Related Citations

Combining multidimensional genomic measurements for predicting cancer prognosis: observations from TCGA.
Authors: Zhao Q. , Shi X. , Xie Y. , Huang J. , Shia B. , Ma S. .
Source: Briefings in bioinformatics, 2015 Mar; 16(2), p. 291-303.
EPub date: 2014-03-13.
PMID: 24632304
Related Citations

Integrative Analysis of "-Omics" Data Using Penalty Functions.
Authors: Zhao Q. , Shi X. , Huang J. , Liu J. , Li Y. , Ma S. .
Source: Wiley interdisciplinary reviews. Computational statistics, 2015 Jan-Feb; 7(1), p. 99-108.
PMID: 25691921
Related Citations

Integrative analysis of gene-environment interactions under a multi-response partially linear varying coefficient model.
Authors: Wu C. , Cui Y. , Ma S. .
Source: Statistics in medicine, 2014-12-10; 33(28), p. 4988-98.
EPub date: 2014-08-21.
PMID: 25146388
Related Citations

Identifying gene-environment and gene-gene interactions using a progressive penalization approach.
Authors: Zhu R. , Zhao H. , Ma S. .
Source: Genetic epidemiology, 2014 May; 38(4), p. 353-68.
EPub date: 2014-04-10.
PMID: 24723356
Related Citations

Risk factors for etiology and prognosis of mantle cell lymphoma.
Authors: Wang Y. , Ma S. .
Source: Expert review of hematology, 2014 Apr; 7(2), p. 233-43.
EPub date: 2014-02-22.
PMID: 24559208
Related Citations

Back to Top