Skip to main content
Grant Details

Grant Number: 5R01CA189532-04 Interpret this number
Primary Investigator: Hsu, Li
Organization: Fred Hutchinson Cancer Research Center
Project Title: Methods for Integrating Functional Data Into Complex Disease Genetic Analyses
Fiscal Year: 2018
Back to top


 DESCRIPTION (provided by applicant): Recent developments in The Human Genome Project and breakthroughs in different types of high throughput technologies have changed how researchers approach complex diseases by moving toward cross- disciplinary studies, collecting data on all facets of disease. The objective of this application is to develop efficient statistica and computational approaches to integrating genetics, genomics and epidemiologic data for understanding the interplay of genetics and environment in complex diseases, with the long-term goal of devising personalized strategies to prevent and treat these diseases. Genome-wide association studies have identified thousands of trait associated genetic variants, and provided valuable insights into the genetic architecture of these traits. However, most variants identified so far confer relatively small increments in risk, and explain only a small proportion o heritability, leading many to question how the remaining 'missing' heritability can be explained. This application addresses this 'missing' heritability from several aspects: rare variant association analysis, gene-environment interaction, and heritability estimation beyond additive genetic effects. Accordingly, we propose the following specific aims. Aim 1 is to develop methods for integrating functional information into rare variants association analysis. To achieve this goal, Aim 1 includes developing databases of tissue-specific functional annotation and constructing regulatory expression networks (eQTL) from public data generated from large collaborative projects such as the Encyclopedia of DNA Elements and the Genotype Tissue Expression. The theoretical properties of the rare variants analysis will also be studied to devise powerful tests in consideration of genomic features such as linkage disequilibrium and sparse signals. Aim 2 is to develop methods for rare variants gene-environment interaction (GxE) that incorporates functional information. Efficient and versatile screening strategies will also be developed for genome-wide discovery of GxE. Even though this aim is focused on GxE, the methods are also applicable to gene-gene interaction (GxG). Aim 3 is to develop methods for estimating heritability that incorporates GxE and GxG to understand the complex interplay between genetic susceptibility and environment The proposed work is motivated by a large consortium on colorectal cancer, which has over 40,000 participants from well-characterized studies with detailed data on both environmental risk factors and GWAS and whole genome sequencing data. The developed methods will be applied to the consortium to gain new insights in colorectal cancer and demonstrate the feasibility of the methods. Since the methods are applicable to other complex diseases and traits, R-based open source software will be developed and submitted to the Comprehensive R Archive Network for broad dissemination.

Back to top


Learning-based biomarker-assisted rules for optimized clinical benefit under a risk constraint.
Authors: Wang Y. , Zhao Y.Q. , Zheng Y. .
Source: Biometrics, 2019-12-13; , .
EPub date: 2019-12-13.
PMID: 31833561
Related Citations

Joint analysis of single-cell and bulk tissue sequencing data to infer intratumor heterogeneity.
Authors: Sun W. , Jin C. , Gelfond J.A. , Chen M.H. , Ibrahim J.G. .
Source: Biometrics, 2019-12-07; , .
EPub date: 2019-12-07.
PMID: 31813161
Related Citations

Gaussian process regression for survival time prediction with genome-wide gene expression.
Authors: Molstad A.J. , Hsu L. , Sun W. .
Source: Biostatistics (Oxford, England), 2019-07-11; , .
EPub date: 2019-07-11.
PMID: 31292609
Related Citations

Genetic variant predictors of gene expression provide new insight into risk of colorectal cancer.
Authors: Bien S.A. , Su Y.R. , Conti D.V. , Harrison T.A. , Qu C. , Guo X. , Lu Y. , Albanes D. , Auer P.L. , Banbury B.L. , et al. .
Source: Human genetics, 2019 Apr; 138(4), p. 307-326.
EPub date: 2019-02-28.
PMID: 30820706
Related Citations

Adjusted time-varying population attributable hazard in case-control studies.
Authors: Zhao W. , Zheng J. , Chen Y.Q. , Hsu L. .
Source: Statistical methods in medical research, 2020 01; 29(1), p. 243-257.
EPub date: 2019-02-25.
PMID: 30799773
Related Citations

Diagnostics for Pleiotropy in Mendelian Randomization Studies: Global and Individual Tests for Direct Effects.
Authors: Dai J.Y. , Peters U. , Wang X. , Kocarnik J. , Chang-Claude J. , Slattery M.L. , Chan A. , Lemire M. , Berndt S.I. , Casey G. , et al. .
Source: American journal of epidemiology, 2018-12-01; 187(12), p. 2672-2680.
PMID: 30188971
Related Citations

Joint skeleton estimation of multiple directed acyclic graphs for heterogeneous population.
Authors: Liu J. , Sun W. , Liu Y. .
Source: Biometrics, 2019 03; 75(1), p. 36-47.
EPub date: 2018-08-06.
PMID: 30081434
Related Citations

A new method for constructing tumor specific gene co-expression networks based on samples with tumor purity heterogeneity.
Authors: Petralia F. , Wang L. , Peng J. , Yan A. , Zhu J. , Wang P. .
Source: Bioinformatics (Oxford, England), 2018-07-01; 34(13), p. i528-i536.
PMID: 29949994
Related Citations

Mendelian randomisation study of age at menarche and age at menopause and the risk of colorectal cancer.
Authors: Neumeyer S. , Banbury B.L. , Arndt V. , Berndt S.I. , Bezieau S. , Bien S.A. , Buchanan D.D. , Butterbach K. , Caan B.J. , Campbell P.T. , et al. .
Source: British journal of cancer, 2018 06; 118(12), p. 1639-1647.
EPub date: 2018-05-24.
PMID: 29795306
Related Citations

A Mixed-Effects Model for Powerful Association Tests in Integrative Functional Genomics.
Authors: Su Y.R. , Di C. , Bien S. , Huang L. , Dong X. , Abecasis G. , Berndt S. , Bezieau S. , Brenner H. , Caan B. , et al. .
Source: American journal of human genetics, 2018-05-03; 102(5), p. 904-919.
PMID: 29727690
Related Citations

The association between copy number aberration, DNA methylation and gene expression in tumor samples.
Authors: Sun W. , Bunn P. , Jin C. , Little P. , Zhabotynsky V. , Perou C.M. , Hayes D.N. , Chen M. , Lin D.Y. .
Source: Nucleic acids research, 2018-04-06; 46(6), p. 3009-3018.
PMID: 29529299
Related Citations

Joint Analysis of Strain and Parent-of-Origin Effects for Recombinant Inbred Intercrosses Generated from Multiparent Populations with the Collaborative Cross as an Example.
Authors: Liu Y. , Xiong S. , Sun W. , Zou F. .
Source: G3 (Bethesda, Md.), 2018-02-02; 8(2), p. 599-605.
EPub date: 2018-02-02.
PMID: 29255115
Related Citations

On Estimation of the Hazard Function from Population-based Case-Control Studies.
Authors: Hsu L. , Gorfine M. , Zucker D.M. .
Source: Journal of the American Statistical Association, 2018; 113(522), p. 560-570.
EPub date: 2018-06-12.
PMID: 30906082
Related Citations

Current Challenges and New Opportunities for Gene-Environment Interaction Studies of Complex Diseases.
Authors: McAllister K. , Mechanic L.E. , Amos C. , Aschard H. , Blair I.A. , Chatterjee N. , Conti D. , Gauderman W.J. , Hsu L. , Hutter C.M. , et al. .
Source: American journal of epidemiology, 2017-10-01; 186(7), p. 753-761.
PMID: 28978193
Related Citations

Update on the State of the Science for Analytical Methods for Gene-Environment Interactions.
Authors: Gauderman W.J. , Mukherjee B. , Aschard H. , Hsu L. , Lewinger J.P. , Patel C.J. , Witte J.S. , Amos C. , Tai C.G. , Conti D. , et al. .
Source: American journal of epidemiology, 2017-10-01; 186(7), p. 762-770.
PMID: 28978192
Related Citations

Incorporation of Biological Knowledge Into the Study of Gene-Environment Interactions.
Authors: Ritchie M.D. , Davis J.R. , Aschard H. , Battle A. , Conti D. , Du M. , Eskin E. , Fallin M.D. , Hsu L. , Kraft P. , et al. .
Source: American journal of epidemiology, 2017-10-01; 186(7), p. 771-777.
PMID: 28978191
Related Citations

A new method to study the change of miRNA-mRNA interactions due to environmental exposures.
Authors: Petralia F. , Aushev V.N. , Gopalakrishnan K. , Kappil M. , W Khin N. , Chen J. , Teitelbaum S.L. , Wang P. .
Source: Bioinformatics (Oxford, England), 2017-07-15; 33(14), p. i199-i207.
PMID: 28881990
Related Citations

Enrichment of colorectal cancer associations in functional regions: Insight for using epigenomics data in the analysis of whole genome sequence-imputed GWAS data.
Authors: Bien S.A. , Auer P.L. , Harrison T.A. , Qu C. , Connolly C.M. , Greenside P.G. , Chen S. , Berndt S.I. , Bézieau S. , Kang H.M. , et al. .
Source: PloS one, 2017; 12(11), p. e0186518.
EPub date: 2017-11-21.
PMID: 29161273
Related Citations

Heritability Estimation using a Regularized Regression Approach (HERRA): Applicable to continuous, dichotomous or age-at-onset outcome.
Authors: Gorfine M. , Berndt S.I. , Chang-Claude J. , Hoffmeister M. , Le Marchand L. , Potter J. , Slattery M.L. , Keret N. , Peters U. , Hsu L. .
Source: PloS one, 2017; 12(8), p. e0181269.
EPub date: 2017-08-16.
PMID: 28813438
Related Citations

Multivariate association analysis with somatic mutation data.
Authors: He Q. , Liu Y. , Peters U. , Hsu L. .
Source: Biometrics, 2018 03; 74(1), p. 176-184.
EPub date: 2017-07-19.
PMID: 28722765
Related Citations

Quantifying the Genetic Correlation between Multiple Cancer Types.
Authors: Lindström S. , Finucane H. , Bulik-Sullivan B. , Schumacher F.R. , Amos C.I. , Hung R.J. , Rand K. , Gruber S.B. , Conti D. , Permuth J.B. , et al. .
Source: Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology, 2017 09; 26(9), p. 1427-1435.
EPub date: 2017-06-21.
PMID: 28637796
Related Citations

Hypothesis testing in functional linear models.
Authors: Su Y.R. , Di C.Z. , Hsu L. .
Source: Biometrics, 2017 06; 73(2), p. 551-561.
EPub date: 2017-03-10.
PMID: 28295175
Related Citations

On estimation of time-dependent attributable fraction from population-based case-control studies.
Authors: Zhao W. , Chen Y.Q. , Hsu L. .
Source: Biometrics, 2017 09; 73(3), p. 866-875.
EPub date: 2017-01-18.
PMID: 28099992
Related Citations

A unified powerful set-based test for sequencing data analysis of GxE interactions.
Authors: Su Y.R. , Di C.Z. , Hsu L. , Genetics and Epidemiology of Colorectal Cancer Consortium .
Source: Biostatistics (Oxford, England), 2017 01; 18(1), p. 119-131.
EPub date: 2016-07-28.
PMID: 27474101
Related Citations

A fully nonparametric estimator of the marginal survival function based on case-control clustered age-at-onset data.
Authors: Gorfine M. , Bordo N. , Hsu L. .
Source: Biostatistics (Oxford, England), 2017 01; 18(1), p. 76-90.
EPub date: 2016-07-19.
PMID: 27436674
Related Citations

Back to Top