Skip to main content
Grant Details

Grant Number: 5U24CA180996-06 Interpret this number
Primary Investigator: Morgan, Martin
Organization: Roswell Park Cancer Institute Corp
Project Title: Cancer Genomics:integrative and Scalable Solutions in R / Bioconductor
Fiscal Year: 2018
Back to top


DESCRIPTION (provided by applicant): This proposal develops scalable R / Bioconductor software infrastructure and data resources to integrate complex, heterogeneous, and large cancer genomic experiments. The falling cost of genomic assays facilitates collection of multiple data types (e.g., gene and transcript expression, structural variation, copy number, methylation, and microRNA data) from a set of clinical specimens. Furthermore, substantial resources are now available from large consortium activities like The Cancer Genome Atlas (TCGA). Existing analysis pipelines focus on the treatment of a specific data type, leaving a critical need for tool for integrative analysis of multiple genomic assays for locally generated or publicly available data. R / Bioconductor has historically provided standardized genomic data structures and annotations that have enjoyed widespread adoption in the cancer genomics research community. This proposal adapts R / Bioconductor to meet the increasing conceptual and computational complexity of multi-assay cancer genomic experiments. We begin by developing software containers for coordinated representation, manipulation, and transformation of heterogeneous derived data from multiple cancer genomic assays. These containers are then extended to manage very large primary data resources. To facilitate integration of local experimental results with major public cancer genomics experiment data sets and annotations, we re-package public resources and provide software and cloud-based facilities for easy and fast programmatic access from within R/Bioconductor. This greatly simplifies cancer genomic analysis tasks that otherwise require significant, error-prone individual efforts. Finally, we provide software infrastructure to enable high-throughput computation using parallel and iterative approaches. The ability to manipulate multi-assay cancer genomic experiments, to understand individual experimental results in the context of public experiments and annotations, and facilities for improved high-throughput computational performance in a well-established computing environment greatly enhances opportunities for analysis and comprehension of large multi-assay cancer genomic experiments.

Back to top


Consensus on Molecular Subtypes of High-Grade Serous Ovarian Carcinoma.
Authors: Chen G.M. , Kannan L. , Geistlinger L. , Kofia V. , Safikhani Z. , Gendoo D.M.A. , Parmigiani G. , Birrer M. , Haibe-Kains B. , Waldron L. .
Source: Clinical Cancer Research : An Official Journal Of The American Association For Cancer Research, 2018-10-15 00:00:00.0; 24(20), p. 5037-5047.
EPub date: 2018-07-03 00:00:00.0.
PMID: 30084834
Related Citations

Continuity of transcriptomes among colorectal cancer subtypes based on meta-analysis.
Authors: Ma S. , Ogino S. , Parsana P. , Nishihara R. , Qian Z. , Shen J. , Mima K. , Masugi Y. , Cao Y. , Nowak J.A. , et al. .
Source: Genome Biology, 2018-09-25 00:00:00.0; 19(1), p. 142.
EPub date: 2018-09-25 00:00:00.0.
PMID: 30253799
Related Citations

Data and Statistical Methods To Analyze the Human Microbiome.
Authors: Waldron L. .
Source: Msystems, 2018 Mar-Apr; 3(2), .
EPub date: 2018-03-13 00:00:00.0.
PMID: 29556541
Related Citations

Software for the Integration of Multiomics Experiments in Bioconductor.
Authors: Ramos M. , Schiffer L. , Re A. , Azhar R. , Basunia A. , Rodriguez C. , Chan T. , Chapman P. , Davis S.R. , Gomez-Cabrero D. , et al. .
Source: Cancer Research, 2017-11-01 00:00:00.0; 77(21), p. e39-e42.
PMID: 29092936
Related Citations

Accessible, curated metagenomic data through ExperimentHub.
Authors: Pasolli E. , Schiffer L. , Manghi P. , Renson A. , Obenchain V. , Truong D.T. , Beghini F. , Malik F. , Ramos M. , Dowd J.B. , et al. .
Source: Nature Methods, 2017-10-31 00:00:00.0; 14(11), p. 1023-1024.
PMID: 29088129
Related Citations

Joint Bounding of Peaks Across Samples Improves Differential Analysis in Mass Spectrometry-Based Metabolomics.
Authors: Myint L. , Kleensang A. , Zhao L. , Hartung T. , Hansen K.D. .
Source: Analytical Chemistry, 2017-03-21 00:00:00.0; 89(6), p. 3517-3523.
EPub date: 2017-03-07 00:00:00.0.
PMID: 28221771
Related Citations

Preprocessing, normalization and integration of the Illumina HumanMethylationEPIC array with minfi.
Authors: Fortin J.P. , Triche T.J. , Hansen K.D. .
Source: Bioinformatics (oxford, England), 2017-02-15 00:00:00.0; 33(4), p. 558-560.
PMID: 28035024
Related Citations

Expression Quantitative Trait loci (QTL) in tumor adjacent normal breast tissue and breast tumor tissue.
Authors: Quiroz-Zárate A. , Harshfield B.J. , Hu R. , Knoblauch N. , Beck A.H. , Hankinson S.E. , Carey V. , Tamimi R.M. , Hunter D.J. , Quackenbush J. , et al. .
Source: Plos One, 2017; 12(2), p. e0170181.
EPub date: 2017-02-02 00:00:00.0.
PMID: 28152060
Related Citations

The Doppelgänger Effect: Hidden Duplicates in Databases of Transcriptome Profiles.
Authors: Waldron L. , Riester M. , Ramos M. , Parmigiani G. , Birrer M. .
Source: Journal Of The National Cancer Institute, 2016 Nov; 108(11), .
PMID: 27381624
Related Citations

Racial/Ethnic Disparities in Genomic Sequencing.
Authors: Spratt D.E. , Chan T. , Waldron L. , Speers C. , Feng F.Y. , Ogunwobi O.O. , Osborne J.R. .
Source: Jama Oncology, 2016-08-01 00:00:00.0; 2(8), p. 1070-4.
PMID: 27366979
Related Citations

Public data and open source tools for multi-assay genomic investigation of disease.
Authors: Kannan L. , Ramos M. , Re A. , El-Hachem N. , Safikhani Z. , Gendoo D.M. , Davis S. , Gomez-Cabrero D. , Castelo R. , Hansen K.D. , et al. .
Source: Briefings In Bioinformatics, 2016 Jul; 17(4), p. 603-15.
PMID: 26463000
Related Citations

Genomic Annotation Resources in R/Bioconductor.
Authors: Carlson M.R. , Pagčs H. , Arora S. , Obenchain V. , Morgan M. .
Source: Methods In Molecular Biology (clifton, N.j.), 2016; 1418, p. 67-90.
PMID: 27008010
Related Citations

Back to Top