We aim to develop novel statistical methods to address some of the major problems facing cancer genetic epidemiologists in the post-GWAS era and to illustrate their use for discovery of novel biology in various colorectal cancer (CRC) studies. These methods leverage prior biological knowledge to inform integrative genomics analyses (Project 1), use phylogenetic information to infer gene function as inputs to our epidemiologic modeling projects (Project 2), model the role of the microbiome and the exposome in cancer risk (Project 3), and exploit intra-tumor heterogeneity to learn about somatic tumor evolution and how this process is modified by the internal environment (Project 4). These four projects will be supported by an administrative core and three shared resource cores on functional annotation, high performance computing, and software development. The entire program is motivated by an overall objective of providing tools for evaluating the impact of potential preventive or therapeutic interventions based on modifiable risk factors. Specifically, the aims of the overall program are (1) to develop statistical analysis methods to integrate multiple types of omics data that describe both constitutional and acquired genomic variation as well as measures of the external and internal environment into comprehensive risk prediction models, leveraging external information; (2) to apply these methods to various studies of CRC etiology and prognosis to uncover novel associations and to develop predictive models that would have translational significance for possible primary, secondary, and tertiary interventions; and (3) to establish an infrastructure (administrative, bioinformatic, computational, software) to support the various research projects and facilitate making our methods accessible to the broader scientific community. This will be achieved by a combination of theoretical developments, simulation studies closely keyed to real data projects, applications to several studies of CRC, and distribution of software for use by outside investigators. Beyond applications to colorectal cancer, our methods will be broadly applicable to other cancer types and many other chronic diseases.

Public Health Relevance

OVERALL PROGRAM NARRATIVE The overall goal of the proposed research program is the integration of different types of information to build comprehensive statistical models for cancer causes and prognosis. Motivated by various studies of colorectal cancer, we will develop novel statistical methods that use prior biological knowledge to inform integrative genomics analyses, that use phylogenetic information to infer gene function, that incorporate high-dimensional data on the internal environment (the microbiome and the exposome), and that model tumor evolution from data on somatic changes within and between tumors. Beyond colon cancer, our methods will be broadly applicable to other cancer types and many other chronic diseases.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Program Projects (P01)
Project #
1P01CA196569-01A1
Application #
9072852
Study Section
Special Emphasis Panel (ZCA1-RPRB-B (J1))
Program Officer
Rotunno, Melissa
Project Start
2016-07-01
Project End
2021-06-30
Budget Start
2016-07-01
Budget End
2017-06-30
Support Year
1
Fiscal Year
2016
Total Cost
$2,549,526
Indirect Cost
$994,518
Name
University of Southern California
Department
Public Health & Prev Medicine
Type
Schools of Medicine
DUNS #
072933393
City
Los Angeles
State
CA
Country
United States
Zip Code
90032
Ryser, Marc D; Min, Byung-Hoon; Siegmund, Kimberly D et al. (2018) Spatial mutation patterns as markers of early colorectal tumor cell mobility. Proc Natl Acad Sci U S A 115:5774-5779
Liu, Jie; Liang, Gangning; Siegmund, Kimberly D et al. (2018) Data integration by multi-tuning parameter elastic net regression. BMC Bioinformatics 19:369
Moss, Lilit C; Gauderman, William J; Lewinger, Juan Pablo et al. (2018) Using Bayes model averaging to leverage both gene main effects and G?×? E interactions to identify genomic regions in genome-wide association studies. Genet Epidemiol :
Mi, Huaiyu; Huang, Xiaosong; Muruganujan, Anushya et al. (2017) PANTHER version 11: expanded annotation data from Gene Ontology and Reactome pathways, and data analysis tool enhancements. Nucleic Acids Res 45:D183-D189
Gref, Anna; Merid, Simon K; Gruzieva, Olena et al. (2017) Genome-Wide Interaction Analysis of Air Pollution Exposure and Childhood Asthma with Functional Follow-up. Am J Respir Crit Care Med 195:1373-1383
Thomas, Duncan C (2017) What Does ""Precision Medicine"" Have to Say About Prevention? Epidemiology 28:479-483
Pereira, Miguel; Thompson, John R; Weichenberger, Christian X et al. (2017) Inclusion of biological knowledge in a Bayesian shrinkage model for joint estimation of SNP effects. Genet Epidemiol 41:320-331
Chopra, Sameer; Liu, Jie; Alemozaffar, Mehrdad et al. (2017) Improving needle biopsy accuracy in small renal mass using tumor-specific DNA methylation markers. Oncotarget 8:5439-5448
Croteau-Chonka, Damien C; Qiu, Weiliang; Martinez, Fernando D et al. (2017) Gene Expression Profiling in Blood Provides Reproducible Molecular Insights into Asthma Control. Am J Respir Crit Care Med 195:179-188
McAllister, Kimberly; Mechanic, Leah E; Amos, Christopher et al. (2017) Current Challenges and New Opportunities for Gene-Environment Interaction Studies of Complex Diseases. Am J Epidemiol 186:753-761

Showing the most recent 10 out of 28 publications