Cancer is a disease of the genome, caused by disruptions in a person's DNA. Orders of magnitude decreases in price and increases in sequencing throughput enabled sequencing of hundreds of genomes. This will launch a new phase bf Precision Medicine, where molecular markers can guide therapies tailored to patients. The genomics revolution is now systematically characterizing every somatic change in every tumor for large cohorts (>300 patients). Despite some successes, predicting cancer outcomes based on molecular signatures remains a major challenge. This proposal aims at obliterating several key roadblocks stymieing progress. First, the raw sequence data is not particularly well-suited for use in developing predictive models. Therefore, gene- and pathway-level evidence will be derived from CGHub to significantly increase the utility of the information for biomedical discovery. The information will be collected in a Social Graph technology framework like Facebook to scale to billions of interconnected objects called the Biomedical Evidence Graph (BMEG). Second, the datasets are so large they are impractical to move around on the internet. Thus, an environment will be created within which researchers can move their code to the vast amounts of data within the BMEG. Third, prediction challenges will be created based on cancer genomics datasets and patient outcomes. While there have been a few successes in predicting outcomes, current approaches suffer from reproducibility and robustness when applied to unseen data. This activity will reach a broad community of algorithm developers, promote transparency and sharing of bioinformatics code, and create a strong network effect to crowd-source the development of the best models for biological discovery. The system constructed will be focused around the investigation of cancer outcomes but the entire pipeline will be of general utility for any number of genome-based projects including investigating any number of disease, stem cell properties, model organisms, and genome-wide association studies.

Public Health Relevance

While we are accumulating vast amounts of information on cancer cells, we are still searching in the dark for clues about predicting treatment strategies. It is of paramount importance to accelerate computational discovery. The creation of the BMEG will catalyze community participation to uncover novel relationships to elucidate new fundamental biology on oncogenesis and therapeutic directions for treating this disease.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Project (R01)
Project #
5R01CA180778-03
Application #
8840914
Study Section
Special Emphasis Panel (ZRG1)
Program Officer
Li, Jerry
Project Start
2013-06-01
Project End
2016-05-31
Budget Start
2015-06-01
Budget End
2016-05-31
Support Year
3
Fiscal Year
2015
Total Cost
Indirect Cost
Name
University of California Santa Cruz
Department
Engineering (All Types)
Type
Biomed Engr/Col Engr/Engr Sta
DUNS #
125084723
City
Santa Cruz
State
CA
Country
United States
Zip Code
95064
Ellrott, Kyle; Bailey, Matthew H; Saksena, Gordon et al. (2018) Scalable Open Science Approach for Mutation Calling of Tumor Exomes Using Multiple Genomic Pipelines. Cell Syst 6:271-281.e7
Lee, Anna Y; Ewing, Adam D; Ellrott, Kyle et al. (2018) Combining accurate tumor genome simulation with crowdsourcing to benchmark somatic structural variant detection. Genome Biol 19:188
Cooper, Christopher I; Yao, Delia; Sendorek, Dorota H et al. (2018) Valection: design optimization for validation and verification studies. BMC Bioinformatics 19:339
Graim, Kiley; Liu, Tiffany Ting; Achrol, Achal S et al. (2017) Revealing cancer subtypes with higher-order correlations applied to imaging and omics data. BMC Med Genomics 10:20
Gönen, Mehmet; Weir, Barbara A; Cowley, Glenn S et al. (2017) A Community Challenge for Inferring Genetic Predictors of Gene Essentialities through Analysis of a Functional Screen of Cancer Cell Lines. Cell Syst 5:485-497.e3
Liu, Tiffany T; Achrol, Achal S; Mitchell, Lex A et al. (2017) Magnetic resonance perfusion image features uncover an angiogenic subgroup of glioblastoma patients with poor survival and better response to antiangiogenic treatment. Neuro Oncol 19:997-1007
Cordero, Pablo; Stuart, Joshua M (2017) TRACING CO-REGULATORY NETWORK DYNAMICS IN NOISY, SINGLE-CELL TRANSCRIPTOME TRAJECTORIES. Pac Symp Biocomput 22:576-587
Newton, Yulia; Novak, Adam M; Swatloski, Teresa et al. (2017) TumorMap: Exploring the Molecular Similarities of Cancer Samples in an Interactive Portal. Cancer Res 77:e111-e114
Farshidfar, Farshad; Zheng, Siyuan; Gingras, Marie-Claude et al. (2017) Integrative Genomic Analysis of Cholangiocarcinoma Identifies Distinct IDH-Mutant Molecular Profiles. Cell Rep 18:2780-2794
Sokolov, Artem; Carlin, Daniel E; Paull, Evan O et al. (2016) Pathway-Based Genomics Prediction using Generalized Elastic Net. PLoS Comput Biol 12:e1004790

Showing the most recent 10 out of 27 publications