Cancer is a disease of the genome, caused by disruptions in a person's DNA. Orders of magnitude decreases in price and increases in sequencing throughput enabled sequencing of hundreds of genomes. This will launch a new phase bf """"""""Precision Medicine,"""""""" where molecular markers can guide therapies tailored to patients. The genomics revolution is now systematically characterizing every somatic change in every tumor for large cohorts (>300 patients). Despite some successes, predicting cancer outcomes based on molecular signatures remains a major challenge. This proposal aims at obliterating several key roadblocks stymieing progress. First, the raw sequence data is not particularly well-suited for use in developing predictive models. Therefore, gene- and pathway-level evidence will be derived from CGHub to significantly increase the utility of the information for biomedical discovery. The information will be collected in a Social Graph technology framework like Facebook to scale to billions of interconnected objects called the Biomedical Evidence Graph (BMEG). Second, the datasets are so large they are impractical to move around on the internet. Thus, an environment will be created within which researchers can move their code to the vast amounts of data within the BMEG. Third, prediction challenges will be created based on cancer genomics datasets and patient outcomes. While there have been a few successes in predicting outcomes, current approaches suffer from reproducibility and robustness when applied to unseen data. This activity will reach a broad community of algorithm developers, promote transparency and sharing of bioinformatics code, and create a strong network effect to crowd-source the development of the best models for biological discovery. The system constructed will be focused around the investigation of cancer outcomes but the entire pipeline will be of general utility for any number of genome-based projects including investigating any number of disease, stem cell properties, model organisms, and genome-wide association studies.
While we are accumulating vast amounts of information on cancer cells, we are still searching in the dark for clues about predicting treatment strategies. It is of paramount importance to accelerate computational discovery. The creation of the BMEG will catalyze community participation to uncover novel relationships to elucidate new fundamental biology on oncogenesis and therapeutic directions for treating this disease.
|Ellrott, Kyle; Bailey, Matthew H; Saksena, Gordon et al. (2018) Scalable Open Science Approach for Mutation Calling of Tumor Exomes Using Multiple Genomic Pipelines. Cell Syst 6:271-281.e7|
|Lee, Anna Y; Ewing, Adam D; Ellrott, Kyle et al. (2018) Combining accurate tumor genome simulation with crowdsourcing to benchmark somatic structural variant detection. Genome Biol 19:188|
|Cooper, Christopher I; Yao, Delia; Sendorek, Dorota H et al. (2018) Valection: design optimization for validation and verification studies. BMC Bioinformatics 19:339|
|Graim, Kiley; Liu, Tiffany Ting; Achrol, Achal S et al. (2017) Revealing cancer subtypes with higher-order correlations applied to imaging and omics data. BMC Med Genomics 10:20|
|Gönen, Mehmet; Weir, Barbara A; Cowley, Glenn S et al. (2017) A Community Challenge for Inferring Genetic Predictors of Gene Essentialities through Analysis of a Functional Screen of Cancer Cell Lines. Cell Syst 5:485-497.e3|
|Liu, Tiffany T; Achrol, Achal S; Mitchell, Lex A et al. (2017) Magnetic resonance perfusion image features uncover an angiogenic subgroup of glioblastoma patients with poor survival and better response to antiangiogenic treatment. Neuro Oncol 19:997-1007|
|Cordero, Pablo; Stuart, Joshua M (2017) TRACING CO-REGULATORY NETWORK DYNAMICS IN NOISY, SINGLE-CELL TRANSCRIPTOME TRAJECTORIES. Pac Symp Biocomput 22:576-587|
|Newton, Yulia; Novak, Adam M; Swatloski, Teresa et al. (2017) TumorMap: Exploring the Molecular Similarities of Cancer Samples in an Interactive Portal. Cancer Res 77:e111-e114|
|Farshidfar, Farshad; Zheng, Siyuan; Gingras, Marie-Claude et al. (2017) Integrative Genomic Analysis of Cholangiocarcinoma Identifies Distinct IDH-Mutant Molecular Profiles. Cell Rep 18:2780-2794|
|Sokolov, Artem; Carlin, Daniel E; Paull, Evan O et al. (2016) Pathway-Based Genomics Prediction using Generalized Elastic Net. PLoS Comput Biol 12:e1004790|
Showing the most recent 10 out of 27 publications