Our proposal for a Sage CCSB, 'Integrating cancer datasets for predictive model development and training', has as its central scientific theme the generation of a set of probabilistic causal models for a series of tumor types from numerous collaborators. By selecting sample sets with different clinical outcomes the resultant Sage models will have applications impacting cancer biology, early intervention and cancer treatments. The Sage CCSB leverages the extensive work done at Rosetta/Merck on predictive models in numerous disease areas which has been gifted to a new nonprofit medical research organization, 'Sage Bionehworks'. The Sage CCSB operational model contains a core platform of curated data, mathematical models and experienced investigators mentoring postdoctoral trainees/fellows. The data comes from collaborators and consists of DNA variation data, RNA expression data and clinical outcomes. The trainees will collate and annotate the genotypic, intermediate molecular phenotype and clinical end point data from at least five different tumor-type cohorts and develop models that can predict potential new cancer targets, markers for early detection, and clinical outcomes. They will do externships at other sites (CCSBs) where they will build additional models of their data and facilitate reciprocal exchange of ideas. The trainees will delineate specifications for tools that will make the access to these models more scalable. Validation of their hypotheses will be performed at the Fred Hutchinson Cancer Research Center and the Netherlands Cancer Institute. This post-doctoral program will provide a unique training and mentorship environment in cancer systems biology and facilitate interactions behween CCSBs and NCI.

Public Health Relevance

The massive generation of molecular information in oncology will not in itself change cancer death rates. This highlights the need to transition from archiving and binning facts to building predictive models of disease that help patients. Probabilistic causal models with curated data will allow early detection markers and directed therapies as well as predicting outcomes. The Sage CCSB will enable this distributed model building, while training scientists, building interface tools, and linking models between sites.

National Institute of Health (NIH)
National Cancer Institute (NCI)
Specialized Center--Cooperative Agreements (U54)
Project #
Application #
Study Section
Special Emphasis Panel (ZCA1-SRLB-C (J1))
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Sage Bionetworks
United States
Zip Code
Nikolova, Olga; Moser, Russell; Kemp, Christopher et al. (2017) Modeling gene-wise dependencies improves the identification of drug response biomarkers in cancer studies. Bioinformatics 33:1362-1369
Mikheev, Andrei M; Mikheeva, Svetlana A; Trister, Andrew D et al. (2015) Periostin is a novel therapeutic target that predicts and regulates glioma malignancy. Neuro Oncol 17:372-82
Dienstmann, Rodrigo; Jang, In Sock; Bot, Brian et al. (2015) Database of genomic biomarkers for cancer drugs and clinical targetability in solid tumors. Cancer Discov 5:118-23
Commo, F; Ferté, C; Soria, J C et al. (2015) Impact of centralization on aCGH-based genomic profiles for precision medicine in oncology. Ann Oncol 26:582-8
Chaibub Neto, Elias (2015) Speeding Up Non-Parametric Bootstrap Computations for Statistics Based on Sample Moments in Small/Moderate Sample Size Applications. PLoS One 10:e0131333
Guinney, Justin; Dienstmann, Rodrigo; Wang, Xin et al. (2015) The consensus molecular subtypes of colorectal cancer. Nat Med 21:1350-6
Jang, In Sock; Dienstmann, Rodrigo; Margolin, Adam A et al. (2015) Stepwise group sparse regression (SGSR): gene-set-based pharmacogenomic predictive models with stepwise selection of functional priors. Pac Symp Biocomput :32-43
Moser, Russell; Xu, Chang; Kao, Michael et al. (2014) Functional kinomics identifies candidate therapeutic targets in head and neck cancer. Clin Cancer Res 20:4274-88
Guinney, Justin; Ferté, Charles; Dry, Jonathan et al. (2014) Modeling RAS phenotype in colorectal cancer uncovers novel molecular traits of RAS dependency and improves prediction of response to targeted agents in patients. Clin Cancer Res 20:265-272
Ferté, Charles; Fernandez, Marianna; Hollebecque, Antoine et al. (2014) Tumor growth rate is an early indicator of antitumor drug activity in phase I clinical trials. Clin Cancer Res 20:246-52

Showing the most recent 10 out of 30 publications