Our proposal for a Sage CCSB, 'Integrating cancer datasets for predictive model development and training', has as its central scientific theme the generation of a set of probabilistic causal models for a series of tumor types from numerous collaborators. By selecting sample sets with different clinical outcomes the resultant Sage models will have applications impacting cancer biology, early intervention and cancer treatments. The Sage CCSB leverages the extensive work done at Rosetta/Merck on predictive models in numerous disease areas which has been gifted to a new nonprofit medical research organization, 'Sage Bionehworks'. The Sage CCSB operational model contains a core platform of curated data, mathematical models and experienced investigators mentoring postdoctoral trainees/fellows. The data comes from collaborators and consists of DNA variation data, RNA expression data and clinical outcomes. The trainees will collate and annotate the genotypic, intermediate molecular phenotype and clinical end point data from at least five different tumor-type cohorts and develop models that can predict potential new cancer targets, markers for early detection, and clinical outcomes. They will do externships at other sites (CCSBs) where they will build additional models of their data and facilitate reciprocal exchange of ideas. The trainees will delineate specifications for tools that will make the access to these models more scalable. Validation of their hypotheses will be performed at the Fred Hutchinson Cancer Research Center and the Netherlands Cancer Institute. This post-doctoral program will provide a unique training and mentorship environment in cancer systems biology and facilitate interactions behween CCSBs and NCI.

Public Health Relevance

The massive generation of molecular information in oncology will not in itself change cancer death rates. This highlights the need to transition from archiving and binning facts to building predictive models of disease that help patients. Probabilistic causal models with curated data will allow early detection markers and directed therapies as well as predicting outcomes. The Sage CCSB will enable this distributed model building, while training scientists, building interface tools, and linking models between sites.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Specialized Center--Cooperative Agreements (U54)
Project #
5U54CA149237-02
Application #
8340528
Study Section
Special Emphasis Panel (ZCA1-SRLB-C (J1))
Project Start
2011-05-01
Project End
2014-02-28
Budget Start
2011-05-01
Budget End
2012-02-29
Support Year
2
Fiscal Year
2011
Total Cost
$1,904,076
Indirect Cost
Name
Sage Bionetworks
Department
Type
DUNS #
830977117
City
Seattle
State
WA
Country
United States
Zip Code
98109
Nikolova, Olga; Moser, Russell; Kemp, Christopher et al. (2017) Modeling gene-wise dependencies improves the identification of drug response biomarkers in cancer studies. Bioinformatics 33:1362-1369
Mikheev, Andrei M; Mikheeva, Svetlana A; Trister, Andrew D et al. (2015) Periostin is a novel therapeutic target that predicts and regulates glioma malignancy. Neuro Oncol 17:372-82
Dienstmann, Rodrigo; Jang, In Sock; Bot, Brian et al. (2015) Database of genomic biomarkers for cancer drugs and clinical targetability in solid tumors. Cancer Discov 5:118-23
Commo, F; Ferté, C; Soria, J C et al. (2015) Impact of centralization on aCGH-based genomic profiles for precision medicine in oncology. Ann Oncol 26:582-8
Chaibub Neto, Elias (2015) Speeding Up Non-Parametric Bootstrap Computations for Statistics Based on Sample Moments in Small/Moderate Sample Size Applications. PLoS One 10:e0131333
Guinney, Justin; Dienstmann, Rodrigo; Wang, Xin et al. (2015) The consensus molecular subtypes of colorectal cancer. Nat Med 21:1350-6
Jang, In Sock; Dienstmann, Rodrigo; Margolin, Adam A et al. (2015) Stepwise group sparse regression (SGSR): gene-set-based pharmacogenomic predictive models with stepwise selection of functional priors. Pac Symp Biocomput :32-43
Gönen, Mehmet; Kaski, Samuel (2014) Kernelized Bayesian Matrix Factorization. IEEE Trans Pattern Anal Mach Intell 36:2047-60
Gönen, Mehmet (2014) Coupled dimensionality reduction and classification for supervised and semi-supervised multilabel learning. Pattern Recognit Lett 38:132-141
Gönen, Mehmet; Margolin, Adam A (2014) Drug susceptibility prediction against a panel of drugs using kernelized Bayesian multitask learning. Bioinformatics 30:i556-63

Showing the most recent 10 out of 30 publications