Many forms of biomolecular (e.g., gene expression, genetics, proteomics) and clinical (e.g., clinical biomarkers, drug targets and indications) data pertaining to many different diseases are now readily available from publicly- available data repositories and knowledge-bases. There is now an opportunity to integrate these data into a unified, globally coherent representation of human disease, or nosology. Such a nosology would express how diseases are related to one another across multiple molecular and clinical axes. In this competitive renewal, we are planning a major expansion for this project. We plan to capture data from newer public repositories with more types of molecular measurements. Inclusion of genetic and protein measurements will enable a richer modeling of diseases and disease similarity, beyond mRNA measurements. To help link the molecular changes seen in disease to genetic differences, we plan to incorporate Expression Quantitative Trait Loci (eQTLs) into our disease models, built from simultaneous genetic and expression measurements. To expand the utility of our nosology in personalized medicine, we plan to incorporate more quantitative epidemiological measurements on disease, and to model transitions between disease states using probabilistic relational modeling. We will compare our nosology with the well-known ICD-10 as well as ICD-11, under development. We will develop novel visualization methods for the complex of edges and nodes seen in nosologies. We also plan to test our nosology in two Driving Biological Projects, in small cell lung cancer and immunology and disease, specifically yielding novel diagnostics and therapeutics ready for clinical trials.

Public Health Relevance

In this competitive renewal, building from 36 publications in the first funding period, we plan to create a new disease classification based on clinical, molecular, and epidemiological data and knowledge, and to use this classification to identify novel diagnostics and drugs for small cell lung cancer and immunological disease.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
2R01GM079719-06
Application #
8246320
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Long, Rochelle M
Project Start
2006-09-30
Project End
2016-03-31
Budget Start
2012-04-01
Budget End
2013-03-31
Support Year
6
Fiscal Year
2012
Total Cost
$625,423
Indirect Cost
$229,729
Name
Stanford University
Department
Pediatrics
Type
Schools of Medicine
DUNS #
009214214
City
Stanford
State
CA
Country
United States
Zip Code
94305
Pessetto, Ziyan Y; Chen, Bin; Alturkmani, Hani et al. (2017) In silico and in vitro drug screening identifies new therapeutic approaches for Ewing sarcoma. Oncotarget 8:4079-4095
Hadley, Dexter; Pan, James; El-Sayed, Osama et al. (2017) Precision annotation of digital samples in NCBI's gene expression omnibus. Sci Data 4:170125
Chen, Bin; Wei, Wei; Ma, Li et al. (2017) Computational Discovery of Niclosamide Ethanolamine, a Repurposed Drug Candidate That Reduces Growth of Hepatocellular Carcinoma Cells In Vitro and in Mice by Inhibiting Cell Division Cycle 37 Signaling. Gastroenterology 152:2022-2036
Chen, Bin; Ma, Li; Paik, Hyojung et al. (2017) Reversal of cancer gene expression correlates with drug efficacy and reveals therapeutic targets. Nat Commun 8:16022
Schmajuk, Gabriela; Tonner, Chris; Trupin, Laura et al. (2017) Using health-system-wide data to understand hepatitis B virus prophylaxis and reactivation outcomes in patients receiving rituximab. Medicine (Baltimore) 96:e6528
Hughey, Jacob J; Hastie, Trevor; Butte, Atul J (2016) ZeitZeiger: supervised learning for high-dimensional data from an oscillatory system. Nucleic Acids Res 44:e80
Hughey, Jacob J; Butte, Atul J (2016) Differential Phasing between Circadian Clocks in the Brain and Peripheral Organs in Humans. J Biol Rhythms 31:588-597
Kodama, Keiichi; Zhao, Zhiyuan; Toda, Kyoko et al. (2016) Expression-Based Genome-Wide Association Study Links Vitamin D-Binding Protein With Autoantigenicity in Type 1 Diabetes. Diabetes 65:1341-9
Chen, B; Butte, A J (2016) Leveraging big data to transform target selection and drug discovery. Clin Pharmacol Ther 99:285-97
Paik, H; Chen, B; Sirota, M et al. (2016) Integrating Clinical Phenotype and Gene Expression Data to Prioritize Novel Drug Uses. CPT Pharmacometrics Syst Pharmacol 5:599-607

Showing the most recent 10 out of 77 publications