Recent cancer genome sequencing efforts have determined the complete protein coding regions for thousands of patients across tens of different cancer types. Initial analyses have revealed that cancer genomes can have numerous genetic alterations, but only a subset are thought to be important for cancer initiation or progression. Further, across patients, there is a high degree of mutational heterogeneity with very few genes altered in a high fraction of cases, and many infrequently altered genes, some of which are functionally important in cancer cells. These factors significantly complicate efforts to identify cancer-related genes. Our long-term goal is to identify cancer-related genes by analyzing the genomes of cohorts of individuals with a particular cancer. The key insight underlying our work is that molecular interactions and networks reveal important aspects of protein functioning, and thus provide an important context by which to tackle the mutational heterogeneity observed across cancers.
Our specific aims are: (1) To develop structure-based methods that uncover proteins enriched in somatic mutations in their interaction interfaces, as mutations in these sites are likely to affect protein functioning. (2) To develop network-based methods for de novo discovery of pathways that are mutated across patient samples, as mutations in cancers tend to target specific pathways?even if different genes within them are mutated in different individuals?and genes proximal in networks tend to be functionally related. (3) To develop metabolite- centric methods that use protein-small molecule networks in order to uncover mutated proteins that alter cellular metabolism, as reprogrammed metabolism is increasingly being recognized as a major adaptation of cancer cells. By pursuing these three complementary and tightly coupled aims?which exploit critical but often overlooked structural and network information?we will vastly advance the state-of-the-art in computational methods for analyzing cancer genomes. These analyses will deepen our understanding of cancer biology, and will ultimately lead to better patient stratification, refined prognostic tools, and novel therapeutics. .

Public Health Relevance

A major challenge in medicine is to better understand and treat human cancers. Recently, large-scale DNA sequencing efforts have allowed us to identify, for thousands of patients, the mutations acquired in their cancers. However, uncovering which of these mutations lead to cancer initiation and progression remains a difficult task. In this proposal, we aim to develop novel computational methods to uncover genes that play an important role in cancer development; application of these approaches not only will advance our understanding of basic cancer biology but also will be a great aid in identifying new drug targets.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Project (R01)
Project #
5R01CA208148-02
Application #
9305972
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Li, Jerry
Project Start
2016-07-01
Project End
2021-06-30
Budget Start
2017-07-01
Budget End
2018-06-30
Support Year
2
Fiscal Year
2017
Total Cost
Indirect Cost
Name
Princeton University
Department
Biostatistics & Other Math Sci
Type
Biomed Engr/Col Engr/Engr Sta
DUNS #
002484665
City
Princeton
State
NJ
Country
United States
Zip Code
08543
Przytycki, Pawel F; Singh, Mona (2017) Differential analysis between somatic mutation and germline variation profiles reveals cancer-related genes. Genome Med 9:79
Hristov, Borislav H; Singh, Mona (2017) Network-Based Coverage of Mutational Profiles Reveals Cancer Genes. Cell Syst 5:221-229.e4