Genetic heterogeneity is a common feature of many diseases, with different causal variants, or mutations, present in different individuals with the disease. Genetic heterogeneity complicates the identification of the genetic basis of disease, as any modest sized study will contain individuals with different causal genetic variants. One reason for this heterogeneity is that causal variants are present in groups of genes that interact in various cellular signaling and regulatory pathways. Genetic heterogeneity demands the testing of combinations of variants, rather than individual variants, for association with a disease. However, while individual variants can be tested exhaustively for association, combinations of variants cannot, as there are too many combinations to test, and the number of samples required for statistical significance would be astronomical. We propose to develop new computational and statistical approaches to identify combinations of variants that are associated with a disease. In contrast to existing approaches, we do not restrict attention to known pathways or groups of genes a priori. Rather, our algorithms utilize genome-scale interaction networks and combinational/statistical constraints to identify combinations of variants and rigorously assess their statistical significance. Further, we extend these approaches to find associations between combinations of variants and various clinical parameters such as survival time or response to treatment. We will apply these techniques to cancer genome sequencing projects including The Cancer Genome Atlas (TCGA), in collaboration with several biomedical research groups. Successful completion of the proposed research will facilitate the study of genetically heterogeneous diseases - and in particular cancer - using only a modest number of samples that is attainable with present DNA sequencing technologies.

Public Health Relevance

Identifying the inherited genetic differences associated with a disease and the acquired mutations that lead to cancer are major challenges in medicine. Next-generation DNA sequencing technologies enable measurement of these genetic variants, but interpreting the resulting data demands new computational and statistical approaches. This is particularly true as many diseases are heterogeneous, with many possible genetic causes. We will develop novel algorithms to aid in the discovery of disease-causing genetic variants that will enable better diagnostics and/or personalized treatments for various diseases.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
1R01HG007069-01
Application #
8417550
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Struewing, Jeffery P
Project Start
2013-09-01
Project End
2016-08-31
Budget Start
2013-09-01
Budget End
2014-08-31
Support Year
1
Fiscal Year
2013
Total Cost
$373,001
Indirect Cost
$143,462
Name
Brown University
Department
Biostatistics & Other Math Sci
Type
Schools of Arts and Sciences
DUNS #
001785542
City
Providence
State
RI
Country
United States
Zip Code
02912
El-Kebir, Mohammed; Satas, Gryte; Raphael, Benjamin J (2018) Inferring parsimonious migration histories for metastatic cancers. Nat Genet 50:718-726
Oesper, Layla; Dantas, Simone; Raphael, Benjamin J (2017) Identifying simultaneous rearrangements in cancer genomes. Bioinformatics :
Cancer Genome Atlas Research Network. Electronic address: andrew_aguirre@dfci.harvard.edu; Cancer Genome Atlas Research Network (2017) Integrated Genomic Characterization of Pancreatic Ductal Adenocarcinoma. Cancer Cell 32:185-203.e13
Nakka, Priyanka; Archer, Natalie P; Xu, Heng et al. (2017) Novel Gene and Network Associations Found for Acute Lymphoblastic Leukemia Using Case-Control and Family-Based Studies in Multiethnic Populations. Cancer Epidemiol Biomarkers Prev 26:1531-1539
Leiserson, Mark D M; Reyna, Matthew A; Raphael, Benjamin J (2016) A weighted exact test for mutually exclusive mutations in cancer. Bioinformatics 32:i736-i745
El-Kebir, Mohammed; Satas, Gryte; Oesper, Layla et al. (2016) Inferring the Mutational History of a Tumor Using Multi-state Perfect Phylogeny Mixtures. Cell Syst 3:43-53
Nakka, Priyanka; Raphael, Benjamin J; Ramachandran, Sohini (2016) Gene and Network Analysis of Common Variants Reveals Novel Associations in Multiple Complex Diseases. Genetics 204:783-798
Leiserson, Mark D M; Gramazio, Connor C; Hu, Jason et al. (2015) MAGI: visualization and collaborative annotation of genomic aberrations. Nat Methods 12:483-4
Vandin, Fabio; Papoutsaki, Alexandra; Raphael, Benjamin J et al. (2015) Accurate computation of survival statistics in genome-wide studies. PLoS Comput Biol 11:e1004071
Leiserson, Mark D M; Vandin, Fabio; Wu, Hsin-Ta et al. (2015) Pan-cancer network analysis identifies combinations of rare somatic mutations across pathways and protein complexes. Nat Genet 47:106-14

Showing the most recent 10 out of 18 publications