The goal of this proposed mentored research is to tie genetic variation to disease by analyzing regions that are intolerant to variation. Identifying regions that are intolerant to new variation can help localize regions of potential functional importance and biologic relevance. Large public population consortia are now accumulating datasets of sufficient size to detect regions subject to evolutionary selective pressures at an increasingly granular level. However, there remains a shortage of appropriate analytical tools that are built to specifically address important issues of disease heterogeneity across diverse populations. Despite the fact that clinical exome sequencing is increasingly used for improved diagnostic evaluation, many genetic disorders remain uncharacterized and diagnosis rates are still relatively low.
In Aim 1, I will develop methodology that localizes regions intolerant to variation and differential isoform expression associated with disease. Many genes display tissue dependent transcript isoforms indicating potential functional implications of different isoforms. I will characterize selective pressure across all isoforms using Bayesian techniques by looking at patterns of genetic constraint across large standing populations, predominantly leveraging public data sets on the order of hundreds of thousands of samples. Then I will leverage existing expression data to isolate key isoforms across different cell and tissue types that are associated with diseases of interest. Then by accounting for regional intolerance to variation, a joint transcriptomic variation?intolerance approach can be employed to improve disease association testing.
In Aim 2, I will analyze ancestry and cross species patterns of genetic intolerance to variation. The majority of genetic studies have focused on European populations, which ignores genetic and phenotypic diversity that can be leveraged to improve both targeted and overall diagnostic and clinical capabilities. I will test for ancestry and cross species patterns of genetic intolerance to variation and association with disease. Expanding to more populations will scale up the already large set of parameters being estimated; so, I will develop new statistical methods and software to improve optimization of parameter estimation for the Bayesian hierarchical models. I will isolate key ancestral populations with known differences in selective pressure to validate findings while then leveraging these new methods and population disease patterns further to isolate novel signals of ancestry-specific selective pressures. I will look for conserved regions across species to isolate essential exonic regions while also isolating unique regions in the context of human specific genetic variation and disease, such as neurodevelopmental disorders. During the training time for this proposed study I will focus on advancing my understanding of biologic mechanisms and clinical genetics to better inform the statistical genetics methods I develop. My mentorship and advisory committee consists of a strong multidisciplinary team of geneticists, pathologists, computational scientists, and biologists who will guide and collaborate with me to refine my work to improve variant interpretation and to advance precision medicine.

Public Health Relevance

Regions in the genome that are intolerant to new variation indicate potential functional importance and provide insights into genetic variation and disease. Despite the fact that clinical exome sequencing is increasingly used for diagnostic evaluation of patients with suspected Mendelian disorders and that rare variation has been shown to confer substantial disease risk in complex traits as well, a large percentage of Mendelian disorders remains unresolved and diagnostic rates of exome sequencing remains low. With the proposal of new statistical methods and user-friendly software implementations, this K01 aims to better account for ancestry and tissue-specific transcriptional intolerance to genetic variation in order to improve association mapping between genetic variation and disease and to ultimately advance precision medicine diagnostics and variant interpretation.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Scientist Development Award - Research & Training (K01)
Project #
1K01HG010498-01A1
Application #
10127009
Study Section
National Human Genome Research Institute Initial Review Group (GNOM)
Program Officer
Ramos, Erin
Project Start
2021-01-05
Project End
2024-12-31
Budget Start
2021-01-05
Budget End
2021-12-31
Support Year
1
Fiscal Year
2021
Total Cost
Indirect Cost
Name
Children's Hospital of Philadelphia
Department
Type
DUNS #
073757627
City
Philadelphia
State
PA
Country
United States
Zip Code
19146