The genome of each individual harbors millions of nucleotide variants, and a major challenge is to understand how these variants contribute to phenotypic variations in the population. We propose a combined computational and experimental framework for identifying non-coding variants that affect cellular and physiological traits, with the goal to establish computational models that can predict the probability of exhibiting a physiological trait from the sequences of non-coding genomic regions. This framework involves iterative refinement of model assumptions and parameters with experimentation. To develop the framework and validate the predictive models, we will focus on the disease Age-related Macular Degeneration (AMD), the leading cause of blindness among the elderly in the country. Previous studies have identified a number of sequence variants strongly associated with AMD. We will develop computational models to predict (or narrow down) the set of non-coding sequence variants that contribute to the disease phenotype. As experimental assessment, we will perform genome editing in patient-derived induced pluripotent stem cells (iPSC) to test the consequence of removing or introducing such sequence variants on molecular and cellular phenotypes in cell culture and in rodent models. While the proposed method is developed for AMD, the general approach is expected to apply to other genetic diseases.

Public Health Relevance

Many common human disorders are due to variations in the DNA sequence. With the rapid progress of DNA sequencing technologies, sequencing one's entire genome is becoming routine, and it is of paramount importance to understand how the sequence variants in each person's genome contribute to his/her unique phenotypic traits. In this project, we will develop a general strategy to establish computational models that can predict the probability of disease phenotype from one's genome sequence. We will develop this strategy by focusing on Age-related Macular Degeneration (AMD), the leading cause of blindness among the elderly in the country.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG008135-03
Application #
9294140
Study Section
Special Emphasis Panel (ZHG1)
Program Officer
Pazin, Michael J
Project Start
2015-08-24
Project End
2019-06-30
Budget Start
2017-07-01
Budget End
2019-06-30
Support Year
3
Fiscal Year
2017
Total Cost
Indirect Cost
Name
Ludwig Institute for Cancer Research Ltd
Department
Type
DUNS #
627922248
City
La Jolla
State
CA
Country
United States
Zip Code
92093
Nguyen, Tri C; Zaleta-Rivera, Kathia; Huang, Xuerui et al. (2018) RNA, Action through Interactions. Trends Genet 34:867-882
Lu, Jia; Cao, Xiaoyi; Zhong, Sheng (2018) A likelihood approach to testing hypotheses on the co-evolution of epigenome and genome. PLoS Comput Biol 14:e1006673
Zheng, Alvin; Cao, Xiaoyi; Zhong, Sheng (2017) Pattern-based Search of Epigenomic Data Using GeNemo. J Vis Exp :
Yu, Miao; Ren, Bing (2017) The Three-Dimensional Organization of Mammalian Genomes. Annu Rev Cell Dev Biol 33:265-289
Xiao, Shu; Lu, Jia; Sridhar, Bharat et al. (2017) SMARCAD1 Contributes to the Regulation of Naive Pluripotency by Interacting with Histone Citrullination. Cell Rep 18:3117-3128
Diao, Yarui; Fang, Rongxin; Li, Bin et al. (2017) A tiling-deletion-based genetic screen for cis-regulatory element identification in mammalian cells. Nat Methods 14:629-635
Wang, Wenqiu; Gawlik, Katarzyna; Lopez, Joe et al. (2016) Genetic and environmental factors strongly influence risk, severity and progression of age-related macular degeneration. Signal Transduct Target Ther 1:16016
Zhang, Yongqing; Cao, Xiaoyi; Zhong, Sheng (2016) GeNemo: a search engine for web-based functional genomic data. Nucleic Acids Res 44:W122-7
Skowronska-Krawczyk, Dorota; Zhao, Ling; Zhu, Jie et al. (2015) P16INK4a Upregulation Mediated by SIX6 Defines Retinal Ganglion Cell Pathogenesis in Glaucoma. Mol Cell 59:931-40