The overall aim of the ENCODE project is to comprehensively identify functional elements in the human genome. Currently applicable high-throughput technologies, such as RNA-Seq, ChIP-Seq, and DNase-Seq, exploit patterns of marks to infer the role of specific sequences, but generally fall short of functionally interrogatig and thereby validating these predictions. To address this gap, we propose a novel paradigm for the massively parallel functional testing of candidate regulatory elements. In preliminary work, we have developed a system whereby sequence-based transcribed barcodes enable the extensive multiplexing of classic reporter assays, in vitro or in vivo. Here, we propose to adapt this approach for testing tens-of- thousands of human regulatory elements in single assays, and furthermore to shift these assays from an episomal to a chromosomal context.
Our specific aims are: (1) To develop high-throughput methods to clone, by capture or by synthesis, large numbers of candidate regulatory elements and to link them to transcribed, synthetic barcodes within complex populations of reporter vectors. (2) To test in parallel tens-of-thousands of candidate regulatory elements nominated by liver ChIP-Seq for in vitro and in vivo activity using HepG2 transfections and the hydrodynamic tail vein assay, with RNA-Seq of the synthetic barcodes serving as a single readout for the differential activity of distinct candidate regulatory elements. (3) To develop a similarly multiplexed lentiviral assay for regulatory element analysis that is chromosomally based and generically applicable to diverse cell and tissue types. We anticipate that these methods can be scaled for the efficient, in vivo functional testing of large numbers of candidate regulatory elements nominated by other technologies. Furthermore, our approach can easily be adopted by other researchers and used for many related goals, such as testing which regulatory elements work together, dissecting the fine-scale architecture of individual regulatory elements, and evaluating the performance of synthetic regulatory elements.

Public Health Relevance

As we enter an era of personalized medicine, a deep understanding of the human genome will be increasingly important to public health, contributing towards the unraveling of the genetic basis of human disease, as well as serving an increasing role in clinical diagnostics. Regulatory sequences in the human genome, that is, sequences that are functionally important but do not encode proteins, are clearly of fundamental importance but are nonetheless poorly understood. This project will develop novel technologies for the parallel validation of large numbers of candidate regulatory sequences, thereby furthering our understanding of their function.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG006768-02
Application #
8464386
Study Section
Special Emphasis Panel (ZHG1-HGR-M (J2))
Program Officer
Pazin, Michael J
Project Start
2012-04-24
Project End
2015-02-28
Budget Start
2013-03-01
Budget End
2014-02-28
Support Year
2
Fiscal Year
2013
Total Cost
$593,430
Indirect Cost
$115,408
Name
University of Washington
Department
Genetics
Type
Schools of Medicine
DUNS #
605799469
City
Seattle
State
WA
Country
United States
Zip Code
98195
VanderMeer, Julia E; Lozano, Reymundo; Sun, Miao et al. (2014) A novel ZRS mutation leads to preaxial polydactyly type 2 in a heterozygous form and Werner mesomelic syndrome in a homozygous form. Hum Mutat 35:945-8
Erwin, Genevieve D; Oksenberg, Nir; Truty, Rebecca M et al. (2014) Integrating diverse datasets improves developmental enhancer prediction. PLoS Comput Biol 10:e1003677
Birnbaum, Ramon Y; Patwardhan, Rupali P; Kim, Mee J et al. (2014) Systematic dissection of coding exons at single nucleotide resolution supports an additional role in cell-specific transcriptional regulation. PLoS Genet 10:e1004592
VanderMeer, Julia E; Smith, Robin P; Jones, Stacy L et al. (2014) Genome-wide identification of signaling center enhancers in the developing limb. Development 141:4194-8
Kim, Mee J; Oksenberg, Nir; Hoffmann, Thomas J et al. (2014) Functional characterization of SIM1-associated enhancers. Hum Mol Genet 23:1700-8
Oksenberg, N; Haliburton, G D E; Eckalbar, W L et al. (2014) Genome-wide distribution of Auts2 binding localizes with active neurodevelopmental genes. Transl Psychiatry 4:e431
Smith, Robin P; Eckalbar, Walter L; Morrissey, Kari M et al. (2014) Genome-wide discovery of drug-dependent human liver regulatory elements. PLoS Genet 10:e1004648
Booker, Betty M; Murphy, Karl K; Ahituv, Nadav (2013) Functional analysis of limb enhancers in the developing fin. Dev Genes Evol 223:395-9
Smith, Robin P; Taher, Leila; Patwardhan, Rupali P et al. (2013) Massively parallel decoding of mammalian regulatory sequences supports a flexible organizational model. Nat Genet 45:1021-8
Zhao, Jingjing; Shi, Hongbo; Ahituv, Nadav (2013) Classification of topological domains based on gene expression and regulation. Genome 56:415-23

Showing the most recent 10 out of 11 publications