Large-scale efforts are underway to systematically map transcription factors binding sites throughout the human genome. The ENCODE project has focused its initial attention on two cell lines, 1) K562 cells, a myeloid precursor cell line and 2) GM12878, a lymphoblastoid cell line, and our laboratory has mapped the binding sites of a large number of transcription factor expressed in these cells. To study their conservation and help provide functional information into these binding sites and to determine if these sites are occupied in vivo, we propose two types of studies. First, we will map the binding sites of at least 30 transcription factor orthologs that have been analyzed in the human ENCODE project in mouse MEL and CH12 cells which are analogous to K562 and GM12878 cells, respectively. Second, we will map the binding sites of Pol II and nine other factors in cells differentiated from human CD34+ cells and primary erythroid mouse cells. These studies will determine which transcription factor binding sites and gene targets are conserved in vertebrates and which are species-specific as well as determine the extent to which targets mapped in cultured cell lines reflect in vivo binding sites. The information from these studies will be deposited into public databases and is expected to be extremely valuable to the large mouse and human genetic communities.

Public Health Relevance

The ENCODE project has produced relatively large amounts of data on transcription factor binding and RNA expression in a limited number of human cell lines. We propose to extend these results by obtaining mouse cell lines at similar states of differentiation to human cell lines. We will then duplicate the experiments that have been done in human cells, and locate control elements based on sequence conservation and similarities in factor binding between the two species. We will also determine if elements identified in vitro are occupied in cells isolated from organisms.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
High Impact Research and Research Infrastructure Programs (RC2)
Project #
5RC2HG005602-02
Application #
8133494
Study Section
Special Emphasis Panel (ZHG1-HGR-M (O1))
Program Officer
Feingold, Elise A
Project Start
2009-09-30
Project End
2012-08-31
Budget Start
2010-09-01
Budget End
2012-08-31
Support Year
2
Fiscal Year
2010
Total Cost
$656,251
Indirect Cost
Name
Stanford University
Department
Genetics
Type
Schools of Medicine
DUNS #
009214214
City
Stanford
State
CA
Country
United States
Zip Code
94305
Sundaram, Vasavi; Cheng, Yong; Ma, Zhihai et al. (2014) Widespread contribution of transposable elements to the innovation of gene regulatory networks. Genome Res 24:1963-76
Pope, Benjamin D; Ryba, Tyrone; Dileep, Vishnu et al. (2014) Topologically associating domains are stable units of replication-timing regulation. Nature 515:402-5
Yue, Feng; Cheng, Yong; Breschi, Alessandra et al. (2014) A comparative encyclopedia of DNA elements in the mouse genome. Nature 515:355-64
Cheng, Yong; Ma, Zhihai; Kim, Bong-Hyun et al. (2014) Principles of regulatory information conservation between mouse and human. Nature 515:371-375
Wu, Jia Qian; Seay, Montrell; Schulz, Vincent P et al. (2012) Tcf7 is an important regulator of the switch of self-renewal and differentiation in a multipotential hematopoietic cell line. PLoS Genet 8:e1002565