Sequence analysis of the human genome has identified approximately25,000 protein-coding genes, but little is known about how most of these genes are regulated in different tissues and stages of development. A number of approaches attempt to identify gene regulatory elements on a genome-wide scale, but there is yet to be a proven method that accurately accomplishes this goal. Only with the development of better experimental technologies will the rapid identification of regulatory elements be possible. Mapping DNase hypersensitive (HS) sites has been the gold-standard method for identifying the location of promoters, enhancers, silencers, and insulators. While this method has been proven invaluable for identifying the location of active regulatory elements for individual genes, the labor-intensive nature of this technique has limited its application to only a small number of human genes. We have developed a novel protocol to generate a genome-wide library of gene regulatory sequences by cloning DNase HS sites. As a pilot, we generated a library of DNase HS sites from quiescent primary human CD4+ T cells and analyzed 5,600 of the resulting clones. Compared to sequences from randomly generated in silico libraries, sequences from these clones were found to map more frequently to regions of the genome known to contain regulatory elements, such as regions upstream of genes, within CpG islands, and in sequences that align between mouse and human. Validation of putative regulatory elements was achieved by repeated recovery of the same sequence (clustering), and by real-time PCR. To distinguish all valid DNase HS sites from background, we estimate it is necessary to sequence approximately 1million clones. To generate this number of sequences affordably and rapidly, we will employ massively parallel signature sequencing (MPSS), a bead-based technology capable of generating 1million sequence tags per run. Preliminary data show that this technology is readily adaptable to our cloning protocol and can efficiently capture DNase HS sites. MPSS will be used to sequence DNase HS libraries from a number of different cell types, includinghuman embryonic stem cells. Comparisons to comparative genomics will determine the degree to which human DNase HS sites are shared among different species. Functional characterization of a representative sample of DNase HS sites around interesting loci of the genome will identify how these regions of the genome are regulated. Characterizing different DNase HS sites libraries will allow for a better understanding of the chromatin differences that delineate tissue specificity, housekeeping function, cell activation, pluripotency, and early cell differentiation.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Career Transition Award (K22)
Project #
5K22HG003169-04
Application #
7595044
Study Section
Subcommittee G - Education (NCI)
Program Officer
Feingold, Elise A
Project Start
2006-06-01
Project End
2010-03-31
Budget Start
2009-04-01
Budget End
2010-03-31
Support Year
4
Fiscal Year
2009
Total Cost
$270,000
Indirect Cost
Name
Duke University
Department
Pediatrics
Type
Schools of Medicine
DUNS #
044387793
City
Durham
State
NC
Country
United States
Zip Code
27705
Balasubramani, Anand; Winstead, Colleen J; Turner, Henrietta et al. (2014) Deletion of a conserved cis-element in the Ifng locus highlights the role of acute histone acetylation in modulating inducible gene transcription. PLoS Genet 10:e1003969
Shibata, Yoichiro; Sheffield, Nathan C; Fedrigo, Olivier et al. (2012) Extensive evolutionary changes in regulatory element activity during human origins are associated with altered gene expression and positive selection. PLoS Genet 8:e1002789
Tsumagari, Koji; Chang, Shao-Chi; Lacey, Michelle et al. (2011) Gene expression during normal and FSHD myogenesis. BMC Med Genomics 4:67
Song, Lingyun; Crawford, Gregory E (2010) DNase-seq: a high-resolution technique for mapping active gene regulatory elements across the genome from mammalian cells. Cold Spring Harb Protoc 2010:pdb.prot5384
Balasubramani, Anand; Shibata, Yoichiro; Crawford, Gregory E et al. (2010) Modular utilization of distal cis-regulatory elements controls Ifng gene expression in T cells activated by distinct stimuli. Immunity 33:35-47
Xu, Xueqing; Tsumagari, Koji; Sowden, Janet et al. (2009) DNaseI hypersensitivity at gene-poor, FSH dystrophy-linked 4q35.2. Nucleic Acids Res 37:7381-93
Boyle, Alan P; Guinney, Justin; Crawford, Gregory E et al. (2008) F-Seq: a feature density estimator for high-throughput sequence tags. Bioinformatics 24:2537-8
Boyle, Alan P; Davis, Sean; Shulha, Hennady P et al. (2008) High-resolution mapping and characterization of open chromatin across the genome. Cell 132:311-22
(2007) Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447:799-816
Xi, Hualin; Shulha, Hennady P; Lin, Jane M et al. (2007) Identification and characterization of cell type-specific and ubiquitous chromatin regulatory structures in the human genome. PLoS Genet 3:e136

Showing the most recent 10 out of 11 publications