Mapping the genomic organization of modified histones and defining locations of DNA binding proteins is critical for deep understanding of the regulatory mechanisms governing cellular states. However, the primary methodology used for mapping DNA associated proteins ? chromatin immunoprecipitation followed by sequencing (ChIP-seq) ? has several major limitations: (i) standard ChIP-seq is unable to profile more than one epitope at a time in one sample; (ii) the immunoprecipitation step is inefficient, resulting in low signal to noise ratios; and (iii) ChIP-seq signal is effectively an average readout from large populations of cells due to input material requirements. Thus, our challenge is to simultaneously study the organization of multiple DNA binding proteins from a single sample. Our proposed project will develop a novel methodology termed `Whip- seq' that is designed to overcome these limitations, in part by avoiding immunoprecipitation altogether through the use of long specialized oligonucleotides and DNA barcoding. The Whip-seq method aspires to provide a simple, rapid, low-cost means to address many to date unanswerable questions in chromatin biology. In the experiments outlined in this proposal, we will: Develop the Whip-seq method in a stepwise manner in vitro (Aim1); implement a computational framework for data analysis (Aim 2); and establish the performance of the new assay using in vivo mouse ES cells (Aim 3). Whip-seq has the potential to transform the mapping of chromatin into a highly sensitive, cheap and robust process that can be carried out in any molecular biology laboratory. Whip-seq will provide a means to obtain an unprecedented view of the different combinatorial signatures of chromatin states and how these signatures may be dysregulated in disease, e.g., cancer. The high-resolution mapping, and the ease of implementation, may enable Whip-seq to serve as a diagnostic tool.

Public Health Relevance

Proteins that bind to DNA play vital roles in determining which genes are active in cells, and this in turn defines the functions of each individual cell. The current standard method for investigating the presence and location of proteins on DNA (ChIP-seq) is limited to mapping a single protein on the DNA, and has other inefficiencies. This project will develop a powerful new technique that will overcome these limitations, in particular to simultaneously map multiple proteins and provide a highly sensitive and cheap tool to answer questions such as how combinations of these proteins are organized in normal development or diseases.

National Institute of Health (NIH)
National Human Genome Research Institute (NHGRI)
Exploratory/Developmental Grants (R21)
Project #
Application #
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Pazin, Michael J
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of California, San Diego
Internal Medicine/Medicine
Schools of Medicine
La Jolla
United States
Zip Code