Reconstructing the circuits that control how cells detect environmental triggers and adopt specific fates is a fundamental challenge across all areas of biology. Genomic research on circuitry has initially used observational approaches that infer regulation from correlations in molecular profiles, but cannot distinguish correlation from causation. Our Center has developed and successfully demonstrated an approach that uses single perturbations to determine the function of individual components. However, because interactions in circuits are non-linear, we cannot predict how the circuit will function simply by summing up these individual effects. What is needed is a massive combinatorial analysis: perturbing multiple genes simultaneously, with a compatible genomic readout. To take on this apparently intractable problem, we need to radically boost the type and scale of our experimental and analytic methods. Several advances from our groups and others provide an unprecedented framework for such massively parallel, high order combinatorial circuit analysis based on millions of experiments. First, the CRISPR/Cas9 system enables large-scale pooled, multi-locus gene perturbation in mammalian cells. Second, massively-parallel single cell genomics and proteomics, based on combinatorial bead barcoding and gel droplet microfluidics, allow global readouts from hundreds of thousands to millions of cells. Third, the mathematical theories of random matrices and compressive sensing justify substantial reduction in the sampling of an otherwise enormous combinatorial space under biological realistic and testable hypotheses. Here, we will develop a set of Massively Parallel Combinatorial Perturbation (MCPP) assays, as cost-effective methods to measure genomic profiles in individual cells commensurate with the scale required for high order combinatorial pooled perturbation screens (Aim 1). To analyze data generated with these methods, we will develop methods to: generate combinatorial genetic models from an under-sampled high-order combinatorial space, infer molecular mechanisms that explain the genetic models, and tackle the scale and noise of multiple types of single cell measurements (Aim 2). We will perform massive combinatorial perturbations and profiling to derive a genetic model of the transcriptional response to pathogens in dendritic cells, and then develop a dynamic molecular model that integrates the genetic model with high- resolution measurements of diverse molecular changes together with the RNA and protein life cycle (Aim 3). We will apply similar approaches to study cell fate transitions and maintenance in developing embryoid bodies, to build a combinatorial genetic model of how transcription and chromatin factors drive, stabilize or resist cell differentiation inan inherently heterogeneous population (Aim 4). Our studies will develop broadly-applicable methods for large-scale pooled combinatorial genetic perturbation with massive single cell genomic profiling of mammalian cells, and will generate the first genomic-scale quantitative combinatorial circuit models. We will share these approaches broadly with the community, enabling their application to diverse biological circuits.

Public Health Relevance

It is difficult to predict how therapies will act because in biological systems the 'whole is greater than the sum of its parts'. To be able to make such predictions, we will develop a new strategy that manipulates multiple genes at a time to build a mathematical model. This will eventually enable more rational therapeutics.

National Institute of Health (NIH)
National Human Genome Research Institute (NHGRI)
Research Project with Complex Structure (RM1)
Project #
Application #
Study Section
Genome Research Review Committee (GNOM-G)
Program Officer
Felsenfeld, Adam
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Broad Institute, Inc.
Research Institutes
United States
Zip Code
Smargon, Aaron A; Cox, David B T; Pyzocha, Neena K et al. (2017) Cas13b Is a Type VI-B CRISPR-Associated RNA-Guided RNase Differentially Regulated by Accessory Proteins Csx27 and Csx28. Mol Cell 65:618-630.e7
Mertins, Philipp; Przybylski, Dariusz; Yosef, Nir et al. (2017) An Integrative Framework Reveals Signaling-to-Transcription Events in Toll-like Receptor Signaling. Cell Rep 19:2853-2866
Tanay, Amos; Regev, Aviv (2017) Scaling single-cell genomics from phenomenology to mechanism. Nature 541:331-338
Di Pierro, Michele; Cheng, Ryan R; Lieberman Aiden, Erez et al. (2017) De novo prediction of human chromosome structures: Epigenetic marking patterns encode genome architecture. Proc Natl Acad Sci U S A 114:12126-12131
Dudchenko, Olga; Batra, Sanjit S; Omer, Arina D et al. (2017) De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356:92-95
Shalek, Alex K; Benson, Mikael (2017) Single-cell analyses to tailor treatments. Sci Transl Med 9:
Cleary, Brian; Cong, Le; Cheung, Anthea et al. (2017) Efficient Generation of Transcriptomic Profiles by Random Composite Measurements. Cell 171:1424-1436.e18
Gierahn, Todd M; Wadsworth 2nd, Marc H; Hughes, Travis K et al. (2017) Seq-Well: portable, low-cost RNA sequencing of single cells at high throughput. Nat Methods 14:395-398
Phanstiel, Douglas H; Van Bortle, Kevin; Spacek, Damek et al. (2017) Static and Dynamic DNA Loops form AP-1-Bound Activation Hubs during Macrophage Development. Mol Cell 67:1037-1048.e6
Abudayyeh, Omar O; Gootenberg, Jonathan S; Konermann, Silvana et al. (2016) C2c2 is a single-component programmable RNA-guided RNA-targeting CRISPR effector. Science 353:aaf5573

Showing the most recent 10 out of 20 publications