Reconstructing the circuits that control how cells detect environmental triggers and adopt specific fates is a fundamental challenge across all areas of biology. Genomic research on circuitry has initially used observational approaches that infer regulation from correlations in molecular profiles, but cannot distinguish correlation from causation. Our Center has developed and successfully demonstrated an approach that uses single perturbations to determine the function of individual components. However, because interactions in circuits are non-linear, we cannot predict how the circuit will function simply by summing up these individual effects. What is needed is a massive combinatorial analysis: perturbing multiple genes simultaneously, with a compatible genomic readout. To take on this apparently intractable problem, we need to radically boost the type and scale of our experimental and analytic methods. Several advances from our groups and others provide an unprecedented framework for such massively parallel, high order combinatorial circuit analysis based on millions of experiments. First, the CRISPR/Cas9 system enables large-scale pooled, multi-locus gene perturbation in mammalian cells. Second, massively-parallel single cell genomics and proteomics, based on combinatorial bead barcoding and gel droplet microfluidics, allow global readouts from hundreds of thousands to millions of cells. Third, the mathematical theories of random matrices and compressive sensing justify substantial reduction in the sampling of an otherwise enormous combinatorial space under biological realistic and testable hypotheses. Here, we will develop a set of Massively Parallel Combinatorial Perturbation (MCPP) assays, as cost-effective methods to measure genomic profiles in individual cells commensurate with the scale required for high order combinatorial pooled perturbation screens (Aim 1). To analyze data generated with these methods, we will develop methods to: generate combinatorial genetic models from an under-sampled high-order combinatorial space, infer molecular mechanisms that explain the genetic models, and tackle the scale and noise of multiple types of single cell measurements (Aim 2). We will perform massive combinatorial perturbations and profiling to derive a genetic model of the transcriptional response to pathogens in dendritic cells, and then develop a dynamic molecular model that integrates the genetic model with high- resolution measurements of diverse molecular changes together with the RNA and protein life cycle (Aim 3). We will apply similar approaches to study cell fate transitions and maintenance in developing embryoid bodies, to build a combinatorial genetic model of how transcription and chromatin factors drive, stabilize or resist cell differentiation inan inherently heterogeneous population (Aim 4). Our studies will develop broadly-applicable methods for large-scale pooled combinatorial genetic perturbation with massive single cell genomic profiling of mammalian cells, and will generate the first genomic-scale quantitative combinatorial circuit models. We will share these approaches broadly with the community, enabling their application to diverse biological circuits.

Public Health Relevance

It is difficult to predict how therapies will act because in biological systems the 'whole is greater than the sum of its parts'. To be able to make such predictions, we will develop a new strategy that manipulates multiple genes at a time to build a mathematical model. This will eventually enable more rational therapeutics.

National Institute of Health (NIH)
National Human Genome Research Institute (NHGRI)
Research Project with Complex Structure (RM1)
Project #
Application #
Study Section
National Human Genome Research Institute Initial Review Group (GNOM)
Program Officer
Felsenfeld, Adam
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Broad Institute, Inc.
United States
Zip Code
Martin-Gayo, Enrique; Cole, Michael B; Kolb, Kellie E et al. (2018) A Reproducibility-Based Computational Framework Identifies an Inducible, Enhanced Antiviral State in Dendritic Cells from HIV-1 Elite Controllers. Genome Biol 19:10
Donaghey, Julie; Thakurela, Sudhir; Charlton, Jocelyn et al. (2018) Genetic determinants and epigenetic effects of pioneer-factor occupancy. Nat Genet 50:250-258
Vian, Laura; P?kowska, Aleksandra; Rao, Suhas S P et al. (2018) The Energetics and Physiological Impact of Cohesin Extrusion. Cell 173:1165-1178.e20
Ordovas-Montanes, Jose; Dwyer, Daniel F; Nyquist, Sarah K et al. (2018) Allergic inflammatory memory in human respiratory epithelial progenitor cells. Nature 560:649-654
Mead, Benjamin E; Ordovas-Montanes, Jose; Braun, Alexandra P et al. (2018) Harnessing single-cell genomics to improve the physiological fidelity of organoid-derived cell types. BMC Biol 16:62
Tanay, Amos; Regev, Aviv (2017) Scaling single-cell genomics from phenomenology to mechanism. Nature 541:331-338
Smith, Zachary D; Shi, Jiantao; Gu, Hongcang et al. (2017) Epigenetic restriction of extraembryonic lineages mirrors the somatic transition to cancer. Nature 549:543-547
Xiong, Kan; Blainey, Paul C (2017) A Simple, Robust, and High Throughput Single Molecule Flow Stretching Assay Implementation for Studying Transport of Molecules Along DNA. J Vis Exp :
Cleary, Brian; Cong, Le; Cheung, Anthea et al. (2017) Efficient Generation of Transcriptomic Profiles by Random Composite Measurements. Cell 171:1424-1436.e18
Shalek, Alex K; Benson, Mikael (2017) Single-cell analyses to tailor treatments. Sci Transl Med 9:

Showing the most recent 10 out of 36 publications