A single stem or progenitor cell can give rise to a breathtaking diversity of differentiated cell types, but our understanding of how single cells choose their fate is limited. This is because cells make individual fate decisions regulated by both molecular and environmental factors, but it is challenging to tease these effects apart. To this end, recent advances in the ability to sequence the molecular contents of a given cell (i.e. single cell RNA-seq) represent a potentially transformative development. However, while these methods can be applied to hundreds or even thousands of cells, they return only a gene expression matrix ? the equivalent of a hypothetical study that sequenced thousands of human genomes but recorded no information about each patient. What is missing is the `metadata' of the cell: What is its regulatory and developmental state? Where was it located in situ? Who were its parents and siblings? To understand cellular decision making, we need to perform an integrated analysis of a cell's transcriptome, environment, and lineage, but unfortunately we lack the tools to directly measure these parameters simultaneously. To address this challenge, I hypothesize that the cellular `metadata' is encoded in gene expression, and therefore can be inferred from single cell RNA-seq datasets. Here, I propose to develop an integrated experimental and computational framework to simultaneously learn the transcriptome and `metadata' from thousands of single cells. I will design strategies to analyze single cell gene expression and learn a cell's regulatory state, pinpoint its environmental milieu, and reconstruct its lineage relationships. I will apply these methods to systematically decipher the regulation of cell fate during the development of the mammalian immune and nervous systems. If successful, however, this work will present a general and widely applicable strategy to study how the interaction between molecular and environmental factors governs cell behavior.

Public Health Relevance

The goal of my proposal is to develop an approach to simultaneously measure a single cell's transcriptome, spatial environment, and lineage relationships. I will apply these tools to study how progenitor cells in the immune and nervous systems integrate diverse cues to choose their terminal subtype and fate. Understanding the balance of factors that influence cellular differentiation will provide key insights into the proper diagnosis and treatment of developmental disorders.

National Institute of Health (NIH)
National Human Genome Research Institute (NHGRI)
NIH Director’s New Innovator Awards (DP2)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-MOSS-C (56)R)
Program Officer
Pazin, Michael J
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
New York Genome Center
New York
United States
Zip Code
Mayer, Christian; Hafemeister, Christoph; Bandler, Rachel C et al. (2018) Developmental diversification of cortical inhibitory interneurons. Nature 555:457-462
Konstantinides, Nikolaos; Kapuralin, Katarina; Fadil, Chaimaa et al. (2018) Phenotypic Convergence: Distinct Transcription Factors Regulate Common Terminal Features. Cell 174:622-635.e13
Butler, Andrew; Hoffman, Paul; Smibert, Peter et al. (2018) Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat Biotechnol 36:411-420
Stephenson, William; Donlin, Laura T; Butler, Andrew et al. (2018) Single-cell RNA-seq of rheumatoid arthritis synovial tissue using low-cost microfluidic instrumentation. Nat Commun 9:791
Villani, Alexandra-Chloé; Satija, Rahul; Reynolds, Gary et al. (2017) Single-cell RNA-seq reveals new types of human blood dendritic cells, monocytes, and progenitors. Science 356:
Stoeckius, Marlon; Hafemeister, Christoph; Stephenson, William et al. (2017) Simultaneous epitope and transcriptome measurement in single cells. Nat Methods 14:865-868
Velasco, Silvia; Ibrahim, Mahmoud M; Kakumanu, Akshay et al. (2017) A Multi-step Transcriptional and Chromatin State Cascade Underlies Motor Neuron Programming from Embryonic Stem Cells. Cell Stem Cell 20:205-217.e8
Breton, Gaëlle; Zheng, Shiwei; Valieris, Renan et al. (2016) Human dendritic cells (DCs) are derived from distinct circulating precursors that are precommitted to become CD1c+ or CD141+ DCs. J Exp Med 213:2861-2870