The recent development of single cell RNA-seq protocols enabled genomewide investigation of organismal systems at the cellular level, opening many new biological questions for study. Single cell resolution allows characterization of rare or unknown cell types, enables dissection of differentiation processes, and aids in decoding regulatory networks responsible for healthy and diseased states of cells. However, current single cell RNA-seq studies are limited by crucial gaps in existing computational methods. We have devised strategies to address three key limitations of current single cell RNA-seq analysis methods: (1) lack of models for isoform-specific expression, (2) inability to link gene expression differences with measurable changes in cell function, and (3) lack of methods for studying sequential progression of gene expression changes. To address the first shortcoming, we developed SingleSplice, an algorithm for identifying genes whose isoform ratios vary more than expected by chance across a set of single cells (Aim 1). We have also developed a novel microraft platform that allows culturing, functional characterization, isolation, and subsequent sequencing of single cells. Using data generated from this platform, we will perform supervised machine learning to identify genes linked to functional differences among cells (Aim 2). To address the third limitation, we will use locally linear embedding, a nonlinear dimensionality reduction technique, to identify trajectories of cells proceeding through sequential processes such as development and response to stimuli (Aim 3). We will apply our methods to our own data generated from microraft experiments, as well as publicly available single cell RNA-seq data from developing lung tissue and immune cells responding to immune stimulation. Using data from experiments in which spike-in transcripts are added at constant, known amounts to cells to mimic an alternative splicing change, we found that SingleSplice detects isoform switching with high sensitivity (73%) and specificity (93%). We used microrafts to sequence single cells from a pancreatic cancer cell line and found that this approach produced high-quality data comparable to that from the Fluidigm C1. The microraft technology also enabled us to sequence RNA from pancreatic cancer cells after gemcitabine treatment and measure the proliferation of the cells, identifying both drug resistant cells that divide and cels that do not proliferate, giving a dataset with matched functional and transcriptomic measurements. Preliminary investigation of a dataset in which dendritic cells were stimulated with bacterial lipopolysaccharide (LPS) shows that locally linear embedding (LLE) can order cells according to the length of time they have been exposed to LPS.

Public Health Relevance

Until very recently, studies have relied on aggregate measurements taken from bulk samples of cells, but in many important health-related applications, the information lost by lumping together thousands of cells poses a real barrier to understanding. Neurological conditions, cancer, and stem cell therapies are among the many areas that call for single cell RNA-seq methods. We propose to develop broadly applicable computational methods that will eliminate key limitations of current analyses and enable groundbreaking research in many domains important for public health.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Predoctoral Individual National Research Service Award (F31)
Project #
5F31HG008912-02
Application #
9285609
Study Section
Special Emphasis Panel (ZRG1-F08-B (20)L)
Program Officer
Gatlin, Christine L
Project Start
2016-06-01
Project End
2018-05-31
Budget Start
2017-06-01
Budget End
2018-05-31
Support Year
2
Fiscal Year
2017
Total Cost
$33,259
Indirect Cost
Name
University of North Carolina Chapel Hill
Department
Biostatistics & Other Math Sci
Type
Schools of Arts and Sciences
DUNS #
608195277
City
Chapel Hill
State
NC
Country
United States
Zip Code
27599
Liu, Ziqing; Wang, Li; Welch, Joshua D et al. (2017) Single-cell transcriptomics reconstructs fate conversion from fibroblast to cardiomyocyte. Nature 551:100-104
Welch, Joshua D; Hartemink, Alexander J; Prins, Jan F (2017) MATCHER: manifold alignment reveals correspondence between single cell transcriptome and epigenome dynamics. Genome Biol 18:138
Welch, Joshua D; Williams, Lindsay A; DiSalvo, Matthew et al. (2016) Selective single cell isolation for genomics using microraft arrays. Nucleic Acids Res 44:8292-301
Welch, Joshua D; Hartemink, Alexander J; Prins, Jan F (2016) SLICER: inferring branched, nonlinear cellular trajectories from single cell RNA-seq data. Genome Biol 17:106
Welch, Joshua D; Hu, Yin; Prins, Jan F (2016) Robust detection of alternative splicing in a population of single cells. Nucleic Acids Res 44:e73