Computational methods for modeling lineage-specific gene regulation

Pritchard, Jonathan

Abstract

Complex sensory and signaling pathways ultimately converge on the DNA in the form of complexes of interacting regulatory RNAs and proteins that bind specific gene-proximal and distal regulatory elements to active and repress genes resulting in the establishment and maintenance of cell-type specific transcriptional responses. These regulatory elements have static and dynamic components - the former encoded in the genomic sequence itself as combinations of transcription factor binding site sequence motifs and the latter a consequence of locus and cell-type specific chromatin accessibility and epigenomic modifications as well as the dynamic linking of distal regulatory elements to target genes. In addition, transcriptional and post-transcriptional feedback and control of the levels of regulatory proteins and RNAs also play a key role in dynamic cell-type specific gene regulation. One of the central challenges in modern genomics is to learn these rules by which the genome and epigenome encode regulatory information. How does a single genome sequence encode the information for exquisitely precise, and yet highly distinctive programs of gene regulation for different cell types, and at different time points? What are the key sequence determinants, and grammatical rules that determine the function of any given DNA sequence under any particular set of conditions? In this project we seek to use public data from The Roadmap Epigenomics project and The ENCODE project to make progress on these questions. We propose to build methods for interpreting shared and distinctive regulatory features, including especially transcription factor binding, across related cell types. In parallel, we will implement novel methods for identifying high-resolution, context-specific dynamic regulatory elements, decipher their underlying regulatory sequence grammars and learn predictive, integrative models of transcriptional regulation to decipher the effects of heterogeneous regulatory components on lineage-specific gene expression dynamics and provide regulatory annotations for large collections of curated and disease-associated gene sets.

Public Health Relevance

The purpose of this project is to develop powerful statistical methods to study cell-type specific gene regulation, and apply these to publicly available data from The Roadmap Epigenomics Project, The ENCODE Project and several publicly available gene expression compendia. We propose new methods for joint inference of regulatory elements across diverse cell types, deciphering lineage-specific regulatory sequence grammars encoded in these elements and learning integrative transcriptional regulation programs. Our models will serve as a resource to provide comprehensive context-specific regulatory annotations for disease-associated gene sets and co-expression modules.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of Environmental Health Sciences (NIEHS)
Type: Research Project (R01)
Project #: 1R01ES025009-01
Application #: 8815902
Study Section: Special Emphasis Panel (ZRG1-IMST-R (51))
Program Officer: Chadwick, Lisa

Project Start: 2014-09-05
Project End: 2016-08-31
Budget Start: 2014-09-05
Budget End: 2015-08-31
Support Year: 1
Fiscal Year: 2014
Total Cost: $318,763
Indirect Cost: $118,763

Institution

Name: Stanford University
Department: Genetics
Type: Schools of Medicine
DUNS #: 009214214

City: Stanford
State: CA
Country: United States
Zip Code: 94305

Related projects


NIH 2016 R01 ES	Computational methods for modeling lineage-specific gene regulation Pritchard, Jonathan K. / Stanford University	$150,000
NIH 2015 R01 ES	Computational methods for modeling lineage-specific gene regulation Pritchard, Jonathan K. / Stanford University
NIH 2014 R01 ES	Computational methods for modeling lineage-specific gene regulation Pritchard, Jonathan K. / Stanford University	$318,763

Publications

Sparvoli, Daniela; Richardson, Elisabeth; Osakada, Hiroko et al. (2018) Remodeling the Specificity of an Endosomal CORVET Tether Underlies Formation of Regulated Secretory Vesicles in the Ciliate Tetrahymena thermophila. Curr Biol 28:697-710.e13

Koh, Pang Wei; Pierson, Emma; Kundaje, Anshul (2017) Denoising genome-wide histone ChIP-seq with convolutional neural networks. Bioinformatics 33:i225-i233

Lan, Xun; Pritchard, Jonathan K (2016) Coregulation of tandem duplicate genes slows evolution of subfunctionalization in mammals. Science 352:1009-13

Field, Yair; Boyle, Evan A; Telis, Natalie et al. (2016) Detection of human adaptation during the past 2000 years. Science 354:760-764

Kontur, Cassandra; Kumar, Santosh; Lan, Xun et al. (2016) Whole Genome Sequencing Identifies a Novel Factor Required for Secretory Granule Maturation in Tetrahymena thermophila. G3 (Bethesda) 6:2505-16

Grubert, Fabian; Zaugg, Judith B; Kasowski, Maya et al. (2015) Genetic Control of Chromatin States in Humans Involves Local and Distal Chromosomal Interactions. Cell 162:1051-65

Comments

Be the first to comment on Jonathan Pritchard's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: