We propose to develop and apply a comprehensive set of computational and experimental methods in order to fully characterize the regulation of mammalian gene expression at the sequence level. At the core of our approach is an information- theoretic framework for sensitive and highly specific identification of DNA and RNA regulatory elements from large-scale gene expression data and genomic sequence information. We will develop and apply a non-alignment based approach based on network-level conservation in order to identify comprehensive catalogues of regulatory elements conserved between pairs of mammalian genomes. These high-confidence predictions will then be used in order to identify distal regulatory elements composed of clusters of transcription factor binding sites. A Bayesian network learning algorithm will be employed to learn the context-dependent and combinatorial rules by which the discovered elements function to affect gene expression-both within local promoters/3'UTRs and through distal regulatory modules such as enhancers and silencers. We propose a versatile approach based on microarray profiling of phage- display selections in order to rapidly and efficiently identify the protein trans factors that specifically interact with the hundreds of novel DNA and RNA regulatory elements we expect to identify. The proposed research will significantly advance the rate and scale at which regulatory networks are characterized-both in humans, but also across a range of other complex genomes of biomedical and industrial importance.

Public Health Relevance

The proposed research will yield tools that enable biologists to understand the regulatory code that orchestrates gene expression patterns in the human genome. The research is focused on aberrations of gene expression that accompany human cancers. As such, it promises to significantly advance our basic understanding of the cancer phenotype, with potentially important implications for therapy.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG003219-06
Application #
7870465
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Good, Peter J
Project Start
2004-09-01
Project End
2012-05-31
Budget Start
2010-06-01
Budget End
2011-05-31
Support Year
6
Fiscal Year
2010
Total Cost
$390,025
Indirect Cost
Name
Princeton University
Department
Biochemistry
Type
Schools of Arts and Sciences
DUNS #
002484665
City
Princeton
State
NJ
Country
United States
Zip Code
08544
Alarcón, Claudio R; Goodarzi, Hani; Lee, Hyeseung et al. (2015) HNRNPA2B1 Is a Mediator of m(6)A-Dependent Nuclear RNA Processing Events. Cell 162:1299-308
Oikonomou, Panos; Goodarzi, Hani; Tavazoie, Saeed (2014) Systematic identification of regulatory elements in conserved 3' UTRs of human transcripts. Cell Rep 7:281-92
Goodarzi, Hani; Zhang, Steven; Buss, Colin G et al. (2014) Metastasis-suppressor transcript destabilization through TARBP2 binding of mRNA hairpins. Nature 513:256-60
Chiu, Isaac M; Morimoto, Emiko T A; Goodarzi, Hani et al. (2013) A neurodegeneration-specific gene-expression signature of acutely isolated microglia from an amyotrophic lateral sclerosis mouse model. Cell Rep 4:385-401
Goodarzi, Hani; Najafabadi, Hamed S; Oikonomou, Panos et al. (2012) Systematic discovery of structural elements governing stability of mammalian messenger RNAs. Nature 485:264-8
Birsoy, Kivanç; Berry, Ryan; Wang, Tim et al. (2011) Analysis of gene networks in white adipose tissue development reveals a role for ETS2 in adipogenesis. Development 138:4709-19
Lieber, Daniel S; Elemento, Olivier; Tavazoie, Saeed (2010) Large-scale discovery and characterization of protein regulatory motifs in eukaryotes. PLoS One 5:e14444
Goodarzi, Hani; Elemento, Olivier; Tavazoie, Saeed (2009) Revealing global regulatory perturbations across human cancers. Mol Cell 36:900-11
Freckleton, Gordon; Lippman, Soyeon I; Broach, James R et al. (2009) Microarray profiling of phage-display selections for rapid mapping of transcription factor-DNA interactions. PLoS Genet 5:e1000449
Legesse-Miller, Aster; Elemento, Olivier; Pfau, Sarah J et al. (2009) let-7 Overexpression leads to an increased fraction of cells in G2/M, direct down-regulation of Cdc34, and stabilization of Wee1 kinase in primary fibroblasts. J Biol Chem 284:6605-9

Showing the most recent 10 out of 13 publications