We propose to develop and apply a comprehensive set of computational and experimental methods in order to fully characterize the regulation of mammalian gene expression at the sequence level. At the core of our approach is an information- theoretic framework for sensitive and highly specific identification of DNA and RNA regulatory elements from large-scale gene expression data and genomic sequence information. We will develop and apply a non-alignment based approach based on network-level conservation in order to identify comprehensive catalogues of regulatory elements conserved between pairs of mammalian genomes. These high-confidence predictions will then be used in order to identify distal regulatory elements composed of clusters of transcription factor binding sites. A Bayesian network learning algorithm will be employed to learn the context-dependent and combinatorial rules by which the discovered elements function to affect gene expression-both within local promoters/3'UTRs and through distal regulatory modules such as enhancers and silencers. We propose a versatile approach based on microarray profiling of phage- display selections in order to rapidly and efficiently identify the protein trans factors that specifically interact with the hundreds of novel DNA and RNA regulatory elements we expect to identify. The proposed research will significantly advance the rate and scale at which regulatory networks are characterized-both in humans, but also across a range of other complex genomes of biomedical and industrial importance.

Public Health Relevance

The proposed research will yield tools that enable biologists to understand the regulatory code that orchestrates gene expression patterns in the human genome. The research is focused on aberrations of gene expression that accompany human cancers. As such, it promises to significantly advance our basic understanding of the cancer phenotype, with potentially important implications for therapy.

National Institute of Health (NIH)
National Human Genome Research Institute (NHGRI)
Research Project (R01)
Project #
Application #
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Pazin, Michael J
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Columbia University (N.Y.)
Schools of Medicine
New York
United States
Zip Code
Alarcón, Claudio R; Goodarzi, Hani; Lee, Hyeseung et al. (2015) HNRNPA2B1 Is a Mediator of m(6)A-Dependent Nuclear RNA Processing Events. Cell 162:1299-308
Oikonomou, Panos; Goodarzi, Hani; Tavazoie, Saeed (2014) Systematic identification of regulatory elements in conserved 3' UTRs of human transcripts. Cell Rep 7:281-92
Goodarzi, Hani; Zhang, Steven; Buss, Colin G et al. (2014) Metastasis-suppressor transcript destabilization through TARBP2 binding of mRNA hairpins. Nature 513:256-60
Chiu, Isaac M; Morimoto, Emiko T A; Goodarzi, Hani et al. (2013) A neurodegeneration-specific gene-expression signature of acutely isolated microglia from an amyotrophic lateral sclerosis mouse model. Cell Rep 4:385-401
Goodarzi, Hani; Najafabadi, Hamed S; Oikonomou, Panos et al. (2012) Systematic discovery of structural elements governing stability of mammalian messenger RNAs. Nature 485:264-8
Birsoy, Kivanç; Berry, Ryan; Wang, Tim et al. (2011) Analysis of gene networks in white adipose tissue development reveals a role for ETS2 in adipogenesis. Development 138:4709-19
Lieber, Daniel S; Elemento, Olivier; Tavazoie, Saeed (2010) Large-scale discovery and characterization of protein regulatory motifs in eukaryotes. PLoS One 5:e14444
Goodarzi, Hani; Elemento, Olivier; Tavazoie, Saeed (2009) Revealing global regulatory perturbations across human cancers. Mol Cell 36:900-11
Freckleton, Gordon; Lippman, Soyeon I; Broach, James R et al. (2009) Microarray profiling of phage-display selections for rapid mapping of transcription factor-DNA interactions. PLoS Genet 5:e1000449
Legesse-Miller, Aster; Elemento, Olivier; Pfau, Sarah J et al. (2009) let-7 Overexpression leads to an increased fraction of cells in G2/M, direct down-regulation of Cdc34, and stabilization of Wee1 kinase in primary fibroblasts. J Biol Chem 284:6605-9

Showing the most recent 10 out of 13 publications