To realize the promise of the human genome project, we need not only the parts list of all the genes, but also a comprehensive understanding of how they function together. Along with genes, our genome contains all the signals necessary for controlling gene expression in response to environmental and developmental stimuli. These regulatory processes are governed by short sequence motifs, responsible for modulating gene usage at every level. Despite their prevalence, regulatory motifs have been particularly challenging to identify, due to their short length and the varying distances at which they can act. Given their extraordinary importance, their systematic understanding still remains one of the major challenges of modern biology. In the proposed work, we use comparative genomics of multiple mammals to systematically identify and characterize regulatory motifs in the human genome based on their evolutionary conservation. We have pioneered a new powerful approach for de novo motif discovery by using genome-wide conservation, and successfully applied it in four yeast genomes, twelve fly genomes, and human promoters and 3'-UTRs. Here we expand this methodology to undertake motif discovery across the entire human genome: (1) we develop methods that use dozens of mammalian species for motif discovery and characterization;(2) we identify significant motif combinations and grammars and reveal their functional roles;and (3) we discover functional regions of motif clustering and study motif role in specifying enhancer function. The proposed work is timely, given that NHGRI's sequencing efforts now encompass more than 30 mammalian genomes, specifically for understanding the human. Moreover, large-scale systematic experimentation is providing the functional information necessary to inform and validate our findings. By revealing the underlying sequence patterns that govern gene usage, we complement these ongoing efforts and provide access to the concrete building blocks of human gene regulation. This will enable researchers world-wide to link new genes in pathways by their co-regulation, elucidate the role of non- coding SNPs in regulatory diseases, and lead to new tests and therapeutics for modern medicine. A global map of regulatory motifs constitutes a necessary knowledge infrastructure towards a comprehensive understanding of regulation, development, and disease.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG004037-08
Application #
8731665
Study Section
Special Emphasis Panel (ZRG1-GGG-D (90))
Program Officer
Pazin, Michael J
Project Start
2007-09-28
Project End
2015-10-31
Budget Start
2014-09-01
Budget End
2015-08-31
Support Year
8
Fiscal Year
2014
Total Cost
$406,271
Indirect Cost
$163,721
Name
Massachusetts Institute of Technology
Department
None
Type
Organized Research Units
DUNS #
001425594
City
Cambridge
State
MA
Country
United States
Zip Code
02142
Loughran, Gary; Chou, Ming-Yuan; Ivanov, Ivaylo P et al. (2014) Evidence of efficient stop codon readthrough in four mammalian genes. Nucleic Acids Res 42:8928-38
Lee, Mark N; Ye, Chun; Villani, Alexandra-Chloé et al. (2014) Common genetic variants modulate pathogen-sensing responses in human dendritic cells. Science 343:1246980
Slattery, Matthew; Ma, Lijia; Spokony, Rebecca F et al. (2014) Diverse patterns of genomic targeting by transcriptional regulators in Drosophila melanogaster. Genome Res 24:1224-35
Washietl, Stefan; Kellis, Manolis; Garber, Manuel (2014) Evolutionary dynamics and tissue specificity of human long noncoding RNAs in six mammals. Genome Res 24:616-28
Kheradpour, Pouya; Kellis, Manolis (2014) Systematic discovery and characterization of regulatory motifs in ENCODE TF binding experiments. Nucleic Acids Res 42:2976-87
Feizi, Soheil; Marbach, Daniel; Médard, Muriel et al. (2013) Network deconvolution as a general method to distinguish direct dependencies in networks. Nat Biotechnol 31:726-33
Meuleman, Wouter; Peric-Hupkes, Daan; Kind, Jop et al. (2013) Constitutive nuclear lamina-genome interactions are highly conserved and associated with A/T-rich sequence. Genome Res 23:270-80
Kheradpour, Pouya; Ernst, Jason; Melnikov, Alexandre et al. (2013) Systematic dissection of regulatory motifs in 2000 predicted human enhancers using a massively parallel reporter assay. Genome Res 23:800-11
Ernst, Jason; Kellis, Manolis (2013) Interplay between chromatin state, regulator binding, and regulatory motifs in six human cell types. Genome Res 23:1142-54
Ward, Lucas D; Kellis, Manolis (2012) HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants. Nucleic Acids Res 40:D930-4

Showing the most recent 10 out of 38 publications