Project Description Recent investments in data collection have produced rich catalogs of information about genomic function, human variation, and mammalian evolution, but improved computational tools are needed to integrate and interpret these catalogs, in order to permit the acquisition of new biological knowledge and advances in medicine. Here we outline an ambitious project to develop computational methods that will integrate publicly available data catalogs to provide deep new insights about the evolution and function of sequences in the human genome. Our proposal focuses in particular on noncoding sequences, which remain the most sparsely annotated and poorly understood regions of the genome. The proposal addresses three fundamental and closely related problems: (1) inference of human demography, to provide improved """"""""null models"""""""" for statistical genetics and reveal local signatures of gene flow, natural selection, and other phenomena;(2) detection of natural selection on interspersed noncoding sequences such as transcription factor binding sites, to provide information about their function and the evolutionary processes that have shaped them;and (3) genome-wide prediction of """"""""functional potential"""""""" based on integrated data sets, to identify new functional elements and prioritize candidate disease loci for experimental follow-up. Our proposal includes innovative statistical modeling, new algorithms for inference, the development of freely available software tools and browser resources, and detailed analyses of the latest genomic data sets. To our knowledge this will be the most comprehensive effort yet undertaken to integrate comparative, population, and functional genomic data in addressing fundamental questions about the function and evolution of sequences in the human genome. Our software and the results of our data analysis will be freely available to the research community.

Public Health Relevance

We propose to develop new computational and statistical methods, and new publicly avail- able resources, to integrate catalogs of information about genomic function, human variation, and mammalian evolution. This work will help to convert the rich data that has been collected in recent years into knowledge about the biology of genomes and advances in medicine.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
1R01GM102192-01A1
Application #
8458272
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Krasnewich, Donna M
Project Start
2013-01-01
Project End
2017-12-31
Budget Start
2013-01-01
Budget End
2013-12-31
Support Year
1
Fiscal Year
2013
Total Cost
$348,647
Indirect Cost
$121,377
Name
Cornell University
Department
Biostatistics & Other Math Sci
Type
Schools of Earth Sciences/Natur
DUNS #
872612445
City
Ithaca
State
NY
Country
United States
Zip Code
14850
Kuhlwilm, Martin; Gronau, Ilan; Hubisz, Melissa J et al. (2016) Ancient gene flow from early modern humans into Eastern Neanderthals. Nature 530:429-33
Campagna, Leonardo; Gronau, Ilan; Silveira, Luís Fábio et al. (2015) Distinguishing noise from signal in patterns of genomic divergence in a highly polymorphic avian radiation. Mol Ecol 24:4238-51
Grenier, Jennifer K; Arguello, J Roman; Moreira, Margarida Cardoso et al. (2015) Global diversity lines - a five-continent reference panel of sequenced Drosophila melanogaster strains. G3 (Bethesda) 5:593-603
Gulko, Brad; Hubisz, Melissa J; Gronau, Ilan et al. (2015) A method for calculating probabilities of fitness consequences for point mutations across the human genome. Nat Genet 47:276-83
Siepel, Adam; Arbiza, Leonardo (2014) Cis-regulatory elements and human evolution. Curr Opin Genet Dev 29:81-9
Mohammed, Jaaved; Siepel, Adam; Lai, Eric C (2014) Diverse modes of evolutionary emergence and flux of conserved microRNA clusters. RNA 20:1850-63
Freedman, Adam H; Gronau, Ilan; Schweizer, Rena M et al. (2014) Genome sequencing highlights the dynamic early history of dogs. PLoS Genet 10:e1004016
Rasmussen, Matthew D; Hubisz, Melissa J; Gronau, Ilan et al. (2014) Genome-wide inference of ancestral recombination graphs. PLoS Genet 10:e1004342
Arbiza, Leonardo; Gottipati, Srikanth; Siepel, Adam et al. (2014) Contrasting X-linked and autosomal diversity across 14 human populations. Am J Hum Genet 94:827-44
Mohammed, Jaaved; Bortolamiol-Becet, Diane; Flynt, Alex S et al. (2014) Adaptive evolution of testis-specific, recently evolved, clustered miRNAs in Drosophila. RNA 20:1195-209

Showing the most recent 10 out of 15 publications