Increasingly persuasive evidence suggests genomic variants driving derived features in humans and among primates are enriched in regulatory elements, but the vast majority of these evolutionarily relevant variants have yet to be discovered or characterized. This is unfortunate, as among the approximately 35 million single nucleotide substitutions (SNPs), 5 million insertions or deletions (indels), and 90 megabases of structural variants where the human and chimpanzee genomes differ are countless variants associated with development, function, or disease. Identifying evolutionarily relevant genetic variants, as well as those implicated in disease or function, can be guided by the analysis of species differences in intermediate molecular phenotypes (e.g., transcriptomic and epigenomic signatures), which are most likely the primary effects determined by genomic variation. In this proposal, we propose to perform primate comparative functional genomics to uncover genetic variants explaining lineage-specific phenotypes affecting the human and non-human primate (NHP) brain, an organ exhibiting pronounced molecular and functional differences between species. To do so, in our first aim we will develop a taxonomy of gene expression and open chromatin across primates, applying single nucleus RNA-seq and single nucleus ATAC-seq to study the mid- fetal and neonatal (late fetal and early infancy) development of the post-mortem human and NHP brain, as well as brain organoid co-cultures containing cells differentiated from multiple primate stem cells and fibroblasts and lymphoblastoid cell lines. In our second aim, we will complement this atlas of species differences in gene expression and open chromatin by cataloguing SNPs, indels, and large, complex structural variants in multiple primate species. This will allow us to differentiate between lineage-specific (i.e., human versus chimpanzee and macaque) and Hominidae-specific (i.e., human and chimpanzee versus macaque) genomic variants. Finally, in our third aim we will integrate and functionally validate, using the Massively Parallel Reporter Assay, CRISPR/Cas9 genome editing, human induced pluripotent stem cells, and mouse models of neural development, key regulatory elements and de novo genes identified through these experiments. Through these aims, we will identify and functionally validate genomic variants and patterns of gene expression and open chromatin potentially driving derived phenotypes in the human and non-human brain and consequently plausibly associated with human cognition, social behaviour, and neuropsychiatric disease.
Fixed genomic variants in specific primate lineages must at some level explain phenotypic differences observed between those lineages. While these phenotypes might include the three-fold increase in the size of the human brain compared to chimpanzee or the higher-level cognitive abilities (such as abstract thinking, syntactical-grammatical language, or episodic memory) of the human, at the most basic level these phenotypes might concern species differences in gene expression and epigenetic regulation. The proposed research will assess species differences in gene expression and epigenetic regulation and associate these differences with inter-species genomic variants, providing fundamental knowledge on the evolution, development, and function of the human and non-human primate brain.