Kruppel associated box zinc finger proteins (KRAB-ZFPs) have emerged as candidates that recognize ERVs. KRAB-ZFPs are rapidly evolving transcriptional repressors that emerged in in a common ancestor of coelacanth, birds, and tetrapods. They make up the largest family of transcription factors in mammals (estimated to be several hundred in mice and humans). Each species has its own unique repertoire of KRAB-ZFPs, with a small number shared with closely related species and a larger fraction specific to each species. Despite their abundance, little is known about their physiological functions. KRAB-ZFPs consist of an N-terminal KRAB domain that binds the co-repressor KAP1 and a variable number of C-terminal C2H2 zinc finger domains that mediate sequence-specific DNA binding. KAP1 directly interacts with the KRAB domain, which recruits the histone methyltransferase (HMT) SETDB1 and heterochromatin protein 1 (HP1) to initiate heterochromatic silencing. Several lines of evidence point to a role for the KRAB-ZFP family in ERV silencing. First, the number of C2H2 zinc finger genes in mammals correlates with the number of ERVs. Second, the KRAB-ZFP protein ZFP809 was isolated based on its ability to bind to the primer binding site for proline tRNA (PBSPro) of murine leukemia virus (MuLV). Third, deletion of the KRAB-ZFP co-repressors Trim28 or Setdb1 leads to activation of many ERVs. Thus we have begun a systematic interrogation of KRAB-ZFP function as a potential adaptive repression system against ERVs. We focused on ZFP809 as a likely ERV-suppressing KRAB-ZFP since it was originally identified as part of a repression complex that recognizes infectious MuLV via direct binding to the 18 nt Primer Binding Site for Proline (PBSpro) sequence. We hypothesized that ZFP809 might function in vivo to repress other ERVs that utilized the PBSpro. Using ChIP-seq of epitope tagged ZFP809 in ESCs and embryonic carcinoma (EC) cells, we determined that ZFP809 bound to several sub-classes of ERV elements via the PBSpro. We generated Zfp809 knockout mice to determine whether ZFP809 was required for VL30pro silencing. We found that Zfp809 knockout tissues displayed high levels of VL30pro elements and that the targeted elements display an epigenetic shift from repressive epigenetic marks (H3K9me3 and CpG methylation) to active marks (H3K9Ac and CpG hypo-methylation). ZFP809-mediated repression extended to a handful of genes that contained adjacent VL30pro integrations. Furthermore, using a combination of conditional alleles and rescue experiments, we determined that ZFP809 activity was required in development to initiate silencing, but not in somatic cells to maintain silencing. These studies provided the first demonstration for the in vivo requirement of a KRAB-ZFP in the recognition and silencing of ERVs. As a follow-up to our studies on ZFP809, we have begun a systematic analysis of KRAB-ZFPs using a medium throughput ChIP-seq screen and functional genomics of KRAB-ZFP clusters and individual KRAB-ZFP genes. Our ChIP-seq data demonstrates that the majority of recently evolved KRAB-ZFP genes interact with and repress distinct and partially overlapping ERV targets. This hypotheses is strongly supported by the distinct ERV reactivation phenotypes we observed in mouse ESC lines lacking one of five of the largest KRAB-ZFP gene clusters. Furthermore our preliminary evidence suggests that KRAB-ZFP cluster KO mice are viable, but have elevated rates of somatic retrotransposition of specific retrotransposon families, providing the first direct genetic link between KRAB-ZFP gene diversification and retrotranspsoson mobility. Although our data shows that many KRAB-ZFPs repress ERVs, we also found that more ancient KRAB-ZFPs that emerged in a human/mouse common ancestor do not bind and repress ERVs. One of these KRAB-ZFPs, ZFP568 plays an important role in silencing a key developmental gene that may have played a critical role in the onset of viviparity in mammals. Using ChIP-seq and biochemical assays, we determined that ZFP568 is a direct repressor of a placental specific isoform of the Igf2 gene called Igf2-P0. Insulin-like growth factor 2 (Igf2) is the major fetal growth hormone in mammals. We demonstrated that loss of Zfp568, which causes gastrulation failure, or mutation of the ZFP568 binding site at the Igf2-P0 promoter causes inappropriate Igf2-P0 activation. We also showed that this lethality could be rescued by deletion of Igf2. These data highlight the exquisite selectivity by which members of the KRAB-ZFP family repress their targets and identifies an additional layer of transcriptional control of a key growth factor regulating fetal and placental development. In an exciting follow-up to these studies, we determined that ZFP568 is highly conserved and under purifying selection in eutheria with the exception of human. Human ZNF568 allele variants have lost the ability to bind and repress Igf2-P0, which may have been driven by the loss of the Igf2-p0 transcript in human placenta. We solve the crystal structure of mouse ZFP568 zinc fingers bound to the Igf2-P0 binding site that reveals several non-canonical ZF-DNA contacts, highlighting the ability of individual ZFs to change confirmation depending upon ZF context and DNA structure. These structures also explain how mutations in human ZNF568 alleles disrupt Igf2-P0 interactions, which contain either deleted ZFs or mutations to key ZF-DNA contact residues. In sum our studies provide important insights into the evolutionary and structural dynamics of ZF-DNA interactions that play a key role in regulating mammalian development and evolution. In the past year, we have also begun to explore the ancestral gene of the KRAB-ZFP family, PRDM9. PRDM9 contains a DNA-bind zinc finger array and a KRAB domain, like other KZFPs, but it is unique in several respects. First, PRDM9 does not interact with KAP1. Second, PRDM9 also contains a histone methyltransferase domain that methylates histone H3 on both K4 and K36 to generate the K4me3 and K36me3 dual mark. Third, PRDM9 is exclusively expressed at a brief window of time during meiotic prophase, where its activity directs the programmed DNA double strand break machinery to initiate meiotic recombination. We began an in silico search for factors that may function downstream of PRDM9. We identified a factor, dubbed PACER that binds to the double marked H3K4me3 and H3K36me3 mark in vitro and at hotpots in vivo. We show that loss of Pacer in mice leads to complete male sterility, meiotic arrest and failed synapsis. Strikingly we determine that the positioning of DSBs are not altered in Pacer KOs, demonstrating that PACER is required not for the initiation but for the repair of PRDM9-induced DSBs. This is the first description of a factor necessary specifically for PRDM9 induced meiotic recombination. Finally, we demonstrate that Pacer co-evolved with Prdm9, with simultaneous appearance in jawless vertebrates and losses in birds and reptiles, suggesting the pair of genes function as obligate cofactors for hotspot determination.

Project Start
Project End
Budget Start
Budget End
Support Year
7
Fiscal Year
2019
Total Cost
Indirect Cost
City
State
Country
Zip Code
Honson, Drew D; Macfarlan, Todd S (2018) A lncRNA-like Role for LINE1s in Development. Dev Cell 46:132-134
Patel, Anamika; Yang, Peng; Tinkham, Matthew et al. (2018) DNA Conformation Induces Adaptable Binding by Tandem Zinc Finger Proteins. Cell 173:221-233.e12
Choi, Yong Jin; Lin, Chao-Po; Risso, Davide et al. (2017) Deficiency of microRNA miR-34a expands cell fate potential in pluripotent stem cells. Science 355:
Yang, Peng; Wang, Yixuan; Hoang, Don et al. (2017) A placental growth factor is silenced in mouse embryos by the zinc finger protein ZFP568. Science 356:757-759
Wolf, Gernot; Rebollo, Rita; Karimi, Mohammad M et al. (2017) On the role of H3.3 in retroviral silencing. Nature 548:E1-E3
Wasson, Jadiel A; Simon, Ashley K; Myrick, Dexter A et al. (2016) Maternally provided LSD1/KDM1A enables the maternal-to-zygotic transition and prevents defects that manifest postnatally. Elife 5:
Thompson, Peter J; Macfarlan, Todd S; Lorincz, Matthew C (2016) Long Terminal Repeats: From Parasitic Elements to Building Blocks of the Transcriptional Regulatory Repertoire. Mol Cell 62:766-76
Wang, Chaochen; Lee, Ji-Eun; Lai, Binbin et al. (2016) Enhancer priming by H3K4 methyltransferase MLL4 controls cell fate transition. Proc Natl Acad Sci U S A 113:11871-11876
Wang, Jianxun; Telese, Francesca; Tan, Yuliang et al. (2015) LSD1n is an H4K20 demethylase regulating memory formation via transcriptional elongation control. Nat Neurosci 18:1256-64
Yang, Peng; Wu, Warren; Macfarlan, Todd S (2015) Maternal histone variants and their chaperones promote paternal genome activation and boost somatic cell reprogramming. Bioessays 37:52-9

Showing the most recent 10 out of 20 publications