We have been developing tools and resources that make it possible to analyze a large number of genes in various experimental conditions. In our earlier work, we 1) constructed cDNA libraries from early mouse embryos and stem cells and generated a large number of expressed sequence tags (ESTs), 2) developed a glass-slide microarray platform containing in situ-synthesized 60-mer oligonucleotide probes representing approximately 44,000 unique mouse transcripts, 3) produced the web-based ANOVA-FDR software to provide user-friendly microarray data analysis, and 4) developed an algorithm and a fully-automated computational pipeline for transcript assembly from expressed sequences aligned to the mouse genome. In addition, we recently developed a comprehensive database and web browser of the binding sites of transcription factors (TFs) and cis-regulatory modules (CRMs) on the mouse genome. These resources and tools are now applied to the systematic analysis of gene regulatory networks in mouse embryonic stem cells. In our pilot project, we have demonstrated that it is possible to analyze and identify downstream target genes by monitoring the global gene expression patterns of mouse ES cell lines;when a gene encoding a specific TF (Pou5f1 or Oct4 in this case) is manipulated so that the gene can be overexpressed or repressed. To extend our strategy further, we generated 137 ES cell lines thus far, in each of which one of a total of 137 different TFs can be overexpressed in a tetracycline-inducible manner. We have been characterizing these ES cell lines as follows: (i) subcellular localization of Flag-tagged transcription factors by immunohistochemistry;(ii) induction levels of the manipulated transcription factors by quantitative RT-PCR, (iii) DNA microarray-based expression profiling before and after the induction of transcription factors;(iv) western blotting, and (v) karyotyping. Together, these results indicate that we have generated reliable TF-manipulable ES cell lines. We carried out detailed analyses of the first 50 ES cell lines and found that among the 50 TFs, Cdx2 provoked the most extensive transcriptome perturbation in ES cells, followed by Esx1, Sox9, Tcf3, Klf4, and Gata3. ChIP-Seq revealed that CDX2 binds to promoters of up-regulated target genes. By contrast, genes down-regulated by CDX2 did not show CDX2 binding, but were enriched with binding sites for POU5F1, SOX2, and NANOG. Genes with binding sites for these core TFs were also down-regulated by the induction of at least 15 other TFs, suggesting a common initial step for ES cell differentiation mediated by interference with the binding of core TFs to their target genes. Further analyses of additional TF-manipulable mouse ES cell lines demonstrated that indeed overexpression of a single TF is sufficient to initiate the differentiation of mouse ES cells into specific cell lineages.
Showing the most recent 10 out of 22 publications