We have been developing tools and resources that make it possible to analyze a large number of genes in various experimental conditions. In our earlier work, we 1) constructed cDNA libraries from early mouse embryos and stem cells and generated a large number of expressed sequence tags (ESTs), 2) developed a glass-slide microarray platform containing in situ-synthesized 60-mer oligonucleotide probes representing approximately 44,000 unique mouse transcripts, 3) produced the web-based ANOVA-FDR software to provide user-friendly microarray data analysis, and 4) developed an algorithm and a fully-automated computational pipeline for transcript assembly from expressed sequences aligned to the mouse genome. In addition, we recently developed a comprehensive database and web browser of the binding sites of transcription factors (TFs) and cis-regulatory modules (CRMs) on the mouse genome. These resources and tools are now applied to the systematic analysis of gene regulatory networks in mouse embryonic stem cells. In our pilot project, we have demonstrated that it is possible to analyze and identify downstream target genes by monitoring the global gene expression patterns of mouse ES cell lines;when a gene encoding a specific TF (Pou5f1 or Oct4 in this case) is manipulated so that the gene can be overexpressed or repressed. To extend our strategy further, we generated 137 ES cell lines thus far, in each of which one of a total of 137 different TFs can be overexpressed in a tetracycline-inducible manner. We have been characterizing these ES cell lines as follows: (i) subcellular localization of Flag-tagged transcription factors by immunohistochemistry;(ii) induction levels of the manipulated transcription factors by quantitative RT-PCR, (iii) DNA microarray-based expression profiling before and after the induction of transcription factors;(iv) western blotting, and (v) karyotyping. Together, these results indicate that we have generated reliable TF-manipulable ES cell lines. We carried out detailed analyses of the first 50 ES cell lines and found that among the 50 TFs, Cdx2 provoked the most extensive transcriptome perturbation in ES cells, followed by Esx1, Sox9, Tcf3, Klf4, and Gata3. ChIP-Seq revealed that CDX2 binds to promoters of up-regulated target genes. By contrast, genes down-regulated by CDX2 did not show CDX2 binding, but were enriched with binding sites for POU5F1, SOX2, and NANOG. Genes with binding sites for these core TFs were also down-regulated by the induction of at least 15 other TFs, suggesting a common initial step for ES cell differentiation mediated by interference with the binding of core TFs to their target genes. Further analyses of additional TF-manipulable mouse ES cell lines demonstrated that indeed overexpression of a single TF is sufficient to initiate the differentiation of mouse ES cells into specific cell lineages. In general, it has been thought that loss-of-function studies are more useful for delineating a gene network and revealing the function of a gene than the gain-of-function studies. Therefore, in addition to the overexpression of TFs (i.e., gain-of-function study), we systematically repressed each of 100 TFs with shRNA and carried out global gene expression profiling in mouse embryonic stem (ES) cells. Unexpectedly, only the repression of a handful of TFs significantly affected transcriptomes, which changed in two directions/trajectories: one trajectory by the repression of either Pou5f1 or Sox2;the other trajectory by the repression of either Esrrb, Sall4, Nanog, or Tcfap4. The data suggest that the trajectories of gene expression change are already preconfigured by the gene regulatory network and roughly correspond to extraembryonic and embryonic fates of cell differentiation, respectively. These data also indicate the robustness of the pluripotency gene network, as the transient repression of most TFs did not alter the transcriptomes.

National Institute of Health (NIH)
National Institute on Aging (NIA)
Investigator-Initiated Intramural Research Projects (ZIA)
Project #
Application #
Study Section
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
National Institute on Aging
Zip Code
Li, Hoi Ming; Hiroi, Toyoko; Zhang, Yongqing et al. (2016) TCRβ repertoire of CD4+ and CD8+ T cells is distinct in richness, distribution, and CDR3 amino acid composition. J Leukoc Biol 99:505-13
Yamamizu, Kohei; Sharov, Alexei A; Piao, Yulan et al. (2016) Generation and gene expression profiling of 48 transcription-factor-inducible mouse embryonic stem cell lines. Sci Rep 6:25667
Sharov, Alexei A (2016) Evolution of natural agents: preservation, advance, and emergence of functional information. Biosemiotics 9:103-129
Sharova, Lioudmila V; Sharov, Alexei A; Piao, Yulan et al. (2016) Emergence of undifferentiated colonies from mouse embryonic stem cells undergoing differentiation by retinoic acid treatment. In Vitro Cell Dev Biol Anim 52:616-24
Teratani-Ota, Yusuke; Yamamizu, Kohei; Piao, Yulan et al. (2016) Induction of specific neuron types by overexpression of single transcription factors. In Vitro Cell Dev Biol Anim :
Sharov, Alexei A (2016) Coenzyme world model of the origin of life. Biosystems 144:8-17
Akiyama, Tomohiko; Xin, Li; Oda, Mayumi et al. (2015) Transient bursts of Zscan4 expression are accompanied by the rapid derepression of heterochromatin in mouse embryonic stem cells. DNA Res 22:307-18
Sherman-Baust, Cheryl A; Kuhn, Elisabetta; Valle, Blanca L et al. (2014) A genetically engineered ovarian cancer mouse model based on fallopian tube transformation mimics human high-grade serous carcinoma development. J Pathol 233:228-37
Yamamizu, Kohei; Schlessinger, David; Ko, Minoru S H (2014) SOX9 accelerates ESC differentiation to three germ layer lineages by repressing SOX2 expression through P21 (WAF1/CIP1). Development 141:4254-66
Sharov, Alexei A (2014) Evolutionary constraints or opportunities? Biosystems 123:9-18

Showing the most recent 10 out of 21 publications