The aim of this Program Project is to discover the function of most genes in the Dictyostelium genome. In Project III we will develop computational techniques to infer gene function and reconstruct gene networks from high-throughput phenotyping, transcriptional profiling, and chromatin footprinfing data, collected in Projects 1 and 11. Our hypothesis is that the increased precision and completeness of these new data sets, made possible by Next Generation sequencing, will allow us to infer powerful predictive models. First, we will design and implement PIPA, a high-throughput sequencing data analysis pipeline. PIPA will be component based and will integrate emerging tools from the community (R, Galaxy, bowtie, top-hat, etc.). It will provide a unified, easy-to-use web-based access to the Program's experimental data. Next, we will devise methods that will query PIPA and consider transcription, competitive growth and chromatin binding information to infer gene function. Integrative data mining to construct consensus gene network models will fuse these emerging hypotheses while considering available external data from other organisms. We will use consensus gene networks as scaffolds upon which we can predict gene function, propose additional experiments, and add layers of informafion from other experiments. We will also use the gene networks as background knowledge for experiment prioritization, the proposal of new mutant-based screens, and the development of new phenotype prediction models. Finally, we propose to implement the new methods within modern serverbased software architecture with visualization-rich interactive interfaces. The most significant aspect of this part of the project is the design of an infrastructure and interfaces that will make the entire planned data analytics transparent and operable by biologists with no computer science background. Our software will be freely available to the research community and well integrated with dictyBase, a primary Dictyostelium community resource.

Public Health Relevance

The lack of appropriate analytical methods reduces he utility of high-dimensional, genome-scale biological data. Using diverse, rich, high-quality phenotypic and transcriptional profiling data sets we will devise new computational methods to accurately infer gene function, helping us to better understand biological processes and equipping other researchers with improved means to analyze their own biomedical data.

National Institute of Health (NIH)
Eunice Kennedy Shriver National Institute of Child Health & Human Development (NICHD)
Research Program Projects (P01)
Project #
Application #
Study Section
Special Emphasis Panel (ZHD1-DSR-N)
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Baylor College of Medicine
United States
Zip Code
Cai, Huaqing; Katoh-Kurasawa, Mariko; Muramoto, Tetsuya et al. (2014) Nucleocytoplasmic shuttling of a GATA transcription factor functions as a development timer. Science 343:1249531
Zitnik, Marinka; Zupan, Bla┼ż (2014) Gene network inference by probabilistic scoring of relationships from a factorized model of interactions. Bioinformatics 30:i246-i254
Zitnik, Marinka; Zupan, Blaz (2014) Matrix factorization-based data fusion for gene function prediction in baker's yeast and slime mold. Pac Symp Biocomput :400-11
Weirauch, Matthew T; Yang, Ally; Albu, Mihai et al. (2014) Determination and inference of eukaryotic transcription factor sequence specificity. Cell 158:1431-43
Miranda, Edward Roshan; Rot, Gregor; Toplak, Marko et al. (2013) Transcriptional profiling of Dictyostelium with RNA sequencing. Methods Mol Biol 983:139-71
Miranda, Edward Roshan; Zhuchenko, Olga; Toplak, Marko et al. (2013) ABC transporters in Dictyostelium discoideum development. PLoS One 8:e70040
Nasser, Waleed; Santhanam, Balaji; Miranda, Edward Roshan et al. (2013) Bacterial discrimination by dictyostelid amoebae reveals the complexity of ancient interspecies interactions. Curr Biol 23:862-72
Huang, Eryong; Talukder, Shaheynoor; Hughes, Timothy R et al. (2011) BzpF is a CREB-like transcription factor that regulates spore maturation and stability in Dictyostelium. Dev Biol 358:137-46
Sucgang, Richard; Kuo, Alan; Tian, Xiangjun et al. (2011) Comparative genomics of the social amoebae Dictyostelium discoideum and Dictyostelium purpureum. Genome Biol 12:R20
Loomis, William F; Shaulsky, Gad (2011) Developmental changes in transcriptional profiles. Dev Growth Differ 53:567-75

Showing the most recent 10 out of 39 publications