The aim of this Program Project is to discover the function of most genes in the Dictyostelium genome. In Project III we will develop computational techniques to infer gene function and reconstruct gene networks from high-throughput phenotyping, transcriptional profiling, and chromatin footprinfing data, collected in Projects 1 and 11. Our hypothesis is that the increased precision and completeness of these new data sets, made possible by Next Generation sequencing, will allow us to infer powerful predictive models. First, we will design and implement PIPA, a high-throughput sequencing data analysis pipeline. PIPA will be component based and will integrate emerging tools from the community (R, Galaxy, bowtie, top-hat, etc.). It will provide a unified, easy-to-use web-based access to the Program's experimental data. Next, we will devise methods that will query PIPA and consider transcription, competitive growth and chromatin binding information to infer gene function. Integrative data mining to construct consensus gene network models will fuse these emerging hypotheses while considering available external data from other organisms. We will use consensus gene networks as scaffolds upon which we can predict gene function, propose additional experiments, and add layers of informafion from other experiments. We will also use the gene networks as background knowledge for experiment prioritization, the proposal of new mutant-based screens, and the development of new phenotype prediction models. Finally, we propose to implement the new methods within modern serverbased software architecture with visualization-rich interactive interfaces. The most significant aspect of this part of the project is the design of an infrastructure and interfaces that will make the entire planned data analytics transparent and operable by biologists with no computer science background. Our software will be freely available to the research community and well integrated with dictyBase, a primary Dictyostelium community resource.

Public Health Relevance

The lack of appropriate analytical methods reduces he utility of high-dimensional, genome-scale biological data. Using diverse, rich, high-quality phenotypic and transcriptional profiling data sets we will devise new computational methods to accurately infer gene function, helping us to better understand biological processes and equipping other researchers with improved means to analyze their own biomedical data.

Agency
National Institute of Health (NIH)
Institute
Eunice Kennedy Shriver National Institute of Child Health & Human Development (NICHD)
Type
Research Program Projects (P01)
Project #
4P01HD039691-15
Application #
9107887
Study Section
Special Emphasis Panel (ZHD1)
Project Start
Project End
Budget Start
2016-05-01
Budget End
2017-04-30
Support Year
15
Fiscal Year
2016
Total Cost
Indirect Cost
Name
Baylor College of Medicine
Department
Type
DUNS #
051113330
City
Houston
State
TX
Country
United States
Zip Code
77030
Dinh, Christopher; Farinholt, Timothy; Hirose, Shigenori et al. (2018) Lectins modulate the microbiota of social amoebae. Science 361:402-406
Hirose, Shigenori; Chen, Gong; Kuspa, Adam et al. (2017) The polymorphic proteins TgrB1 and TgrC1 function as a ligand-receptor pair in Dictyostelium allorecognition. J Cell Sci 130:4002-4012
Swatson, William S; Katoh-Kurasawa, Mariko; Shaulsky, Gad et al. (2017) Curcumin affects gene expression and reactive oxygen species via a PKA dependent mechanism in Dictyostelium discoideum. PLoS One 12:e0187562
Stajdohar, Miha; Rosengarten, Rafael D; Kokosar, Janez et al. (2017) dictyExpress: a web-based platform for sequence data management and analytics in Dictyostelium and beyond. BMC Bioinformatics 18:291
Rosengarten, Rafael D; Santhanam, Balaji; Kokosar, Janez et al. (2017) The Long Noncoding RNA Transcriptome of Dictyostelium discoideum Development. G3 (Bethesda) 7:387-398
Zhang, Xuezhi; Zhuchenko, Olga; Kuspa, Adam et al. (2016) Social amoebae trap and kill bacteria by casting DNA nets. Nat Commun 7:10938
Zitnik, Marinka; Zupan, Blaz (2016) COLLECTIVE PAIRWISE CLASSIFICATION FOR MULTI-WAY ANALYSIS OF DISEASE AND DRUG DATA. Pac Symp Biocomput 21:81-92
Katoh-Kurasawa, Mariko; Santhanam, Balaji; Shaulsky, Gad (2016) The GATA transcription factor gene gtaG is required for terminal differentiation in Dictyostelium. J Cell Sci :
Zitnik, Marinka; Zupan, Blaz (2016) Jumping across biomedical contexts using compressive data fusion. Bioinformatics 32:i90-i100
Chen, Xinlu; Köllner, Tobias G; Jia, Qidong et al. (2016) Terpene synthase genes in eukaryotes beyond plants and fungi: Occurrence in social amoebae. Proc Natl Acad Sci U S A 113:12132-12137

Showing the most recent 10 out of 64 publications