This proposal aims to bring the methods of mathematics and statistics to bear on understanding developmental gene control in the experimental system of sporulation in the bacterium Bacillus subtilis. This is an ideal system for such an interdisciplinary approach because sporulation is a relatively primitive developmental system and is highly accessible to the tools of genetics, biochemistry and cytology. The specific goals are to understand how three key DNA-binding proteins (master regulators) in the sporulation pathways recognize their targets, and to model the circuitry that governs the initiation of the program of sporulation gene expression. Biochemical and molecular genetic experiments will be designed and conducted to probe DNA sequence features responsible for protein binding so as to feed to the development of computational strategies. In turn computational predictions will be tested experimentally to provide feedbacks to improve computational models. Models will also be developed to understand how genes respond differently to the concentration of a key master regulator SpoOA, and to explain how and why the slow accumulation of SpoOA plays a key role in the sporulation initiation. Sporulation has the combined virtues of representing a significant biological problem in a system that is well suited to quantitative approaches. Hence, the progress made here in devising quantitative methods to describe the interactions among the molecular constituents of the sporulation system should serve as a model for computational approaches to more complicated biological systems. A distinctive feature of this proposal is that the theoretical work will be carried out hand-in-hand with experimentation so that quantitative modeling can be informed by rapid empirical feedback and, conversely, mathematical models can guide the design of experiments and data collection, and can also suggest the types of data that are most helpful for certain analyses and most complementary to existing data. The research will contribute to the larger field of systems biology, which attempts to describe quantitatively """"""""the behavior of complex biological organization and processes in terms of the molecular constituents"""""""" (Mark Kirschner). The proposed research is relevant to human health because disease typically involves perturbations to the complex web of interactions among cellular regulatory components and will be applicable to devising quantitative approaches to understanding biological organization in normal and abnormal cellular states. A thorough understanding of B. subtilis sporulation can also provide knowledge for combating bio-terrorism and diseases that are caused by similar bacteria, such as Bacillus anthracis.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM078990-04
Application #
7650273
Study Section
Special Emphasis Panel (ZGM1-CBCB-5 (BM))
Program Officer
Brazhnik, Paul
Project Start
2006-07-01
Project End
2012-06-30
Budget Start
2009-07-01
Budget End
2012-06-30
Support Year
4
Fiscal Year
2009
Total Cost
$318,488
Indirect Cost
Name
Harvard University
Department
Biostatistics & Other Math Sci
Type
Schools of Arts and Sciences
DUNS #
082359691
City
Cambridge
State
MA
Country
United States
Zip Code
02138
Sun, Hanchang; Yuan, Yuan; Wu, Yibo et al. (2010) Tmod: toolbox of motif discovery. Bioinformatics 26:405-7
Zhang, Wei; Zhu, Jun; Schadt, Eric E et al. (2010) A Bayesian partition method for detecting pleiotropic and epistatic eQTL modules. PLoS Comput Biol 6:e1000642
Chowdhary, Rajesh; Bajic, Vladimir B; Dong, Difeng et al. (2010) Genome-wide analysis of regions similar to promoters of histone genes. BMC Syst Biol 4 Suppl 1:S4
Chastanet, Arnaud; Vitkup, Dennis; Yuan, Guo-Cheng et al. (2010) Broadly heterogeneous activation of the master regulator for sporulation in Bacillus subtilis. Proc Natl Acad Sci U S A 107:8486-91
Chowdhary, Rajesh; Zhang, Jinfeng; Liu, Jun S (2009) Bayesian inference of protein-protein interactions from biological literature. Bioinformatics 25:1536-42
Zou, Lihua; Sriswasdi, Sira; Ross, Brian et al. (2008) Systematic analysis of pleiotropy in C. elegans early embryogenesis. PLoS Comput Biol 4:e1000003
Yuan, Guo-Cheng; Liu, Jun S (2008) Genomic sequence is highly predictive of local nucleosome depletion. PLoS Comput Biol 4:e13
Gu, Jiajun; Liu, Jun S (2008) Bayesian biclustering of gene expression data. BMC Genomics 9 Suppl 1:S4
Fan, Xiaodan; Zhu, Jun; Schadt, Eric E et al. (2007) Statistical power of phylo-HMM for evolutionarily conserved element detection. BMC Bioinformatics 8:374
Yuan, Yuan; Guo, Lei; Shen, Lei et al. (2007) Predicting gene expression from sequence: a reexamination. PLoS Comput Biol 3:e243

Showing the most recent 10 out of 11 publications