Understanding regulatory networks controlling gene expression is one of the fundamental problems of modern biology. The proposed research focuses on the methods of locating regulatory elements in DNA. We have developed a new maximum likelihood method based on the physical DNA dependent binding probability of a transcription factor (TF) that correctly incorporates the protein concentration dependent saturation effect. The advantage of keeping the saturation effect is that the method automatically provides a score threshold for classifying candidate sites into binders and non-binders. Most conventional methods, based on the information score, merely provide a relative ordering of candidate sequences. The principled choice of a threshold is extremely useful for dealing with the highly variable sites typical of global regulatory factors. The simplest of our algorithms reduces to a one-class support vector machine. This classifier will be applied to detect large regulons in E. coli, as well as in phages, with special attention to targets of sigma factors. We also develop classifiers for regulatory targets that go beyond pure sequence analysis and combine it with information from additional sources, like microarray expression data or sequence similarity between phylogenetically closely related species. The proposed computational effort will be complemented by experiments verifying the predictions as well as providing in vitro and in vivo data needed to make predictions. Experimental efforts will involve a high throughput low stringency SELEX method applied to global transcriptional regulators from E. coli. It will also involve chromatin immunoprecipitation and beta- galactosidase assays performed in S. Cerevisiae that test the ability of bioinformatic algorithms to predict functionality of TF binding sites. A special feature of this proposal is the analysis of the effect of the rest of the promoter on the regulatory potential of a site. We apply the lessons learnt in simple organisms to the elucidation of distinctive specificity of different NFkB proteins involved in immunity, inflammation and cancer. Mutation, over-expression and amplification of genes encoding transcription factors play an important role in many diseases from diabetes to cancer. Understanding how a factor targets genes is crucial for discovering the pathways whose malfunction leads to the symptoms. This is an achievable goal, given the right way to analyze the plethora of genome-wide data available to us. ? ? ?

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG003470-02
Application #
7221303
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Good, Peter J
Project Start
2006-04-13
Project End
2009-03-31
Budget Start
2007-04-01
Budget End
2008-03-31
Support Year
2
Fiscal Year
2007
Total Cost
$299,796
Indirect Cost
Name
Rutgers University
Department
Physics
Type
Schools of Arts and Sciences
DUNS #
001912864
City
New Brunswick
State
NJ
Country
United States
Zip Code
08901
Nagaraj, Vijayalakshmi H; Mukhopadhyay, Swagatam; Dayarian, Adel et al. (2014) Breaking an epigenetic chromatin switch: curious features of hysteresis in Saccharomyces cerevisiae telomeric silencing. PLoS One 9:e113516
Dayarian, Adel; Sengupta, Anirvan M (2013) Titration and hysteresis in epigenetic chromatin silencing. Phys Biol 10:036005
Mukhopadhyay, Swagatam; Sengupta, Anirvan M (2013) The role of multiple marks in epigenetic silencing and the emergence of a stable bivalent chromatin state. PLoS Comput Biol 9:e1003121
Kulaeva, Olga I; Zheng, Guohui; Polikanov, Yury S et al. (2012) Internucleosomal interactions mediated by histone tails allow distant communication in chromatin. J Biol Chem 287:20248-57
Mukhopadhyay, Swagatam; Schedl, Paul; Studitsky, Vasily M et al. (2011) Theoretical analysis of the role of chromatin interactions in long-range action of enhancers and insulators. Proc Natl Acad Sci U S A 108:19919-24
McIsaac, R Scott; Huang, Kerwyn Casey; Sengupta, Anirvan et al. (2011) Does the potential for chaos constrain the embryonic cell-cycle oscillator? PLoS Comput Biol 7:e1002109
Mehta, Pankaj; Schwab, David J; Sengupta, Anirvan M (2011) Statistical Mechanics of Transcription-Factor Binding Site Discovery Using Hidden Markov Models. J Stat Phys 142:1187-1205
Gelfand, Brian; Mead, Janet; Bruning, Adrian et al. (2011) Regulated antisense transcription controls expression of cell-type-specific genes in yeast. Mol Cell Biol 31:1701-9
Mukhopadhyay, Swagatam; Nagaraj, Vijayalakshmi H; Sengupta, Anirvan M (2010) Locus dependence in epigenetic chromatin silencing. Biosystems 102:49-54
Dayarian, Adel; Michael, Todd P; Sengupta, Anirvan M (2010) SOPRA: Scaffolding algorithm for paired reads via statistical optimization. BMC Bioinformatics 11:345

Showing the most recent 10 out of 17 publications