It is a major challenge to extract useful biological knowledge from the large amounts of data that are currently being generated by genome sequencing projects and related technologies such as DNA microarrays. The surprisingly low number of protein-encoding genes found in the human genome unscores the importance of gene expression regulation as a determinant of organismal complexity. Failure of regulatory mechanisms plays a role in many human diseases. The general aim of our research is to further develop regression approaches as a new paradigm for the analysis of functional genomics data. Using simple models based on the molecular mechanisms that control transcription initiation, mRNA turnover, and chromatin remodeling we will analyze genomic sequences as well microarray data for mRNA expression and transcription factor binding. The research proposed here will build upon the success of REDUCE, our motif-based regression analysis tool for discovering cis-regulatory elements in non-coding DNA and inferring the activity of the regulatory factors binding to these elements. The fact that a single genome-wide mRNA expression pattern can be analyzed in isolation makes it possible to model the environmental condition dependence of regulatory processes.
Specific aims are to: (1) Increase the statistical power of REDUCE to detect degenerate motifs by incorporating algorithms based on suffix trees, position-specific scoring matrices, and gene-specific error estimation. We will also explore the use of comparative genomics to restrict the search for motifs to conserved regions; (2) Associate transcription factors with their functional target genes in S. cerevisiae through intergrated analysis of genomewide transcription factor binding data and a large library of mRNA expression data; (3) Uncover synergistic and competitive interactions between transcription factors through multivariate regression analysis of mRNA expression data in which such interaction are modeled explicitly. We will also analyze the possible context dependence of these interactions; (4) Characterize cis-regulatory modules in Drosophila and their target genes by combining hidden Markov modeling of clustered transcription factor binding sites in non-coding DNA with regression analysis of mRNA expression data. ? ?

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG003008-04
Application #
7242590
Study Section
Genome Study Section (GNM)
Program Officer
Good, Peter J
Project Start
2004-08-13
Project End
2009-05-11
Budget Start
2007-06-01
Budget End
2009-05-11
Support Year
4
Fiscal Year
2007
Total Cost
$352,273
Indirect Cost
Name
Columbia University (N.Y.)
Department
Biology
Type
Other Domestic Higher Education
DUNS #
049179401
City
New York
State
NY
Country
United States
Zip Code
10027
Rastogi, Chaitanya; Rube, H Tomas; Kribelbauer, Judith F et al. (2018) Accurate and sensitive quantification of protein-DNA binding affinity. Proc Natl Acad Sci U S A 115:E3692-E3701
Rube, H Tomas; Rastogi, Chaitanya; Kribelbauer, Judith F et al. (2018) A unified approach for quantifying and interpreting DNA shape readout by transcription factors. Mol Syst Biol 14:e7902
Rao, Satyanarayan; Chiu, Tsu-Pei; Kribelbauer, Judith F et al. (2018) Systematic prediction of DNA shape changes due to CpG methylation explains epigenetic effects on protein-DNA binding. Epigenetics Chromatin 11:6
Zhang, Liyang; Martini, Gabriella D; Rube, H Tomas et al. (2018) SelexGLM differentiates androgen and glucocorticoid receptor DNA-binding preference over an extended binding site. Genome Res 28:111-121
Li, Jinsen; Sagendorf, Jared M; Chiu, Tsu-Pei et al. (2017) Expanding the repertoire of DNA shape features for genome-scale studies of transcription factor binding. Nucleic Acids Res 45:12877-12887
Sagendorf, Jared M; Berman, Helen M; Rohs, Remo (2017) DNAproDB: an interactive tool for structural analysis of DNA-protein complexes. Nucleic Acids Res 45:W89-W97
Kribelbauer, Judith F; Laptenko, Oleg; Chen, Siying et al. (2017) Quantitative Analysis of the DNA Methylation Sensitivity of Transcription Factor Complexes. Cell Rep 19:2383-2395
van Arensbergen, Joris; FitzPatrick, Vincent D; de Haas, Marcel et al. (2017) Genome-wide mapping of autonomous promoter activity in human cells. Nat Biotechnol 35:145-153
Bussemaker, Harmen J; Causton, Helen C; Fazlollahi, Mina et al. (2017) Network-based approaches that exploit inferred transcription factor activity to analyze the impact of genetic variation on gene expression. Curr Opin Syst Biol 2:98-102
Chiu, Tsu-Pei; Rao, Satyanarayan; Mann, Richard S et al. (2017) Genome-wide prediction of minor-groove electrostatic potential enables biophysical modeling of protein-DNA binding. Nucleic Acids Res 45:12565-12576

Showing the most recent 10 out of 55 publications