This award is funded under the American Recovery and Reinvestment Act of 2009 (Public Law 111-5).

A current challenge for genetics and genomics research is discovering DNA sequences that alter the expression of genes operating during development and various physiological processes. This project will develop the statistical method of structural equation modeling to mine genome-wide DNA sequence and gene expression data to identify these connections. The research relies on methods developed in the area of computational Bayesian statistics to overcome computational difficulties posed by the scale and complexity of genomic data, which make extracting relevant biological information difficult.

This research will develop software for analyzing genomic data that will be used by researchers in the fields of medical, agricultural, and conservation genetics to discover important biological network relationships between DNA sequences and the expression of genes. The software will be used as a starting point for investigating why and how DNA sequences produce differences among individuals in both appearance and susceptibility to genetic diseases. The underlying statistical methods of the software will result in faster and more efficient computational network analysis for data collected at the scale of the entire genome.

Agency
National Science Foundation (NSF)
Institute
Division of Environmental Biology (DEB)
Type
Standard Grant (Standard)
Application #
0922432
Program Officer
Samuel M. Scheiner
Project Start
Project End
Budget Start
2009-09-01
Budget End
2012-08-31
Support Year
Fiscal Year
2009
Total Cost
$516,392
Indirect Cost
Name
Cornell Univ - State: Awds Made Prior May 2010
Department
Type
DUNS #
City
Ithica
State
NY
Country
United States
Zip Code
14850