We propose to improve methods for locus-specific resequencing using variation detection arrays (VDAs), also known as resequencing microarrays. High throughput genotyping technology provides a means to determine the diversity of the genomes of humans and other model organisms, and thereby more efficiently resolve the functional causes of disease. Recently microarray probe densities have increased significantly and costs have been reduced such that it has become economical to use VDAs for large scale SNP discovery. Recent improvements in gene mapping technologies have identified many disease-linked loci in humans, but efficient technologies for resequencing of these loci for large numbers of samples are needed because of low penetrance and complex disease etiologies in human populations. VDAs provide one such solution. We propose to improve the design of VDAs and the methods for automatic genotype base calling of VDA data. Our approach is based on modeling the physical DNA-DNA hybridization on the surface of the microarray and optimizing chip design for better performance under uniform hybridization conditions. Specifically, we propose to 1. Increase the call rate and accuracy of base calling on resequencing microarray data by using physical models that predict'hybridization results based on probe-target sequences, 2. Modify the probe sequence design of resequencing microarrays to achieve greater sensitivity and single nucleotide discrimination, 3. Assess the effects of our novel design strategy by hybridizing a diverse set of samples to a test microarray interrogating select ENCODE regions of the human genome, and 4. Demonstrate the advantages of our improved methods on a preliminary case study of a low penetrance modifier loci implicated in breast cancer susceptibility. This work aims to improve the effectiveness of resequencing technology, thereby improving the efficiency of variation detection, and ultimately leading to a better understanding of the molecular basis of complex human disease.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG003880-03
Application #
7363618
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Ozenberger, Bradley
Project Start
2006-03-23
Project End
2009-01-31
Budget Start
2008-02-01
Budget End
2009-01-31
Support Year
3
Fiscal Year
2008
Total Cost
$296,144
Indirect Cost
Name
University of Massachusetts Amherst
Department
Biostatistics & Other Math Sci
Type
Schools of Arts and Sciences
DUNS #
153926712
City
Amherst
State
MA
Country
United States
Zip Code
01003
Bahl, Amit; Davis, Paul H; Behnke, Michael et al. (2010) A novel multifunctional oligonucleotide microarray for Toxoplasma gondii. BMC Genomics 11:603
Kwan, Alan L; Li, Linya; Kulp, David C et al. (2009) Improving gene-finding in Chlamydomonas reinhardtii:GreenGenie2. BMC Genomics 10:210
Jagalur, Manjunatha; Pal, Chris; Learned-Miller, Erik et al. (2007) Analyzing in situ gene expression in the mouse brain with image registration, feature extraction and block clustering. BMC Bioinformatics 8 Suppl 10:S5