Two forms of genetic variation are common and can be measured on a genomic scale using recent high throughput genotyping platforms: single nucleotide polymorphisms (SNPs) and copy number variants (CNVs). Unlike high throughput genotyping algorithms that are highly accurate, copy number estimates are very imprecise and tools for estimating copy number and inferring regions of CNV are still under development. My immediate scientific goals are to provide first generation algorithms for each of the following tiers of estimation problems: (i) By locus: estimate the raw copy number at each locus on the array and quantify the uncertainty, (ii) By sample: infer regions of CNV, and (iii) Between samples: assess the contribution of CNV to disease susceptibility. My long term goal is to establish an interdisciplinary research lab in biostatistics and human genetics that supports creative computational and statistical solutions to high throughput genomic data. This Award will facilitate the necessary training and skills to transition to independent research through formal coursework in statistical genetics and computational biology, leadership opportunities in structured career development activities, such as the GWAs@JohnsHopkins working group, new collaborations from multiple research institutes, and presentations at national conferences, including epidemiological (American Heart Association), methodological (Joint Statistical Meetings), and topical (e.g., a copy number variant workshop). A scientific advisory panel of internationally recognized experts will oversee my research. New technologies and applications for genomic research developed during the course of this Award will lead to exciting new opportunities for biostatistical research, as well as R01 funding opportunities that I will actively pursue. PUBLIC HEALTH REVELANCE - Genetic variation between individuals is common and has been linked to common diseases such as diabetes and cancer. I propose to develop statistical methods for new genome-scale technologies to identify genetic variants and to characterize their contribution to disease susceptibility.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Career Transition Award (K99)
Project #
1K99HG005015-01
Application #
7641202
Study Section
Ethical, Legal, Social Implications Review Committee (GNOM)
Program Officer
Brooks, Lisa
Project Start
2009-07-24
Project End
2010-06-10
Budget Start
2009-07-24
Budget End
2010-06-10
Support Year
1
Fiscal Year
2009
Total Cost
$90,000
Indirect Cost
Name
Johns Hopkins University
Department
Biostatistics & Other Math Sci
Type
Schools of Public Health
DUNS #
001910777
City
Baltimore
State
MD
Country
United States
Zip Code
21218
Scharpf, Robert B; Ruczinski, Ingo; Carvalho, Benilton et al. (2011) A multilevel model to address batch effects in copy number estimation using SNP arrays. Biostatistics 12:33-50
Scharpf, Robert B; Ruczinski, Ingo (2010) R classes and methods for SNP array data. Methods Mol Biol 593:67-79