Common diseases, such as bipolar disorder, asthma, heart disease, cancer, etc. are caused by a complex interplay among multiple genetic and environmental risk factors. Both common and rare genetic variants are expected to influence risk to these traits. Thus far, most research in nding disease susceptibility variants has focused, out of necessity, on the discovery of common susceptibility variants (i.e. variants with a population frequency of at least 5%). Genome-wide association studies have been very successful at nding common variants robustly associated with many complex traits. However, taken together, these variants only explain a small fraction of the estimated trait heritability. Recent advances in sequencing technologies have brought along substantial reductions in cost and in- creases in genomic throughput by more than three orders of magnitude. These developments have lead to an increasing number of sequencing studies being performed, including the 1000 Genomes Project, with the main goal to identify rare genetic variants. Therefore, for the first time, it is now possible to systematically assess the role rare variants may play in various complex traits. Existing methods for the detection of common susceptibility variants are not suitable for the detection of rare variants. We believe that there is a great need for new developments in statistical methodology for the analysis of rare variants, if we want to make the best use of the sequence data currently being generated. The proposed research intends to develop novel and efficient statistical approaches to address this need. The methods proposed here exploit information about the full frequency distributions of rare variants for cases and controls to achieve substantial increases in power over current methods, and can handle large genomic regions, possibly entire genomes. We plan to test our methods on data simulated under a comprehensive set of disease models, and then to apply them to real data on psychiatric diseases, for which common susceptibility variants are very hard to identify. The methods proposed here will be implemented into a software package, to be made available to the larger research community. We believe that this proposal has the strong potential to help in the current efforts to expand the search for causal genetic variants to the, until now, unexplored territory of rare variation. This next phase is key to advancing our understanding of the biological underpinnings of complex diseases, and ultimately essential to improving the public health.

Public Health Relevance

Recent advances in sequencing technologies allow for the first time in history the systematic assessment of the potential role rare variants may play in various complex diseases, such as bipolar disorder and asthma. We propose to develop powerful statistical methods toward this goal, and implement them into a software package, to be made available to the larger research community.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Small Research Grants (R03)
Project #
5R03HG005908-02
Application #
8127996
Study Section
Biostatistical Methods and Research Design Study Section (BMRD)
Program Officer
Brooks, Lisa
Project Start
2010-08-16
Project End
2013-05-31
Budget Start
2011-06-01
Budget End
2013-05-31
Support Year
2
Fiscal Year
2011
Total Cost
$80,500
Indirect Cost
Name
Columbia University (N.Y.)
Department
Biostatistics & Other Math Sci
Type
Schools of Public Health
DUNS #
621889815
City
New York
State
NY
Country
United States
Zip Code
10032
Ionita-Laza, Iuliana; Lee, Seunggeun; Makarov, Vladimir et al. (2013) Family-based association tests for sequence data, and comparisons with population-based association tests. Eur J Hum Genet 21:1158-62
Ionita-Laza, Iuliana; Lee, Seunggeun; Makarov, Vlad et al. (2013) Sequence kernel association tests for the combined effect of rare and common variants. Am J Hum Genet 92:841-53
Ionita-Laza, Iuliana; Makarov, Vlad; ARRA Autism Sequencing Consortium et al. (2012) Scan-statistic approach identifies clusters of rare disease variants in LRP2, a gene linked and associated with autism spectrum disorders, in three datasets. Am J Hum Genet 90:1002-13
Ionita-Laza, Iuliana; Ottman, Ruth (2011) Study designs for identification of rare disease variants in complex diseases: the utility of family-based designs. Genetics 189:1061-8
Ionita-Laza, Iuliana; Buxbaum, Joseph D; Laird, Nan M et al. (2011) A new testing strategy to identify rare variants with either risk or protective effect on disease. PLoS Genet 7:e1001289
Ionita-Laza, Iuliana; Makarov, Vlad; Yoon, Seungtai et al. (2011) Finding disease variants in Mendelian disorders by using sequence data: methods and applications. Am J Hum Genet 89:701-12