We propose to develop novel statistical methods and software tools for disease association testing with rare variants, with particular application to autism. Although genome-wide association studies have led to the discovery of many common variants reproducibly associated with various complex traits, these variants have small effect sizes and overall explain only a small fraction of the total estimated trait heritability. Recent advances in next-generation sequencing technologies allow for the first time an objective assessment of the importance of rare variants in complex diseases. Over the past few years it has become clear from numerous empirical studies that rare variants are an important contributor to disease risk. This is especially compelling for psychiatric diseases, such as schizophrenia and autism, where common disease susceptibility variants have been more difficult to identify. Traditional association testing strategies that have worked well for common variants have low power for the analysis of rare variants, mostly due to the large number of such variants in any genetic region and their low frequency counts in datasets of realistic sizes. Therefore development of powerful methods for rare variant analysis is greatly needed in order to efficiently extract information from the many sequencing datasets currently being generated. In this application we propose novel methods for both population- and family-based designs to identify rare genetic variants that influence risk to complex diseases, with particular application to autism. In particular, we focus on methods development in the following areas: family-based testing strategies for rare variants, unified testing strategies to efficiently combine family-base and population-based studies, and refinement strategies to identify causal rare variants once an overall association at a gene- or region-level has been established. We will implement the new methods in a comprehensive software package to be made available to the scientific community. Furthermore we will apply these methods to whole-exome data from 1000 autism cases, 1000 matched controls, and 500 autism trios. We believe the proposed research is very timely and has the potential to be of great public health importance through direct application to autism, and more broadly to other complex diseases.

Public Health Relevance

Autism and other psychiatric diseases are major public health problems. The proposed statistical methodology with direct application to autism will help in the identification of genetic variants influencing autism risk, with important implications for public health.

National Institute of Health (NIH)
National Institute of Mental Health (NIMH)
Research Project (R01)
Project #
Application #
Study Section
Behavioral Genetics and Epidemiology Study Section (BGES)
Program Officer
Senthil, Geetha
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Columbia University (N.Y.)
Biostatistics & Other Math Sci
Schools of Public Health
New York
United States
Zip Code
Polubriaginof, Fernanda C G; Vanguri, Rami; Quinnies, Kayla et al. (2018) Disease Heritability Inferred from Familial Relationships Reported in Medical Records. Cell 173:1692-1704.e11
Backenroth, Daniel; He, Zihuai; Kiryluk, Krzysztof et al. (2018) FUN-LDA: A Latent Dirichlet Allocation Model for Predicting Tissue-Specific Functional Effects of Noncoding Variation: Methods and Applications. Am J Hum Genet 102:920-942
Liu, Yuwen; Liang, Yanyu; Cicek, A Ercument et al. (2018) A Statistical Framework for Mapping Risk Genes from De Novo Mutations in Whole-Genome-Sequencing Studies. Am J Hum Genet 102:1031-1047
Sanna-Cherchi, Simone; Khan, Kamal; Westland, Rik et al. (2017) Exome-wide Association Study Identifies GREB1L Mutations in Congenital Kidney Malformations. Am J Hum Genet 101:789-802
He, Zihuai; Xu, Bin; Lee, Seunggeun et al. (2017) Unified Sequence-Based Association Tests Allowing for Multiple Functional Annotations and Meta-analysis of Noncoding Variation in Metabochip Data. Am J Hum Genet 101:340-352
Kiryluk, Krzysztof; Li, Yifu; Moldoveanu, Zina et al. (2017) GWAS for serum galactose-deficient IgA1 implicates critical genes of the O-glycosylation pathway. PLoS Genet 13:e1006609
Song, Xiaoyu; Li, Gen; Zhou, Zhenwei et al. (2017) QRank: a novel quantile regression tool for eQTL discovery. Bioinformatics 33:2123-2130
He, Zihuai; Lee, Seunggeun; Zhang, Min et al. (2017) Rare-variant association tests in longitudinal studies, with an application to the Multi-Ethnic Study of Atherosclerosis (MESA). Genet Epidemiol 41:801-810
Lim, Elaine T; Uddin, Mohammed; De Rubeis, Silvia et al. (2017) Rates, distribution and implications of postzygotic mosaic mutations in autism spectrum disorder. Nat Neurosci 20:1217-1224
Song, Xiaoyu; Ionita-Laza, Iuliana; Liu, Mengling et al. (2016) A General and Robust Framework for Secondary Traits Analysis. Genetics 202:1329-43

Showing the most recent 10 out of 34 publications