Genome-wide association studies (GWAS) are an effective tool for indentifying common genetic variants that contribute to disease and heritable traits. These studies use high-density oligoneculeotide arrays to assay hundreds of thousands of diallelic genetic markers in each individual. However, genome-wide association studies can also produce hundred of spurious disease-gene associations caused by genotyping error. This research will develop statistical and computational methods that use inter-marker correlation to substantially improve genotype accuracy. All existing methods for calling genotypes for large-scale data ignore the correlation between genetic markers. This correlation is highly informative, but exploiting inter-marker correlation is computationally difficult because it requires inference of the marker alleles inherited from a single parent (the haplotype phase). Recently, we have developed a novel method of haplotype phase inference for large-scale data sets of unrelated individuals that is orders of magnitude faster and more accurate than competing methods. The next step will be to improve haplotype phase inference and genotype calling by performing both tasks simultaneously. This will enable genotype uncertainty to be taken into account when inferring haplotype phase and inter-marker correlation to be taken into account when calling genotypes. Our methods will improve genotype accuracy, improve haplotype phase inference accuracy, decrease false positive associations due to genotyping error, and increase power to detect true genetic associations. We will extend these novel methods to call genotypes and phase haplotypes for parent-offspring trios where the additional relatedness information will lead to even larger gains in accuracy. The improved genotype accuracy and phased haplotypes from our methods will contribute to improved understanding of the genetic contribution to human disease. Our research will also address one of the main impediments to haplotypic analysis: the difficulty in interpreting analysis results. We will develop interactive methods for visualizing haplotype structure and haplotype-trait associations. These new data exploration methods will greatly simplify the task of identifying sequences of genetic variants that are associated with a trait.

Public Health Relevance

Heritable genetic variants contribute to many common diseases, such as cardiovascular disease and diabetes. This research will develop new methods and tools that improve the accuracy of genetic data and that improve our ability to identify genetic variants that increase risk of disease. These methods and tools will contribute to the prevention, diagnosis, and treatment of heritable diseases in the United States and throughout the world.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
1R01HG004960-01
Application #
7632327
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Brooks, Lisa
Project Start
2009-08-07
Project End
2010-06-30
Budget Start
2009-08-07
Budget End
2010-06-30
Support Year
1
Fiscal Year
2009
Total Cost
$144,218
Indirect Cost
Name
Auckland Uniservices Limited
Department
Type
DUNS #
590494795
City
Auckland
State
Country
New Zealand
Zip Code
1010
Browning, Brian L; Browning, Sharon R (2016) Genotype Imputation with Millions of Reference Samples. Am J Hum Genet 98:116-26
Browning, Sharon R; Browning, Brian L (2015) Accurate Non-parametric Estimation of Recent Effective Population Size from Segments of Identity by Descent. Am J Hum Genet 97:404-18
1000 Genomes Project Consortium; Auton, Adam; Brooks, Lisa D et al. (2015) A global reference for human genetic variation. Nature 526:68-74
Zhang, Qian S; Browning, Brian L; Browning, Sharon R (2015) Genome-wide haplotypic testing in a Finnish cohort identifies a novel association with low-density lipoprotein cholesterol. Eur J Hum Genet 23:672-7
Qian, Yu; Browning, Brian L; Browning, Sharon R (2014) Efficient clustering of identity-by-descent between multiple individuals. Bioinformatics 30:915-22
Yu, Z; Li, C F; Mkhikian, H et al. (2014) Family studies of type 1 diabetes reveal additive and epistatic effects between MGAT1 and three other polymorphisms. Genes Immun 15:218-23
Shirts, Brian H; Jacobson, Angela; Jarvik, Gail P et al. (2014) Large numbers of individuals are required to classify and define risk for rare variants in known cancer risk genes. Genet Med 16:529-34
Browning, Sharon R; Browning, Brian L (2013) Identity-by-descent-based heritability analysis in the Northern Finland Birth Cohort. Hum Genet 132:129-38
de Candia, Teresa R; Lee, S Hong; Yang, Jian et al. (2013) Additive genetic variation in schizophrenia risk is shared by populations of African and European descent. Am J Hum Genet 93:463-70
Browning, Brian L; Browning, Sharon R (2013) Detecting identity by descent and estimating genotype error rates in sequence data. Am J Hum Genet 93:840-51

Showing the most recent 10 out of 26 publications