Methods, software, and analyses of genomic data in multiplex oral cleft families

Ruczinski, Ingo

Abstract

Considerable genetic heterogeneity must be expected with any complex disease such as oral clefts, where rare variants could explain part of the so-called missing heritability. In extended families with multiple affected members, there is a high probability that several of these affected relatives carry the same rare, high penetrance risk variant if such a variant is found in one affected individual. We recently developed a general framework for calculating rare nucleotide variant sharing probabilities when two or more affected subjects from an extended family are sequenced, and show how information from multiple families can be combined by calculating a p-value as the sum of the probabilities of sharing events equal or more extreme. We also examined the impact of unknown relationships (i.e. cryptic relatedness), and proposed methods to approximate sharing probabilities based on empirical estimates of kinship between family members obtained from genome-wide marker data. We applied this method to the whole exome sequence data in a study of 55 multiplex cleft families with apparent non-syndromic forms of oral clefts from four distinct populations, and identified a genome-wide significant rare variant in the gene ADAMTS9 shared by affected relatives in three Indian families. An additional, more targeted analysis focused on 348 oral cleft candidate genes identified an additional potentially damaging SNV in CDH1 in a single family. In this application, we propose to extend this approach to rare DNA copy number variants, to implement an open source software package for genomic array and sequencing data, scalable and suitable for reproducible genomic research, and to use existing oral cleft data to identify novel and rare high penetrance genetic variants underlying oral cleft risk.

Public Health Relevance

In extended families with multiple affected members there is a high probability that several affected relatives carry the same rare, high penetrance risk variant if such a variant is found in one affected individual. We recently developed a general framework for calculating rare nucleotide variant sharing probabilities when two or more affected subjects from an extended family are sequenced (either with whole exome or whole genome), and show how information from multiple families can be combined by calculating the sum of the probabilities of sharing events equal or more extreme. The goal of this application to develop new methods to infer DNA copy number variants from sequencing and array data in extended multiplex families, to implement an open source software package based on these new methods of assessing variant sharing for the analysis of genomic data in these families, and to use existing oral cleft data to identify novel and rare high penetrance genetic variants underlying ora cleft risk.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of Dental & Craniofacial Research (NIDCR)
Type: Small Research Grants (R03)
Project #: 1R03DE025279-01
Application #: 8952405
Study Section: Special Emphasis Panel (ZDE1)
Program Officer: Harris, Emily L

Project Start: 2015-09-01
Project End: 2017-08-31
Budget Start: 2015-09-01
Budget End: 2016-08-31
Support Year: 1
Fiscal Year: 2015
Total Cost
Indirect Cost

Institution

Name: Johns Hopkins University
Department: Biostatistics & Other Math Sci
Type: Schools of Public Health
DUNS #: 001910777

City: Baltimore
State: MD
Country: United States
Zip Code: 21205

Related projects


NIH 2016 R03 DE	Methods, software, and analyses of genomic data in multiplex oral cleft families Ruczinski, Ingo / Johns Hopkins University
NIH 2015 R03 DE	Methods, software, and analyses of genomic data in multiplex oral cleft families Ruczinski, Ingo / Johns Hopkins University

Comments

Be the first to comment on Ingo Ruczinski's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: