Neurodevelopmental disorders (NDDs) affect a considerable fraction of the population resulting in substantial economic and healthcare system burdens to society. Genetic aberrations, including copy number variations (CNVs) and sequence variants, are the most common etiology of NDDs. Understanding the mechanism of formation, developmental origin, and genomic architectural elements that predispose to such aberrations could improve clinical interpretation of diagnostics as well as genetic counseling of affected patients and families. Repetitive elements in the genome, and specifically those of the Alu family, have been found to mediate human CNVs, and we hypothesize that they could be under-recognized substrates for CNV formation. To explore this, we will computationally identify genes with significantly increased Alu content in introns and flanking genomic regions and compare this gene list to databases of human CNVs. De novo mutations, including those mediated by repetitive elements, that underlie NDDs are classically thought of as occurring in the germ line;however, mitotic cell divisions also represent a large target for mutagenic processes. Based on data obtained from families identified in our laboratory, we hypothesize that mutations during mitotic cell divisions in parents are a more frequent source of pathogenic alleles in their offspring than current detection suggests. We will prospectively screen a large cohort of family trios for unrecognized, low-level somatic mosaicism for genomic deletions to estimate the frequency. In recent years, genome-wide technologies have accelerated the identification of genomic variations underlying NDDs. However, clinical interpretation is made difficult due to the large number of variants detected in each individual. The classic method for determining detrimental alleles is based on incidence differences between patients and controls. Yet, because of recent human population expansion, most variation in an individual is rare and restricted among family lineages or clans, making distinction between rare and pathogenic variants challenging. We hypothesize that integration of multiple knowledge sources, including gene-specific, genome architecture, and population incidence data will result in more accurate, efficient interpretation of genome-wide diagnostics. To this end, we will identify potentially pathogenic alleles from both a large cohort of patients tested by chromosomal microarray as well as by performing exome sequencing of families with NDDs. We will then utilize bioinformatics and statistics to combine multiple information sources together to develop phenotype-specific "pathogenicity probability" for each variant. Such scores will also be used to investigate the extent to which the genetic load of individual variants do or do not contribute to human disease. Overall, this proposal aims to elucidate mechanisms of CNV formation, delineate the timing of mutagenic processes, and assess the deleteriousness of identified variants to improve interpretation of genomic diagnostics for patients with NDDs.

Public Health Relevance

The DNA of two unrelated people in the population differ in millions of places;for individuals with neurodevelopmental disorders, one or more of these differences may be the underlying cause of their disease. Identifying and prioritizing the changes that are most likely to be disease-causing among the millions of other benign variants is an important problem in genomic medicine. We will test the hypothesis that the most efficient method of prioritization uses multiple sources of information, such as information about the specific genes affected by the change, information about DNA structure in the region, and information about the frequency of the change in diseased and unaffected individuals.

Agency
National Institute of Health (NIH)
Institute
National Institute of Neurological Disorders and Stroke (NINDS)
Type
Predoctoral Individual National Research Service Award (F31)
Project #
1F31NS083159-01A1
Application #
8657741
Study Section
NST-2 Subcommittee (NST)
Program Officer
Gwinn, Katrina
Project Start
2013-09-25
Project End
2016-09-24
Budget Start
2013-09-25
Budget End
2014-09-24
Support Year
1
Fiscal Year
2013
Total Cost
$42,520
Indirect Cost
Name
Baylor College of Medicine
Department
Genetics
Type
Schools of Medicine
DUNS #
051113330
City
Houston
State
TX
Country
United States
Zip Code
77030
Campbell, Ian M; Yuan, Bo; Robberecht, Caroline et al. (2014) Parental somatic mosaicism is underrecognized and influences recurrence risk of genomic disorders. Am J Hum Genet 95:173-82
Campbell, Ian M; James, Regis A; Chen, Edward S et al. (2014) NetComm: a network analysis tool based on communicability. Bioinformatics 30:3387-9
Campbell, Ian M; Stewart, Jonathan R; James, Regis A et al. (2014) Parent of origin, mosaicism, and recurrence risk: probabilistic modeling explains the broken symmetry of transmission genetics. Am J Hum Genet 95:345-59
Boone, Philip M; Yuan, Bo; Campbell, Ian M et al. (2014) The Alu-rich genomic architecture of SPAST predisposes to diverse and functionally distinct disease-associated CNV alleles. Am J Hum Genet 95:143-61