Within the last decade, major progress has been made in understanding the genetic basis of Autism Spectrum Disorders (ASDs). Based on exome sequencing studies and microarray-based genotyping, it is recognized that the genetic architecture of ASD consists of rare mutations of large effect, including structural variants (SVs) and de novo point mutations that impact genes, as well as a significant contribution from common polygenic variation. However, a majority of the genetic risk for ASD remains unexplained. A proportion of the missing heritability of ASD could be attributable to genetic variation that remains inaccessible to today?s high throughput sequencing platforms including a majority of structural variants (SVs) and sequence variation that occurs within repetitive sequences in the genome. A systematic analysis of these novel classes of genetic variation could close a significant gap in our knowledge of ASD genetics. The development of new single- molecule sequencing platforms now enables direct sequencing of long DNA fragments (average read lengths >5,000 bp). These technologies have enable sequence assembly and variant calling within complex and repetitive regions of the genome and have dramatically increased the proportion of structural and tandem repeat (TR) variation that can be captured by whole genome sequencing (WGS). The application of long read WGS to ASD family samples could greatly expand knowledge of the genetic causes of autism. This study will investigate the contribution of complex genetic variants to risk for ASD using a combination of long-read and short-read technologies. (1) We will characterize global patterns of genetic variation in a primary sample of ASD families (N=373 cases, 127 sibling controls and their parents) by WGS using the Pacific Biosciences (SEQUEL) platform, and these data will be combined with an existing WGS dataset of Illumina short reads on the same samples. (2) We will investigate the genetic association of novel classes of SVs and TRs in genes and in regulatory elements that control gene expression, and novel findings will be replicated in Illumina WGS data on 2600 ASD families from the Simons Simplex Collection (SSC) (3) We will then investigate the influence of novel risk alleles on clinical phenotype in families and experimentally confirm the effects of mutations on gene function. Our proposed study will expand our knowledge of the genetic architecture of ASD and identify novel risk alleles and genetic mechanisms underlying disease risk.

Public Health Relevance

The underlying causes of autism consist, in part, of genetic variants that influence brain development. This study will expand our knowledge of the genetic and neurobiological basis of autism by applying a new sequencing technology with dramatically improved sensitivity to capture variation throughout the genome. New genetic findings will facilitate an improved understanding the genetic mechanisms that contribute to ASD.

Agency
National Institute of Health (NIH)
Institute
National Institute of Mental Health (NIMH)
Type
Research Project (R01)
Project #
5R01MH113715-04
Application #
9980506
Study Section
Behavioral Genetics and Epidemiology Study Section (BGES)
Program Officer
Gitik, Miri
Project Start
2017-09-06
Project End
2022-05-31
Budget Start
2020-06-01
Budget End
2021-05-31
Support Year
4
Fiscal Year
2020
Total Cost
Indirect Cost
Name
University of California, San Diego
Department
Psychiatry
Type
Schools of Medicine
DUNS #
804355790
City
La Jolla
State
CA
Country
United States
Zip Code
92093
Brandler, William M; Antaki, Danny; Gujral, Madhusudan et al. (2018) Paternally inherited cis-regulatory structural variants are associated with autism. Science 360:327-331
Antaki, Danny; Brandler, William M; Sebat, Jonathan (2018) SV2: accurate structural variation genotyping and de novo mutation detection from whole genomes. Bioinformatics 34:1774-1777