While often ignored in analysis, repetitive regions of the genome and their association with disease is becoming more apparent in recent years. Part of the resurgence of interest in these regions is the availability of new tech- nologies to sequence them and accurately map their location. Indeed, classes of transposable elements have been shown to be polymorphic in the population indicating both their continued activity in shaping our genomes and their propensity to be genetic drivers of phenotype. This has been especially true in neurologic disorders, where transposable elements are not only polymorphic but actively moving in somatic cells and has driven parts of projects such as the Brain Somatic Mosaicism Network. However, a major roadblock in identifying these ele- ments remains as their inherent repetitive nature makes them difficult to place on a genome. In this proposal, we will develop a technology to capture a set of actively moving transposable elements: L1Hs, AluYa5/8, AluYb8/9, and SVAs. These represent the vast majority of active transposable elements and thus will allow us to measure the genetic diversity of polymorphic insertions of these elements. After capture, we will use nanopore long-read sequencing to capture both the entire insertion as well as thousands of bases of surrounding sequence which will allow for accurate mapping of these elements to the genome. We will apply this new technology to a set of three diverse trios that have been well-studied and characterized to allow for follow-up analysis of the effect of polymorphic insertions of transposable elements.

Public Health Relevance

This project seeks to develop assays to capture and map repetitive elements in the human genome. Accurate mapping of these elements will allow for study into genetic mechanisms driven by actively moving transposable elements in the genome. Improved understanding of these regions will provide a better mechanism for interpretation of element function and suggest new targets to approach incorrect regulation leading to disease.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Exploratory/Developmental Grants (R21)
Project #
1R21HG011493-01
Application #
10105744
Study Section
Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer
Gilchrist, Daniel A
Project Start
2020-12-01
Project End
2022-11-30
Budget Start
2020-12-01
Budget End
2021-11-30
Support Year
1
Fiscal Year
2021
Total Cost
Indirect Cost
Name
University of Michigan Ann Arbor
Department
Biostatistics & Other Math Sci
Type
Schools of Medicine
DUNS #
073133571
City
Ann Arbor
State
MI
Country
United States
Zip Code
48109