There is a fundamental gap in our understanding of how germline variation in immunoglobulin (IG) heavy (IGH) and light chain (IGK; IGL) loci in the human population impacts the development of the functional antibody (Ab) response in health and disease. However, there is a growing appreciation that IG polymorphism contributes to variability in the Ab repertoire, indicating that the integration of IG genetic data has the potential to inform our understanding of Ab function in various clinical contexts. A critical barrier to progress has been that existing genomic resources for IG loci are lacking and poorly represent diversity found across human populations. IG regions are structurally complex, consisting of large segmental duplications, and are among the most polymorphic in the genome, with large copy number variants (CNVs), elevated nucleotide diversity, and population-specific haplotype variants. These complexities have long made IG loci difficult to study at the genomic and population level using standard high-throughput methods, with direct negative impacts on genetic disease association studies and more recently the analysis of expressed Ab repertoire data. As a result, our knowledge of human IG germline diversity (particularly in non-Caucasians) and its contribution to disease lags far behind that of other well studied immune loci. This highlights a direct need for publically available well- characterized IG haplotype references and accurate variant catalogues from diverse ethnic backgrounds to facilitate the design and integration of more accurate genotyping tools, analysis pipelines, and their interpretation. To meet this need, we have developed several robust approaches, which we will utilize here to establish critical community resources for the IG loci. We will first enumerate up to 16 novel IGH/K/L haplotype reference assemblies from an existing set of 8 fosmid libraries from individuals of African, Asian, and European descent. We will also use a novel multi-haplotype informed genotyping pipeline to profile IGH/K/L genetic variation in a cohort of 180 familial and unrelated individuals from these same three populations. This will represent the most comprehensive population survey of IG germline diversity, including descriptions of variable, diversity, joining, and constant gene variation, and locus-wide single nucleotide polymorphisms (SNPs) and CNVs, allowing for fine-scale assessment of variant imputation panels for disease association studies. Finally, to facilitate the utility of these data as long-term resources, all sequences, tools/methods, and analysis pipelines will be made publically available. We will work with established databases to ensure all sequences are deposited in both raw and annotated form. This will include the integration of assemblies into future releases of the human genome reference for use by the genomics community, as well as updates to existing germline gene/allele databases critical to expressed Ab repertoire analysis. This project establishes desperately needed genomic resources for the human IG loci, which will better serve the immunology community for years to come. These will stand as a foundation for future efforts to define the role of IG germline variation in Ab function, health, and disease.

Public Health Relevance

Genomic resources for the human immunoglobulin loci are lacking and poorly represent diversity among human populations, hindering our understanding of the role of antibody genetic variation in health and disease. This project will result in a set of genomic resources that will have transformative impacts on the fields of B cell immunology and immunogenetics, allowing for a better integration of antibody germline data at the genomic and population level. These will have direct applications in genetic association and antibody repertoire profiling studies, with implications for a range of immunological diseases and personalized medicine approaches.

Agency
National Institute of Health (NIH)
Institute
National Institute of Allergy and Infectious Diseases (NIAID)
Type
Resource-Related Research Projects (R24)
Project #
5R24AI138963-03
Application #
9955200
Study Section
Special Emphasis Panel (ZAI1)
Program Officer
Breen, Joseph J
Project Start
2018-07-23
Project End
2022-06-30
Budget Start
2020-07-01
Budget End
2021-06-30
Support Year
3
Fiscal Year
2020
Total Cost
Indirect Cost
Name
University of Louisville
Department
Biochemistry
Type
Schools of Medicine
DUNS #
057588857
City
Louisville
State
KY
Country
United States
Zip Code
40292