Integrated Exchange and Storage of Current- and Future-Generation Immunogenomic Data

Hollenbach, Jill; Mack, Steven

Abstract

The goal of the proposed work is to further develop and integrate our software for the analysis, collection, exchange and storage (ACES) of all current and future immunogenomic data. The Human Leukocyte Antigen (HLA) region on human chromosome 6p21 is the most medically important region of the human genome. In the most successful application of precision medicine to date, matching of HLA genotypes is required for bone marrow and solid organ transplantation. HLA molecules have functional interactions with Killer cell Immunoglobulin-like Receptor (KIR) molecules, also recognized to play critical roles in transplantation and disease. The extensive genetic variation of these immunogenomic loci in human populations also makes them model systems in health disparities research. However, because these immunogenomic data have been generated using a wide variety of methods and under different nomenclature systems, cross-study data compatibility has remained an important and debilitating limitation to the field. Recognizing the need to consolidate the data and standardize data analysis in such a broad field, we have developed our Push Immunogenomics to the Next Generation (PING) data-generation system, Toolkit for Immunogenomic Data Exchange and Storage (TIDES) data-management platform, and our Bridging ImmunoGenomic Data-Analysis Workflow Gaps (BIGDAWG) data-analysis pipeline for highly polymorphic genomic data. When coupled with our Genotype List Service, Histoimmunogenetic Markup Language and Minimum Information for Reporting Immunogenomic NGS Genotyping reporting data-exchange standard, our ACES systems will represent a significant advance toward the goal of completely integrated exchange and storage of immunogenomic data. To extend our work of the prior period, we will (1) further develop the TIDES platform as the hub that integrates these systems, services and standards, centralize consensus sequence as the primary data type in immunogenomic research and clinical practice, and develop a service for gene feature enumeration (GFE) that describes HLA and KIR polymorphism by acknowledging underlying gene structure. To facilitate analysis of these data for research and clinical applications, we will (2) extend our BIGDAWG pipeline to handle new data input formats, incorporate new analyses, and export new data formats. BIGDAWG will be adapted as TIDES' primary analysis partner, and will directly, seamlessly accept TIDES outputs. Our tools are designed to maximize the ongoing utility of immunogenomic data for clinical and basic research science. As leaders in the immunogenomics field, we are uniquely suited to address the critical unmet need for standardized, robust ACES of immunogenomic data. The capacity of the systems described here to take these complex, highly polymorphic and medically important data along a complete pipeline from data generation through high-level statistical analysis will be a significant milestone toward the goal of ?bench to bedside? precision medicine.

Public Health Relevance

The Human Leukocyte Antigen (HLA) and killer cell immunoglobulin-like receptor (KIR) genes display extremely high levels of variation compared to the rest of the human genome, and are central for understanding the genetic basis of disease and improving human health. The unified system of specialized software tools that we are building will standardize the analysis, collection, exchange and storage (ACES) of all current and future forms of data for the HLA and KIR genes, making these immunogenomic data easier to use for medical research, as well as for improving patient therapies.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of Allergy and Infectious Diseases (NIAID)
Type: Research Project (R01)
Project #: 5R01AI128775-06
Application #: 9640391
Study Section: Biodata Management and Analysis Study Section (BDMA)
Program Officer: Bridges, Nancy D

Project Start: 2017-03-01
Project End: 2022-02-28
Budget Start: 2019-03-01
Budget End: 2020-02-29
Support Year: 6
Fiscal Year: 2019
Total Cost
Indirect Cost

Institution

Name: University of California San Francisco
Department: Neurology
Type: Schools of Medicine
DUNS #: 094878337

City: San Francisco
State: CA
Country: United States
Zip Code: 94118

Related projects


NIH 2021 R01 AI	Integrated Exchange and Storage of Current- and Future-Generation Immunogenomic Data Hollenbach, Jill Allison; Mack, Steven John / University of California San Francisco
NIH 2020 R01 AI	Integrated Exchange and Storage of Current- and Future-Generation Immunogenomic Data Hollenbach, Jill Allison; Mack, Steven John / University of California San Francisco
NIH 2020 R01 AI	COVID R PA-18-591 NOT-AI-20-031 Administrative Supplement to Integrated Exchange and Storage of Current- and Future-Generation Immunogenomic Data Hollenbach, Jill Allison; Mack, Steven John / University of California San Francisco
NIH 2019 R01 AI	Integrated Exchange and Storage of Current- and Future-Generation Immunogenomic Data Hollenbach, Jill Allison; Mack, Steven John / University of California San Francisco
NIH 2018 R01 AI	Integrated Exchange and Storage of Current- and Future-Generation Immunogenomic Data Hollenbach, Jill Allison; Mack, Steven John / University of California San Francisco
NIH 2017 R01 AI	Integrated Exchange and Storage of Current- and Future-Generation Immunogenomic Data Hollenbach, Jill Allison; Mack, Steven John / University of California San Francisco

Publications

Chang, Chia-Jung; Osoegawa, Kazutoyo; Milius, Robert P et al. (2018) Collection and storage of HLA NGS genotyping data for the 17th International HLA and Immunogenetics Workshop. Hum Immunol 79:77-86

Mack, Steven J; Udell, Julia; Cohen, Franziska et al. (2018) Correction: High resolution HLA analysis reveals independent class I haplotypes and amino-acid motifs protective for multiple sclerosis. Genes Immun :

Amorim, Leonardo M; Santos, Tiago H S; Hollenbach, Jill A et al. (2018) Cost-effective and fast KIR gene-content genotyping by multiplex melting curve analysis. HLA 92:384-391

Mack, Steven J; Udell, Julia; Cohen, Franziska et al. (2018) High resolution HLA analysis reveals independent class I haplotypes and amino-acid motifs protective for multiple sclerosis. Genes Immun :

Moore, Eugene; Grifoni, Alba; Weiskopf, Daniela et al. (2018) Sequence-based HLA-A, B, C, DP, DQ, and DR typing of 496 adults from San Diego, California, USA. Hum Immunol 79:821-822

Misra, Maneesh K; Damotte, Vincent; Hollenbach, Jill A (2018) Structure-based selection of human metabolite binding P4 pocket of DRB1*15:01 and DRB1*15:03, with implications for multiple sclerosis. Genes Immun :

Pappas, D J; Lizee, A; Paunic, V et al. (2018) Significant variation between SNP-based HLA imputations in diverse populations: the last mile is the hardest. Pharmacogenomics J 18:367-376

Grifoni, Alba; Weiskopf, Daniela; Lindestam Arlehamn, Cecilia S et al. (2018) Sequence-based HLA-A, B, C, DP, DQ, and DR typing of 714 adults from Colombo, Sri Lanka. Hum Immunol 79:87-88

Wagner, Ines; Schefzyk, Daniel; Pruschke, Jens et al. (2018) Allele-Level KIR Genotyping of More Than a Million Samples: Workflow, Algorithm, and Observations. Front Immunol 9:2843

Weiskopf, Daniela; Grifoni, Alba; Arlehamn, Cecilia S Lindestam et al. (2018) Sequence-based HLA-A, B, C, DP, DQ, and DR typing of 339 adults from Managua, Nicaragua. Hum Immunol 79:1-2

Showing the most recent 10 out of 16 publications

Comments

Be the first to comment on Jill Hollenbach's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: