The goal of the proposed work is to further develop and integrate our software for the analysis, collection, exchange and storage (ACES) of all current and future immunogenomic data. The Human Leukocyte Antigen (HLA) region on human chromosome 6p21 is the most medically important region of the human genome. In the most successful application of precision medicine to date, matching of HLA genotypes is required for bone marrow and solid organ transplantation. HLA molecules have functional interactions with Killer cell Immunoglobulin-like Receptor (KIR) molecules, also recognized to play critical roles in transplantation and disease. The extensive genetic variation of these immunogenomic loci in human populations also makes them model systems in health disparities research. However, because these immunogenomic data have been generated using a wide variety of methods and under different nomenclature systems, cross-study data compatibility has remained an important and debilitating limitation to the field. Recognizing the need to consolidate the data and standardize data analysis in such a broad field, we have developed our Push Immunogenomics to the Next Generation (PING) data-generation system, Toolkit for Immunogenomic Data Exchange and Storage (TIDES) data-management platform, and our Bridging ImmunoGenomic Data-Analysis Workflow Gaps (BIGDAWG) data-analysis pipeline for highly polymorphic genomic data. When coupled with our Genotype List Service, Histoimmunogenetic Markup Language and Minimum Information for Reporting Immunogenomic NGS Genotyping reporting data-exchange standard, our ACES systems will represent a significant advance toward the goal of completely integrated exchange and storage of immunogenomic data. To extend our work of the prior period, we will (1) further develop the TIDES platform as the hub that integrates these systems, services and standards, centralize consensus sequence as the primary data type in immunogenomic research and clinical practice, and develop a service for gene feature enumeration (GFE) that describes HLA and KIR polymorphism by acknowledging underlying gene structure. To facilitate analysis of these data for research and clinical applications, we will (2) extend our BIGDAWG pipeline to handle new data input formats, incorporate new analyses, and export new data formats. BIGDAWG will be adapted as TIDES' primary analysis partner, and will directly, seamlessly accept TIDES outputs. Our tools are designed to maximize the ongoing utility of immunogenomic data for clinical and basic research science. As leaders in the immunogenomics field, we are uniquely suited to address the critical unmet need for standardized, robust ACES of immunogenomic data. The capacity of the systems described here to take these complex, highly polymorphic and medically important data along a complete pipeline from data generation through high-level statistical analysis will be a significant milestone toward the goal of ?bench to bedside? precision medicine.
The Human Leukocyte Antigen (HLA) and killer cell immunoglobulin-like receptor (KIR) genes display extremely high levels of variation compared to the rest of the human genome, and are central for understanding the genetic basis of disease and improving human health. The unified system of specialized software tools that we are building will standardize the analysis, collection, exchange and storage (ACES) of all current and future forms of data for the HLA and KIR genes, making these immunogenomic data easier to use for medical research, as well as for improving patient therapies.
Showing the most recent 10 out of 16 publications