Computational tools for the analysis of high-throughput immunoglobulin sequencing

Kleinstein, Steven

Abstract

The ability of our immune system to respond effectively to pathogenic challenge or vaccination depends on a diverse repertoire of Immunoglobulin (Ig) receptors expressed by B lymphocytes. Each Ig receptor is unique, having been assembled during lymphocyte development by somatic recombination of gene segments. During the course of an immune response, B cell that initially bind antigen with low affinity through their Ig receptor are modified through cycles of somatic hypermutation (SHM) and affinity-dependent selection to produce high- affinity memory and plasma cells. This affinity maturation is a critical component of T cell dependent adaptive immune responses. It helps guard against rapidly mutating pathogens and underlies the basis for many vaccines. Large-scale characterization of B cell Ig repertoires is now feasible in humans. Driven by the dramatic improvements in high-throughput sequencing technologies, these data are opening up exciting avenues of inquiry. Features of the B cell repertoire, including polymorphisms, biased segment usage and diversity, can be correlated with clinically relevant outcomes, such as susceptibility to infection or vaccination response. These data can also contribute to basic understanding of B cells and adaptive immunity. In particular, the ability to estimate positive and negative selection from Ig mutation patterns has broad applications not only for understanding the immune response to pathogens, but is also critical to determining the role of somatic hypermutation in autoimmunity and B cell cancers. Although promising, repertoire-scale data also present fundamental challenges for analysis requiring the development of new techniques and the rethinking of existing methods that are not scalable to the millions of sequences being generated. This proposal describes novel approaches for the analysis of high-throughput Ig sequencing data sets enabled through a combination of bioinformatics and statistics method development, computational modeling and sequence data-mining. New ways to characterize repertoire properties will be developed that have the potential for use as biomarkers for disease risk, diagnosis and prognosis. Specifically, methods will be developed to:
(Aim 1) group sequences into clones and improve V(D)J segment assignment, thus allowing identification of somatic mutations, (Aim 2) model SHM mutability and substitution patterns so they can be quantified and compared across groups, thus providing insights into underlying mutation mechanisms, and (Aim 3) quantify selection and characterize clonal diversity, providing information on affinity maturation and response dynamics. These methods will be validated through a combination of simulation-based studies, as well as testing on new experimental gold-standard data sets from both human and murine systems. All of the methods will be made widely available through web interfaces and distribution of open-source code.

Public Health Relevance

This project will develop and validate computational methods to analyze large-scale immunoglobulin sequencing data sets that have become possible with the advent of next-generation sequencing technologies. Through quantitative characterization of the immunoglobulin repertoire, these methods will provide insights into the mechanisms underlying autoimmune disease, as well as biomarkers for susceptibility to infection or vaccination response. In addition, methods to identify and analyze mutation patterns will be used to investigate the role of somatic hypermutation in physiological and pathological immune responses.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of Allergy and Infectious Diseases (NIAID)
Type: Research Project (R01)
Project #: 5R01AI104739-03
Application #: 9054793
Study Section: Genomics, Computational Biology and Technology Study Section (GCAT)
Program Officer: Gondre-Lewis, Timothy A

Project Start: 2014-04-15
Project End: 2018-03-31
Budget Start: 2016-04-01
Budget End: 2017-03-31
Support Year: 3
Fiscal Year: 2016
Total Cost
Indirect Cost

Institution

Name: Yale University
Department: Pathology
Type: Schools of Medicine
DUNS #: 043207562

City: New Haven
State: CT
Country: United States
Zip Code

Related projects


NIH 2021 R01 AI	Computational Tools for the Analysis of High-Throughput Immunoglobulin Sequencing Experiments Kleinstein, Steven H. / Yale University
NIH 2020 R01 AI	Computational Tools for the Analysis of High-Throughput Immunoglobulin Sequencing Experiments Kleinstein, Steven H. / Yale University
NIH 2020 R01 AI	Computational Tools for the Analysis of High-Throughput Immunoglobulin Sequencing Experiments Kleinstein, Steven H. / Yale University
NIH 2019 R01 AI	Computational Tools for the Analysis of High-Throughput Immunoglobulin Sequencing Experiments Kleinstein, Steven H. / Yale University
NIH 2017 R01 AI	Computational tools for the analysis of high-throughput immunoglobulin sequencing Kleinstein, Steven H. / Yale University	$407,503
NIH 2016 R01 AI	Computational tools for the analysis of high-throughput immunoglobulin sequencing Kleinstein, Steven H. / Yale University
NIH 2015 R01 AI	Computational tools for the analysis of high-throughput immunoglobulin sequencing Kleinstein, Steven H. / Yale University	$407,602
NIH 2014 R01 AI	Computational tools for the analysis of high-throughput immunoglobulin sequencing Kleinstein, Steven H. / Yale University

Publications

Rydyznski, Carolyn E; Cranert, Stacey A; Zhou, Julian Q et al. (2018) Affinity Maturation Is Impaired by Natural Killer Cell Suppression of Germinal Centers. Cell Rep 24:3367-3373.e4

Ohm-Laursen, Line; Meng, Hailong; Chen, Jessica et al. (2018) Local Clonal Diversification and Dissemination of B Lymphocytes in the Human Bronchial Mucosa. Front Immunol 9:1976

Vander Heiden, Jason Anthony; Marquez, Susanna; Marthandan, Nishanth et al. (2018) AIRR Community Standardized Representations for Annotated Immune Repertoires. Front Immunol 9:2206

Nouri, Nima; Kleinstein, Steven H (2018) A spectral clustering-based method for identifying clones from high-throughput B cell repertoire sequencing data. Bioinformatics 34:i341-i349

Bukhari, Syed Ahmad Chan; O'Connor, Martin J; Martínez-Romero, Marcos et al. (2018) The CAIRR Pipeline for Submitting Standards-Compliant B and T Cell Receptor Repertoire Sequencing Studies to the National Center for Biotechnology Information Repositories. Front Immunol 9:1877

Nouri, Nima; Kleinstein, Steven H (2018) Optimized Threshold Inference for Partitioning of Clones From High-Throughput B Cell Repertoire Sequencing Data. Front Immunol 9:1687

Laidlaw, Brian J; Lu, Yisi; Amezquita, Robert A et al. (2017) Interleukin-10 from CD4+ follicular regulatory T cells promotes the germinal center response. Sci Immunol 2:

Gupta, Namita T; Adams, Kristofor D; Briggs, Adrian W et al. (2017) Hierarchical Clustering Can Identify B Cell Clones with High Confidence in Ig Repertoire Sequencing Data. J Immunol 198:2489-2499

Breden, Felix; Luning Prak, Eline T; Peters, Bjoern et al. (2017) Reproducibility and Reuse of Adaptive Immune Receptor Repertoire Data. Front Immunol 8:1418

Rubelt, Florian; Busse, Christian E; Bukhari, Syed Ahmad Chan et al. (2017) Adaptive Immune Receptor Repertoire Community recommendations for sharing immune-repertoire sequencing data. Nat Immunol 18:1274-1278

Showing the most recent 10 out of 26 publications

Comments

Be the first to comment on Steven Kleinstein's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: