The Repertoire Dissimilarity Index as a method to compare lymphocyte receptor repertoires

Christopher R Bolen; Florian Rubelt; Jason A Vander Heiden; Mark M Davis

doi:10.1186/s12859-017-1556-5

The Repertoire Dissimilarity Index as a method to compare lymphocyte receptor repertoires

BMC Bioinformatics. 2017 Mar 7;18(1):155. doi: 10.1186/s12859-017-1556-5.

Authors

Christopher R Bolen^{1

2}, Florian Rubelt¹, Jason A Vander Heiden³, Mark M Davis^{4

5

6}

Affiliations

¹ Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, 94305, CA, USA.
² Genentech, Inc., 1 DNA Way, MS 93, South San Francisco, 94080, CA, USA.
³ Interdepartmental Program in Computational Biology and Bioinformatics, Department of Computational Biology & Bioinformatics, Yale University, New Haven, 06520, CT, USA.
⁴ Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, 94305, CA, USA. [email protected].
⁵ Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, 94305, CA, USA. [email protected].
⁶ Institute of Immunity, Department of Microbiology and Immunology, Transplantation and Infection, Stanford University School of Medicine, Stanford, 94305, CA, USA. [email protected].

Abstract

Background: The B and T cells of the human adaptive immune system leverage a highly diverse repertoire of antigen-specific receptors to protect the human body from pathogens. The sequencing and analysis of immune repertoires is emerging as an important tool to understand immune responses, whether beneficial or harmful (in the case of autoimmunity). However, methods for studying these repertoires, and for directly comparing different immune repertoires, are lacking.

Results: In this paper, we present a non-parametric method for directly comparing sequencing repertoires, with the goal of rigorously quantifying differences in V, D, and J gene segment utilization. This method, referred to as the Repertoire Dissimilarity Index (RDI), uses a bootstrapped subsampling approach to account for variance in sequencing depth, and, coupled with a data simulation approach, allows for direct quantification of the average variation between repertoires. We use the RDI method to recapitulate known differences in the formation of the CD4⁺ and CD8⁺ T cell repertoires, and further show that antigen-driven activation of naïve CD8⁺ T cells is more selective than in the CD4⁺ repertoire, resulting in a more specialized CD8⁺ memory repertoire.

Conclusions: We prove that the RDI method is an accurate and versatile method for comparisons of immune repertoires. The RDI method has been implemented as an R package, and is available for download through Bitbucket.

Keywords: Immunology; Nonparametric methods; Repertoire sequencing.

MeSH terms

Algorithms*
Base Sequence
CD4-Positive T-Lymphocytes / immunology
CD4-Positive T-Lymphocytes / metabolism*
CD8-Positive T-Lymphocytes / immunology
CD8-Positive T-Lymphocytes / metabolism*
Gene Rearrangement, T-Lymphocyte
Genetic Variation
Humans
Receptors, Antigen, T-Cell / chemistry*
Receptors, Antigen, T-Cell / genetics
Sequence Analysis, RNA / methods*
V(D)J Recombination

Substances

Receptors, Antigen, T-Cell

Abstract

MeSH terms

Substances

Grants and funding