Structurally Mapping Antibody Repertoires

Konrad Krawczyk; Sebastian Kelm; Aleksandr Kovaltsuk; Jacob D Galson; Dominic Kelly; Johannes Trück; Cristian Regep; Jinwoo Leem; Wing K Wong; Jaroslaw Nowak; James Snowden; Michael Wright; Laura Starkie; Anthony Scott-Tucker; Jiye Shi; Charlotte M Deane

doi:10.3389/fimmu.2018.01698

Structurally Mapping Antibody Repertoires

Front Immunol. 2018 Jul 23:9:1698. doi: 10.3389/fimmu.2018.01698. eCollection 2018.

Authors

Konrad Krawczyk¹, Sebastian Kelm², Aleksandr Kovaltsuk¹, Jacob D Galson³, Dominic Kelly³, Johannes Trück^{3

4}, Cristian Regep¹, Jinwoo Leem¹, Wing K Wong¹, Jaroslaw Nowak¹, James Snowden², Michael Wright², Laura Starkie², Anthony Scott-Tucker², Jiye Shi², Charlotte M Deane¹

Affiliations

¹ Department of Statistics, Oxford University, Oxford, United Kingdom.
² UCB Pharma, Slough, United Kingdom.
³ Division of Immunology, Children's Research Center, University Children's Hospital, Zurich, Switzerland.
⁴ Oxford Vaccine Group, University of Oxford, NIHR Oxford Biomedical Research Centre, Oxford, United Kingdom.

Abstract

Every human possesses millions of distinct antibodies. It is now possible to analyze this diversity via next-generation sequencing of immunoglobulin genes (Ig-seq). This technique produces large volume sequence snapshots of B-cell receptors that are indicative of the antibody repertoire. In this paper, we enrich these large-scale sequence datasets with structural information. Enriching a sequence with its structural data allows better approximation of many vital features, such as its binding site and specificity. Here, we describe the structural annotation of antibodies pipeline that maps the outputs of large Ig-seq experiments to known antibody structures. We demonstrate the viability of our protocol on five separate Ig-seq datasets covering ca. 35 m unique amino acid sequences from ca. 600 individuals. Despite the great theoretical diversity of antibodies, we find that the majority of sequences coming from such studies can be reliably mapped to an existing structure.

Keywords: B-cell receptor; antibody specificity; bioinformatics tools; next-generation sequencing; protein; structural homology.