5,121
Views
13
CrossRef citations to date
0
Altmetric
Report

Ab-Ligity: identifying sequence-dissimilar antibodies that bind to the same epitope

ORCID Icon, ORCID Icon, , ORCID Icon, , ORCID Icon, , & ORCID Icon show all
Article: 1873478 | Received 05 Oct 2020, Accepted 04 Jan 2021, Published online: 15 Jan 2021

Figures & data

Table 1. Number of positive and negative comparisons in the datasets, based on Ab-Ligity’s definition of similar epitopes

Table 2. Precision and recall on the full and CDRH3 ≤ 0.8 sets, using the selected paratope similarity thresholds for Ab-Ligity (0.1) and InterComp (0.6), based on Ab-Ligity’s definition of similar epitopes

Table 3. Precision and recall on the full set, using heavy chain and light chain only homology models and the predicted paratopes on the corresponding single domain

Figure 1. Analysis of anti-lysozyme (HEL) antibodies with dissimilar CDRH3 sequences and highly similar epitopes

(A) Structural superposition of three anti-lysozyme and one anti-TNFRSF4 antibodies co-crystallized with their antigens (lysozyme or TNFRSF4) in white. The three anti-lysozyme antibodies were HyHEL-26 (1NDM_BA_C), HyHEL-10 L-Y50F mutant (1J1O_HL_Y) and HyHEL-63 (1NBY_BA_C); and the anti-TNFRSF4 antibody is 3C8 (6OKM_HL_R). The antigens from the three anti-lysozyme antibody crystal structures are aligned. The legend shows the colors of the antibodies with their PDB codes followed by the heavy-light chain and antigen chain identifiers, separated by (‘_’). The CDRH3 sequences are displayed next to the respective antibody identifiers. Crystal paratope residues within the CDRs are in bold, and Parapred-predicted paratopes within the CDRs are underlined. The other five CDR sequences are listed in Supplementary Table S15. (B) Heatmaps of CDRH3 sequence identity and Ab-Ligity paratope similarity. The row and column labels correspond to the structures shown in (A). Ab-Ligity paratope similarity is calculated on the antibody model and predicted paratope as outlined in the Methods section. Pairs of antibodies with CDRH3 identity of >0.80 would have been considered similar by sequence-based metric. For Ab-Ligity, a similarity score of >0.1 suggests that the antibodies bind to highly similar epitopes.
Figure 1. Analysis of anti-lysozyme (HEL) antibodies with dissimilar CDRH3 sequences and highly similar epitopes

Figure 2. Analysis of two anti-gp120 antibodies. VRC01 antibody (PDB code and chain IDs: 4LSS_HL_G) is colored in blue; VRC07 (PDB code and chain IDs: 4OLU_HL_G) is shown in pink. The gp120 core antigen is displayed as white surfaces and the superimposed antibodies are in cartoons. The CDRH3 loops are in solid shades of the cartoon representation. The PDB code, heavy and light chain ID, and antigen chain ID are separated by (‘_’) and listed in the legend with the corresponding CDRH3 sequences. CDRH3 sequences are shown by aligning their IMGT positions and ‘-’ indicates a gap in the alignment according to the IMGT numbering scheme.Citation29 Crystal paratope residues within the CDRs are in bold, and Parapred-predicted paratopes within the CDRs are underlined. The other five CDR sequences are listed in Supplementary Table S16. The Ab-Ligity paratope similarity score for the pair is listed

Figure 2. Analysis of two anti-gp120 antibodies. VRC01 antibody (PDB code and chain IDs: 4LSS_HL_G) is colored in blue; VRC07 (PDB code and chain IDs: 4OLU_HL_G) is shown in pink. The gp120 core antigen is displayed as white surfaces and the superimposed antibodies are in cartoons. The CDRH3 loops are in solid shades of the cartoon representation. The PDB code, heavy and light chain ID, and antigen chain ID are separated by (‘_’) and listed in the legend with the corresponding CDRH3 sequences. CDRH3 sequences are shown by aligning their IMGT positions and ‘-’ indicates a gap in the alignment according to the IMGT numbering scheme.Citation29 Crystal paratope residues within the CDRs are in bold, and Parapred-predicted paratopes within the CDRs are underlined. The other five CDR sequences are listed in Supplementary Table S16. The Ab-Ligity paratope similarity score for the pair is listed

Figure 3. The Ab-Ligity workflow

(A) Binding site residues within 4.5 Å of the binding partner are tokenized as stated in . (B) All distances between tokenized points are calculated and hashed into 1.0 Å -wide distance bins for both the paratopes and epitopes. (C) Each pair of tokens is given a unique hash code. (D) A six-character hash code is generated for each triplet. (E) The hashes of a binding site are stored in a frequency table.
Figure 3. The Ab-Ligity workflow

Table 4. Residue groupings for tokenization

Supplemental material

Supplemental Material

Download MS Word (199.6 KB)