5,066
Views
4
CrossRef citations to date
0
Altmetric
Report

Development of in silico models to predict viscosity and mouse clearance using a comprehensive analytical data set collected on 83 scaffold-consistent monoclonal antibodies

ORCID Icon, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , & show all
Article: 2256745 | Received 29 May 2023, Accepted 05 Sep 2023, Published online: 12 Sep 2023

Figures & data

Table 1. Data collected on the antibody panel.

Figure 1. Pairwise Spearman correlations for a selected set of in vitro assays. See table S3 for the complete set of pairwise correlations.

A visual cross-correlation matrix for 20 different assays in which the strength of the correlation is proportional to the size of a circle in the matrix and the direction of the correlation is represented by color on a blue-white-red scale with dark blue = perfect positive correlation, dark red = perfect negative correlation, and white = no correlation. The matrix diagonal (self-correlation) is shown as large blue circles; most other squares show small circles, with a pocket of larger blue circles in assays related to charge (heparin chromatography, zeta potential, poly-D-Lys score, PEI score).
Figure 1. Pairwise Spearman correlations for a selected set of in vitro assays. See table S3 for the complete set of pairwise correlations.

Figure 2. Homology modeling pipeline accuracy measured across all 23 experimental crystal structures.

Nine panels with six colorful curves each that rise from left to right to demonstrate the accuracy of structure prediction for different antibody regions (6 CDRs, the full Fv, VH, and VL). The brown DeepAb curve is generally furthest left in each panel, indicating higher accuracy across all antibody regions.
Figure 2. Homology modeling pipeline accuracy measured across all 23 experimental crystal structures.

Figure 3. Deamidation (a) and isomerization (b) rates for each indicated motif. Fractions indicate the # of sites with >/ = 2% modification after 4 weeks at 40°C over total number of CDR sites with coverage in the peptide mapping data.

Two greyscale bar charts showing number of sites with either deamidation or isomerization by CDR sequence motif. Three to four tall bars are clustered to the left in both plots with the rest of the positions at baseline, indicating a small number of sequence motifs account for most of the chemical modifications observed.
Figure 3. Deamidation (a) and isomerization (b) rates for each indicated motif. Fractions indicate the # of sites with >/ = 2% modification after 4 weeks at 40°C over total number of CDR sites with coverage in the peptide mapping data.

Table 2. Spearman rank correlations of the top features in each category that achieve correlations with measured viscosity of magnitude 0.4. Structure-based feature names are as defined by the MOE software package. All p-values are <0.001.

Table 3. Median test metrics of viscosity binary classifiers. One hundred random stratified train-test splits were performed for various feature combinations. Median test metrics of logistic regression models trained with an elastic net regularization penalty applied are reported. HT = high throughput experimental features (AC-SINS, DLS, and PEG precipitation). Structure = 112 structure-based features derived from structural models. AUC = area under the receiver-operator curve. MCC = Matthews correlation coefficient. PLR = positive likelihood ratio (true positive rate/false positive rate). The confusion matrix for the best model (HT + Structure) is as follows: true positives = 40. True negatives = 23. False positives = 5. False negatives = 3. Here, ‘positive’ refers to a molecule with low viscosity (<15 cP @ 150 mg/ml).

Table 4. Spearman rank correlations of the top features in each category that achieve correlations with measured PK AUCt of magnitude 0.4. Structure-based feature names are as defined by the MOE software package. All p values are <0.003.

Table 5. Median test metrics of PK clearance binary classifiers. One hundred random stratified train-test splits were performed for various feature combinations. Median test metrics of logistic regression models trained with an elastic net regularization penalty applied are reported. Experimental = (heparin chromatography, DLS, zeta potential, ammonium sulfate precipitation, PEG precipitation, BVP, membrane Prep assay, poly-D-Lysine assay, PEI assay, HIC, DSF, SEPAX, and AC-SINS). Structure = 112 structure-based features derived from structural models. AUC = area under the receiver-operator curve. MCC = Matthews correlation coefficient. PLR = positive likelihood ratio (true positive rate/false positive rate). The confusion matrix for the best model (structure) is as follows: true positives = 3. True negatives = 49. False positives = 1. False negatives = 2. Here, ‘positive’ refers to a molecule with high clearance ( 3.9 × 10Citation6 h x ng/mL).

Supplemental material

Supplemental Material

Download Zip (6.8 MB)