Figures & data
Figure 1. Prediction performance (AUC) using different sliding window sizes for PDNA-62 and PDNA-224 over five-fold cross-validations..
![Figure 1. Prediction performance (AUC) using different sliding window sizes for PDNA-62 and PDNA-224 over five-fold cross-validations..](/cms/asset/8709f8dd-2093-42b7-a5eb-5c7ccdd2e4e2/tbeq_a_2122871_f0001_c.jpg)
Figure 2. An example of proposed 1-stage adjustment algorithm (a) and 2-stage adjustment algorithm (b).
![Figure 2. An example of proposed 1-stage adjustment algorithm (a) and 2-stage adjustment algorithm (b).](/cms/asset/639e9f3a-e228-4c7c-a07a-d1986cadcc2d/tbeq_a_2122871_f0002_c.jpg)
Table 1. Prediction performance of individual feature and different feature combinations on PDNA-62 and PDNA-224 over five-fold cross-validations.
Figure 3. Heatmap of of PDNA-62 (a) and PDNA-224 (b).
Note: The x-axis and y-axis denote the 20 amino acid types, and every element denotes a specific .
![Figure 3. Heatmap of PNRB−1 of PDNA-62 (a) and PDNA-224 (b).Note: The x-axis and y-axis denote the 20 amino acid types, and every element denotes a specific PNRB−1(x,y).](/cms/asset/7b4513ae-fb4b-4b08-9b8b-ccc8c6b29fe2/tbeq_a_2122871_f0003_c.jpg)
Table 2. Prediction performance of original LightGBM outputs, 1-stage adjustment, 2-stage adjustment and higher stage adjustment algorithm on PDNA-62 and PDNA-224 over five-fold cross-validations.
Figure 4. Threshold selection for 1-stage and 2-stage adjustment algorithm on PDNA-62 and PDNA-224 over five-fold cross-validations.
![Figure 4. Threshold selection for 1-stage and 2-stage adjustment algorithm on PDNA-62 and PDNA-224 over five-fold cross-validations.](/cms/asset/5d89b439-8897-4bdd-8183-abb35c7eb2f3/tbeq_a_2122871_f0004_c.jpg)
Table 3. Prediction performance of our proposed method and other state-of-art methods on PDNA-62 over five-fold cross-validations.
Table 4. Prediction performance of our proposed method and other state-of-art methods on PDNA-224 over five-fold cross-validations.
Figure 5. AUC comparison between our proposed method and other prediction methods on the independent testing set TS-72.
![Figure 5. AUC comparison between our proposed method and other prediction methods on the independent testing set TS-72.](/cms/asset/268eb0d0-955e-4a4e-afe8-7991ab902043/tbeq_a_2122871_f0005_c.jpg)
Figure 6. The prediction result of 3PVP_A of original LightGBM outputs (a), 1-stage adjustment algorithm (b) and 1-stage and 2-stage adjustment algorithm (c). The green spheres, yellow spheres and red spheres denote the true positive (TP) instances, false negative (FN) instances and false positive (FP) instances, respectively.
![Figure 6. The prediction result of 3PVP_A of original LightGBM outputs (a), 1-stage adjustment algorithm (b) and 1-stage and 2-stage adjustment algorithm (c). The green spheres, yellow spheres and red spheres denote the true positive (TP) instances, false negative (FN) instances and false positive (FP) instances, respectively.](/cms/asset/17e90197-653a-4c3b-a3ec-c52d82d2e579/tbeq_a_2122871_f0006_c.jpg)
Data availability statement
The data that support the findings of this study are openly available at https://github.com/tlsjz/DNAbinding.