Figures & data
Table 1. Definitions of reinforcement learning components pertaining to the LA and TA.
Figure 9. Histograms and probability distribution functions of arrangement period and interarrival time.
![Figure 9. Histograms and probability distribution functions of arrangement period and interarrival time.](/cms/asset/93f795fe-bb06-4a50-b583-45d491929276/tprs_a_1748247_f0009_oc.jpg)
Figure 13. Comparison of rearrangement results obtained using (a) the A3C algorithm and the record data, (b) the A3C and BLF algorithms and (c) the A3C and PSLAP algorithms.
![Figure 13. Comparison of rearrangement results obtained using (a) the A3C algorithm and the record data, (b) the A3C and BLF algorithms and (c) the A3C and PSLAP algorithms.](/cms/asset/fac4163b-1cd2-4743-aec9-cb948df855ae/tprs_a_1748247_f0013_oc.jpg)