3,495
Views
17
CrossRef citations to date
0
Altmetric
Articles

Spatial arrangement using deep reinforcement learning to minimise rearrangement in ship block stockyards

, ORCID Icon &
Pages 5062-5076 | Received 03 Mar 2019, Accepted 21 Mar 2020, Published online: 20 Apr 2020

Figures & data

Figure 1. Block arrangement problem and operational processes in shipyards.

Figure 1. Block arrangement problem and operational processes in shipyards.

Figure 2. Decision processes in block stockyards arrangement problem.

Figure 2. Decision processes in block stockyards arrangement problem.

Table 1. Definitions of reinforcement learning components pertaining to the LA and TA.

Figure 3. Examples of (a) a grid and (b) rearrangement problem.

Figure 3. Examples of (a) a grid and (b) rearrangement problem.

Figure 4. LA training process based on the A3C approach.

Figure 4. LA training process based on the A3C approach.

Figure 5. Geographical information and schematic drawing of the target stockyard.

Figure 5. Geographical information and schematic drawing of the target stockyard.

Figure 6. Episode length and reward during a training TA for 200,000 episodes.

Figure 6. Episode length and reward during a training TA for 200,000 episodes.

Figure 7. Failure rate and average number of moved blocks according to area occupancy ratio.

Figure 7. Failure rate and average number of moved blocks according to area occupancy ratio.

Figure 8. Examples of rearrangement process using the trained TA.

Figure 8. Examples of rearrangement process using the trained TA.

Figure 9. Histograms and probability distribution functions of arrangement period and interarrival time.

Figure 9. Histograms and probability distribution functions of arrangement period and interarrival time.

Figure 10. Episode length and reward during a training LA for 11,000 episodes.

Figure 10. Episode length and reward during a training LA for 11,000 episodes.

Figure 11. Example policy for the trained LA.

Figure 11. Example policy for the trained LA.

Figure 12. Schedule for a single block stockyard.

Figure 12. Schedule for a single block stockyard.

Figure 13. Comparison of rearrangement results obtained using (a) the A3C algorithm and the record data, (b) the A3C and BLF algorithms and (c) the A3C and PSLAP algorithms.

Figure 13. Comparison of rearrangement results obtained using (a) the A3C algorithm and the record data, (b) the A3C and BLF algorithms and (c) the A3C and PSLAP algorithms.

Table 2. Number of rearrangement for heuristic algorithms and A3C algorithm.