17,861
Views
70
CrossRef citations to date
0
Altmetric
Research Article

Deep reinforcement learning in production systems: a systematic literature review

ORCID Icon &
Pages 4316-4341 | Received 06 Apr 2021, Accepted 18 Aug 2021, Published online: 17 Sep 2021

Figures & data

Figure 1. Agent–environment interaction; Sutton and Barto (Citation2017).

Figure 1. Agent–environment interaction; Sutton and Barto (Citation2017).

Figure 2. Eight step approach to conduct an SLR.

Figure 2. Eight step approach to conduct an SLR.

Table 1. Taxonomy framework of the SLR.

Table 2. Defined keywords for the SLR.

Figure 3. Conducted review process.

Figure 3. Conducted review process.

Figure 4. Analysis of yearly deep RL publications, 2021 includes Jan./Feb. (a) Yearly and accumulated deep RL publications (b) Yearly deep and non-deep RL publications.

Figure 4. Analysis of yearly deep RL publications, 2021 includes Jan./Feb. (a) Yearly and accumulated deep RL publications (b) Yearly deep and non-deep RL publications.

Figure 5. Number of publications per outlet; 2010–2021.

Figure 5. Number of publications per outlet; 2010–2021.

Figure 6. Number of publications allocated to the production disciplines.

Figure 6. Number of publications allocated to the production disciplines.

Table 3. Summary of deep RL applications in process control.

Table 4. Summary of deep RL applications in production scheduling, dispatching, and (intra-) logistics.

Table 5. Summary of deep RL applications in assembly and robotics.

Table 6. Summary of deep RL applications in maintenance, energy management, and (process) design.

Table 7. Summary of deep RL applications in quality control and further applications.

Table 8. Summary of the key findings from the review analysis.

Figure 7. Quantitative analysis of applied algorithms and testing environments.

Figure 7. Quantitative analysis of applied algorithms and testing environments.