Search in:

International Journal of Computer Integrated Manufacturing Volume 35, 2022 - Issue 10-11: Data-Driven Modeling and Analytics for Optimization of Complex Manufacturing Systems

Submit an article Journal homepage

Open access

3,181

Views

CrossRef citations to date

Altmetric

Research Article

Reinforcement learning based optimal decision making towards product lifecycle sustainability

Yang Liua Department of Management and Engineering, Linköping University, Linköping, Sweden;b Department of Production, University of Vaasa, Vaasa, FinlandCorrespondence[email protected]

https://orcid.org/0000-0001-8006-3236 View further author information

Miying Yangc Group of Sustainability, School of Management, Cranfield University, Cranfield, UK

https://orcid.org/0000-0002-8617-2115 View further author information

Zhengang Guod Department of Electrical and Electronic Engineering, Imperial College London, London, UK

https://orcid.org/0000-0002-3783-408X View further author information

Pages 1269-1296 | Received 04 Sep 2020, Accepted 31 Dec 2021, Published online: 31 Jan 2022

Cite this article
https://doi.org/10.1080/0951192X.2022.2025623
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

Figures & data

Table 1. Summary of the literature on the decision-making for product lifecycle management.

Download CSV Display Table

Figure 1. AI for product lifecycle management.

Figure 2. The simplified product lifecycle from a maintenance perspective.

Figure 3. Some RL scenarios.

Figure 4. Example 1: Hidden failure rate for Product-X.

Figure 5. Example 1: the simulation model.

Figure 6. Example 1: optimal policy.

Figure 7. Example 1: maintenance crossover.

Figure 8. Example 1: reward validation.

Figure 9. Example 1: mean rewards.

Figure 10. Example 1: MTTF.

Figure 11. Example 2: hidden failure rate with energy cost.

Figure 12. Example 2: the simulation model.

Figure 13. Example 2: optimal policy.

Figure 14. Example 2: maintenance crossover.

Figure 15. Example 2: reward validation.

Figure 16. Example 2: mean rewards.

Figure 17. Example 2: MTTF.

Figure 18. Comparison of learned and simulated failure rate.

Figure 19. RL extends learning to simultaneously solving a decision problem. Given just a problem state x_t and a reward for that state r_t, the agent will try to learn how to maximise the profits or minimise the costs over time. This can be visualised as a Bayesian network augmented with action nodes, where at each point in time t it has to make a decision based on the history of observed states and rewards.

Figure A1. RL overview.

Figure B1. Q-Learning using interaction.

Figure B2. Q-Learning using raw data.

Figure C1. Months 0–9.

Figure C2. Months 10–19.

Figure C3. Months 20–29.

Figure C4. Months 30–35.

Zhang, Y., S. Ren, Y. Liu, and S. Si. January 2017b. “A Big Data Analytics Architecture for Cleaner Manufacturing and Maintenance Processes of Complex Products.” Journal of Cleaner Production 142: 626–641. Elsevier Ltd. 10.1016/j.jclepro.2016.07.123.

Web of Science ®Google Scholar

Ferreira, F., J. Faria, A. Azevedo, and L. A. Marques. 2017. “Product Lifecycle Management in Knowledge Intensive Collaborative Environments: An Application to Automotive Industry.” International Journal of Information Management 37 (1): 1474–1487. doi:10.1016/j.ijinfomgt.2016.05.006. Elsevier Ltd: 1474–1487.

Web of Science ®Google Scholar

Rondini, A., F. Tornese, M. G. Gnoni, G. Pezzotta, and R. Pinto. 2017. “Hybrid Simulation Modelling as A Supporting Tool for Sustainable Product Service Systems: A Critical Analysis.” International Journal of Production Research 55 (23): 6932–6945. doi:10.1080/00207543.2017.1330569. Taylor and Francis Ltd.

Web of Science ®Google Scholar

Badurdeen, F., R. Aydin, and A. Brown. November 2018. “A Multiple Lifecycle-Based Approach to Sustainable Product Configuration Design.” Journal of Cleaner Production 200: 756–769. Elsevier Ltd. 10.1016/j.jclepro.2018.07.317.

Web of Science ®Google Scholar

Kaewunruen, S., and Q. Lian. August 2019. “Digital Twin Aided Sustainability-Based Lifecycle Management for Railway Turnout Systems.” Journal of Cleaner Production 228: 1537–1551. Elsevier Ltd. 10.1016/j.jclepro.2019.04.156.

Web of Science ®Google Scholar

Li, Z., S. Zhong, and L. Lin. 2019. “An Aero-Engine Life-Cycle Maintenance Policy Optimization Algorithm: Reinforcement Learning Approach.” Chinese Journal of Aeronautics 32 (9): 2133–2150. doi:10.1016/j.cja.2019.07.003. Chinese Journal of Aeronautics.

Web of Science ®Google Scholar

Yao, L., Q. Dong, J. Jiang, and F. Ni. 2020. “Deep Reinforcement Learning for Long‐term Pavement Maintenance Planning.” Computer-Aided Civil and Infrastructure Engineering 35 (11): 1230–1245. doi:10.1111/mice.12558. Blackwell Publishing Inc.

Web of Science ®Google Scholar

Yousefi, N., S. Tsianikas, and D. W. Coit. 2020. “Reinforcement Learning for Dynamic Condition-Based Maintenance of a System with Individually Repairable Components.” Quality Engineering 32 (3): 388–408. doi:10.1080/08982112.2020.1766692. Taylor and Francis Inc.

Web of Science ®Google Scholar

Liu, X. L., W. M. Wang, H. Guo, A. V. Barenji, Z. Li, and G. Q. Huang. June 2020a. “Industrial Blockchain Based Framework for Product Lifecycle Management in Industry 4.0.” Robotics and Computer-Integrated Manufacturing 63: 101897. Elsevier Ltd: 101897. 10.1016/j.rcim.2019.101897.

Web of Science ®Google Scholar

Andriotis, C. P., and K. G. Papakonstantinou. 2020. “Deep Reinforcement Learning Driven Inspection and Maintenance Planning under Incomplete Information and Constraints.” ArXiv, July. arXiv. http://arxiv.org/abs/2007.01380.

Google Scholar

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Reinforcement learning based optimal decision making towards product lifecycle sustainability

Table 1. Summary of the literature on the decision-making for product lifecycle management.

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Reinforcement learning based optimal decision making towards product lifecycle sustainability

Figures & data

Table 1. Summary of the literature on the decision-making for product lifecycle management.

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date