4,012
Views
21
CrossRef citations to date
0
Altmetric
Original Articles

Integrated maintenance and spare part optimization for moving assets

ORCID Icon, &
Pages 230-245 | Received 13 Apr 2016, Accepted 20 Mar 2017, Published online: 21 Jun 2017

ABSTRACT

We consider an integrated maintenance and spare part optimization problem for a single critical component of a moving asset for which the degradation level is observable. Degradation is modeled as a function of the current operating mode, mostly dictated by the actual location of the moving asset. The spare part is stocked at the home base that the moving asset eventually visits. Alternatively, the spare part can be stocked on-board the moving asset to prevent costly expedited deliveries. The costs associated with spare part deliveries and part replacements depend on the operating mode. Our objective is to minimize the expected total discounted cost of spare part deliveries, part replacements, and inventory holding over an infinite planning horizon. We formulate the problem as a Markov decision process and characterize the structure of the optimal policy, which is shown to be a bi-threshold policy in each operating mode. Our numerical experiments show that the cost savings obtained by the integrated optimization of spare part inventory and part replacement decisions are significant. We also demonstrate the value of the integrated approach in a case study from the maritime sector.

1. Introduction

Maintenance and spare parts holding costs are known to be significant contributors to the overall operating costs for many moving assets, such as maritime assets (e.g., ships, vessels, submarines), aircraft, commercial vehicles (e.g., trucks, buses, trains), and military equipment (e.g., frigates, strike-fighters). For maritime assets, maintenance activities can constitute between 25 and 35% of the operating costs (Turan et al., Citation2009). For commercial airlines, maintenance accounts for around 10% of an airline's total costs (Lam, Citation1995). The total spare parts inventory in the aviation industry was estimated to be 45 billion U.S. dollars in 1995 (Flint, Citation1995). In addition to the significant costs associated with maintenance and spare parts inventory, unexpected downtimes can lead to a significant loss of revenue and can affect health, safety, and the environment. For instance, daily shipping operation of a tanker can yield as much as $20 500, which would be lost during downtime (United Nations Conference on Trade and Development, Citation2013). Downtime of naval ships can seriously affect national security. For such assets, maintenance and spare parts optimization is essential to decrease overall operating costs and to increase asset availability.

A component is called critical when the consequences associated with its failure are significant. In practice, such components are usually maintained by the so-called repair-by-replacement strategy; i.e., the component is removed from the asset and replaced by a new or as-good-as-new spare part. This reduces the maintenance execution time and the resulting downtime (Van Houtum and Kranenburg, Citation2015). In this setting, the availability of spare parts is crucial, as emergency shipments and the downtime associated with waiting for the spare parts may be very costly. Traditionally, maintenance and spare parts inventory decisions are made separately. However, a separate optimization may lead to poor solutions, as it ignores the interdependencies between the two decisions. In general, integrated optimization provides remarkable improvement in terms of cost and availability compared with separate optimizations (Kabir and Al-Olayan, Citation1996; van Horenbeek et al., Citation2013).

For many technical systems, it is possible to measure parameters of the critical components that reveal the actual degradation behavior (van Houtum and Kranenburg, Citation2015). This enables the prevention of failures by means of Condition-Based Maintenance (CBM); i.e., by executing a preventive replacement at the moment that a pre-specified degradation threshold is exceeded. In practice, measurements can be performed via periodic inspections or advanced sensor technologies. Physical degradation models investigate the quantitative relation between degradation, usage, and environment (see, e.g., Tinga (Citation2010) and Tinga and Janssen (Citation2013)). In this article, we consider a critical component of a moving asset for which the degradation level is observable and can be modeled as a function of the current operating mode. Operating modes represent the current location of the moving asset, how the component is used, and under what environmental conditions it operates. For many moving assets, the sequence and duration of operating modes are more realistically modeled as random variables (see, e.g., Alam and Al-Saggaf (Citation1986) and Çekyay and Özekici (Citation2015)).

Under the commonly used repair-by-replacement strategy, the spare part must be on-board the moving asset to allow replacement of the component. In practice, the spare part is usually available at a fixed location in the home base, which is taken to be a location that the moving asset is expected to eventually visit. Depending on the application, the home base can be a harbor, hangar, or garage. Typically, the cost associated with delivering a spare part is negligible when the moving asset is already in the home base. However, the cost of the delivery of a spare part can become significant when the asset is operating at a remote location. Such a delivery is made by means of an aircraft, helicopter, boat, dedicated vehicle, etc. To prevent expensive deliveries, the spare part can be stocked on-board the moving asset. A holding cost is incurred if the spare part is kept on-board the asset. This holding cost can become relatively high, due to several factors including the spare part's unit-value, the risk of obsolescence, and limited space availability on-board the asset.

The work reported in this article is motivated by our collaboration with Fugro Marine Services (FMS). Fugro is the world's largest integrator of geotechnical, survey, subsea, and geosciences services. A major part of its turnover relates to exploration of the seafloor. For these operations, survey vessels are employed. The availability of these vessels is of major importance, as the activities performed generate significant revenues. FMS is Fugro's in-house vessel management company. FMS manages maintenance-related activities on survey vessels and other types of vessels. We considered a critical component of one of these vessels in the case study presented in Section 6. FMS considered the component under consideration to be reliable, based on the specifications of the Original Equipment Manufacturer (OEM). Thus, the replacement of this component was planned to be performed on a time-based schedule and it was decided to not to hold the spare part on-board the vessel. However, the component failed before its preventive replacement time while the vessel was operating at a remote location. The expedited delivery of the spare part incured an approximate cost of €10 000. In addition, the downtime cost due to a 1-day delay in operations, which significantly varied over time, was within the range €20 000–€50 000 in this case. This situation could have been avoided if the spare part of this relatively cheap component was held on-board the vessel. On the other hand, it was neither realistic nor economic to always hold spare parts on-board a vessel. First of all, the available space on-board a vessel is usually very limited. Some of the critical components are very expensive, resulting in significant holding costs. This shows that there is a need for a structured approach to make the decisions on when to put a spare part on-board a vessel and when to replace the component just-in-time. Such an approach should take into account the randomness in the sequence and the duration of operating modes, as well as the relation among operating modes, component degradation, and cost parameters. To the best of our knowledge, this problem has not been studied in the literature.

We formulate the above-presented problem as a Markov decision process model, in which the objective is to minimize the expected total discounted cost. Given that many moving assets have very long service times, we develop an infinite-horizon model. We characterize the structure of the optimal policy, which is found to be a bi-threshold policy in each operating mode. Moreover, we provide insights regarding the performance of the optimal policy compared with four benchmark policies. In these benchmark policies, replacement decisions are optimized and spare part inventory decisions are pre-defined as follows: never keep a spare part on-board a vessel or always keep a spare part on-board a vessel. Our numerical experiments show that, compared with the considered benchmark policies, the cost savings obtained by the optimal policy can be significant.

This article is organized as follows. Section 2 presents the related literature. Section 3 formulates the problem as a Markov decision process model. Section 4 characterizes the structure of the optimal integrated policy. Section 5 provides numerical experiments and several insights regarding the performance of the optimal policy compared with four benchmark policies. Section 6 presents a case study for demonstration purposes. Section 7 draws some conclusions and suggests potential future research directions.

2. Literature review

This article relates to three main research streams, namely, CBM optimization for components subject to Markovian degradation, maintenance optimization of mission-based systems, and integrated maintenance and spare parts inventory optimization.

A large body of the CBM literature models degradation as a Markov process (Elwany et al., Citation2011). The earliest and basic replacement models for components subject to Markovian degradation can be found in Kolesar (Citation1966), Derman (Citation1970), and Kawai et al. (Citation2002). For these models, it is shown that under some monotonicity assumptions the optimal replacement policy that minimizes the expected total discounted cost in an infinite time horizon is a threshold-type policy. This means that there exists a threshold with respect to the degradation level above which the optimal decision is to replace the component and below which the optimal decision is to do nothing. Over the last decades, the structure of optimal replacement policies have been investigated in different problem settings (see, e.g., Makis and Jiang (Citation2003), Kurt and Kharoufeh (Citation2010); Elwany et al. (Citation2011)). Çekyay et al. (Citation2011) reviewed CBM models with Markovian degradation, for which threshold-type policies are optimal. They showed that these policies are not necessarily optimal in a variety of cases and the optimal policy may have a rather complex structure.

There is a vast literature on systems performing missions that are composed of different phases or stages. In the literature, these systems are called phased-mission systems or mission-based systems. These systems are analogous to those that are subject to different operating modes and mission environments. Many papers have focused on reliability and availability analysis for such systems (see, e.g., Esary and Ziehms (Citation1975), Kim and Park (Citation1994), Mura and Bondavalli (Citation1999), and Kharoufeh et al. (Citation2010)). To the best of our knowledge, maintenance and replacement models for mission-based systems and for systems subject to randomly varying environments are limited. Waldmann (Citation1983) investigated the structure of an optimal replacement policy for a system subject to stochastic degradation in a random environment. The author provided sufficient conditions to establish the optimality of threshold-type policies. Özekici (Citation1995) modeled the environment using a semi-Markov process and used the intrinsic aging concept for degradation. The intrinsic age represents the cumulative hazard accumulated over time during the operation of the component in the randomly varying environment. Under increasing failure rate distribution functions in all environments and reasonable cost structures, the author showed that the threshold-type intrinsic age replacement and repair policies are optimal. In a similar setting, Zhang et al. (Citation2013) and Çekyay and Özekici (Citation2015) provided extensions to deal with the multi-component case. Ulukus et al. (Citation2012) modeled the environment as a finite-state Markov process. They assumed that the system accumulates degradation at a linear rate whose value depends on the environment. They showed that there exists an optimal threshold-type replacement policy for each environment. Considering similar degradation dynamics, Flory et al. (Citation2015) extended the work of Ulukus et al. (Citation2012) to partially observable environments. Çekyay and Özekici (Citation2012) defined both the mission process and the degradation process as finite-state Markov processes where the generator of the degradation process depends on the phases of the mission. They discussed optimal repair and replacement problems and characterized the optimal policies under some monotonicity assumptions. In this article, our assumptions regarding mission and degradation processes are similar. The mission process (operating modes) is described as a finite-state Markov process. The degradation process of the component is defined by another finite-state Markov process and is modulated by the mission process. Different from the existing papers in this research stream, we investigate an integrated problem by taking the spare part inventory decision into account.

In the last decades, several integrated maintenance and spare parts inventory optimization models have been developed. Many papers in this research stream rely on the assumption that components' lifetime/reliability distributions are given. Armstrong and Atkins (Citation1996) considered a single component subject to age-based replacement. They investigated the optimal combination of replacement and ordering time that minimizes the total replacement and inventory costs. Armstrong and Atkins (Citation1998) provided several extensions to the previous work, generalizing the cost terms and the replenishment time of spare parts. Kabir and Al-Olayan (Citation1996) proposed a policy for integrated optimization of age-based replacement and spare parts ordering. They considered a number of identical components. Their simulation model reported a remarkable improvement on the total cost compared with separate optimization. To the best of our knowledge, only a few articles are available for components subject to CBM. Wang et al. (Citation2008) considered a number of identical components and developed a condition-based replacement and spare parts ordering policy. They modeled the degradation of components as discrete-time Markov chains. They used Monte Carlo simulation to evaluate the performance of the proposed policy. Wang et al. (Citation2009) combined Monte Carlo simulation and a genetic algorithm to determine the optimal parameters of the condition-based replacement and spare parts ordering policy. Elwany and Gebraeel (Citation2008) enabled the use of sensor information and proposed a method that dynamically updates replacement and ordering decisions based on the physical condition of the equipment. They highlighted the advantages of using the proposed methodology compared with that of Armstrong and Atkins (Citation1996). We refer the reader to van Horenbeek et al. (Citation2013) for a complete review of this research stream. Unlike our work, existing research does not investigate an integrated maintenance and inventory optimization problem from a moving asset perspective. To the best of our knowledge, none of these models incorporates the relation among operating modes, component degradation, and cost parameters.

3. Markov decision process formulation

In this section, we present the Markov decision process formulation of our integrated maintenance and spare part optimization problem.

Let I be a non-empty finite set representing different operating modes of the moving asset. Operating modes dictate the current location of the moving asset, how the component is used, and under which environmental conditions it operates. For example, operating modes can be defined as “wait in the home base,” “transit to location l,” “perform mission m in environment e,” etc. We assume that the operation process evolves as a Continuous-Time Markov Chain (CTMC), X ≡ {Xt, t ⩾ 0}, on the discrete state space I. Therefore, the duration of operating mode iI is assumed to be exponentially distributed with rate μi. The probability that the system jumps from operating mode iI to operating mode kI is denoted by Q(i, k). Generally, Q(i, i) = 0 for all iI, but technically one can allow Q(i, i) to be larger than zero. As further demonstrated in Section 6, the operation process of a real-life system can be well represented by such a Markov process.

The degradation of the component also evolves as a CTMC, Y ≡ {Yt, t ⩾ 0} on a discrete state space J ≡ {0, 1, …, F}, representing degradation levels from perfect (0) to failure state (F), where F > 1. Transitions between degradation levels are from j to j + 1 for , where The degradation process is modulated by the operation process; i.e., transition rate λij for the transition from j to j + 1 depends on the current operating mode iI. We note that the number of degradation levels and operating modes can be arbitrarily large. Therefore, this constitutes a fairly general degradation process.

We assume that at most one spare part can be stocked on-board the moving asset. This assumption is not unrealistic considering the limited space availability on-board the vessel. Moreover, the component lifetime is usually much higher than the time between two successive home base visits; i.e., in practice, it is not likely to observe two or more failures during one mission. We denote the number of spare parts on-board the vessel by uU, where U ≡ {0, 1}.

We define the state of the Markov decision process as (i, j, u) ∈ I × J × U, a realization of the joint process (X, Y, Z) where X denotes the current operating mode, Y is the current level of degradation, and Z is the number of spare parts on-board the vessel. We restrict our attention to a model where decisions are only made at transition instants. A decision maker observes the system at each decision epoch (i.e., each time instant at which a transition occurs) and makes decision a ∈ {0, 1, 2}, which can be one of the following:

0:

Do nothing

1:

Deliver a spare part

2:

Replace the part

The set of possible actions A(i, j, u) in state (i, j, u) ∈ I × J × U is defined as

At a decision epoch, action 0 (do nothing) can be chosen if the component has not failed. Action 1 (spare part delivery) is possible if there is no spare part on-board the vessel. Action 2 (part replacement) can be performed only if the spare part is already on-board the vessel.

We assume that a part replacement or a spare part delivery can be scheduled to occur immediately and instantaneously. Therefore, action 1 brings the system from state (i, j, 0) to state (i, j, 1) immediately. Similarly, action 2 brings the system from state (i, j, 1) to state (i, 0, 0) immediately. After each transition, a new action can be taken. That is, a spare part delivery can be immediately followed by a part replacement and vice versa. For example, if the component failed when the spare part was not on-board the vessel, a spare part delivery and a part replacement should take place consecutively, bringing the system from state (i, F, 0)  to state (i, F, 1)  and then from state (i, F, 1)  to state (i, 0, 0)  instantaneously. In this case, both actions take place at the same time instant.

The assumption of instantaneous replacements and deliveries is reasonable when the duration of these actions is much shorter than the mean component lifetime. In practice, even short downtimes can lead to significant costs. In this model, the downtime is translated into cost parameters. We distinguish a part replacement cost and a spare part delivery cost. At the time that a preventive replacement occurs in operating mode iI, the replacement cost cpri > 0 is charged. If the system is found to be failed, then a corrective replacement must be performed at a corrective replacement cost ccri, where ccricipr for iI. The replacement cost parameters cpri and ccri include the cost of labor and the cost of downtime due to preventive and corrective replacement actions, respectively.

For a spare part delivery that takes place when the system is in operating mode iI, a preventive spare part delivery cost cpdi ⩾ 0 is incurred. The cost of a preventive delivery includes the price of the spare part and the transportation cost. The transportation cost depends on the location of the moving asset; i.e., on the operating mode. We assume that a spare part is always available at the home base. By definition, the moving asset eventually visits the home base. When the moving asset is at its home base, the spare part can be directly put on board with a negligible transportation cost (regular delivery). If the moving asset is at a remote location at the moment a spare part is requested, the delivery should be made by means of an aircraft, helicopter, boat, dedicated vehicle, etc. (expedited delivery). We translate the difference between regular and expedited deliveries into cost terms, by differentiating the spare part delivery cost with respect to operating modes. In principle, preventive deliveries do not cause downtime if the component has not failed. If the component has failed, additional costs may be incurred, due to the cost of an emergency shipment and the cost of downtime while waiting for the spare part. To represent these additional costs, we introduce a corrective spare part delivery cost ccdi, where ccdicipd for iI. Moreover, holding the spare part on-board the moving asset incurs a holding cost h > 0 per time unit.

We apply uniformization to convert the continuous-time Markov decision problem into an equivalent discrete-time Markov decision problem. We add fictitious transitions from state to itself, ensuring that the total rate out of a state is equal for all states . The latter refers to the so-called uniformization rate (see Alagoz and Ayvaci (Citation2010)). We select a positive uniformization rate v such that where vij = μi + λij. In our Markov decision model, how long the system stays in a state depends on the action taken. Under uniformization, if action 0 is taken, the next transition (and decision epoch) is after an exponentially distributed time with mean 1/v in each state . In this case, one of the following transitions can occur at the subsequent transition instant: (i) the system can jump from degradation level to j + 1 with probability λij/v; (ii) the system can jump from operating mode iI to operating mode kI, ki with probability μiQ(i, k)/v; or (iii) the system can occupy the same state after transition with probability 1 − vij/v. If action 1 or 2 is taken, the transition to the subsequent state is instantaneous. Due to the latter, our formulation is slightly different from the standard Markov decision process formulation in Puterman (Citation1994), in which all transitions are as described for action 0. For illustration purposes, we depict an exemplary process in Appendix A.

We assume a continuous discount rate α > 0 so that any cost incurred at some future time t is discounted by a factor e− αt. Hence, the discount factor in the uniformized process can be defined as

The discounted holding cost of keeping a spare part on board during t time units is

If there is a spare part on-board the vessel and action 0 is taken, then the expected discounted holding cost between two decision epochs in the uniformized process is

The expected total discounted cost between two decision epochs in the uniformized process r(i, j, u, a) if action a is taken in state (i, j, u) is equal to where H(i, j, u, a) is the lump sum equivalent of the continuous cost and C(i, j, u, a) is the lump sum cost of action aA(i, j, u) in state (i, j, u):

We note that the uniformized process and the original process have the same lump sum cost under instantaneous transitions. Let V(i, j, u) be the value function representing the minimum expected total discounted cost using the optimal policy if the operating mode is i, the degradation level is j, and the number of spare parts on-board the vessel is u. The optimal replacement problem can be formulated by the following dynamic programming equation: (1) where operators Γ0, Γ1, and Γ2 refer to the actions of doing nothing, a spare part delivery, and a part replacement, respectively: (2) (3) (4)

Equation (Equation2) models possible future random events following the decision to do nothing. It consists of three types of transitions. The first one is the transition in operating modes, bringing the system from state (i, j, u) to (k, j, u) where i, kI, ik. The second one represents the component's degradation, bringing the system from state (i, j, u) to (i, j + 1, u) where The third one represents the transitions from state (i, j, u) to itself, which is a consequence of the uniformization procedure. Equation (Equation3) corresponds to the decision to deliver a spare part, in which case the system jumps immediately from state (i, j, 0)  to state (i, j, 1). Finally, Equation (Equation4) represents the replacement decision that resets the number of spare parts on-board the vessel and the degradation level to zero. Under Equations (Equation3) and (Equation4), at each decision epoch, a part replacement can immediately follow a spare part delivery and vice versa. We note that the state space of the model is discrete, the action space A(i, j, u) ∈ {0, 1, 2} is finite for each (i, j, u) ∈ I × J × U. The model satisfies the conditions of Theorem 6.2.10 in Puterman (Citation1994), which establishes the existence of an optimal deterministic stationary policy and the convergence of the value iteration algorithm to the optimal value. A value iteration algorithm that is designed to deal with immediate and instantaneous actions is given in Appendix B.

4. Optimal policy

In this section, we analyze the structure of the optimal policy. We start by stating three theorems that establish monotonicity results related to the component's degradation level. From this, we derive the structure of the optimal policy, which is an operating mode–dependent bi-threshold policy. The proofs of the theorems are given in Appendices C to E.

Theorem 1.

V(i, j, u) is non-decreasing in jJ for each iI and uU.

This theorem states the following. Consider two components in operating mode iI while uU spare parts are on-board the vessel. If the degradation level of the first component is higher than the second one, then the minimum expected total cost of the first component cannot be less than the minimum expected total cost of the second component.

Theorem 2.

V(i, j, u) is submodular in (j, u) ∈ J × U; that is,

In Theorem 2, we consider the difference between the two cases: (i) there is a spare part on-board the vessel and (ii) there is no spare part on-board the vessel. We show that in a certain operating mode, when a component's degradation increases, the increase in the minimum expected total cost in case (i) cannot be greater than that of case (ii). In other words, if the spare part is not on-board the vessel, the cost increase resulting from the component's degradation is higher compared with the case where the spare part is on-board the vessel.

Theorem 3.

In each iI, there exists an optimal part replacement threshold ψ(i) ⩽ F and an optimal spare part delivery threshold ξ(i) ⩽ F such that

This theorem shows that the optimal policy is an operating mode–dependent bi-threshold policy on the degradation level.

Remark 1.

The spare part delivery threshold can be greater than or equal to the part replacement threshold, which may seem counterintuitive in the sense that one would expect a spare part delivery to precede a part replacement in all operating modes. This is because whether a replacement decision is made or not depends on the number of spare parts on-board the vessel. If there is no spare part on-board the vessel at a certain degradation level and operating mode, the additional cost of delivering a spare part may not repay the risk of a failure. However, if there is already a spare part on-board the vessel for the same degradation level and operating mode, it may be optimal to replace the component. Appendix F provides an example supporting this remark.

5. Numerical experiments

In this section, we execute numerical experiments in order to test the performance of the optimal policy and to assess the value of integrated spare part and maintenance optimization. We compare the optimal policy to four benchmark policies, in which replacement decisions are not integrated with spare part optimization. These policies simplify spare part inventory decisions by “never” or “always” keeping the spare part stock on-board the vessel. Replacement decisions are optimized in all benchmark policies.

In the first benchmark policy, the spare part stockis never kept on-board the vessel. That is, a spare part delivery is always followed by a part replacement. In addition, preventive deliveries are not allowed when the moving asset is away from its home base. As a consequence, preventive part replacements can only occur at the home base. We refer to this policy as never spare part policy (NP). Denoting the home base by i = 0, spare part delivery and part replacement thresholds in NP are such that ξ(i) = ψ(i) = F for all iI\{0} and ξ(0) = ψ(0) ⩽ F. The second policy is similar to NP, the only difference being that preventive deliveries are allowed when the moving asset is away from the home base. We call this policy a never spare part policy with preventive deliveries (NPP). In NPP, the spare part delivery threshold becomes equal to the part replacement threshold in each operating mode; i.e., ξ(i) = ψ(i) ⩽ F for all iI. In the third policy, the spare part is put on-board the vessel whenever the asset is at its home base (if it is not already on-board). We refer to this policy as always spare part policy (AP). In AP, preventive deliveries in operating modes other than the home base are not allowed. That is, the spare part delivery threshold is zero at the home base and is equal to the failure threshold in all other operating modes; i.e., ξ(0) = 0 and ξ(i) = F for all iI\{0}. Finally, we relax AP to allow preventive deliveries outside the home base and call this policy as the always spare part policy with preventive deliveries (APP). As such, APP enables more than one preventive replacement in between home base visits. In APP, ξ(0) = 0 and ξ(i) ⩽ F for all iI\{0}. The setup of the experiments is described in Section 5.1 and the results are discussed in Section 5.2.

5.1. Setup

We consider a critical component with four operating states I ≡ {0, 1, 2, 3} where 0 represents the home base of the moving asset, 1 and 3 intermediate states, and 2 the mission state. Intermediate states may correspond to transit states from the home base to the mission site or vice versa. The operating mode transition probability matrix is defined as in , representing a deterministic cyclic sequence of operating modes.

Table 1. Operating mode transition probability matrix.

Three alternatives are considered regarding the operating mode transition rates (see ). Alternative 1 fits to a commercial setting similar to the real-life case that we present in Section 6. In that case, missions and home base visits have short durations. Alternative 2 reflects our observations from the defense industry (Navy ships) where mission durations and home base visits are long. To enrich our comparative analysis, we also consider Alternative 3, in which mission durations are very long compared with home base visits.

Table 2. Operating mode transition rates.

Depending on the component, degradation can be uniform in different operating modes, high during missions, or high during home base visits. For example, power on/off switching has a significant influence on degradation for some electronic components; in this case, degradation rates may be higher at the home base where the systems are often switched on/off. On the other hand, if the degradation is linearly related to operating time and the component mostly operates during missions, the degradation would be higher in missions (cf. the case study in Section 6). In these numerical experiments, we consider 10 degradation levels J ≡ {0, 1, …, 9} and different degradation profiles for each operating mode iI as given in .

Table 3. Degradation rates.

We take the same preventive replacement costs cpri = €1000 for all operating modes iI. The cost parameters considered for corrective replacement ccri, transportation ctri, and the spare part's price csp aregiven in . We note that the preventive spare part delivery cost consists of the cost of transportation and the spare part's price; i.e., cpdi = citr + csp in operating mode iI. The additional spare part delivery cost in the case of failure cadi is the difference between preventive and corrective deliveries, reflecting a possible downtime cost or the additional cost of an emergency shipment; i.e., ccdi = cipd + cadi in operating mode iI. Three alternatives are considered for the annual holding cost rate hra, 10% (low), 25% (medium), and 50% (high). The annual holding cost is calculated by multiplying the annual holding cost rate with the cost of the spare part; i.e., h = hracsp. The annual discount rate of money is taken as 2%, resulting in the continuous rate α = −ln (0.98). We note that the considered cost ranges are based on our observations from real-life cases that consist of both commercial (survey vessels) and defense equipment (the Navy ships) in the maritime sector. Overall, we obtain 36 × 2 = 1458 instances with the combination of different alternatives considered. The value iteration algorithm given in Appendix B was coded in C on a DELL computer with Intel Core i5-4300U processor (1.90 – 2.50 GHz) and 4.00 GB RAM.

Table 4. Cost parameters in thousand €.

5.2. Results

We assess the valueof the integrated approach by comparing its performance with the four benchmark policies in terms of average and maximum cost increase. The cost increase was calculated by CS = (CC*)/C*, where C is the expected total discounted cost under the benchmark policy and C* is the optimal cost obtained by our integrated maintenance and spare part optimization model. In addition, we calculated the percentage of instances for which the expected total discounted cost obtained under the benchmark policy coincides with the optimal cost (denoted by OC %).

Our numerical experiments show that the integrated approach has significant value. As shown in , the average cost increases under the benchmark policies NP, NPP, AP, and APP are 78, 20, 30, and 27%, respectively. The benchmark policies are not very likely to give the optimal cost. The percentage of instances for which the optimal cost coincides with the cost obtained by NP, NPP, AP, or APP is 5, 22, 2, or 12%, respectively. Not surprisingly, NP and AP are outperformed by NPP and APP, respectively. The average cost performance of NPP is found to be better than that of APP. In addition, the never spare part policies (NP and NPP) are more likely to give the optimal cost than the always spare part policies (AP and APP) for most of the cases. Nevertheless, the maximum cost increases in the never spare part policies are significantly higher compared with always spare part policies, meaning that the latter are usually more robust. In what follows, we summarize our observations regarding the performance of benchmark policies in different parameter settings.

Table 5. Cost increases under the benchmark policies compared to the optimal policy.

Observation 1.

Our numerical experiments show that APP is near-optimal when the spare part is cheap. NPP is well-performing if the spare part is expensive or the operating mode transition rates are high or transportation costs are low. For the above-mentioned cases, the associated benchmark policy could be preferred, due to its practical benefits (e.g., ease of implementation and communication of the spare part inventory decisions). For all other cases, the optimal policy is advisable, as the benchmark policies would lead to significant cost increases.

Observation 2.

When the spare part is never stocked on-board the vessel, relying on preventive replacements at the home base significantly deteriorates the cost performance. The cost performance of NP is significantly worse compared with NPP when the transportation costs are low, corrective replacement and corrective delivery costs are high, degradation is fast during missions, and the moving asset is mostly performing a mission. Moreover, the cost performance of NP is non-monotonous with respect to transportation costs. This can be explained as follows. When transportation costs are low, preventive deliveries are favorable away from the home base; i.e., we often observe that ξ(i) < F for iI\{0} in the optimal policy. Since NP is restricted to ξ(i) = F for iI\{0} (i.e., no preventive spare part deliveries away from the home base), it performs poorly for low transportation costs. On the other hand, when transportation costs are high, preventive deliveries away from the home base are much less favorable. Nevertheless, NP becomes more expensive since the spare part cannot be stocked on-board the vessel.

Observation 3.

When the spare part is put on-board the moving asset whenever it is at its home base, the cost saving resulting from preventive deliveries performed away from the home base is usually very limited. This can be seen by looking at the difference between AP and APP in terms of average cost increase (3%). Enabling more than one replacement in between home base visits leads to significant cost benefits only under extreme conditions (e.g., the combination of high corrective costs, low price of the spare part, long mission durations, and high degradation rates during missions).

Observation 4.

The cost performance of all benchmark policies is very sensitive to the structure of the degradation rates. The performance of never spare part policies deteriorates when a component's degradation is fast during missions. On the other hand, the always spare part policies perform relatively poorly when degradation is fast at the home base. For the uniform degradation rates that we consider, the component's lifetime is longer compared with other alternatives. This is shown to cause a very poor performance in always spare part policies.

Observation 5.

Different from the always spare part policies, the performance of the never spare part policies is very sensitive to changes in the operating mode transition rates. This is because, when the spare part is not stocked on-board the vessel, the total cost of spare part deliveries is highly dependent on the operation process. In particular, NPP performs very well when the operating mode transition rates are high; i.e., when the home base visits are frequent. These cases greatly improve the average performance of NPP.

Observation 6.

The cost performances of NPP, AP, and APP are almost insensitive with respect to corrective replacement and corrective delivery costs. This is due to the trade-off between preventive and corrective costs being taken into account while optimizing replacement thresholds ψ(i) for all iI. This does not hold for NP, as preventive deliveries and replacements cannot be performed away from the home base, by the definition ξ(i) = ψ(i) = F for all iI\{0} under NP.

Observation 7.

In our numerical experiments, the average computation time of the optimal policy (47 seconds) is comparable to those of AP (31.1 seconds) and APP (30.8 seconds). NP (1.8 seconds) and NPP (1.7 seconds) outperform the other policies in terms of computation time. This is because the state information that indicates the number of spare parts on-board the vessel is irrelevant in NP and NPP, resulting in smaller action and state spaces. The difference in terms of computation time will become significant for problems with larger state spaces or higher degradation/transition rates, due to the structure of the solution algorithm (cf. Appendix B). For cases where one of the benchmark policies is known to be well-performing (cf. Observation 1) and the optimal policy is computationally expensive, a near-optimal solution can be found using that benchmark policy.

6. Case study: Cooling fan of a survey vessel

In this section, real-life data acquired from FMS are utilized to demonstrate our integrated maintenance and spare part optimization model. As briefly introduced in Section 1, a major part of Fugro's business activity relates to exploration of the seafloor, for which survey vessels are used. These vessels use Diesel Electric Propulsion (DEP) systems that produce thrust and create movement. The DEP system consists of several subsystems. The centrifugal cooling fan of the frequency convertor is a part of the DEP system. The frequency converter generates heat as a consequence of its operation and needs to be cooled down by a cooling fan. As the failure of the cooling fan can lead to a fire, the DEP system needs to be stopped if the cooling fan fails, which results in downtime. The cooling fan can be easily replaced at all locations and a stock of spare parts can be kept either on-shore or on-board the vessel.

The daily reports of survey vessels contain information about the duration of different states, such as length of stay in a harbor, sailing to a location, performing operations, or waiting for suitable weather conditions. For the case of unsuitable weather conditions, the vessel sails to a location near the shore and waits there for improved weather conditions. When this happens the vessel sails back to its operation site. Otherwise, the vessel returns back to the harbor. There also exist consecutive missions; i.e., a mission can be followed by another mission with a transit phase in between. Based on the daily reports, we have categorized the operating modes as follows: “harbor,” “transit-to-mission,” “mission,” “transit-to-harbor,” and “weather.” We have analyzed a data set of 3 years regarding the sequence of these operating modes and their duration. As for the appropriateness of the Markovian model, we have also tested whether the durations of operation modes are well represented by exponential distributions. For all operating modes, except for the transit-to-mission state, the exponential distribution assumption is not rejected by the Kolmogorov–Smirnov test at the 95% confidence level. The corresponding transition rates and the transition probability matrix are given in . Based on the failures experienced by FMS, the failure of the ball bearing is found to be the main reason for the failure of the cooling fan. According to the OEM, the service life of the bearing system mainly depends on the thermal load on the bearing; i.e., the thermal fatigue. The OEM provides the L10 fatigue life of the bearing, which is the time at which 10% of the bearings can be expected to have failed. The OEM states that the L10 fatigue life is approximately 40 000 operating hours at an ambient temperature of 40°C. Based on this information, FMS treats cooling fans as reliable and holds no spare parts on-board a vessel. The current policy employed by FMS corresponds to a time-based block replacement policy. The replacement is done during dry-dock maintenance, which is mandatory every 5 years. Corrective maintenance is performed if the cooling fan has failed before dry-dock maintenance.

Table 6. Transition rates and the transition probability matrix.

Over the last 3 years, FMS has observed significantly shorter service lives. The earliest failures have occurred within the range of 2000–5000 operating hours. In the pursuit of better representing the expected failure times of the ball bearing, we used the fatigue life prediction model presented in Harris and Barnsby (Citation2001): (5) where Lna is the adjusted bearing fatigue life and a1, a2, and a3 are life adjustment factors for the selected reliability level, material, and operating conditions, respectively. As the vessel under consideration always operates in the same geographical region, environmental conditions were assumed to be similar for all the different operating modes. After applying Equation (Equation5) (based on the qualitative information obtained from FMS), the adjusted fatigue life of the ball bearing was found to be 4300 operating hours. The adjusted fatigue life is consistent with the failures that FMS has observed.

The degradationprocess was modeled based on our service life estimation in Equation (Equation5) and considering how the cooling fan is used in different operating modes. We note that the cooling fan is in use only when the bow thruster is operating. This is required to make the vessel more maneuverable in the mission, harbor, and weather states. Using expert knowledge, we have assessed the operating time of the cooling fan in different operating modes. Assuming that the degradation is linearly correlated with the operating time, the cooling fan is expected to degrade 17 times faster during missions than during harbor or weather states. In transit states, the degradation of the cooling fan is assumed to be negligible, as it is never utilized. Based on this information, we built an Erlangian degradation process for each operating mode where F = 10 and the rates out of each state to the next degradation state are 0.41, 0.00, 7.13, and 0.41 for harbor, transit, mission, and weather states, respectively. By doing this, we divided the lifetime of the component in 10 equal parts (see ). There is a small amount of data to statistically support the degradation process that we propose. Nevertheless, this degradation process is found to reflect actual operation based on the observations of FMS.

Table 7. Degradation rates.

The cost parameters differ between operating modes and are as listed in . The preventive replacement costs mainly consist of the cost of labor. The preventive delivery costs include the spare part's price, which is €3600 and the cost of transportation, which is €10 000 (applied if the vessel is in transit or on a mission). During a bad weather situation, the vessel takes shelter near the shore, which results in lower transportation costs. During a harbor stay, the cost of transportation is negligible. The corrective replacement and delivery costs can be significant during mission and transit-to-mission states, due to the resulting downtime. A corrective replacement was estimated to take 4 hours and an expedited delivery lasts almost 1 day. The downtime cost associated with the delay in operations is an intangible parameter and can vary to a major extent over time. In this case study, we took the downtime cost as €30 000 per day; however, this should not be taken as the actual cost incurred by FMS. We refer the reader to Section 5 for the impact of the downtime cost on the solution performance (this is reflected in corrective costs). In this case study, the holding cost rate was taken as 25% per year and the continuous rate is set to α = −ln (0.98).

Table 8. Cost parameters.

Considering the degradation information given in , we adapted the replacement interval of the current policy. Indeed, the initial service life of the component was overestimated and the current policy excludes the relation between operating modes and degradation. The replacement interval was adjusted using the bisection method. Under the adjusted policy, the component is replaced at fixed time intervals of 1.5 years instead of 5 years and the spare part is never stocked on-board the vessel.

summarizes the performance of different policies in terms of cost. We note that the total cost obtained by the current policy is coherent considering what FMS has faced for the last few years. Assuming that the degradation model that we have built reflects the degradation of the cooling fan, the current policy performs significantly worse compared with the adjusted policy. The adjusted policy is found to be 47% more expensive than the optimal policy. This is mainly a result of the operating modes and condition information being neglected and the replacements are made in a time-based manner under the adjusted policy.

Table 9. Cost performance of different policies.

demonstrates the threshold values obtained by the optimal policy and the benchmark policies. We observe that the never spare part policies encourage early replacements in the harbor to deal with the risk of corrective deliveries/replacements in non-harbor states. On the other hand, the always spare part policies postpone the replacement in the operating modes in which the corrective replacement costs are relatively low (harbor, transit-to-harbor, and weather states). Moreover, NP and AP give the same solution as NPP and APP, respectively. This is due to the fact that preventive delivery costs are high in non-harbor states and the frequency of harbor visits is very high compared with the frequency of failures. This makes preventive deliveries unfavorable in non-harbor states.

Figure 1. Threshold values obtained for different policies.

Figure 1. Threshold values obtained for different policies.

We observe that the never spare part policies outperform the always spare part policies. This is due to harbor visits being very frequent, offering many opportunities for cheap preventive replacements in the harbor. Nevertheless, compared with the optimal policy, both simple policies are shown to be very costly. Total cost increases of about 11 and 38% are observed under the never spare part policies and the always spare part policies, respectively. Even under very frequent harbor visits, it is not optimal to “never” keep a stock of the spare part on-board the vessel. The optimal policy provides a solution in which the component is stocked on-board the vessel when the risk of failure is high, providing a trade-off between spare part deliveries, part replacements, and holding costs.

7. Conclusions

In this article, we have considered an integrated maintenance and spare part optimization problem for a single critical component of a moving asset. We have proposed a Markov decision model and analyzed the optimal integrated policy. We have shown that the optimal policy is an operating mode–dependent bi-threshold policy. Numerical experiments have been provided to assess the value of the optimal policy compared with four benchmark policies. In these benchmark policies, replacement decisions are optimized but not integrated with spare part inventory decisions. These policies simplify spare part inventory decisions by “never” or “always” keeping a stock of the spare part on-board the vessel. Through a comparative analysis, we have shown that our integrated approach has significant value. Simplifying spare part inventory decisions results in an average cost increase of at least 20%. The benchmark policies are unlikely to be optimal, except under some specific conditions such as very high or very low prices of the spare part or transportation costs.

The problem studied in this article was motivated by a real-life problem faced at FMS. From our collaboration with the maritime sector, we observed that failure-related historical data are usually limited. This makes the characterization of the degradation process challenging. In modeling the degradation and failure behaviors of the components, physical models are known to be less data demanding than data-driven models. Moreover, the difference associated with operating modes can be incorporated into physical models, using the quantitative relation between usage, environment, and degradation. Nevertheless, considerable effort is generally needed to develop such models.

We considered a single critical component and at most one spare part kept on-board of the moving asset. Our numerical experiments show that under extreme conditions, performing more than one replacement in between home base visits would be cost-effective. Therefore, allowing a stock of more than one spare part on-board a vessel can potentially lead to a better solution. In this direction, extension of our model requires the consideration of fixed costs associated with delivering multiple spare parts at a time; i.e., the economies of scale in spare part deliveries. This can be captured by extending the state and the action space of our model. The investigation of the optimal policy structure would be an interesting topic in this extension. We note that if a non-consumable and critical component has to be replaced more than one time in between home base visits, one can question the existence of a design problem. For such cases, one should consider increasing reliability by redesign or built-in redundancy. Existing research considers CBM and inventory optimization problems for identical and independent components (see, e.g., Wang et al. (Citation2008), (Citation2009) and Xie and Wang, (Citation2008)). These works take non-zero lead times, fixed ordering costs, and shortage costs for spare parts into account. As these aspects increase problem complexity, the proposed solution methods are approximate, combining techniques such as simulation and genetic algorithms. To the best of our knowledge, there exists no work in the literature that considers non-identical components (see also van Horenbeek et al., (Citation2013)). The consideration of economic, structural, and stochastic dependencies between multiple components has a great practical relevance for moving assets. In practice, there exist fixed costs associated with maintenance and ordering spare parts, which would promote clustering or opportunistic policies. In this respect, a direct extension of our Markov decision process would be affected by the curse of dimensionality. In order to solve such real-life problems, the application of approximate dynamic programming can be investigated (Powell, Citation2007).

In this article, we assumed that spare parts are always available at the home base. In practice, the home base is a central warehouse where the stock of spare parts is limited and pooled to serve several moving assets. If spare parts are directly assigned to moving assets and stocked on-board, the benefit of inventory pooling at the central warehouse vanishes. In particular, this happens if lateral transshipments from one asset to another are not allowed. In our model, this effect can be taken into account implicitly, by inflating the holding cost rate. However, this approach is approximate. An exact approach would be to analyze this problem from multi-echelon spare parts inventory perspective, considering a number of moving assets served by a central warehouse (see, e.g., Muckstadt (Citation2005) and van Houtum and Kranenburg (Citation2015)).

Furthermore, in this article, the consequences associated with system failure have been translated into cost parameters. In practice, estimating intangible corrective cost terms is not straightforward, in particular, when failures affect people's health, safety, and environment. If modeling the trade-off between preventive and corrective costs is not obvious, the focus should lie on availability and reliability measures. The relation between system design and availability–reliability requirements needs to be closely investigated in the design phase of a system.

We note that although our motivation came from a maritime application, the model and the results presented in this article may be applied to other moving assets such as aircraft, commercial vehicles, and military equipment. The rapid development of advanced sensor technologies is making the condition monitoring of components more affordable and feasible. The condition information provided by sensors can be especially useful for critical systems operating in environments that vary randomly over time. This is due to the changes in environmental conditions and system usage being related to the rate at which degradation accumulates (Ulukus et al., Citation2012). The model that we propose is highly valuable considering the trend of shifting from time-based maintenance to CBM. Moreover, repair-by-replacement is a common practice for high-value capital assets in general (Driessen et al., Citation2015). Our model can be of value when there is an option of stocking the spare part on-board a moving asset. Depending on the application to be performed, the model can be easily tailored to adjust to the number of operating modes, their duration/sequence, associated costs, and degradation parameters.

Acknowledgements

The authors gratefully acknowledge the support of Fugro Marine Services and the discussions with Roy van Hugten and Fred Schulte.

Additional information

Funding

This research is funded by Dutch Institute for Advanced Logistics (Dinalog, 2011-4-085-R).

Notes on contributors

Ayse Sena Eruguz

Ayse Sena Eruguz is a Post-doctoral Researcher at Eindhoven University of Technology and is working on joint university–industry research projects, namely, integrated maintenance and service logistics concepts for maritime assets (MaSeLMa) and pro-active service logistics for capital goods—the next steps (ProSeLoNext). She received her Ph.D. in industrial engineering from Ecole Centrale Paris, France, in 2014. Her current research interests include service logistics, maintenance optimization, and multi-echelon inventory optimization.

Tarkan Tan

Tarkan Tan is an Associate Professor of sustainable operations management in the School of Industrial Engineering at Eindhoven University of Technology, The Netherlands. He received his Ph.D. from the Middle East Technical University, Ankara, Turkey, in 2002. He spent a year at Columbia University, New York, as a Fulbright Scholar. He joined Eindhoven University of Technology in 2003 and he is currently the director of the operations management and logistics M.Sc. program at this university. He has visited the University of California, Los Angeles, and the University of Sydney as a visiting scholar. He is an executive board member of the European Supply Chain Forum. His research interests include inventory theory, capacity management, spare parts management, and supply chain management with a particular focus on sustainability.

Geert-Jan van Houtum

Geert-Jan van Houtum is a Professor of maintenance and reliability at the School of Industrial Engineering, Eindhoven University of Technology. He is the scientific director of the BETA Research School for Operations Management and Logistics. His research is focused on spare parts management, production-inventory systems, maintenance and availability management of capital goods, and the effect of design decisions on the total cost of ownership of capital goods.

References

  • Alagoz, O. and Ayvaci, M.U.S. (2010) Uniformization in Markov decision processes, in Wiley Encyclopedia of Operations Research and Management Science, Cochran, J.J. (ed), John Wiley & Sons, Inc.
  • Alam, M. and Al-Saggaf, U.M. (1986) Quantitative reliability evaluation of repairable phased-mission systems using Markov approach. IEEE Transactions on Reliability, 35, 498–503. doi: 10.1109/TR.1986.4335529
  • Armstrong, M.J. and Atkins, D.R. (1996) Joint optimization of maintenance and inventory policies for a simple system. IIE Transactions, 28, 415–424. doi: 10.1080/07408179608966288
  • Armstrong, M.J. and Atkins, D.R. (1998) A note on joint optimization of maintenance and inventory. IIE Transactions, 30, 143–149. doi: 10.1080/07408179808966446
  • Çekyay, B. and Özekici, S. (2012) Optimal maintenance of systems with Markovian mission and deterioration. European Journal of Operational Research, 219, 123–133. doi: 10.1016/j.ejor.2011.12.037
  • Çekyay, B. and Özekici, S. (2015) Optimal maintenance of semi-Markov missions. Probability Engineering Information Science, 29, 77–98. doi: 10.1017/S0269964814000229
  • Çekyay, B., Özekici, S., Cochran, J.J., Cox, L.A., Keskinocak, P., Kharoufeh, J.P. and Smith, J.C. (2011) Condition-based maintenance under Markovian deterioration, in J.J. Cochran (ed), Wiley Encyclopedia of Operations Research and Management Science, John Wiley & Sons, Inc.
  • Derman, C. (1970) Finite State Markovian Decision Processes, Academic Press, Orlando, FL.
  • Driessen, M., Arts, J., van Houtum, G.J., Rustenburg, J.W. and Huisman, B. (2015) Maintenance spare parts planning and control: A framework for control and agenda for future research. Production Planning and Control, 26, 407–426. doi: 10.1080/09537287.2014.907586.
  • Elwany, A.H. and Gebraeel, N.Z. (2008) Sensor-driven prognostic models for equipment replacement and spare parts inventory. IIE Transactions, 40, 629–639. doi: 10.1080/07408170701730818
  • Elwany, A.H., Gebraeel, N.Z. and Maillart, L.M. (2011) Structured replacement policies for components with complex degradation processes and dedicated sensors. Operations Research, 59, 684–695. doi: 10.1287/opre.1110.0912
  • Esary, J.D. and Ziehms, H. (1975) Reliability analysis of phased missions, in R.E. Barlow, J.B. Fussel, N.D. Singpurwalla (eds), Proceedings of the Conference on Reliability and Fault Tree Analysis, SIAM, Philadelphia, PA, pp. 213–236.
  • Flint, P. (1995) Too much of a good thing: Better inventory management could save the industry millions while improving reliability. Air Transport World, 32, 103–106.
  • Flory, J.A., Kharoufeh, J.P. and Abdul-Malak, D.T. (2015) Optimal replacement of continuously degrading systems in partially observed environments. Naval Research Logistics, 62, 395–415. doi: 10.1002/nav.21638
  • Harris, T.A. and Barnsby, R.M. (2001) Life ratings for ball and roller bearings. Proceedings of the Institution of Mechanical Engineers-Part J: Journal of Engineering Tribology 215, 577–595. doi: 10.1243/1350650011543817
  • Kabir, A.B.M.Z. and Al-Olayan, A.S. (1996) A stocking policy for spare part provisioning under age based preventive replacement. European Journal of Operational Research, 90, 171–181. doi: 10.1016/0377-2217(94)00246-0
  • Kawai, H., Koyanagi, J. and Ohnishi, M. (2002) Optimal maintenance problems for Markovian deteriorating systems, in Stochastic Models in Reliability and Maintenance, Osaki, P.S. (ed), Springer, Berlin, Germany, pp. 193–218. doi: 10.1007/978-3-540-24808-8_8
  • Kharoufeh, J.P., Solo, C.J. and Ulukus, M.Y. (2010) Semi-Markov models for degradation-based reliability. IIE Transactions, 42, 599–612. doi: 10.1080/07408170903394371
  • Kim, K. and Park, K.S. (1994) Phased-mission system reliability under Markov environment. IEEE Transactions on Reliability, 43, 301–309. doi: 10.1109/24.295013
  • Kolesar, P. (1966) Minimum cost replacement under Markovian deterioration. Management Science, 12, 694–706.
  • Kurt, M. and Kharoufeh, J.P. (2010) Optimally maintaining a Markovian deteriorating system with limited imperfect repairs. European Journal of Operational Research, 205, 368–380. doi: 10.1016/j.ejor.2010.01.009
  • Lam, M. (1995) An introduction to airline maintenance, in D. Jenkins (ed), Handbook of Airline Economics, Aviation Week Group, pp. 397–406. McGraw-Hill, New York, NY.
  • Makis, V. and Jiang, X. (2003) Optimal replacement under partial observations. Mathematics & Operational Research, 28, 382–394. doi: 10.1287/moor.28.2.382.14484
  • Muckstadt, J.A. (2005) Analysis and Algorithms for Service Parts Supply Chains, Springer, New York, NY.
  • Mura, I. and Bondavalli, A. (1999) Hierarchical modeling and evaluation of phased-mission systems. IEEE Transactions on Reliability, 48, 360–368. doi: 10.1109/24.814518
  • Özekici, S. (1995) Optimal maintenance policies in random environments. European Journal of Operational Research, 82, 283–294. doi: 10.1016/0377-2217(94)00264-D
  • Powell, W.B. (2007) Approximate Dynamic Programming: Solving the Curses of Dimensionality, Wiley, New York, NY.
  • Puterman, M.L. (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming, first edition, John Wiley & Sons, Inc., New York, NY.
  • Tinga, T. (2010) Application of physical failure models to enable usage and load based maintenance. Reliability and Engineering System Safety, 95, 1061–1075. doi: 10.1016/j.ress.2010.04.015
  • Tinga, T. and Janssen, R. (2013) The interplay between deployment and optimal maintenance intervals for complex multi-component systems. Proceedings of the Institution of Mechanical Engineers, Part O: Journal of Risk and Reliability, 227, 227–240. doi: 10.1177/1748006X13480743
  • Turan, O., Ölçer, A.İ., Lazakis, I., Rigo, P. and Caprace, J.D. (2009) Maintenance/repair and production-oriented life cycle cost/earning model for ship structural optimisation during conceptual design stage. Ships and Offshore Structures, 4, 107–125. doi: 10.1080/17445300802564220
  • Ulukus, M.Y., Kharoufeh, J.P. and Maillart, L.M. (2012) Optimal replacement policies under environment-driven degradation. Probability Engineering and Information Science, 26, 405–424. doi: 10.1017/S0269964812000083
  • United Nations Conference on Trade and Development. (2013) Review of Maritime Transport 2013, United Nations, Geneva, Switzerland.
  • Van Horenbeek, A., Buré, J., Cattrysse, D., Pintelon, L. and Vansteenwegen, P. (2013) Joint maintenance and inventory optimization systems: A review. International Journal of Production Economics, 143, 499–508. doi: 10.1016/j.ijpe.2012.04.001
  • Van Houtum, G.J. and Kranenburg, B. (2015) Spare Parts Inventory Control under System Availability Constraints, Springer, Boston, MA.
  • Waldmann, K.-H. (1983) Optimal replacement under additive damage in randomly varying environments. Naval Research Logistics, 30, 377–386. doi:10.1002/nav.3800300302
  • Wang, L., Chu, J. and Mao, W. (2008) An optimum condition‐based replacement and spare provisioning policy based on Markov chains. Journal of Quality in Maintenance Engineering, 14, 387–401. doi: 10.1108/13552510810909984
  • Wang, L., Chu, J. and Mao, W. (2009) A condition-based replacement and spare provisioning policy for deteriorating systems with uncertain deterioration to failure. European Journal of Operational Research, 194, 184–205. doi: 10.1016/j.ejor.2007.12.012
  • Xie, J. and Wang, H. (2008) Joint optimization of condition-based preventive maintenance and spare ordering policy, in The 2008 4th International Conference on Wireless Communications, Networking and Mobile Computing, pp. 1–5. IEEE, Dalian, China. doi: 10.1109/WiCom.2008.1468.
  • Zhang, Z., Wu, S., Li, B. and Lee, S. (2013) Optimal maintenance policy for multi-component systems under Markovian environment changes. Expert Systems and Applications, 40, 7391–7399. doi: 10.1016/j.eswa.2013.07.003

Appendix

Appendix A: Markov decision process with immediate transitions

For illustration purposes, we depict an exemplary process in . We consider two operating modes I ≡ {0, 1} and three degradation levels J ≡ {0, 1, 2}. We assume that the system operates under the following policy. In operating mode i = 0, a spare is delivered when degradation level j is greater than one. In operating mode i = 1, spare part delivery is performed when the component has failed. In both operating modes, the component is replaced only correctively. In the uniformized process, when action 0 is taken under the given policy, the probability that the system jumps from degradation level to j + 1 is pij = λij/v and the probability that the system jumps from operating mode iI to ki is qik = μiQ(i, k)/v. In , transitions from a state to itself are not depicted for ease of exposition (the corresponding probabilities are qii = 1 − vij/v). Immediate transitions under the given policy are shown by dotted arcs.

Figure A1. Markov chain illustrating the uniformized process under a given policy.

Figure A1. Markov chain illustrating the uniformized process under a given policy.

Appendix B:

Value iteration algorithm

Let Vn(i, j, u) be the expected total discounted cost over a time horizon of n periods, where each period has an expected length 1/v. Without loss of generality, we select V0(i, j, u) as follows: (A1)

We introduce an additional notation σa to represent the instantaneous part replacements and spare part deliveries:

For (i, j, u) ∈ I × J × U and n ∈ ℕ0, where ℕ0 is the set of non-negative integers, Vn + 1(i, j, u) can be computed by (A2) where is defined as follows for a = 0, a = 1, and a = 2, respectively: (A3) (A4) (A5)

Due to instantaneous part replacements and spare part deliveries, value iteration should substitute updated values of Vn + 1(i, j, u) at the (n + 1)th iteration. In state (i, 0, 1), if the optimal decision is to replace the part, this decision will not be immediately followed by a spare part delivery. This is because Vn + 1(i, 0, 1) = cpri + cpdi + Vn + 1(i, 0, 1) cannot hold since cpri > 0 and cpdi ⩾ 0 for each iI. Therefore, Vn + 1(i, 0, 1) is equivalent to (A6)

Starting with Equation (EquationA6), Vn + 1(i, j, u) can be evaluated in decreasing order of uU and increasing order of jJ using Equation (EquationA2). As such, Equations (EquationA4) and (EquationA5) can be indirectly expressed in terms of Vn(i, j, u) for each (i, j, u) ∈ I × J × U. This overcomes instantaneous transitions for all states. What follows presents the value iteration algorithm that is designed to solve Equation (Equation1). We refer to Puterman (Citation1994, Theorem 6.3.1., p. 161) for further details about the convergence of such a value iteration algorithm to the optimal value of the infinite-horizon problem.

  • Step 1: Set n = 0 and ϵ = 0.0001. Select V0(i, j, u) as Equation (EquationA1).

  • Step 2: For each (i, j, u) ∈ I × J × U, compute Vn + 1(i, j, u) in decreasing order of uU and increasing order of jJ: If (j, u) = (0, 1) use Equation (EquationA6). Otherwise, use Equation (EquationA2).

  • Step 3: If for each (i, j, u) ∈ I × J × U, |Vn + 1(i, j, u) − Vn(i, j, u)| < ϵ(1 − β)/2β, then go to Step 4. Otherwise, increment n by one and return to Step 2.

  • Step 4: For each (i, j, u) ∈ I × J × U, choose the optimal stationary policy by

    and stop.

Appendix C:

Proof of Theorem 1

In order to prove Theorem 1, we first introduce Lemmas A1 to A4.

Lemma A1.

For each iI and n ∈ℕ0, if Γ0Vn(i, j, 1) is non-decreasing in , then Vn + 1(i, j, 1) is non-decreasing in .

Proof of Lemma A1.

From Equation (EquationA2), Vn + 1(i, j, 1) can be re-written as follows for each iI, , and n ∈ℕ0: (A7)

For each iI, the cost of preventive replacement cpri + Vn + 1(i, 0, 0) and the term (β/v)h are constant and do not depend on . Therefore, if Γ0Vn(i, j, 1) is non-decreasing in , then Equation (EquationA7) is also non-decreasing in .■

Lemma A2.

For each iI and n ∈ ℕ0:

(a)

Vn(i, j, 1) is non-decreasing in jJ.

(b)

Γ0Vn(i, j, 1) is non-decreasing in .

Proof of Lemma A2(a).

We first prove Lemma A2(a) by induction.

Basis: Lemma A2(a) holds for n = 0 by Equation (EquationA1).

Induction Step: For each iI, assume that Vn(i, j, 1) is non-decreasing in jJ for a given n > 0 (induction hypothesis). From Equation (EquationA3), we have the following for : (A8)

The second equality follows from the definition of vij and vi, j − 1, which are equal to μi + λij and μi + λi, j − 1, respectively. We note that v ⩾ max {vij|iI, jJ} by definition. Therefore, Equation (EquationA8) is greater than or equal to zero for each iI and under the induction hypothesis. Thus, Γ0Vn(i, j, 1) is non-decreasing in for each iI. Using Lemma A1, Vn + 1(i, j, 1) is non-decreasing in for each iI.

It remains to prove that Vn + 1(i, F, 1) − Vn + 1(i, F − 1, 1) ⩾ 0 for all iI. For j = F, we have (A9)

The first inequality follows from the definition of preventive and corrective cost parameters. The second inequality holds by the definition of Vn + 1(i, F − 1, 1). Thus, for each iI, Vn + 1(i, j, 1) is non-decreasing in jJ, completing the induction and the proof of Lemma A2(a).

From Lemma A2(a), Equation (EquationA8) is greater than or equal to zero for each iI, , and n ∈ ℕ0, implying that Lemma A2(b) holds. □

We note that Lemma A2(b) is required to prove Theorem 2.

Lemma A3.

For each iI and n ∈ ℕ0, if Γ0Vn(i, j, 0) is non-decreasing in , then Vn + 1(i, j, 0) is non-decreasing in .

Proof of Lemma A3.

From Equation (EquationA2), Vn + 1(i, j, 0) can be re-written as follows for each iI,, and n ∈ ℕ0: (A10)

From Lemma A2(a), Vn + 1(i, j, 1) is non-decreasing in jJ for each iI and n ∈ ℕ0. Lemma A3 follows from Equation (EquationA10), in which Vn + 1(i, j, 0) is the minimum between two non-decreasing terms in

Lemma A4.

For each iI and n ∈ ℕ0:

(a)

Vn(i, j, 0) is non-decreasing in jJ.

(b)

Γ0Vn(i, j, 0) is non-decreasing in .

Proof of Lemma A4.

The proof of Lemma A4 is along the same lines as the proof of Lemma A2, replacing u = 1 by u = 0, Lemma A1 by Lemma A3, Lemma A2 by Lemma A4, and Equation (EquationA9) by

The first inequality follows from Equation (EquationA9) and ccdicipd. The second equality follows from Equation (EquationA10). ■

We note that Lemma A4(b) is required to prove Theorem 3.

Proof of Theorem 1.

Theorem 1 follows from Lemma A2(a) and Lemma A4(a). Since they hold for all n ∈ ℕ0, they also hold for the infinite-horizon function V(i, j, u) where n tends to infinity.■

Appendix D:

Proof of Theorem 2

In order to prove Theorem 2, we first introduce Lemmas A5 to A7.

Lemma A5.

For each iI,, and n ∈ ℕ0, if action 2 is optimal in state (i, j − 1, 1), then action 2 is also optimal in state (i, j, 1).

Proof of Lemma A5.

Fix operating mode iI and n ∈ ℕ0. Suppose that .

From Equation (EquationA7), the optimality of action 2 means that the cost associated with action 2 is less than or equal to the cost associated with action 0. Hence, if action 2 is optimal in state (i, j − 1, 1), then

From Lemma A2(b), we have

Thus, action 2 is also optimal in state (i, j, 1), completing the proof. ■

Proof of Lemma A6.

The following inequality holds for all iI, , and n ∈ ℕ0: (A11)

Proof of Lemma A6.

Fix operating mode iI and n ∈ ℕ0. Suppose that .

From Equation (EquationA10), we have Vn + 1(i, j − 1, 0) ⩽ Γ0Vn(i, j − 1, 0). Similarly, from Equation (EquationA7), we have

Therefore, if then Equation (EquationA11) holds. Otherwise, if Vn + 1(i, j − 1, 1) = cpri + Vn + 1(i, 0, 0), then Vn + 1(i, j, 1) − Vn + 1(i, j − 1, 1) = 0 by Lemma A5. From Lemma A2(b), Γ0Vn(i, j, 1) − Γ0Vn(i, j − 1, 1) ⩾ 0 and, thus, Equation (EquationA11) holds.■

Lemma A7.

For each iI and n ∈ ℕ0, the following inequalities hold: (A12) (A13)

Proof of Lemma A7.

We first prove Lemma A7(a) by induction.

Basis: Lemma A7(a) holds for n = 0 under Equation (EquationA1).

Induction Step: For each iI and jJ\{0}, assume that Lemma A7(a) holds for a given n > 0 (induction hypothesis).

Case 1: j = F.

By definition, Vn + 1(i, F, 0) = ccdi + Vn + 1(i, F, 1). Hence, for n + 1, the left-hand side of Equation (EquationA12) is (A14)

Equation (EquationA14) is less than or equal to zero since Vn + 1(i, F − 1, 0) ⩽ Vn + 1(i, F − 1, 1) + cpdi and cpdicicd.

Case 2: and Vn + 1(i, j, 0) = cpdi + Vn + 1(i, j, 1).

For n + 1, the left-hand side of Equation (EquationA12) is (A15)

From Equation (EquationA10), Equation (EquationA15) is less than or equal to zero.

Case 3: and Vn + 1(i, j, 0) = Γ0Vn(i, j, 0).

For n + 1, the left-hand side of Equation (EquationA12) becomes (A16)

From the definition of operator Γ0, we have the following: (A17)

From the induction hypothesis, Equation (EquationA17) is less than or equal to zero for each iI and . Lemma A6 implies that Equation (EquationA16) is less than or equal to Equation (EquationA17). Hence, Equation (EquationA16) is less than or equal to zero for each iI and , completing the induction and the proof of Lemma A7(a).

From Lemma A7(a), Equation (EquationA17) is less than or equal to zero for each iI, , and n ∈ ℕ0, implying that Lemma A7(b) holds.■

Proof of Theorem 2.

Theorem 2 follows from Lemma A7(a). Since Equation (EquationA12) holds for all n ∈ ℕ0, it also holds for the infinite horizon function V(i, j, u) where n tends to infinity. ■

Appendix E:

Proof of Theorem 3

Lemma A8.

For each iI,, and n ∈ ℕ0, if action 1 is optimal in state (i, j − 1, 0), then action 1 is also optimal in state (i, j, 0).

Proof of Lemma A8.

Fix operating mode iI and n ∈ ℕ0. Suppose that .

From Equation (EquationA10), the optimality of action 1 means that the cost associated with action 1 is less than or equal to the cost associated with action 0. Hence, if action 1 is optimal in state (i, j − 1, 0), then (A18)

From Lemma A4(b), we have

If Vn + 1(i, j − 1, 1) = cpri + Vn + 1(i, 0, 0), then Vn + 1(i, j, 1) = cpri + Vn + 1(i, 0, 0) = Vn + 1(i, j − 1, 1) from Lemma A5. In this case, it holds that cpdi + Vn + 1(i, j, 1) = cpdi + Vn + 1(i, j − 1, 1) ⩽ Γ0Vn(i, j, 0) and, thus, action 1 is optimal in state (i, j, 0).

Otherwise, if Vn + 1(i, j − 1, 1) = (β/v)h + Γ0Vn(i, j − 1, 1), then Equation (EquationA18) can be re-written as (A19)

From Lemma A7(b), we have (A20)

Inequalities (EquationA19) and (EquationA20) imply

From Equation (EquationA7), Vn + 1(i, j, 1) ⩽ (β/v)h + Γ0Vn(i, j, 1). Thus,

In this case, action 1 is optimal in state (i, j, 0), completing the proof.■

Proof of Theorem 3.

The existence of an optimal replacement threshold ψ(i) ⩽ F in operating mode iI follows from Lemma A5. The existence of an optimal spare part delivery threshold ξ(i) ⩽ F in operating mode iI follows from Lemma A8. Since both lemmas hold for all n ∈ ℕ0, they also hold for the infinite-horizon problem where n tends to infinity.■

Appendix F:

Illustrative example for Remark 1

As an illustrative example to Remark 1, we use one of the instances of the test bed described in Section 5. We consider the combination given in . demonstrates the threshold values obtained by the optimal policy. For this example, under the optimal policy, a preventive spare part delivery would not be performed away from the home base since ξ(i) = F = 9 for all iI\{0}. That is, if the spare part is not on-board the vesssel, a preventive part replacement would not be optimal. However, if the spare part has already been delivered at the home base, it is optimal to perform a preventive part replacement in iI\{0} since ψ(i) ⩽ F.

Table A1. Illustrative example parameters.

Figure A2. Threshold values of the optimal policy.

Figure A2. Threshold values of the optimal policy.