Full article: Physical, data-driven and hybrid approaches to model engine exhaust gas temperatures in operational conditions

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

Fast diesel engine models for real-time prediction in dynamic conditions are required to predict engine performance parameters, to identify emerging failures early on and to establish trends in performance reduction. In order to address these issues, two main alternatives exist: one is to exploit the physical knowledge of the problem, the other one is to exploit the historical data produced by the modern automation system. Unfortunately, the first approach often results in hard-to-tune and very computationally demanding models that are not suited for real-time prediction, while the second approach is often not trusted because of its questionable physical grounds. In this paper, the authors propose a novel hybrid model, which combines physical and data-driven models, to model diesel engine exhaust gas temperatures in operational conditions. Thanks to the combination of these two techniques, the authors were able to build a fast, accurate and physically grounded model that bridges the gap between the physical and data driven approaches. In order to support the proposal, the authors will show the performance of the different methods on real-world data collected from the Holland Class Oceangoing Patrol Vessel.

KEYWORDS:

1. Introduction

Internal combustion engines (ICEs), Diesel Engines (DEs) in particular, have been the main power provider for shipping over the past century, since their efficiency made steam engines obsolete (Curley Citation2012). While advanced electrical and hybrid propulsion architectures have changed propulsion systems over the past decades, the DEs maintain their primary position, either as a propulsion engine driving the shaft or as a generator providing electrical power (Geertsma, Negenborn, Visser and Hopman Citation2017). However, concerns over hazardous emissions from shipping on air quality (Viana et al. Citation2014) and on global warming (Taljegard et al. Citation2014) have led to more stringent regulations on emissions, such as sulfur and NO $_{x}$ (IMO MARPOL Citation2011), and the target to reduce annual global shipping emissions with 50% by 2050 (IMO MEPC 72 Citation2018). Economic studies suggest that internal combustion engines will maintain their leading position over the next decades (Taljegard et al. Citation2014), due to the long operating profiles and the high energy requirement of transport ships, although alternative fuels, such as Liquefied Natural Gas (LNG) (Anderson et al. Citation2015), methanol (Svanberg et al. Citation2018; Amma Citation2019) and biodiesel (Geng et al. Citation2017; Hoang et al. Citation2019) could reduce the environmental impact of engine emissions.

Hence, keeping DEs functioning and efficient is a critical issue in the marine industry for reducing the environmental impact of engine emissions and for maintaining their availability (Lloyd and Cackette Citation2001; Xu et al. Citation2002). While crews previously performed maintenance on DEs themselves, the trend to reduce crew size and the increasing complexity of ship systems have led to an increase in support contracts, through which the original equipment manufacturers perform maintenance (Ghaderi Citation2019). As availability requirements have also increased (Zahedi et al. Citation2014; Geertsma, Negenborn, Visser and Hopman Citation2017), maintenance needs to be accurately planned and failures before planned maintenance need to be prevented (Verbert et al. Citation2017). In the near future, autonomous shipping will requires even more accurate maintenance planning and increased reliability (Banda et al. Citation2019; Ghaderi Citation2019). While work on automatic path planning and collision avoidance (Liu et al. Citation2017, Citation2019) is ongoing and practical experiments have demonstrated ships sailing autonomously, the development of reliable power and propulsion systems and their operating and maintenance concepts is equally important (Schwartz Citation2002). Therefore, work is required to increase the reliability and the efficiency of ships power systems, in particular the main power providers such as the ICE, and to develop methods to accurately predict when maintenance is required and identify developing failures before they obstruct reliable operation (Wu et al. Citation2013; Cipollini et al. Citation2018b). In this respect, the development of a real-time virtual model of an ICE, i.e. a digital twin, that can provide accurate predictions and offer insights regarding operational performance and health status can be of great importance. This has been identified both by academia and the industry, with researchers demonstrating the benefits of this technology in a wide variety of industrial applications (Bondarenko and Fukuda Citation2020; Liu et al. Citation2020; Bhatti et al. Citation2021; Teng et al. Citation2021; Xu et al. Citation2021).

A critical requirement of such a virtual model for an ICE is the precise reflection of its key characteristics, under all operating conditions and in real time (Bondarenko and Fukuda Citation2020). Focusing on the health status of a DE, key diagnostic parameters are exhaust gas temperatures, as they can provide valuable insights, with respect to the turbocharging system, the fuel supply system, and the working medium exchange system (Korczewski Citation2015, Citation2016). More specifically, exceedingly high exhaust gas temperatures can lead to severe damage on the cylinder valves, while exceeding the permissible values in the turbine inlet cross section can cause severe and irreversible damage in the turbine blades. Considering marine engines, a number of operating conditions can potentially lead to increased exhaust gas temperatures, these include: excessive load resulting from hull fouling or damaged propeller blades, malfunction of the water cooling system that cools cylinder liners, and pollution of the exhaust manifold that is usually caused by deposits of the products of incomplete combustion.

Unfortunately, taking into account the dimensions and cost of marine DEs, the process of carrying out experimental campaigns to test the efficiency and to diagnose possible decays requires significant resources. For this reason, modelling and simulation techniques are recognised as the most effective approaches to obtain a cost-efficient and reliable understanding of the engine performance and components' interactions (Theotokatos Citation2010). Numerical models play a pivotal role in predicting key engine performance parameters, such as the exhaust gas temperature, to identify emerging failures early on, and to establish trends in performance degradation (Grimmelius et al. Citation2007). The most advanced engine models that are available in the literature (Reitz and Rutland Citation1995; Baldi et al. Citation2015; Xiang et al. Citation2019) show that the complexity of diesel combustion requires simulations with many complex, interacting submodels to guarantee high accuracy. However, such modelling approaches are computationally demanding, and are unsuitable for accurate and real-time dynamic predictions. As such, their use is prohibitive in applications which require real-time simulations to be performed (Khaled et al. Citation2014) with strict accuracy requirements under both steady-state and dynamic conditions. In order to develop accurate models that can predict engine behaviour real-time, the authors propose a Hybrid Model (HM) approach, combining both Physical Models (PMs) and Data-Driven Models (DDMs) to the problem of modelling DEs exhaust gas temperatures in operational conditions.

PMs are models in which first principle equations represent the physical phenomena of the system. The majority of studies involving PMs (see Section 2.1) report results that are in very good agreement with measurements taken from shop trial data, or under static operating conditions with a limited number of operating points. Such validation approaches might be sufficient for the purpose of the respective studies, bearing in mind possible constraints posed by the lack of available data. Nevertheless, the suitability of each model to predict key performance indicators, and in particular exhaust gas temperatures under transient operating conditions, and in true operational conditions is not sufficiently demonstrated. Moreover, the literature does not report the statistic accuracy under dynamic conditions. Finally, the most effective physical models require extensive computational time for high accuracy results.

DDMs, also called black-box models, contrarily to PMs do not exploit any first principle equations but they are able to exploit robust statistical inference procedures and historical data collected through a logging system, in order to make predictions about the future behaviour of the modelled system. DDMs have gained substantial interest with the rapid growth of ship monitoring systems within the shipping industry, and several interesting applications can be found in the literature (see Section 2.2). An advantage of these methods is represented by the fact that there is no need of any a-priori knowledge about the underlying physical system. Furthermore, thanks to the nature of these approaches, it is possible to exploit even data regarding particular phenomena that cannot be easily modelled with a PM. Despite the impressive accuracy that can be obtained, DDMs usually produce non-parametric models that are not supported by any physical interpretation; this, despite representing a possible advantage, as mentioned above, may limit the capability of the models themselves, without exploiting important knowledge about the phenomena of interest. Moreover, a great amount of historical data is necessary in order to build reliable models. In the authors' opinion, a modelling approach that aims to identify emerging failures on a DE at an early stage and establish trends with respect to performance degradation, must be able to fast and accurately predict the most critical process parameters equally well under static and dynamic operation, across the entire operating envelope providing also insight and knowledge about the physical processes.

Therefore, in this paper, an existing DE PM (Geertsma, Negenborn, Visser, Loonstijn, et al. Citation2017) is improved by combining PMs and DDMs (Leifsson et al. Citation2008; Coraddu et al. Citation2017). The result of this combination is a, recently named (Coraddu et al. Citation2018), HM, also referred to as gray-box model, which allows to exploit both the mechanistic knowledge of the physical principles and historical data. As reported in Coraddu et al. (Citation2017), this approach provides more accurate outcomes when compared with the first principle PMs, requires a smaller amount of data when compared to the DDMs, and is extremely fast compared to advanced PMs of comparable performance. For this reason, this work aims to investigate how a combination of DDMs and PMs can improve prediction of the engine exhaust gas temperatures, using extensive measurement data from the Holland Class Oceangoing Patrol Vessel. In particular, in this paper:

authors review the performance of a mean value PM in factory acceptance conditions, in static and dynamic conditions at sea, thus demonstrating that current state-of-the-art PMs are not suitable for predicting operating parameters in true operational conditions;
authors test the DDMs to establish whether they can be used to predict the DE exhaust gas temperatures;
authors present a novel HM to predict DE exhaust gas temperature combining the PMs and the DDMs mentioned above;
authors exploit real-world data coming from a Holland Class Oceangoing Patrol Vessel to assess the accuracy and effectiveness of the different modelling approaches.

Results will demonstrate that the HM yields a more accurate representation of the DE, which will then be suitable for use in various aspects of off-line and real-time operational monitoring.

The rest of the paper is organised as follows. Section 2 gives an overview of related works. Section 3 gives a brief description of the system and reports the dataset used for this work. In Section 4, the different modelisation approaches are described, respectively the PM (see Section 4.2), the DDM (see Section 4.3) and the HM (see Section 4.4). Section 5 shows the results of the three modelling approaches on real data coming from an Holland Class Oceangoing Patrol Vessel. Finally, Section 6 summarises and concludes the paper with a description of the future scenarios opened by the authors' work.

2. Related works

In this section, the authors will review literature that deals with PMs, DDMs and HMs for DE modelling.

2.1. PMs

Intensive research has been conducted in PMs for DE modelling. The works of Grondin et al. (Citation2004), Grimmelius et al. (Citation2007), and Geertsma, Negenborn, Visser, Loonstijn, et al. (Citation2017) provide insightful reviews on the extensive work done on this field, as well as its evolution over the last decades. The general consensus is that the choice of a suitable model depends primarily on the requirements of each application and, of course, the available computational tools (Johnson et al. Citation2010). The same is also claimed in Hountalas (Citation2000), in which the author argues that, due to the uniqueness of marine DEs and their operation, computer programs for marine applications must be specifically designed, implying that each application needs a different model.

In Grimmelius (Citation2003), modelling approaches for any physical system are categorised according to different dimensions; in this work, the authors will address the dimension referred to as the model level. The model level divides approaches into three groups, according to the level of detail at which the physical processes are described: PMs, DDMs and HMs. PMs, or white-box models, are the most common type adopted to deal with performance prediction, they are built considering a set of a-priori equations, defined through the knowledge of the physical phenomenon governing the DE and its performance. State-of-the-art approaches in PMs report errors well within the tolerance margins given by engine manufacturers in static conditions, however, in dynamic predictions the reported errors are much larger. Moreover, most predictions are validated in a limited operating region, mostly the operating region used for model tuning.

In Baldi et al. (Citation2015), a combined mean value-zero dimensional model was developed and used to investigate the propulsion behaviour of a handymax-size product carrier under constant and variable engine speed operations. The modelling approach was validated against shop trials data, considering steady-state conditions with a load variation between 50% and 110%. The reported error was lower than the standard tolerance employed by the marine engine manufacturers, with a simulation time only slightly exceeding the one of the mean value model. Temperature estimation errors at compressor outlet, and turbine inlet and outlet averaged 2.7%, 1.9% and 1.5% respectively, with the lowest error margins occurring around the nominal point. They concluded that their proposed model provides a favourable time-accuracy trade-off and it can be used in cases where information, not provided by a mean-value approach, is needed. Llamas and Eriksson (Citation2018) developed a control-oriented mean value engine model of a large two-stroke engine with Exhaust Gas Re-circulation (EGR), to assess engine performance under transient operation. The model was validated against operational data from a container-ship engine under steady-state and transient operations. For steady-state conditions 52 operating points were used, spanning a load between 10% and 90% of the nominal, with and without EGR. The stationary relative errors were reported to be in general under 3.35%, for both estimation and validation data, while the error of the temperature estimation on the exhaust manifold was recorded at a root mean square value of 12 K. Dynamic validation was performed for four different scenarios including load increase and decrease, and EGR start and stop operations. All of them were focused on low load operation, as it was the most uncertain operating area for the model. Results showed that the model was capable of following the measured engine signals during transients with low computational times, and the estimation for the exhaust manifold temperature agreed quite well with the measurements and could thus be used for control purposes. Guan et al. (Citation2015) investigated a two-stroke marine DE with emphasis at part load operating conditions using a zero-dimensional model. The proposed model was validated against experimental data obtained from engine shop tests, which correspond to steady-state operating conditions at four different loads: 25%, 50%, 75% and 100% of the nominal. Very good accuracy was obtained for the entire operating region, and for all performance parameters. Relative percentage errors on the exhaust gas receiver temperature and the exhaust gas temperature after the turbocharger (TC) were reported to be 0.6% and 2% respectively, and errors of equal order of magnitude were observed for all process parameters.

In the work of Sui et al. (Citation2017), a Mean Value First Principle (MVFP) model was presented with the aim of investigating performance at preliminary stage design, based on both the basic and advanced six-point Seiliger process and applicable to both steady-state and transient-state conditions. The model was validated using experimental data in three operating conditions: operation at the nominal point, 50% of nominal speed at 30% of nominal load, and 80% of nominal speed at 50% of nominal load. Although actual error values were not reported, graphical illustration of the results of the in-cylinder process showed good correspondence with the test data across all process parameters, including in-cylinder temperatures, with satisfactory accuracy and adaptability to variable operating conditions. Sapra et al. (Citation2017) studied back pressure effects on the performance of a marine DE, by means of an MVFP model. The model was calibrated under steady-state conditions, using 9 points along a propeller curve. It was further validated at the same conditions for different back pressures. Although quantitative performance metrics for the model are not given, the graphical representation of the results indicates average relative percentage errors of around 4% for the turbine inlet temperature across all operating conditions.

In Larsen et al. (Citation2015), the zero-dimensional model of Scappin et al. (Citation2012) was further extended and validated for steady-state conditions within a load range of 25–100% of the nominal. The model showed good agreement with the measurements of the manufacturer across all performance parameters, with a root mean square deviation of around 1% for the exhaust gas temperature.

More recently, in Wang et al. (Citation2020) the authors performed a parametric investigation of a large four-stoke dual-fuel marine engine in order to identify the pre-injection effects on the engine combustion, knocking and emissions parameters. Their modelling approach consisted of the integration of a 1-D model and a 3-D computational fluid dynamics (CFD) model utilising the MAN 51/60DF marine engine as a case study. The authors validated their model under steady-state conditions in 4 points, within a range of 25–100% of the nominal load. Near-zero deviation was reported for most parameters, whereas the maximum deviation for NO $_{x}$ emissions was only 2.4%.

Finally, Hao et al. (Citation2021) studies and improves the in-cylinder fuel/air mixing process of heavy-duty DEs, utilising a new device named the ‘fuel split device’. Due to the nature of their research, detailed modelling of the in-cylinder process was required, which the authors performed utilising CFD methods. To this end, they developed and verified their simulations in terms of the spray liquid/vapour penetration, heat release rate and in-cylinder pressures, at a variety of operational and environmental conditions. Although quantitative performance metrics are not explicitly given, graphical representations per crank-angle degree, show very low discrepancy between experimental and simulated results.

In summary, models are available that can accurately predict process parameters and engine temperatures. However, the most accurate models in dynamic conditions, zero dimensional models, cannot run real time, which is required to perform online condition monitoring, and this is certainly the case for more detailed CFD simulations. Alternatively, mean value models can run real time and can be used for control system design and evaluation, but lack accuracy over the complete operating envelope under dynamic conditions, as will be demonstrated in Section 5.1.

2.2. DDMs

DDMs have proved to be valuable instruments in many marine applications (Coraddu et al. Citation2017; Zhang et al. Citation2017; Cipollini et al. Citation2018a, Citation2018b; Baldini et al. Citation2018; Gao et al. Citation2018; Karimi et al. Citation2018; Silva et al. Citation2018; Yang et al. Citation2018), and in industry Qi et al. (Citation2018). In particular, an older study of Antonić et al. (Citation2004) utilised an Adaptive Neuro Fuzzy Inference System (ANFIS) to model marine DE cylinder dynamics. Experimental data from a test-bed were used, and the resulting models presented very low errors, for medium to high loads (50–100%). Porteiro et al. (Citation2011) developed a multilayer neural network to provide load estimation and fault identification on a DE, for different faulty conditions: misfiring, shaft imbalance, clogged intake and leaking start plug, using vibration signals and exhaust temperature as inputs. The reported performance of the model, in terms of correctly classified cases, was roughly 90% using only two vibration signals for load estimation, and 89.6% for the failure type identifier. However, the work focused on mechanical failures as opposed to thermal failures that are the scope for this work. Basurko and Uriondo (Citation2015) developed a three-layer feed-forward Artificial Neural Network (ANN) to represent the behaviour of medium speed DE with the aim of enabling a condition based maintenance framework for a fishery vessel. More than 10,000 h of operational data was utilised, with the ANN to give predictions with the mean squared error spread between 0.3 and 2.1 depending on the operational parameter. In the work of Bukovac et al. (Citation2015), an ANN was used to replace a computationally demanding physical simulation model, and predict the steady-state performance of a two-strokes marine DE. They report that the ANN architecture did provide predictions of the same accuracy as the physical model (errors of the order of 3% compared to experimental data), while being 3000 times faster.

Furthermore, in Yu et al. (Citation2018) a recurrent neural network for a diesel-generator set was presented, aiming at reproducing the engine output characteristics (namely rotational speed), under changes of electrical load. The model was trained using data from steady-state operations, at 25%, 50%, 75% and 100% of the nominal load. Although quantitative results were not presented, very low errors were reported across the entire operating region in steady state operations by means of graphical representations. Nikzadfar and Shamekhi (Citation2014) utilised an ANN to study the relative contribution of several operational parameters to the performance of a DE. More specifically, the operational parameters included: injected fuel mass, pilot and main injection mass, main and pilot injection timing, inlet air pressure and temperature, exhaust pressure, fuel rail pressure and EGR, and their effects on brake torque, Soot, SO $_{x}$ , NO $_{x}$ and specific fuel consumption were investigated. The ANN was built utilising 4000 steady-state operating points covering the entire envelope of the DE, generated by means of a simulation model. In this case too, quantitative results on the performance of the ANN are not described, however, graphical representations show a relative difference of around 5% with respect to exhaust gas temperatures. Al-Hinti et al. (Citation2009) studied the effects of inlet air pressure on DE indicated mean effective pressure and specific fuel consumption. They introduced ANFIS as an efficient method for modelling and sensitivity analysis of a DE. Steady-state experimental data was used to develop the model, at four different air intake conditions with varying speed between 50% and 100% of the nominal. Validation results report average percentage errors of 4%, 0.15%, and 2.43% for efficiency, mean effective pressure and specific fuel consumption, respectively. Galindo et al. (Citation2005) developed an ANN to model the combustion of high speed direct injection DEs in a transient regime. More specifically, their aim was to develop a simulation model for the rate of combustion in DEs during transient operating conditions, accurate and fast enough to be incorporated in a one-dimensional gas dynamic model for global transient behaviour prediction. Detailed analysis on the selection of the optimal architecture of the ANN was presented, with the optimal model resulting in an $R^{2}$ value of 0.985.

In the work of Parlak et al. (Citation2006), an ANN was employed to model exhaust temperatures and specific fuel consumption as a function of mean effective pressure, engine speed and injection timing. The ANN was trained using experimental data at four constant engine speeds within 50–100% of the nominal speed, while varying load. The results report mean relative errors of 1.93% and 2.36%, respectively.

Another interesting application was reported in Yuanwang et al. (Citation2002). The authors developed an ANN to analyse the effect of cetane number on exhaust emissions from a DE. The total cetane number, base cetane number and cetane improver, total cetane number and nitrogen content in the diesel fuel were used as inputs, and the emissions studied included hydrocarbon, carbon monoxide, particulate matter and NO $_{x}$ . The ANN was trained using 20,000 measurements, and the relative percentage errors reported ranged between 0.14% and 2.52% depending on the combination of inputs employed. Namigtle-Jiménez et al. (Citation2019) developed an arrangement of 3 ANNs for a Fault Detection and Diagnosis scheme (FDD), based on the extraction of features from the pressure signal of the injection rail of an ICE. They showed that the proposed FDD was capable of detecting and isolating accurately the faulty injector of the electronic fuel injection system, with offline training results reporting a 100% classification accuracy in any possible fault scenario, and near 100% accurate classification in online scenarios.

A similar study was performed in Wang et al. (Citation2019), in which the authors presented a Bayesian network-based approach for fault isolation in a DE fuel injection system, under the presence of uncertainties. Special consideration was given in the simplification of the Bayesian network structures, due to which symptoms under multiple faults could be decoupled into symptoms corresponding to each individual fault. This greatly reduced the prior knowledge needed for the diagnosis, decreased the complexity of the application, and improved the computational efficiency. Palmer and Bollas (Citation2019) showed that model-based active Fault Detection and Isolation (FDI) tests can improve the capability of DDMs to predict and isolate faults. One of the case studies presented was the detection of actuator faults and manifold leakage on a DE. The authors showed that when proper FDI test designs are selected, even the relatively simple combination of principal component analysis and k-nearest neighbours classifier could provide satisfactory results in fault detection. Faults considered in that study included inlet and exhaust manifold leaks, variable-geometry turbine, and exhaust gas receiver actuator valve drifts.

More recently, in Wang, Chen, et al. (Citation2021) the authors proposed a random convolutional neural network structure for health monitoring of DEs, relying on vibration measurements. More specifically, the authors constructed several individual convolutional neural networks, and the diagnostic results from each individual model were fused by a combinatorial strategy using the Dempster–Shafer evidence theory. They evaluated their approach by utilising two vibration signal datasets from a DE, and they concluded that, compared to traditional methods based on signal analysis techniques and shallow classifiers, their approach can automatically learn high-level representative features from the raw vibration signals and eliminate the necessity of manual feature extraction.

A similar approach that combines several DDMs for fault detection of DEs was proposed in Cai et al. (Citation2020). The authors combined a rule-based algorithm with Bayesian networks, and utilising experimental data collected from an EV80 DE, they showed that their approach is able to identify seven different faults on a DE at a wide variety of rotational speeds, requiring only with few training samples at a fixed speed.

In Wang, Cai, et al. (Citation2021), the authors proposed a diagnostic framework that integrates variational mode decomposition and the Rihaczek distribution to acquire time-frequency representations of vibration measurements of diesel engines. Utilising these features, a graph regularised bi-directional non-negative matrix factorisation algorithm was proposed to find a parts-based representation corresponding to different fault models. Their approach was compared with several other methods on an experimental dataset of the 6135G diesel engine, with a 100% fault identification accuracy, requiring few training data and high computational efficiency.

A further example is reported in Coraddu et al. (Citation2021), in which the authors designed and proposed multiple DDMs for weakly supervised marine duel fuel engines health monitoring. The proposed framework relied on a digital twin of the engine or on novelty detection algorithms, which were compared against state-of-the-art fully supervised approaches. Utilising data from the validated simulation model of Stoumpos et al. (Citation2020), the authors demonstrated that their approach can overcome the problematic requirement for a large amount of labelled samples, that are rarely available, with a decrease in performance of less than 1% compared to state-of-the-art fully-supervised approaches.

2.3. HMs

HMs are a quite recent modelling approach in the maritime field and just very few works showed the advantage of a hybrid approach with respect to pure PMs and DDMs. For example, in Coraddu et al. (Citation2017) authors show that it is possible to effectively predict fuel consumption with HMs. Another example is the one reported in Miglianti et al. (Citation2019); authors showed that it is possible to predict the propeller cavitation noise characteristics via HMs. Finally, in our preliminary work Coraddu et al. (Citation2018), we attempt to model the engine exhaust gas temperature with a naive HM.

3. Vessel description and available data

The case study Holland Class Oceangoing Patrol Vessels, shown in , are naval vessels that can perform various security operations, such as counter terrorism, counter piracy, counter drug transport, disaster relief and coastguard operations. The small crew of 50 people requires a high degree of automation (Geertsma et al. Citation2013), but nevertheless maintenance load is high for the crew and needs to be reduced Horenberg and Melaet (Citation2013). Reducing the maintenance burden on DEs using predictive maintenance based on its current sensor fit can contribute to this. The propulsion system of the vessel consists of two shafts with Controllable Pitch Propellers (CPP), a gearbox, and one DE per shaft, as shown in . This configuration is typical for multi-function ships that require silent, maneuverable, highly reliable and low emission propulsion.

Figure 1. Holland class oceangoing patrol vessels.

The Patrol vessel is equipped with a data logging system which is used by the Royal Netherlands Navy both for on-board monitoring and control and for land-based performance analysis. For testing the developed PMs, DDMs, and HMs, the authors use the dataset of one of the two four-stroke, medium speed DEs on board. The dataset consists of 114 signals, from the on-board Integrated Platform Management System (IPMS), with a sample rate of 1/3 Hz that cover a time of 3347 h, totalling 3,988,939 data points. The dataset consists of several control and monitoring parameters of the engine, from engine speed and torque, to various operational pressures and temperatures of engine components such as the crankshaft, cylinder and turbo-charger and systems, such as water cooling, lubricating oil, exhaust-gas, and fuel systems. It should be noted that the authors consider engine performance by taking into account the interaction with gearbox, propeller and ship through the load, which is represented by measured outputs for shaft torque ( $M_{p}$ ) and fuel rack position ( $X_{g o v}$ ). summarises the subset of the available measurements, from the IPMS, that have been used in the modelling phase, while in (a) schematic layout of the measured outputs is reported.

Figure 2. Propulsion system layout for the Holland class oceangoing patrol vessels.

Figure 3. Schematic layout of the available data.

Table 1. Subset of the available measurements, from the continuous monitoring system, that have been used in the modelling phase.

Display Table

4. Modelisation

In the proposed context, namely modelling DE exhaust gas temperatures in operational conditions, a general modelisation framework can be defined, characterised by an input space $X \subseteq R^{d}$ , an output space $Y \subseteq R$ , and an unknown relation $μ : X \to Y$ to be learned. For what concerns this work, $X$ is composed by the features reported in , while the output space $Y$ refers to the exhaust gas temperatures reported in .

Table 2. Input space $X$ for the modelisation phase.

Display Table

Table 3. Output space $Y$ for the modelisation phase.

Display Table

In this context, the authors define as model $h : X \to Y$ an artificial simplification of μ. The model h can be obtained with different kinds of techniques, for example requiring some physical knowledge of the problem, as in PMs, or the acquisition of large amount of data, as in DDMs, or both of them, as in HMs.

4.1. Performance measures

Independently of the adopted technique, any model h requires some data in order to be tuned (or learned) on the problem specificity and to be validated (or tested) on a real-world scenario. For these purposes, two separate sets of data $D_{n} = {(x_{1}, y_{1}), \dots, (x_{n}, y_{n})}$ and $T_{m} = {(x_{1}^{t}, y_{1}^{t}), \dots, (x_{m}^{t}, y_{m}^{t})}$ , where $X \in X$ and $Y \in Y$ , need to be exploited, to respectively tune h and evaluate its performances. It is important to note that $T_{m}$ is needed since the error that h would commit over $D_{n}$ would be too optimistically biased since $D_{n}$ has been used to tune h.

Hence, the error that h commits on $T_{m}$ in approximating the real process is usually measured with reference to different indexes of performance (Ghelardoni et al. Citation2013):

the Mean Absolute Error (MAE) is computed by taking the absolute loss value of h over $T_{m}$ (1) $M A E (h) = \frac{1}{m} \sum_{i = 1}^{m} | h (x_{i}^{t}) - y_{i}^{t} |;$ (1)
the Mean Absolute Percentage Error (MAPE) is computed by taking the absolute loss value of h over $T_{m}$ in percentage (2) $M A P E (h) = \frac{100}{m} \sum_{i = 1}^{m} | \frac{h (x_{i}^{t}) - y_{i}^{t}}{y_{i}^{t}} |;$ (2)
the Pearson Product-Moment Correlation Coefficient (PPMCC) measures the linear dependency between $h (x_{i}^{t})$ and $y_{i}^{t}$ with $i \in {1, \dots, m}$ (3) $P P M C C (h) = \frac{\sum_{i = 1}^{m} (y_{i}^{t} - \bar{y}) (h (x_{i}^{t}) - \hat{\bar{y}})}{\sqrt{\sum_{i = 1}^{m} (y_{i}^{t} - \bar{y})^{2}} \sqrt{\sum_{i = 1}^{m} (h (x_{i}^{t}) - \hat{\bar{y}})^{2}}},$ (3) where $\bar{y} = \frac{1}{m} \sum_{i = 1}^{m} y_{i}^{t}$ and $\hat{\bar{y}} = \frac{1}{m} \sum_{i = 1}^{m} h (x_{i}^{t})$ .

Other measures of error exist, such as R-squared and the Mean Square Error. However, in this work the authors consider these three measures because, from a physical point of view, they give a complete description of the quality of the model, and adding more measures would make the results more difficult to interpret while not adding any new insights.

4.2. Physical models (PMs)

The PM used in this work is illustrated in . It is a Mean Value Engine Model (MVEM), and a slightly improved version of the one described in Geertsma, Negenborn, Visser, Loonstijn, et al. (Citation2017). The MVEM was developed to investigate the performance of the ship propulsion system and its control strategy, with respect to fuel consumption; acceleration time and minimum air excess ratio, during predefined acceleration manoeuvres at varying operating conditions (Geertsma, Negenborn, Visser and Hopman Citation2017). As many other engine models, the MVEM was calibrated against the Factory Acceptance Test (FAT) protocol and showed a mean absolute percentage error within 10%, as reported in . For the purpose of control strategy evaluation, the MVEM provided good resemblance with the measured system behaviour, but its accuracy was never reported with any statistically robust measures. For this reason, in Section 5.1, the authors will re-evaluate the model performance on the large dataset presented in Section 3, considering the following scenarios:

Steady state: The data described in Section 3 will be used to prove the MVEM limitations in predicting exhaust gas temperature in real world application characterised by steady-state conditions.
Transient: the remaining part of the data, described in Section 3, will be used to further assess the MVEM limitations in transient conditions.

In subsequent works, the MVEM was used to evaluate advanced control strategies for mechanical (Geertsma et al. Citation2018) and hybrid propulsion architectures (Geertsma, Negenborn, Visser and Hopman Citation2017), hybrid propulsion systems, and hybrid power supply architectures (Kalikatzarakis et al. Citation2018). As these studies considered benchmark ship manoeuvres (Geertsma, Negenborn, Visser and Hopman Citation2017; Geertsma et al. Citation2018) and fuel consumption over a typical operating profile (Kalikatzarakis et al. Citation2018), these studies exploited the main feature of the MVEM: runtimes between 100 and 2500 times real-time (Geertsma et al. Citation2018). This also enables to test the performance of this model on the dataset described in Section 3 and to develop the HMs detailed in Section 4.4.

Figure 4. Schematic representation of the DE model and the interaction of its subsystems, from Geertsma, Negenborn, Visser, Loonstijn, et al. (Citation2017).

The MVEM consists of three state variables: fuel injection per cylinder per cycle $m_{f}$ , charge pressure $p_{a i r}$ and exhaust receiver pressure $p_{t c}^{b}$ . The inputs of the model are engine speed $N_{r e f}$ and fuel pump set-point $X_{s e t}$ , the latter originating from the speed governor, and the output is engine torque $M_{e}$ .

The model is characterised by six modules, as illustrated in and described below.

(A)	the fuel pump module represents the combined effect of the fuel pump inertia and the ignition delay;
(B)	the air swallow module represents the air swallow characteristics of the engine to establish the air excess ratio $λ_{a}$ , the amount of air that is left after all fuel is combusted. The air excess ratio can be used as an indicator for engine thermal loading (Sapra et al. Citation2017) and EGR effectiveness (Asad and Zheng Citation2014; Nielsen et al. Citation2017; Llamas and Eriksson Citation2018);
(C)	the heat release module represents the heat release during combustion of fuel during the three combustion stages in the Seiliger cycle: isochoric, isobaric and isothermal combustion;
(D)	the Seiliger cycle module represents in-cylinder compression, combustion and expansion using the six stage Seiliger process. It establishes the work produced during the closed cylinder process $w_{i}$ , the temperature $T_{t c}^{b}$ and pressure $p_{6}$ at the end of the closed cylinder process, when the exhaust valve opens;
(E)	the exhaust receiver and TC module represents Zinner blowdown (Zinner Citation1980) and the Büchi power and flow balance (Dixon Citation1998; Stapersma Citation2010) with variable TC efficiency, heat release efficiency and slip ratio. This module establishes the charge pressure $p_{1}$ and the exhaust receiver pressure $p_{t c}^{b}$ . Furthermore, thermal inertias have been added to account for the effect of the location of the thermocouples, mounted on the surface of the exhaust pipes, as reported in . The authors have included thermal inertia effects by means of a first order response with a thermocouple time constant, $τ_{t c}$ , and a constant bias, $b_{t c}$ , to better align the physical prediction with the actual temperature measurements. The output of the module are the exhaust gas temperature at TC inlet (a), $T_{t c}^{a}$ and TC outlet (b) $T_{t c}^{b}$ (4) $\frac{T_{t c}^{b} (t)}{d t} = \frac{(T_{t c}^{b} (t - 1) - b_{t c}^{b}) - T_{t c}^{b} (t)}{τ_{t c}^{b}},$ (4) (5) $\frac{T_{t c}^{a} (t)}{d t} = \frac{(T_{t c}^{a} (t - 1) - b_{t c}^{a}) - T_{t c}^{a} (t)}{τ_{t c}^{a}};$ (5)
(F)	the mechanical conversion module represents the mechanical losses due to the conversion from pressure to rotation and the losses due to driving auxiliary equipment.

For a more detailed description of the modules, the reader is referred to Geertsma et al. (Citation2018).

In summary, the temperatures of the gas flow in the exhaust receiver and at the turbine exit, main subjects of this study, are represented by a system of Algebraic Equations (AE) and Differential and Algebraic Equations (DAE) featuring the input variables, state variables and the following mathematically related parameters: trapped mass in the cylinder $m_{1}$ , air excess ratio $λ_{a}$ , isobaric, isochoric and isothermal heat release $q_{23}$ , $q_{34}$ and $q_{45}$ , temperature and pressure after expansion of the Seiliger cycle $T_{t c}^{b}$ and $p_{6}$ and induced work during the Seiliger cycle $w_{i}$ . The original aspect of this model is that the TC dynamics are represented by the Büchi power and flow balance between compressor and turbine, and do not require compressor or turbine maps for calibration. By neglecting fast dynamics, the model's run-time is between 100 and 2500 times real-time, much faster than MVEMs using compressor and turbine maps, such as Nielsen et al. (Citation2017), Theotokatos and Tzelepis (Citation2015), Sapra et al. (Citation2017), and Kökkülünk et al. (Citation2016).

Lastly, in order to compare the real measurements from the IPMS with the PM outcomes, the authors considered the average value of the Bank A and B (6) ${\bar{T}}_{t c}^{b} = \frac{T_{{t c}_{A}}^{b} + T_{{t c}_{B}}^{b}}{2}, {\bar{T}}_{t c}^{a} = \frac{T_{{t c}_{A}}^{a} + T_{{t c}_{B}}^{a}}{2} .$ (6)

4.3. Data driven models (DDMs)

The problem considered here, from the data science point of view, can be mapped to a typical Machine Learning (ML) regression problem (Vapnik Citation1998; Shawe-Taylor and Cristianini Citation2004) in a straightforward approach. In fact, ML techniques aim at estimating the unknown relationship μ between input and output through a learning algorithm $A_{H}$ which exploits the data in $D_{n}$ to learn h and where $H$ is a set of hyperparameters which characterises the generalisation performance of $A$ (Oneto Citation2020).

In this paper, a method from the ML Kernel Methods family called Kernel Regularised Least Squares (KRLS) has been adopted in order to estimate the relation between the input variables of and the output variables of . The idea behind KRLS can be summarised as follows. During the training phase, the quality of the learned function $h (x)$ is measured according to a loss function $ℓ (h (x), y)$ (Rosasco et al. Citation2004) with the empirical error (7) ${\hat{L}}_{n} (h) = \frac{1}{n} \sum_{i = 1}^{n} ℓ (h (x_{i}), y_{i}) .$ (7) A simple criterion for selecting the final model during the training phase could then consist in simply choosing the approximating function that minimises the empirical error ${\hat{L}}_{n} (h)$ . This approach is known as Empirical Risk Minimisation (ERM) (Vapnik Citation1998). However, ERM is usually avoided in ML as it leads to severe overfitting of the model on the training dataset. As a matter of fact, in this case the training process could choose a model, complicated enough to perfectly describe all the training samples (including noise, which afflicts them). In other words, ERM implies memorisation of data rather than learning from them.

A more effective approach is to minimise a cost function where the tradeoff between accuracy on the training data and a measure of the complexity of the selected model is achieved (Tikhonov and Arsenin Citation1979), implementing the Occam's razor principle (8) $h^{*} : min_{h} {\hat{L}}_{n} (h) + λ C (h) .$ (8) In other words, the best approximating function $h^{*}$ is chosen as the one that is complicated enough to learn from data without overfitting them. In particular, $C (\cdot)$ is a complexity measure: depending on the exploited ML approach, different measures are realised. Instead, $λ \in [0, \infty)$ is a hyperparameter, that must be set a-priori and is not obtained as an output of the optimisation procedure: it regulates the trade-off between the overfitting tendency, related to the minimisation of the empirical error, and the underfitting tendency, related to the minimisation of $C (\cdot)$ . The optimal value for λ is problem-dependent, and tuning this hyperparameter is a non-trivial task, as will be discussed later in this section. In KRLS, models are defined as (9) $h (x) = w^{T} φ (x),$ (9) where $φ$ is an a-priori defined Feature Mapping (FM) (Shalev-Shwartz and Ben-David Citation2014), which strongly depends on the particular problem under examination and will be described later in this section, allowing to keep the structure of $h (x)$ linear. The complexity of the models, in KRLS, is measured as (10) $C (h) = ‖ w ‖^{2},$ (10) i.e. the Euclidean norm of the set of weights describing the regressor, which is a standard complexity measure in ML (Shalev-Shwartz and Ben-David Citation2014). Regarding the loss function, the square loss is typically adopted because of its convexity, smoothness, and statistical properties (Rosasco et al. Citation2004) (11) ${\hat{L}}_{n} (h) = \frac{1}{n} \sum_{i = 1}^{n} ℓ (h (x_{i}), y_{i}) = \frac{1}{n} \sum_{i = 1}^{n} [h (x_{i}) - y_{i}]^{2} .$ (11) Consequently, Problem (Equation8(8) $h^{*} : min_{h} {\hat{L}}_{n} (h) + λ C (h) .$ (8) ) can be reformulated as (12) $w^{*} : min_{w} \sum_{i = 1}^{n} [w^{T} φ (x) - y_{i}]^{2} + λ ‖ w ‖^{2} .$ (12) By exploiting the Representer Theorem (Schölkopf et al. Citation2001), the solution $h^{*}$ of the RLS Problem (Equation12(12) $w^{*} : min_{w} \sum_{i = 1}^{n} [w^{T} φ (x) - y_{i}]^{2} + λ ‖ w ‖^{2} .$ (12) ) can be expressed as a linear combination of the samples projected in the space defined by $φ$ (13) $h^{*} (x) = \sum_{i = 1}^{n} α_{i} φ (x_{i})^{T} φ (x) .$ (13) It is worth underlining that, according to the kernel trick, it is possible to reformulate $h^{*} (x)$ without an explicit knowledge of $φ$ , and consequently avoiding the course of dimensionality of computing $φ$ , by using a proper kernel function $K (x_{i}, x) = φ (x_{i})^{T} φ (x)$ (14) $h^{*} (x) = \sum_{i = 1}^{n} α_{i} K (x_{i}, x) .$ (14) Several kernel functions can be retrieved in literature (Cristianini and Shawe-Taylor Citation2000; Scholkopf Citation2001), each one with a particular property that can be exploited based on the problem under exam.

The KRLS problem of Equation (Equation12(12) $w^{*} : min_{w} \sum_{i = 1}^{n} [w^{T} φ (x) - y_{i}]^{2} + λ ‖ w ‖^{2} .$ (12) ) can be reformulated by exploiting kernels as (15) $α^{*} : min_{α} ‖ Q α - y ‖^{2} + λ α^{T} Q α,$ (15) where $y = [y_{1}, \dots, y_{n}]^{T}$ , $α = [α_{1}, \dots, α_{n}]^{T}$ , the matrix Q such that $Q_{i, j} = K (x_{j}, x_{i})$ , and the identity matrix $I \in R^{n \times n}$ . By setting the gradient equal to zero w.r.t. $α$ it is possible to state that (16) $(Q + λ I) α^{*} = y,$ (16) which is a linear system for which effective solvers have been developed over the years, allowing it to cope with even very large sets of training data (Young Citation2003).

The problems that still have to be faced is how to choose $φ$ , the kernel K, and how to set up the hyperparameter λ. It is possible to start by setting $φ$ and the kernel K. Usually the Gaussian kernel is exploited in real world applications because of the theoretical reasons described in Keerthi and Lin (Citation2003) and because of its effectiveness (Fernández-Delgado et al. Citation2014; Wainberg et al. Citation2016). Basically the Gaussian kernel is able to implicitly create an infinite dimensional $φ$ and thanks to this, the KRLS are able to learn any possible function (Keerthi and Lin Citation2003). The last problem is how to tune the hyperparameters γ, and λ of the proposed method.

Since every ML model is characterised by a set of hyperparameters $H$ , influencing their ability to estimate μ, a proper Model Selection (MS) procedure needs to be adopted (Oneto Citation2020). Several methods exist for MS purpose but resampling methods, like the well-known k-Fold Cross Validation (KCV) (Kohavi Citation1995) or the nonparametric Bootstrap (BTS) (Efron and Tibshirani Citation1994) approaches, representing the state-of-the-art MS approaches when targeting real-world applications. Resampling methods rely on the following method: the original dataset $D_{n}$ is resampled once or many ( $n_{r}$ ) times, with or without replacement, to build two independent datasets called the training, and the validation sets, respectively $L_{l}^{r}$ and $V_{v}^{r}$ , with $r \in {1, \dots, n_{r}}$ . Note that $L_{l}^{r} \cap V_{v}^{r} = ⊘$ , $L_{l}^{r} \cup V_{v}^{r} = D_{n}$ . Then, in order to select the best combination the hyperparameters $H$ in a set of possible ones $H = {H_{1}, H_{2}, \dots}$ for the algorithm $A_{H}$ or, in other words, to perform the MS phase, the following procedure has to be applied: (17) $H^{*} : min_{H \in H} \frac{1}{n_{r}} \sum_{r = 1}^{n_{r}} \frac{1}{v} \sum_{(x_{i}, y_{i}) \in V_{v}^{r}} ℓ (A_{H, L_{l}^{r}} (x_{i}), y_{i}),$ (17) where $A_{H, L_{l}^{r}}$ is a model built with the algorithm $A$ with its set of hyperparameters $H$ and with the data $L_{l}^{r}$ . Since the data in $L_{l}^{r}$ are independent from the ones in $V_{v}^{r}$ , the idea is that $H^{*}$ should be the set of hyperparameters which allows to achieve a small error on a data set that is independent from the training set.

In this work, authors will exploit the BTS procedure and consequently r=500, if l=n and the resampling must be done with replacement (Oneto Citation2020).

4.4. Hybrid models (HMs)

The problem that authors face is how to construct a model able to take both, the physical knowledge about the problem encapsulated in the PMs of Section 4.2 and the information hidden in the available data as the DDMs of Section 4.3, into account. For this purpose authors will start from a simple observation: a HM, based on the previous observation, should be able to learn from the data without being too different, or too far away, from the PM.

From the Data Science point of view, this requirement can be straightforwardly mapped in a typical ML Multi Task Learning (MTL) problem (Caruana Citation1997; Baxter Citation2000; Bakker and Heskes Citation2003; Evgeniou and Pontil Citation2004; Argyriou et al. Citation2008). MTL aims at simultaneously learning two concepts, in this case the PM and the available data, through a learning algorithm $A_{H}$ which exploits the data in $D_{n}$ to learn a function h which is both close to the observation, the data $D_{n}$ and the PM, namely its forecasts.

Consequently, in this case a slightly different scenario is presented where the dataset is composed by a triple of points $D_{n} = {(x_{1}, y_{1}, p_{1}), \dots, (x_{n}, y_{n}, p_{n})}$ where $p_{i}$ is the output of the PM in the point $x_{n}$ with $i \in {1, \dots, n}$ . The target is to learn a function able to approximate both μ, namely the relation between the input $x \in X$ and the output $y \in Y$ , and the PM, namely the relation between the input and the output of the PM. Two tasks have to be learned. For this purpose, there are two main approaches: the first approach is called Shared Task Learning (STL) and the second Independent Task Learning (ITL). While the latter independently learns a different model for each task, the former aims to learn a model that is common between all tasks. A well-known weakness of these methods is that they tend to generalise poorly on one of the two tasks (Baxter Citation2000). In this paper, authors show that an appealing approach to overcome such limitations is provided by MTL (Caruana Citation1997; Baxter Citation2000; Bakker and Heskes Citation2003; Evgeniou and Pontil Citation2004; Argyriou et al. Citation2008). This methodology leverages on the information between the tasks to learn more accurate models.

In order to apply the MTL approach to this case, it is possible to modify the KRLS problem of Equation (Equation12(12) $w^{*} : min_{w} \sum_{i = 1}^{n} [w^{T} φ (x) - y_{i}]^{2} + λ ‖ w ‖^{2} .$ (12) ) in order to simultaneously learn a shared model and a task specific model which should be close to the shared model. In this way, authors obtain a model which is able to simultaneously learn the two tasks. The model that authors are interested in is the shared model, while the task specific models are just used as a tool. A shared model is defined as (18) $h (x) = w^{T} φ (x),$ (18) and two task specific models as (19) $h_{i} (x) = w_{i}^{T} φ (x), i \in {y, p} .$ (19) Then, it is possible to state the MTL version of Equation (Equation12(12) $w^{*} : min_{w} \sum_{i = 1}^{n} [w^{T} φ (x) - y_{i}]^{2} + λ ‖ w ‖^{2} .$ (12) ), as follows: (20) $\begin{aligned} w^{*}, w_{y}^{*}, w_{p}^{*} : & min_{w, w_{y}, w_{p}} \sum_{i = 1}^{n} [w^{T} φ (x) - y_{i}]^{2} + [w^{T} φ (x) - p_{i}]^{2} \\ + \sum_{i = 1}^{n} [w_{y}^{T} φ (x) - y_{i}]^{2} + [w_{p}^{T} φ (x) - p_{i}]^{2} \\ + λ ‖ w ‖^{2} + θ (‖ w - w_{y} ‖^{2} + ‖ w - w_{p} ‖^{2}), \end{aligned}$ (20) where λ is the usual regularisation of KRLS and $θ \in [0, \infty)$ , instead, is another hyperparameter that forces the shared model to be close to the task specific models. Basically the MTL problem of Equation (Equation20(20) $\begin{aligned} w^{*}, w_{y}^{*}, w_{p}^{*} : & min_{w, w_{y}, w_{p}} \sum_{i = 1}^{n} [w^{T} φ (x) - y_{i}]^{2} + [w^{T} φ (x) - p_{i}]^{2} \\ + \sum_{i = 1}^{n} [w_{y}^{T} φ (x) - y_{i}]^{2} + [w_{p}^{T} φ (x) - p_{i}]^{2} \\ + λ ‖ w ‖^{2} + θ (‖ w - w_{y} ‖^{2} + ‖ w - w_{p} ‖^{2}), \end{aligned}$ (20) ) is a concatenation of three learning problems solved with KRLS plus a term which tries to keep a relation between all the three different problems.

By exploiting the kernel trick as in KRLS, it is possible to reformulate Problem (Equation20(20) $\begin{aligned} w^{*}, w_{y}^{*}, w_{p}^{*} : & min_{w, w_{y}, w_{p}} \sum_{i = 1}^{n} [w^{T} φ (x) - y_{i}]^{2} + [w^{T} φ (x) - p_{i}]^{2} \\ + \sum_{i = 1}^{n} [w_{y}^{T} φ (x) - y_{i}]^{2} + [w_{p}^{T} φ (x) - p_{i}]^{2} \\ + λ ‖ w ‖^{2} + θ (‖ w - w_{y} ‖^{2} + ‖ w - w_{p} ‖^{2}), \end{aligned}$ (20) ), as follows: (21) $\begin{aligned} α^{*} : & min_{α} {‖ [\begin{matrix} Q & Q 0 & 0 \\ Q & Q 0 & 0 \\ 0 & 0 Q & 0 \\ 0 & 00 & Q \end{matrix}] α - [\begin{matrix} y \\ p \\ y \\ p \end{matrix}] ‖}^{2} \\ + α^{T} [\begin{matrix} (λ + 2 θ) Q & (λ + 2 θ) Q & - θ Q & - θ Q \\ (λ + 2 θ) Q & (λ + 2 θ) Q & - θ Q & - θ Q \\ - θ Q & - θ Q & θ Q & 0 \\ - θ Q & - θ Q & 0 θ Q \end{matrix}] α, \end{aligned}$ (21) where $p = [p_{1}, \dots, p_{n}]^{T}$ . The solution of this problem is again equivalent to solving a linear system (22) $[\begin{matrix} Q + (λ + 2 θ) I & Q + (λ + 2 θ) I & - θ I & - θ I \\ Q + (λ + 2 θ) I & Q + (λ + 2 θ) I & - θ I & - θ I \\ - θ I & - θ I & Q + θ I & 0 \\ - θ I & - θ I & 0 & Q + θ I \end{matrix}] α^{*} = [\begin{matrix} y \\ p \\ y \\ p \end{matrix}] .$ (22) The function that the authors are interested in, the shared one, can be expressed as follows: (23) $h (x) = w^{T} φ (x) = \sum_{i = 1}^{n} (α_{i} + α_{i + n}) K (x_{i}, x) .$ (23) What changes here, with respect to the MS phase of the DDMs described in Section 4.3, is the MS phase where just λ, γ, and also θ need to be tuned.

4.5. DDMs and HMs: taking into account the dynamics

The approaches described in Sections 4.3 and 4.4 are quite effective (as will be shown in Section 5), but naive. Moreover, they do not take into account all the possible information that the data has to offer. In fact, the variables reported in and are actually time-series produced by the IPMS. What the authors described in Sections 4.3 and 4.4 corresponds to the approach described in (a), where all the variables of at time $t_{0}$ are given as input to the model, and where one of the variables of at time $t_{0}$ is given as an output to the model.

Figure 5. How to take into account the dynamics in DDMs and HMs. (a) Input and output variables of the DDMs and the HMs as described in Sections 4.3 and 4.4. (b) Input and output variables of the DDMs and the HMs as described in Section 4.5.

This approach is obviously sub-optimal, since at time $t_{0}$ the values of all the variables in and are known for each of the measurement taken before $t_{0}$ . For this reason, as depicted in (b), it is possible to feed the model not just the variables of at time $t_{0}$ but also all the measurements of these variables in $[t_{0} - Δ, t_{0}]$ , and the variables of in $[t_{0} - Δ, t_{0}]$ as an input. Thanks to this approach the authors are now able to map a time-series problem again into a classical multivariate regression problem (Packard et al. Citation1980; Takens Citation1981), and exploit the methods described in Sections 4.3 and 4.4. Note that Δ is an application specific parameter that needs to be tuned and its effect will be tested in Section 5. Note that, the methodology described in Sections 4.3 and 4.4 is a special case of what described in this section, and correspond to the case when $Δ = 0$ .

4.6. Feature ranking

Once the models are built it is required to investigate how these models are affected by the different features used in the model identification phase to understand if the models have also a foundation which relies on the underlying phenomena or if the model just captures spurious correlations (Guyon and Elisseeff Citation2003). This procedure is called Feature Ranking (FR) and allows to detect if the importance of those features, that are known to be relevant from a physical perspective, are appropriately taken into account by the learned models. The failure of the computational model to properly account for the relevant features might indicate poor quality in the measurements or spurious correlations. FR therefore represents an important step of model verification, since it should generate consistent results with the available knowledge of the phenomena under exam.

For this purpose, authors will adopt the backward elimination techniques described in Guyon and Elisseeff (Citation2003). Note that, when $Δ = 0$ (see Section 4.5) the feature ranking will be the classical one where the authors consider the variables of and as features. When $Δ > 0$ a new concept of feature ranking will be defined by the authors, where the entire time-series of the variables of and will be considered as features.

5. Experimental results

In this section, the authors utilise the data described in Section 3 to test the models developed in Section 4, using the performance measures described in Section 4.1. To begin with, calibration results of the PM described in Section 4.2 are reported. Subsequently, the validation of the PM is carried out, both in steady and dynamic state as reported in Section 5.1. Then a comparison of the performance of PMs, DDMs, and HMs in operational conditions is reported in Section 5.2.

5.1. PM validation

In line with the standard academic and industrial procedure (Theotokatos and Tzelepis Citation2015), the PM has been calibrated with data provided by the manufacturer, namely the FAT protocol. The percentage error (PE) between the measured values during the engine shop trials, and the predicted values by the PM, is reported in .

Table 4. PM FAT validation results.

Display Table

The PM achieved predictions of sufficient accuracy for the entire speed range. The observed PEs are always lower than 10%, also considering the mean exhaust gas temperature after and before the turbine. Nevertheless, very high accuracy (less than 1%) is obtained at the MCR speed, this is attributed to the fact that the model hyperparameters were tuned specifically for this point and therefore, deviations of the PM performance are expected for the lower engine speed region. However, the PM predictions are satisfactory and the model can be used for the scope of this work.

Results of the calibration are reported in , from which it can be seen that the PM achieves a mean PE of ±5% for all parameters reported, apart from the mean exhaust gas temperature before and after turbine. These results are in agreement with the relevant available literature (Guan et al. Citation2015; Theotokatos and Tzelepis Citation2015; Sui et al. Citation2017). After the calibration phase, the PM validation was performed according to the discussion of Section 4.2, at different DE speeds and loads, to assess model performance with respect to exhaust gas temperatures on the real world data described in Section 3, considering steady-state and dynamic conditions separately.

Figure 6. PM FAT validation results: (a) charge air pressure, (b) relative exhaust gas receiver pressure, (c) relative maximum combustion pressure, (d) specific fuel oil consumption, (e) mean exhaust gas temperature after turbine, (f) mean exhaust gas temperature before turbine.

The results of the steady-state simulations are reported in and , while the results of the dynamic operating conditions are reported in and . From and , it can be observed that for ${\bar{T}}_{t c}^{a}$ the MAPE is significantly greater than 10% and significantly higher than the MAPE for the FAT data points. For example, for ${\bar{T}}_{t c}^{b}$ , the MAPE observed is greater than 20%. This is caused by running the engine at different operating points than the operating points at which the model was calibrated. For calibration, the FAT operating points were used, all on the theoretical propeller curve and above 650 rpm engine speed, while demonstrates that in static and dynamic operating conditions, the engine is running at speeds below 650 rpm and the control system forces the engine to run at loads below the theoretical propeller curve. The FAT measurement at 650 rpm already showed an error of +10% for temperature prediction, while the results in and demonstrate that the greatest prediction errors are below 700 rpm and between 150 and 350 $^{\circ}$ C and appear to get worse with further reducing loads and speeds. These large errors are clearly caused by the fact that the model was not calibrated for low speeds and low powers and by the modelling assumptions. Furthermore, the results in (a) and (b) also illustrate the two different control modes with two different combinator curves that lead to two distinct areas in the scatter plots.

Figure 7. PM steady state operating conditions: (a) engine power and speed, (b) mean exhaust gas temperature before turbine.

Figure 8. PM steady-state operating condition: (a) mean exhaust gas temperature after turbine and (b) mean exhaust gas temperature before turbine.

Figure 9. PM dynamic operating condition: (a) mean exhaust gas temperature after turbine and (b) mean exhaust gas temperature before turbine.

Table 5. PM steady-state performance measures.

Display Table

Table 6. PM Dynamic operating conditions performance measures.

Display Table

Moreover, comparing the results from and with the scatter plots reported in and , interesting observations can be made that cannot be established from the MAPE. While the prediction of the temperature in static conditions appears to be fairly consistent, and could possibly be predicted more accurately with more accurate assumptions, higher order dynamics appear to have a great effect on temperature prediction that cannot be captured by the PM. In particular, the model's predictions of ${\bar{T}}_{t c}^{a}$ are acting as a low-pass filter. In conclusion, the PM in this case is first characterised by highly biased predictions, as reported in the scatter plots of and , and second is acting as a low-pass filter for dynamic operations. This indicates that the Seiliger cycle module (module D in ) needs to be improved to accurately capture operation over the complete operating profile and presents limitations in dynamic operating conditions.

5.2. Models performance comparison

In this section, the authors will compare the performance of PMs, DDMs, and HMs, described in Section 4, in operational conditions using the data described in Section 3.

In order to build $D_{n}$ and $T_{m}$ , the authors split the data in different temporal slots in such a way that data belonging to $D_{n}$ corresponds to a different temporal slot with respect to $T_{m}$ . The two data sets consist of various different manoeuvres using the two control modes described in Geertsma, Negenborn, Visser and Hopman (Citation2017):

Manoeuvre Mode (MM): combinator curve with relative low pitch, high engine speed and fast acceleration rates;
Transit Mode (TM): combinator curve with higher pitch, lower engine speed and slow acceleration rates.

The error metrics reported in and , refer to $T_{m}$ . – have been included purely for illustrative purposes, and correspond to a subset of $T_{m}$ , which covers 24 h of continuous operation of the DE in a healthy mix of steady-state and dynamic conditions, as described in .

Figure 10. Scatter plot (measured vs predicted) and trend in time for ${\bar{T}}_{t c}^{a}$ using a PM with $Δ = 0$ .

Figure 10. Scatter plot (measured vs predicted) and trend in time for T¯tca using a PM with Δ=0.

Table 7. Testing dataset operational description – illustrative subset.

Display Table

Table 8. Indexes of performances (MAE, MAPE, and PPMCC) of the different models (PMs, DDMs, and HMs) for different $Δ \in {0, 10, 20, 30}$ for ${\bar{T}}_{t c}^{a}$ . Note that $Δ = 0$ means that the authors do not exploit time series information from the past, for $Δ > 0$ there is no PM result as described in Section 4.

Display Table

Table 9. Indexes of performances (MAE, MAPE and PPMCC) of the different models (PMs, DDMs, and HMs) for different $Δ \in {10, 20, 30}$ for ${\bar{T}}_{t c}^{b}$ . Note that $Δ = 0$ means that the authors do not exploit time series information from the past, for $Δ > 0$ there is no PM result as described in Section 4.

Display Table

As reported in Section 4.2, PMs are limited to only handling the case with $Δ = 0$ . More precisely, $Δ > 0$ does not improve the model. When it comes to the DDMs, the custom algorithm described in Section 4.3 will be exploited. The set of hyperparameters tuned during the MS phase are $H = {γ, λ}$ chosen in $H = {10^{- 4.0}, 10^{- 3.8}, \dots, 10^{+ 4.0}} \times {10^{- 4.0}, 10^{- 3.8}, \dots, 10^{+ 4.0}}$ .

Eventually, the HMs custom algorithm described in Section 4.4 will be exploited. The set of hyperparameters tuned during the MS phase are $H = {γ, λ, θ}$ chosen in $H = {10^{- 4.0}, 10^{- 3.8}, \dots, 10^{+ 4.0}} \times {10^{- 4.0}, 10^{- 3.8}, \dots, 10^{+ 4.0}}$ $\times {10^{- 4.0}, 10^{- 3.8}, \dots, 10^{+ 4.0}}$ .

All the tests have been repeated 30 times, and the average results are reported together with their t-student $95 %$ confidence interval, to ensure the statistical validity of the results.

5.2.1. PM results

As indicated by the error metrics of and , the PM does not predict the exhaust gas temperatures at turbine inlet ( ${\bar{T}}_{t c}^{b}$ ) and outlet ( ${\bar{T}}_{t c}^{a}$ ) to a satisfactory degree, regardless of the operating (steady-state or dynamic) conditions. As shown in and , the PM is characterised by low bias and high variance in predicting ${\bar{T}}_{t c}^{a}$ , and by high bias and high variance in predicting ${\bar{T}}_{t c}^{b}$ . The same applies to dynamic conditions, according to , and . For the sake of clarity, a representative time-series sample of the PMs' predictions is reported in for $\bar{T_{t c}^{a}}$ .

On one hand, these discrepancies can be attributed to the following assumptions and simplifications (Geertsma, Negenborn, Visser, Loonstijn, et al. Citation2017):

Pressure losses in the inlet duct, filter and air cooler are neglected.
Heat transfer effects along the air and exhaust-gas paths are neglected, namely, heat losses in the inlet duct, filter and intercooler.
Regarding the combustion process, the constant volume portion of combustion increases linearly with engine speed, and the temperature portion of combustion increases proportionately to fuel injection.
Fuel injection time delay is constant.
Scavenge efficiency is constant and equal to unity.
Heat loss modelling during the expansion and blowdown processes has been simplified.
Namely, the heat release efficiency is inversely related to engine speed.
Air temperature at the start of compression is constant.
Combustion efficiency is constant.
The expansion in the turbine is polytropic.
The polytropic efficiency between compressor and turbine has been split equally.
Turbine efficiency is a quadratic function with respect to charge pressure.
Air and exhaust gas properties have been kept constant throughout.
The lower heating value of the fuel is equal to 42,700 [kJ/kg], according to ISO standards.

The model calibration and more advanced assumptions could enable significant improvement to the PM, but only if sufficient calibration data is available over the complete engine operating envelop. On the other hand, the aforementioned assumptions and simplifications enabled the PM to reach a good trade-off between accuracy (in steady-state) and computational time, making the model's run-time close to 2500 times real-time, much faster than MVEMs characterised by the presence of the compressor and turbine maps (Theotokatos and Tzelepis Citation2015; Sapra et al. Citation2017). For the reasons discussed above, although the PM is suitable for real-time applications, its accuracy is not sufficient for accurate temperature prediction in dynamic conditions that allows early identification of emerging failures.

5.2.2. DDMs results

The proposed DDMs are more accurate in predicting both ${\bar{T}}_{t c}^{a}$ and ${\bar{T}}_{t c}^{b}$ compared to the PM, even without considering past information ( $Δ > 0$ ). Of course, when this information is also taken into account, the error metrics drop by around 50% (e.g. MAPE reduces from $2.1 % \pm 0.5 %$ to $1.3 % \pm 0.1 %$ ) as reported in and . In and , representative time-series of the predictions of $\bar{T_{t c}^{a}}$ are shown.

Figure 11. Scatter plot (measured vs predicted) and trend in time for ${\bar{T}}_{t c}^{a}$ using a DDM with $Δ = 0$ .

Figure 11. Scatter plot (measured vs predicted) and trend in time for T¯tca using a DDM with Δ=0.

From and , it is possible to observe that DDMs are capable of fully capturing the thermodynamic transients of the exhaust gases, both in steady-state and dynamic conditions, as shown in . From –, it can be observed that the DDMs are characterised by both lower bias and lower variance, with respect to the PM. The optimal time window (Δ) is found for a value equal to 20 s. For this value, minimal error metrics among all DDMs occur. According to , for this time window, the MAPE for ${\bar{T}}_{t c}^{b}$ is as low as $1.3 % \pm 0.1 %$ , whereas for ${\bar{T}}_{t c}^{a}$ , the MAPE is $1.5 % \pm 0.1 %$ , as reported in . Furthermore, from the scatter plot of , it can be observed that minimum variance is also achieved.

It should be noted that, although DDMs are computationally demanding in the training phase, they are characterised by lower computational complexity in the feed-forward phase, as they just require matrix manipulation methods, in contrast with the solution of a system of DAEs that the PM requires. The combination of both accurate and fast predictions, makes DDMs an ideal candidate for real-time performance and condition estimation. However, the necessary data to reach this level of performance is rather high (Cipollini et al. Citation2018a, Citation2018b), which makes this type of models applicable only after extensive measurement campaigns have been undertaken. Finally, another disadvantage of DDMs is the lack of interpretability as it is not supported by any physical interpretation (Shawe-Taylor and Cristianini Citation2004).

5.2.3. HMs Results

To overcome the limitations discussed in Sections 5.2.1 and 5.2.2 for the PMs and DDMs, respectively, the authors have proposed the use of HMs. These allow the exploitation of both the mechanistic knowledge of the underlying physical principles from the PM, and any available measurements taken during the operation of the vessel.

The novelty introduced by the HMs led to more accurate predictions of both ${\bar{T}}_{t c}^{a}$ and ${\bar{T}}_{t c}^{b}$ compared to the rest of the models, regardless of the time window considered (Δ), as can be seen from and . Furthermore, the same tables reveal that the optimal model is an HM with a time window of 20 s, which achieves MAPEs of $0.6 % \pm 0.1 %$ for ${\bar{T}}_{t c}^{b}$ , and $0.8 % \pm 0.1 %$ for ${\bar{T}}_{t c}^{a}$ . This is also supported by and , which show representative time-series of the predictions of ${\bar{T}}_{t c}^{a}$ for time windows of 10 and 20 s, respectively. It can be seen that the variance has been completely eliminated, whereas the bias has been reduced to near-zero levels.

Figure 12. Scatter plot (measured vs predicted) and trend in time for ${\bar{T}}_{t c}^{a}$ using an HM with $Δ = 0$ .

Figure 12. Scatter plot (measured vs predicted) and trend in time for T¯tca using an HM with Δ=0.

Figure 13. Scatter plot (measured vs predicted) and trend in time for ${\bar{T}}_{t c}^{a}$ using a DDM with $Δ = 20$ which is the best one as shown in .

Figure 13. Scatter plot (measured vs predicted) and trend in time for T¯tca using a DDM with Δ=20 which is the best one as shown in Table 9.

Figure 14. Scatter plot (measured vs predicted) and trend in time for ${\bar{T}}_{t c}^{a}$ using an HM with $Δ = 20$ which is the best one as shown in .

Figure 14. Scatter plot (measured vs predicted) and trend in time for T¯tca using an HM with Δ=20 which is the best one as shown in Table 9.

An advantage of the HMs is their ability to exploit the coarse, but physically supported, predictions of the PM. Therefore, they have much smaller requirements regarding the use of actual measurements for the learning phase (Coraddu et al. Citation2017). While they will still require a measurement campaign in order to be deployed, they can be reliably used already after a few months worth of measurements, in contrast with pure DDMs that would require at least half a year of available data, before they can be exploited.

5.2.4. Features ranking results

In and , the top ranked features are reported for the top performing models, namely the DDMs and HMs for every time window. Starting from $Δ = 0$ , it can be seen that the DDM model ranks the relevant features consistently with respect to engineering knowledge. As expected, the high-temperature (HT) and low-temperature (LT) cooling water temperatures after the coolers ( $T_{h t}^{a}$ , $T_{l t}^{a}$ ), in combination with the temperatures for main bearing 4 and 5 ( $T_{b_{4}}$ , $T_{b_{5}}$ ), have the highest predictive power for ${\bar{T}}_{t c}^{a}$ , according to . This is to be expected because both the lube oil system (where bearings 4 and 5 carry the highest load) and the cooling water system absorb the largest part of the overall heat rejection of the engine, which is tightly coupled with the power output of the engine and serves as an overall indicator for the average temperature increase at each of the measurement points as shown in . Moreover, the charge air temperature after compressor (Bank A – Column 6) $T_{c A}^{a}$ , the charge air temperature before turbine (Bank B – Column 8) $T_{c B}^{a}$ , and the turbine speed $N_{{t c}_{B}}$ (Bank B – Column 7) have influence on the prediction. It should be noted that to compare the real measurements from the IPMS with the PM outcomes, the authors considered the average value of the Banks A and B. For this reason, in , Bank A and Bank B contribution cannot be captured independently by the DDMs and HMs. The same conclusion can be drawn from .

Table 10. Top 10 feature in $X$ , ranked in descending importance, of the different models (DDMs and HMs) for different $Δ \in {0, 10, 20, 30}$ for ${\bar{T}}_{t c}^{a}$ . Note that, for $Δ > 0$ , the importance does not refer to the single feature in $X$ but the entire past temporal series.

Display Table

Table 11. Top 10 feature in $X$ , ranked in descending importance, of the different models (DDMs and HMs) for different $Δ \in {0, 10, 20, 30}$ for ${\bar{T}}_{t c}^{b}$ . Note that, for $Δ > 0$ , the importance does not refer to the single feature in $X$ but the entire past temporal series.

Display Table

Considering the HM's feature ranks from , it should be highlighted that they use as inputs ${\bar{T}}_{t c}^{a}$ and ${\bar{T}}_{t c}^{a}$ from the PM. Highest predictive power is observed for ${\bar{T}}_{t c}^{a} (P M)$ , a result that acts as a sanity check on the feature ranking procedure's robustness. The same features discussed for the DDMs are the most important ones also for the HMs. Nevertheless, non-linear correlations between the different features lead to a slight variation in the features' position. When time windows are also employed (models with $Δ > 0$ ), the most important feature for the prediction of each temperature, as expected, is the time-history of the temperature itself as reported in Sections 5.2.3 and 5.2.4, and depicted in (b).

It can be noted that the models rank approximately the same features among the different time windows. From a physical point of view, this can be considered as a sanity check for the reliability and robustness of the model.

6. Conclusion and discussion

In this work, the authors developed novel hybrid approaches to model diesel engine exhaust gas temperatures in operational conditions. With this purpose in mind, a hybrid modelling approach is introduced, to build a robust and reliable diesel engine model suitable for real-time performance assessment and condition monitoring applications. A state-of-the-art Kernel method has been presented, able to exploit the information provided by on-board measurements from one Holland Class Oceangoing Patrol Vessel, provided by the Royal Netherlands Navy and Damen Schelde Naval Shipbuilding.

To define the improvements brought by the proposed methodology, the authors first applied the standard approach used by industry experts and academics, by using and evaluating a first-principle-equation-based diesel engine model that is capable of providing real-time predictions. However, results in Section 5 show the following: while the calibration results indicated an adequate model that can capture the behaviour of the engine within 10% when using data from the factory acceptance test at operating points on the theoretical propeller curve, validation with real measurements revealed that the performance of the model over the true operating envelope is much worse. This greater error, up to 30% MAPE, is caused by running the engine at much lower loads and speeds, and due to the control strategy that forces the engine to other operating points than the theoretical propeller curve.

On the other hand, the data-driven models proposed in Section 4.3, are adequate in predicting the behaviour of the diesel engine, with a focus on exhaust gas temperatures. Classically, these exhaust gas temperatures would be approached by first principle thermodynamic and heat transfer equations, requiring very detailed design information and possibly lab scale tests to experimentally determine principle heat transfer coefficients. However, due to their nature, these data-driven models are hard to interpret.

To overcome the limitations of both the physical and the data-driven models, the authors developed a hybrid approach that can take into consideration past information, are capable of improving accuracy, are easily interpreted, and have low computational time requirements. These hybrid models can improve average errors by a factor 2 over purely data-driven models. These hybrid models can potentially also be used to improve accuracy of predictions for operation in other conditions than the measured ones, as purely data-driven models cannot be used for extrapolation, but the physical model contribution will improve hybrid model performance during extrapolation. While the hybrid approach will still require a measurement campaign in order to be deployed, this approach can be reliably used based on a significantly smaller dataset in comparison with the pure DDMs, for the same average error, as shown in Section 5.2.3. Moreover, the proposed methodology can also be applied to other industries facing problems of similar nature. Automotive, aviation, railway and process industries are potential candidates for the application of these types of models.

Next steps of the research will consider the utilisation of a more extensive data set containing engines of different vessels, the application of the proposed method to other systems installed on-board, and more importantly the application of the methodology for early fault detection and isolation.

Acknowledgments

This project is supported by the Royal Netherlands Navy supplying the operational measurement data from one Holland Class Oceangoing Patrol Vessel and Damen Schelde Naval Shipbuilding. The Royal Netherlands Navy supplied and maintains its copyright.

Disclosure statement

No potential conflict of interest was reported by the author(s).

References

Al-Hinti I, Samhouri M, Al-Ghandoor A, Sakhrieh A. 2009. The effect of boost pressure on the performance characteristics of a diesel engine: A neuro-fuzzy approach. Appl Energy. 86:113–121.
Web of Science ®Google Scholar
Amma NR. 2019. An environmental and economic analysis of methanol fuel for a cellular container ship. Trans Res D. 69:66–76.
Google Scholar
Anderson M, Salo K, Fridell E. 2015. Particle and gaseous emissions form an LNG powered ship. Environ Sci Technol. 49:12568–12575.
PubMed Web of Science ®Google Scholar
Antonić R, Vukić Z, Kuljača O. 2004. Neuro-fuzzy modelling of marine diesel engine cylinder dynamics. IFAC Proc Vol. 37:95–100.
Google Scholar
Argyriou A, Evgeniou T, Pontil M. 2008. Convex multi-task feature learning. Mach Learn. 73:243–272.
Web of Science ®Google Scholar
Asad U, Zheng M. 2014. Exhaust gas recirculation for advanced diesel combustion cycles. Appl Energy. 123:242–52.
Web of Science ®Google Scholar
Bakker B, Heskes T. 2003. Task clustering and gating for Bayesian multitask learning. J Mach Learn Res. 4:83–99.
Web of Science ®Google Scholar
Baldi F, Theotokatos G, Andersson K. 2015. Development of a combined mean value-zero dimensional model and application for a large marine four-stroke diesel engine simulation. Appl Energy. 154:402–415.
Web of Science ®Google Scholar
Baldini A, Ciabattoni L, Felicetti R, Ferracuti F, Freddi A, Monteriú A. 2018. Dynamic surface fault tolerant control for underwater remotely operated vehicles. ISA Trans. 78:10–20.Advanced Methods in Control and Signal Processing for Complex Marine Systems.
PubMed Web of Science ®Google Scholar
Banda OAV, Kannos S, Goerlandt F, van Gelder PH, Bergström M, Kujala P. 2019. A systemic hazard analysis and management process for the concept design phase of an autonomous vessel. Reliab Eng Syst Safety. 191:106584.
Google Scholar
Basurko OC, Uriondo Z. 2015. Condition-based maintenance for medium speed diesel engines used in vessels in operation. Appl Therm Eng. 80:404–412.
Web of Science ®Google Scholar
Baxter J. 2000. A model of inductive bias learning. J Artif Intell Res. 12:149–198.
Web of Science ®Google Scholar
Bhatti G, Mohan H, Singh R. 2021. Towards the future of smart electric vehicles. Digital twin technology. Renew Sustain Energy Rev. 141:110801.
Web of Science ®Google Scholar
Bondarenko O, Fukuda T. 2020. Development of a diesel engine's digital twin for predicting propulsion system dynamics. Energy. 196:117126.
Google Scholar
Bukovac O, Medica V, Mrzljak V. 2015. Steady state performances analysis of modern marine two-stroke low speed diesel engine using MLP neural network model. Brodogradnja. 66:57–70.
Web of Science ®Google Scholar
Cai B, Sun X, Wang J, Yang C, Wang Z, Kong X, Liu Z, Liu Y. 2020. Fault detection and diagnostic method of diesel engine by combining rule-based algorithm and BNs/BPNNs. J Manuf Syst. 57:148–157.
Web of Science ®Google Scholar
Caruana R. 1997. Multitask learning. Mach Learn. 28:41–75.
Web of Science ®Google Scholar
Cipollini F, Oneto L, Coraddu A, Murphy AJ, Anguita D. 2018a. Condition-based maintenance of naval propulsion systems: data analysis with minimal feedback. Reliab Eng Syst Safety. 177:12–23.
Web of Science ®Google Scholar
Cipollini F, Oneto L, Coraddu A, Murphy AJ, Anguita D. 2018b. Condition-based maintenance of naval propulsion systems with supervised data analysis. Ocean Eng. 149:268–278.
Web of Science ®Google Scholar
Coraddu A, Kalikatzarakis M, Oneto L, Meijn GJ, Godjevac M, Geertsmad RD. 2018. Ship diesel engine performance modelling with combined physical and machine learning approach. International Naval Engineering Conference and Exhibition (INEC).
Google Scholar
Coraddu A, Oneto L, Baldi F, Anguita S. 2017. Vessels fuel consumption forecast and trim optimisation: a data analytics perspective. Ocean Eng. 130:351–370.
Web of Science ®Google Scholar
Coraddu A, Oneto L, Ilardi D, Stoumpos S, Theotokatos G. 2021. Marine dual fuel engines monitoring in the wild through weakly supervised data analytics. Eng Appl Artif Intell. 100:104179.
Google Scholar
Cristianini N, Shawe-Taylor J. 2000. An introduction to support vector machines and other kernel-based learning methods. Cambridge University Press.
Google Scholar
Curley R. 2012. The complete history of ships and boats. Britannica Educational Publishing.
Google Scholar
Dixon SL. 1998. Fluid mechanics and thermodynamics of thermomachinery. Butterworth Heinemann.
Google Scholar
Efron B, Tibshirani RJ. 1994. An introduction to the bootstrap. CRC Press.
Google Scholar
Evgeniou T, Pontil M. 2004. Regularized multi-task learning. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
Google Scholar
Fernández-Delgado M, Cernadas E, Barro S, Amorim D. 2014. Do we need hundreds of classifiers to solve real world classification problems? J Mach Learn Res. 15:3133–3181.
Web of Science ®Google Scholar
Galindo J, Lujan JM, Serrano JR, Hernández L. 2005. Combustion simulation of turbocharger HSDI diesel engines during transient operation using neural networks. Appl Therm Eng. 25:877–898.
Web of Science ®Google Scholar
Gao Y, Karimi M, Kudreyko AA, Song W. 2018. Spare optimistic based on improved ADMM and the minimum entropy de-convolution for the early weak fault diagnosis of bearings in marine systems. ISA Trans. 78:98–104. Advanced Methods in Control and Signal Processing for Complex Marine Systems.
PubMed Web of Science ®Google Scholar
Geertsma RD, Badon Ghijben NA, Zwart OR. 2013. Fire fighting and damage control automation: enabling future crew reduction. Engine as a Weapon International Symposium.
Google Scholar
Geertsma RD, Negenborn RR, Visser K, Hopman JJ. 2017. Parallel control for hybrid propulsion of multifunction ships. IFAC Proc Vol. 50(1):2296–2303.
Google Scholar
Geertsma RD, Negenborn RR, Visser K, Loonstijn MA, Hopman JJ. 2017. Pitch control for ships with diesel mechanical and hybrid propulsion: modelling, validation and performance quantification. Appl Energy. 206:1609–1631.
Web of Science ®Google Scholar
Geertsma RD, Visser K, Negenborn RR. 2018. Adaptive pitch control for ships with diesel mechanical and hybrid propulsion. Appl Energy. 228:2490–2509.
Web of Science ®Google Scholar
Geng P, Mao H, Hang Y, Wei L, You K, Ju J, Chen T. 2017. Combustion characteristics and NOx emissions of a waste cooking oil biodiesel blend in a marine auxiliary diesel engine. Appl Therm Eng. 115:947–954.
Web of Science ®Google Scholar
Ghaderi H. 2019. Autonomous technologies in short sea shipping: trends, feasibility and implications. Trans Rev. 39:152–173.
Web of Science ®Google Scholar
Ghelardoni L, Ghio A, Anguita D. 2013. Energy load forecasting using empirical mode decomposition and support vector regression. IEEE Trans Smart Grid. 4:549–556.
Web of Science ®Google Scholar
Grimmelius HT. 2003. Simulation models in marine engineering: from training to concept exploration. In COMPIT.
Google Scholar
Grimmelius H, Mesbahi E, Schulten P, Stapersma D. 2007. The use of diesel engine simulation models in ship propulsion plant design and operation. CIMAC International Council on Combustion Engines. p. 1–12.
Google Scholar
Grondin O, Stobart R, Chafouk H, Maquet J. 2004. Modelling the compression ignition engine for control: review and future trends. Technical Report SAE Technical Paper.
Google Scholar
Guan C, Theotokatos G, Chen H. 2015. Analysis of two stroke marine diesel engine operation including turbocharger cut-out by using a zero-dimensional model. Energies. 8:5738–5764.
Web of Science ®Google Scholar
Guyon I, Elisseeff A. 2003. An introduction to variable and feature selection. J Mach Learn Res. 3:1157–1182.
Google Scholar
Hao C, Lu Z, Feng Y, Bai H, Wen M, Wang T. 2021. Optimization of fuel/air mixing and combustion process in a heavy-duty diesel engine using fuel split device. Appl Therm Eng. 186:116458.
Google Scholar
Hoang AT, Tran VD, Dong VH, Le AT. 2019. An experimental analysis on physical properties and spray characteristics of an ultrasound-assisted emulsion of ultra-low-sulphur diesel and jatropha-based biodiesel. J Mar Eng Technol.
Google Scholar
Horenberg SC, Melaet ACF. 2013. Uniting weapon and marine knowledge. In Engine as a Weapon International Symposium.
Google Scholar
Hountalas DT. 2000. Prediction of marine diesel engine performance under fault conditions. Appl Therm Eng. 20:1753–1783.
Web of Science ®Google Scholar
IMO MARPOL. 2011. International convention for the prevention of pollution from ships (MARPOL) annex VI. Consolidated edition International Maritime Organisation.
Google Scholar
IMO MEPC 72. 2018. Initial strategy on greenhouse gas emissions reduction for ships. Technical Report International Maritime Organisation.
Google Scholar
Johnson K, Mollenhauer K, Tschoke H. 2010. Handbook of diesel engines. Springer Science & Business Media.
Google Scholar
Kalikatzarakis M, Geertsma RD, Boonen EJ, Visser K, Negenborn RR. 2018. Ship energy management for hybrid propulsion and power supply with shore charging. Appl Energy. 76:133–154.
Google Scholar
Karimi HR, Zhang H, Ding S. 2018. Advanced methods in control and signal processing for complex marine systems. ISA Trans. 78:1–2. Advanced Methods in Control and Signal Processing for Complex Marine Systems.
PubMed Web of Science ®Google Scholar
Keerthi SS, Lin CJ. 2003. Asymptotic behaviors of support vector machines with Gaussian kernel. Neural Comput. 15:1667–1689.
PubMed Web of Science ®Google Scholar
Khaled AB, Gaid MB, Pernet N, Simon D. 2014. Fast multi-core co-simulation of cyber-physical systems: application to internal combustion engines. Simul Model Pract Theory. 47:79–91.
Web of Science ®Google Scholar
Kohavi R. 1995. A study of cross-validation and bootstrap for accuracy estimation and model selection. In International Joint Conference on Artificial Intelligence.
Google Scholar
Kökkülünk G, Parlak A, Erdam HH. 2016. Determination of performance degradation of a marine diesel engine by using curve based approach. Appl Therm Eng. 108:1136–1146.
Web of Science ®Google Scholar
Korczewski Z. 2015. Exhaust gas temperature measurements in diagnostics of turbocharged marine internal combustion engines part I standard measurements. Polish Marit Res. 22:47–54.
Web of Science ®Google Scholar
Korczewski Z. 2016. Exhaust gas temperature measurements in diagnostics of turbocharged marine internal combustion engines part II dynamic measurements. Polish Marit Res. 23:68–76.
Web of Science ®Google Scholar
Larsen U, Pierobon L, Baldi F, Haglind F, Ivarsson A. 2015. Development of a model for the prediction of the fuel consumption and nitrogen oxides emission trade-off for large ships. Energy. 80:545–555.
Web of Science ®Google Scholar
Leifsson L, Sævarsdóttir H, Sigurdhsson S, Vésteinsson A. 2008. Grey-box modeling of an ocean vessel for operational optimization. Simul Model Pract Theory. 16:923–932.
Web of Science ®Google Scholar
Liu M, Fang S, Dong H, Xu C. 2020. Review of digital twin about concepts, technologies, and industrial applications. J Manuf Syst. 58:346–361.
Web of Science ®Google Scholar
Liu C, Negenborn RR, Chu X, Zheng H. 2017. Predictive path following based on adaptive line-of-sight for underactuated autonomous surface vessels. J Mar Sci Technol. 23(3):483–494.
Web of Science ®Google Scholar
Liu C, Negenborn RR, Chu X, Zheng H. 2019. Colregs-constrained real-time path planning for autonomous ships using modified artificial potential fields. J Navig. 72(3):588–608.
Web of Science ®Google Scholar
Llamas X, Eriksson L. 2018. Control-oriented modeling of two-stroke diesel engines with exhaust gas recirculation for marine applications. Proc Inst Mech Eng M: J Eng Maritime Environ. 233:551–574. doi:https://doi.org/10.1177/1475090218768992
Google Scholar
Lloyd AC, Cackette TA. 2001. Diesel engines: environmental impact and control. J Air Waste Manage Assoc. 51:809–847.
Web of Science ®Google Scholar
Miglianti F, Cipollini F, Oneto L, Tani G, Viviani M. 2019. Model scale cavitation noise spectra prediction: combining physical knowledge with data science. Ocean Eng. 178:185–203.
Web of Science ®Google Scholar
Namigtle-Jiménez A, Escobar-Jiménez R, Gómez-Aguilar J, García-Beltrán C, Téllez-Anguiano A. 2019. Online ANN-based fault diagnosis implementation using an FPGA: application in the EFI system of a vehicle. ISA Trans.
PubMed Web of Science ®Google Scholar
Nielsen KV, Blanke M, Eriksson L. 2017. Control-oriented model of molar scavenge oxygen fraction for exhaust recirculation in large diesel engines. J Dyn Syst Meas Contr. 139:1–9.
Web of Science ®Google Scholar
Nikzadfar K, Shamekhi AH. 2014. Investigating the relative contribution of operational parameters on performance and emissions of a common-rail diesel engine using neural network. Fuel. 125:116–128.
Web of Science ®Google Scholar
Oneto L. 2020. Model selection and error estimation in a nutshell. Springer.
Google Scholar
Packard NH, Crutchfield JP, Farmer JD, Shaw RS. 1980. Geometry from a time series. Phys Rev Lett. 45:712.
Web of Science ®Google Scholar
Palmer KA, Bollas GM. 2019. Active fault diagnosis for uncertain systems using optimal test designs and detection through classification. ISA Trans. 93:354–369.
PubMed Web of Science ®Google Scholar
Parlak A, Islamoglu Y, Yasar H, Egrisogut A. 2006. Application of artificial neural network to predict specific fuel consumption and exhaust temperature for a diesel engine. Appl Therm Eng. 26:824–828.
Web of Science ®Google Scholar
Porteiro J, Collazo J, Patiño D, Míguez JL. 2011. Diesel engine condition monitoring using a multi-net neural network system with nonintrusive sensors. Appl Therm Eng. 31:4097–4105.
Web of Science ®Google Scholar
Qi G, Zhu Z, Erqinhu K, Chen Y, Chai Y, Sun J. 2018. Fault-diagnosis for reciprocating compressors using big data and machine learning. Simul Model Pract Theory. 80:104–127.
Web of Science ®Google Scholar
Reitz R, Rutland C. 1995. Development and testing of diesel engine CFD models. Prog Energy Combust Sci. 21:173–196.
Web of Science ®Google Scholar
Rosasco L, De Vito E, Caponnetto A, Piana M, Verri A. 2004. Are loss functions all the same? Neural Comput. 16:1063–1076.
PubMed Web of Science ®Google Scholar
Sapra H, Godjevac M, Visser K, Stapersma D, Dijkstra C. 2017. Experimental and simulation-based investigations of marine diesel engine performance against static back pressure. Appl Energy. 204:78–92.
Web of Science ®Google Scholar
Scappin F, Stefansson SH, Haglind F, Andreasen A, Larsen U. 2012. Validation of a zero-dimensional model for prediction of NOx and engine performance for electronically controlled marine two-stroke diesel engines. Appl Therm Eng. 37:344–352.
Web of Science ®Google Scholar
Scholkopf B. 2001. The kernel trick for distances. In Advances in neural information processing systems.
Google Scholar
Schölkopf B, Herbrich R, Smola AJ. 2001. A generalized representer theorem. In Computational learning theory.
Google Scholar
Schwartz G. 2002. Reliability and survivability in the reduced ship's crew by virtual presence system. In Proceedings International Conference on Dependable Systems and Networks. IEEE. p. 199–204.
Google Scholar
Shalev-Shwartz S, Ben-David S. 2014. Understanding machine learning: from theory to algorithms. Cambridge University Press.
Google Scholar
Shawe-Taylor J, Cristianini N. 2004. Kernel methods for pattern analysis. Cambridge University Press.
Google Scholar
Silva AA, Gupta S, Bazzi AM, Ulatowski A. 2018. Wavelet-based information filtering for fault diagnosis of electric drive systems in electric ships. ISA Trans. 78:105–115.Advanced Methods in Control and Signal Processing for Complex Marine Systems.
PubMed Web of Science ®Google Scholar
Stapersma D. 2010. Diesel engines: a fundamental approach to performance analysis, turbocharging, combustion, emissions and heat transfer, volume 2: turbocharging. Netherlands Defence Academy.
Google Scholar
Stoumpos S, Theotokatos G, Mavrelos C, Boulougouris E. 2020. Towards marine dual fuel engines digital twins–integrated modelling of thermodynamic processes and control system functions. J Mar Sci Eng. 8:200.
Web of Science ®Google Scholar
Sui C, Song E, Stapersma D, Ding Y. 2017. Mean value modelling of diesel engine combustion based on parameterized finite stage cylinder process. Ocean Eng. 136:218–232.
Web of Science ®Google Scholar
Svanberg M, Ellis J, Lundgren J, Landalv I. 2018. Renewable methanol as a fuel for the shipping industry. Renew Sustain Energy Rev. 94:1217–1228.
Web of Science ®Google Scholar
Takens F. 1981. Detecting strange attractors in turbulence. Springer.
Google Scholar
Taljegard M, Brynolf S, Grahn M, Andersson K, Johnson H. 2014. Cost-effective choices of marine fuels in a carbon-constrained world: results from a global energy model. Environ Sci Technol. 48:12986–12993.
PubMed Web of Science ®Google Scholar
Teng S, Touš M, Leong W, How B, Lam H, Máša V. 2021. Recent advances on industrial data-driven energy savings: digital twins and infrastructures. Renew Sustain Energy Rev. 135:110208.
Web of Science ®Google Scholar
Theotokatos G. 2010. On the cycle mean value modelling of a large two-stroke marine diesel engine. Proc Inst Mech Eng M: J Eng Maritime Environ. 224:193–205.
PubMed Web of Science ®Google Scholar
Theotokatos G, Tzelepis V. 2015. A computational study on the performance and emission parameters mapping of a ship propulsion system. J Eng Maritime Environ. 229(1):58–76.
Google Scholar
Tikhonov AN, Arsenin VY. 1979. Methods for solving ill-posed problems. Moscow: Nauka.
Google Scholar
Vapnik VN. 1998. Statistical learning theory. New York: Wiley.
Google Scholar
Verbert K, De Schutter B, Babuška R. 2017. Timely condition-based maintenance planning for multi-component systems. Reliab Eng Syst Safety. 159:310–321.
Web of Science ®Google Scholar
Viana M, Hammingh P, Colette A, Querol X, Degraeuwe B, de Vlieger I, van Aerdenne J. 2014. Impact of maritime transport emissions on coastal air quality in Europe. Atmos Environ. 90:96–105.
Web of Science ®Google Scholar
Wainberg M, Alipanahi B, Frey BJ. 2016. Are random forests truly the best classifiers? J Mach Learn Res. 17:3837–3841.
Web of Science ®Google Scholar
Wang X, Cai Y, Li A, Zhang W, Yue Y, Ming A. 2021. Intelligent fault diagnosis of diesel engine via adaptive VMD-Rihaczek distribution and graph regularized bi-directional NMF. Measurement. 172:108823.
Google Scholar
Wang R, Chen H, Guan C. 2021. Random convolutional neural network structure: an intelligent health monitoring scheme for diesel engines. Measurement. 171:108786.
Google Scholar
Wang H, Gan H, Theotokatos G. 2020. Parametric investigation of pre-injection on the combustion, knocking and emissions behaviour of a large marine four-stroke dual-fuel engine. Fuel. 281:118744.
Google Scholar
Wang J, Wang Z, Stetsyuk V, Ma X, Gu F, Li W. 2019. Exploiting Bayesian networks for fault isolation: a diagnostic case study of diesel fuel injection system. ISA Trans. 86:276–286.
PubMed Web of Science ®Google Scholar
Wu X, Chang Y, Mao J, Du Z. 2013. Predicting reliability and failures of engine systems by single multiplicative neuron model with iterated nonlinear filters. Reliab Eng Syst Safety. 119:244–250.
Web of Science ®Google Scholar
Xiang L, Theotokatos G, Ding Y. 2019. Investigation on gaseous fuels interchangeability with an extended zero-dimensional engine model. Energy Convers Manag. 183:500–514.
Web of Science ®Google Scholar
Xu Z, Ji F, Ding S, Zhao Y, Zhou Y, Zhang Q, Du F. 2021. Digital twin-driven optimization of gas exchange system of 2-stroke heavy fuel aircraft engine. J Manuf Syst. 58:132–145.
Web of Science ®Google Scholar
Xu K, Tang LC, Xie M, Ho SL, Zhu M. 2002. Fuzzy assessment of FMEA for engine systems. Reliab Eng Syst Safety. 75:17–29.
Web of Science ®Google Scholar
Yang Y, Linlin L, Ding SX, Qiu J, Peng K. 2018. Fault detection for piecewise affine systems with application to ship propulsion systems. ISA Trans. 78:3–9. Advanced Methods in Control and Signal Processing for Complex Marine Systems.
PubMed Web of Science ®Google Scholar
Young DM. 2003. Iterative solution of large linear systems. Dover Publications.
Google Scholar
Yu M, Tang X, Lin Y, Wang X. 2018. Diesel engine modeling based on recurrent neural networks for a hardware-in-the-loop simulation system of diesel generator sets. Neurocomputing. 283:9–19.
Web of Science ®Google Scholar
Yuanwang D, Meilin Z, Dong X, Xiaobei C. 2002. An analysis for effect of cetane number on exhaust emissions from engine with the neural network. Fuel. 81:1963–1970.
Web of Science ®Google Scholar
Zahedi B, Norum LE, Ludwigsen KB. 2014. Optimised efficiency of all-electric ship by dc hybrid power systems. J Power Sources. 255:341–354.
Web of Science ®Google Scholar
Zhang M, Wang T, Tang T, Benbouzid M, Diallo D. 2017. An imbalance fault detection method based on data normalization and EMD for marine current turbines. ISA Trans. 68:302–312.
PubMed Web of Science ®Google Scholar
Zinner K. 1980. Aufladung von Verbrennungsmotoren. Springer-Verlag.
Google Scholar

Physical, data-driven and hybrid approaches to model engine exhaust gas temperatures in operational conditions

ABSTRACT

1. Introduction