545
Views
0
CrossRef citations to date
0
Altmetric
Technical Papers

Nonlinear modeling of industrial boiler NOx emissions

ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon &
Pages 556-569 | Received 04 Jun 2021, Accepted 02 Sep 2021, Published online: 25 Apr 2022

ABSTRACT

Pollutant emissions into the atmosphere are recognized as a significant problem in fossil fuel combustion. The pollution emission measurement in industrial boilers is difficult and expensive but fundamental for monitoring and controlling. Frequently continuous emissions monitoring (CEM) system is out of service or useless due to obsolescence, high maintenance cost, and so on or simply is not installed. When a system for measuring pollutant emissions is not available, an alternative method must be employed to get the pollutant emission value. According to the black-box model approach, this article describes the nonlinear modeling of NOx emissions from a utility boiler. Bayesian-Gaussian (BG), multilayer perceptron (MLP), and Volterra polynomial basis functions (VPBF) neural networks are developed for model benchmarking. Experimental data from a utility boiler was acquired in order to model definition and evaluation. The models process three boiler variables oxygen excess, fuel mass flow and flue gas recirculation gates for NOx emission estimation. Models with BG show better performance than models with MLP and VPBF for NOx prediction.

Implications: The technology to control NOx emissions generated by combustion operates under strict regulations. In order to reduce NOx emissions, theoretical models of NOx generation have been studied extensively, including nitrogen chemistry and the dynamic flow of gas particles which is very complex. The new technology trends would require the continuous measurement of high precision NOx emissions to achieve further reductions in NOx emissions. Currently, NOx emissions are measured by a Continuous Emission Monitoring system, which turns out to be extremely expensive and difficult to maintain, so alternative low-cost solutions are desirable. Our contribution shows how algorithms based on different artificial intelligence techniques are viable and quality alternatives for the measurement of continuous NOx emissions. The NOx emissions models based on IA algorithms are viable alternatives that have versatility and self-tuning capacity due to the fact that they are based on boiler operation parameters which have valuable information few explored nowadays.

Introduction

lectricity has become an essential element of everyday life. When the electricity supply fails, our daily life activities stop, all electrical systems are useless, and modern life cannot continue. Institutions, industries, and so on would have limited operation.

Even though there are several ways to produce electricity, most electricity is generated by fossil fuel power plants because they are plentiful and it will be decades before they run out. The World Energy Council (WEC) estimates that by 2020, 76% of the world’s primary energy will be generated by fossil fuels. The burning of fossil fuels in boilers for the production of electrical energy results in the discharge of greenhouse gases into the ambient air.

Nitrogen oxides (NOx) are one of the pollutants generated during combustion processes. NOx in the combustion gases consists of 90% to 95% nitric oxide (NO), and the rest is nitrogen dioxide (NO2). NOx emissions react in the atmosphere with the presence of sunlight and water to form acid components and ozone in the shallow altitude environment (tropospheric ozone). Low-altitude ozone is one of the significant components of smog in cities and some rural areas. Ozone well above the earth in the stratosphere provides a protective layer, but the ozone that we breathe at ground level has been related to respiratory diseases and other health problems. Acid rain can damage ecosystems by directly destroying plant tissues, and it can also be combined with other pollutants, such as ozone, weakening trees and leaving them vulnerable to pests. NOx emission reduction is the primary requirement for a power plant.

The technology that controls combustion-generated NOx emissions operates under strict regulations. Bowman (Citation1992) reviewed some of the classic and new technology for NOx emission reduction from combustion sources and examined future technologies to meet the stringent emission standards. He also indicated that new technologies for NOx emission reduction required deep analysis of the nitrogen combustion process as well as establishing pathways for improving reductions in continuous NOx emissions.

Theoretical NOx generation models have been studied extensively, to reduce NOx emissions. These include nitrogen chemistry, dynamic flow of complex gas particles, and data-based models, the primary focus of this research. In most cases, the main source of NOx emission is the nitrogen contained in the fuel that is converted into NOx during combustion. The formation occurs through a series of chemical reactions that are not yet fully understood.

For some plants, developing an enhanced NOx emissions control system is of great importance. Technically, advanced control methods require some form of modeling, which uses historical and current data from the plant, to determine the correlation between the plant operational inputs and NOx emissions. This model can later be optimized to manipulate the model inputs minimizing NOx emissions. These values are used for adjusting the plant inputs. Li, Thompson, and Peng (Citation2002) conclude that with these techniques NOx reductions are achieved.

Several studies have been carried out for data-based modeling NOx emissions. Li, Thompson, and Peng (Citation2002) use artificial neural networks (ANN) systematically trying to model NOx emissions. NOx formation is a complex and nonlinear process. Also, in the power generation plants, changes in operating conditions could affect NOx emission levels, and these variants could be plant-dependent. On the other hand, the power generation plant operation is not interruptible, and the only available data is the operational data. This data is time-dependent and can be obtained on a daily, weekly, or annual basis. In order to identify the operational variables that have NOx emissions information, sensitivity analysis, data selection for modeling, ANN structure model selection, and generalized ANN training are discussed in this paper.

Ikonen, Najim, and Kortela (Citation2000) used fuzzy neural networks to data-based model the NOx emissions from a fluidized bed combustion chamber. The distributed logic processor, sigmoid neural networks, and a recursive error prediction method were presented as a learning method. Simulations indicate the conclusion is that the distributed logic processor models for NOx emissions in a fluidized bed combustion chamber were able to compete with linear and nonlinear regression methods.

In another study conducted by Hocking, Johnson, and Flowers (Citation2002) to comply with regulations while improving the flexibility and response generation, public services in Texas implement hybrid systems that link advances in software and hardware for emission reduction to improve the boiler combustion efficiency. These solutions use advanced control processes (APC) with high-performance experiential model techniques created using plant data from neural network software. This method has resulted in a significant NOx reduction while the production of carbon monoxide (CO) is controlled. A NOx model was developed using neural networks with manipulable variables such as the O2 setpoint; the separate air registers (SOFA) gate and the oven differential pressure (DP), etc. In addition, nonlinear steady-state models and dynamic linear data-based models with a single input and a single output (SISO) were developed from planned tests, where one input was adjusted, and the others remained constant. An APC technique was developed using a combination of dynamic and steady-state models.

Zhou and Cen (Citation2018) used support vector regression (SVR) for the modeling of NOx emissions to find the best condition using global search tools. This model was developed using coal fuel features as the dependent variable in NOx emissions. In recent studies have been developed a NOx model based in a recurrent neural network (RNN) (Safdarnejad, Tuttle, and Powell Citation2019), in a combined ANN and genetic algorithm (GA) scheme (Shi et al. Citation2019) of a utility boiler for emission prediction and optimization, and in radial basis functions (RBF) (Iliyas et al. Citation2013) neural network in a natural gas-fired, water-tube boiler under various operating condition simulation.

Several works have been carried out in coal-fired power units. Song et al. (Citation2017) applied a General Regression Neural Network (GRNN) which is a kind of RBF network for NOx prediction. A model comparison study including linear and nonlinear modeling approaches was presented in (Smrekar, Potočnik, and Senegačnik Citation2013), did not find significant differences and an input variable selection analysis was recommended. Recently, Tuttle et al. conducted some studies to predict NOx emissions by identifying the optimal radial basis kernel for SVM classification algorithm (Tuttle, Blackburn, and Powell Citation2020), comparing 10 machine learning algorithms model methods (Tuttle et al. Citation2021), and selecting the neural network model structure based on genetic GA for combustion optimization process application (Tuttle et al. Citation2019). These models were defined through linear structure to find the ANN inputs, using the online NOx emission measurement.

In nonlinear modeling, ANNs are useful for identifying dynamic systems including NOx modeling, and MLP ANNs are the most used. It has been proven that an MLP with only a hidden layer using the sigmoid activation function can approximate any multivariable function (Hornik, Stinchcombe, and White Citation1989). There are significant neural network types for data-based modeling; Ye, Nicolai, and Reh (Citation1998) developed Bayesian-Gaussian neural networks; this approach enhances neural network performance for online estimations. The BG neural network algorithm was modified applying GA (Liu and Peng Citation2009), as an alternative to the simplex algorithm, making it an attractive algorithm in applications where nonlinear systems change in time, this feature makes it very attractive for dynamic online systems modeling.

Algorithms for model structure detection and online parameter estimation of the nonlinear systems have been studied using orthogonal estimation algorithm based on the nonlinear autoregressive-moving average (NARMA) polynomial models with QR orthogonal decomposition algorithm using a sliding data window (Luo and Billings Citation1995). Subsequently, Luo, Billings, and Tsang (Citation1996) used the same technique to apply to an exponential data window.

Radial basis function neural networks provide another alternative to nonlinear modeling (Han, Chen, and Qiao Citation2010). The standard training algorithm for RBF neural networks has some limitations. Kassam and Cha (Citation1993) introduced an RBF network training approach according to the stochastic gradient (SG), improving the performance of RBF models. This method forces a compromise between speed and accuracy in the learning process, Zeng, Zhao, and Jin (Citation2012) proposed to introduce multiple RBF convex combinations utilizing different step sizes in the SG learning algorithm to improve the SG algorithm performance.

The Volterra series has been used successfully in the nonlinear system identification (Schmidt et al. Citation2014; Wray and Green Citation1994) and modeling (Cheng et al. Citation2017; Ronquillo-Lomeli et al. Citation2018), controllers design, and model structure detection with online parameter estimation (Liu Citation2001). In this method, the orthogonal least squares (OLS) algorithm is applied for model structure detection and size control using an online model structure selection. The online structure selection is used to graduate the network complexity and make it suitable for providing a system approximation uniform with the actual data values that are being received, and we also developed an algorithm for recursive parameter estimation using the Lyapunov synthesis.

The objective of this work is the NOx emission model development for a heavy oil-fired 350 MW utility boiler. The NOx model was built using a data-based approach also known as black-box modeling (Sjöberg et al. Citation1995). The nonlinear model structure is based on BG, MLP, and VPBF neural networks assuming that it can uniformly approximate any continuous function (Cybenko Citation1989; Funahashi Citation1989) and with good precision according to the universal approximation theorem (Haykin Citation1999). The Nelder-Mead simplex algorithm (Nelder and Mead Citation1965) for the BG network model, the backpropagation algorithm for the MLP network model, and the orthogonal least squares algorithm (Billings, Chen, and Korenberg Citation1989) for the VPBF network model were used in the learning process. Finally, root mean square error (RMSE) and the mean absolute error (MAE) were used to compare BG, MLP, and VPBF model performance to select the best approach for NOx emissions modeling.

Materials and methods

For the ANN model development, a data-based approach was used; in the first stage, experimental data are collected. Once enough data is collected, a family and structure model are selected, and then the “best” model structure is chosen for the parameter estimation stage. When the model is completely defined, it is necessary to evaluate its performance using some error metric in the model evaluation stage; in this step, we can go back to the procedure to test different family and structure models. Finally, the model with the best performance is selected.

Experimental test

The experimental test purpose is to collect a data set that describes how the system behaves under all operating conditions. By changing the input, u enough, the output impact is observed in y. The corresponding input and output data set ut,yt are subsequently used to infer a system model.

The experiment was carried out in a 350 MW industrial boiler with an opposite wall that burns heavy fuel oil with 24 burners installed in 12 cells with a secondary air register per burner pair. The unit has a balanced draft furnace and is of the subcritical, single reheat design. The maximum continuous rating (MCR) for steam flow output is 170 kg/cm2 and 538°C is 1058 metric ton/h. Both main steam and hot reheat design temperatures are 538°C. Steam temperature control is achieved with flue gas recirculation (FGR). shows a simplified boiler scheme. Before the execution of the parametric test, some activities were carried out for combustion tunings such as preliminary tests, test planning, boiler inspection, burner balancing, sensitivity analysis of variables, and so on.

Figure 1. Boiler unit configuration.

Figure 1. Boiler unit configuration.

Combustion tuning and preliminary tests

A combustion tuning was performed to balance the boiler stoichiometric status. Preliminary tests were accomplished at the three loads, 100%, 75%, and 50% using a combination of economizer oxygen excess (O2), fuel temperature (Tf), steam atomization pressure (Pa), and FGR damper position. The oxygen excess was diverted by the total air flow to furnace boxes controlled by the forced draft fan. The fuel parameters such as temperature, pressure, and atomization were varied within a safe operating interval. The FGR gate was operated taking care not to exceed the boiler’s operating limits such as main steam temperature. Non-controllable variables (unit load, fuel mass flow (Qf), steam temperature, etc.) were monitored to perform a sensitivity analysis on NOx emissions. Operating ranges for all variables were defined so as not to break any operational constraints limited by the boiler design.

Sensitivity analysis

Some ANN limitations are the quantity of data required, slow training process, overfitting, and hidden parameters within black box models. To reduce these limitations, it is necessary to omit irelevant data components in the model inputs to obtain reduced networks, small model structure, and minimal redundancy in the data. Selecting the appropriate input parameters for a neural network is known as feature selection, which means finding the most relevant variables for a specific goal. The sensitivity analysis is a method used to select variables and is often used to identify and rank relevant inputs.

According to (Evaluation of measurement data-Guide to the expression of uncertainty in measurement Citation2008), the partial derivatives of a model function f/xi are called sensitivity coefficients, describing how the output estimate y varies with changes in the input values x1,x2,,xN. In particular, the change in y produced by a small change Δxi in the input estimate xi is given by Δyi=f/xiΔxi. Instead of being calculated from the function f, sensitivity coefficients f/xi are sometimes determined experimentally when the function f is unknown. For experimental sensitivity analysis, the Morris method is commonly used, which has been used to test the impact of parameters. The Morris method evaluates the model output response for small changes in each input variable. The mean impact of a single variable from the complete dataset points of size M and N variables is presented as

(1) μk=1Mk=1Nfx1,,xk+Δk,,xNfx1,,xNΔk,(1)

In the preliminary tests, a complete database was recorded where the input target variable xk was normally varied with time; however, small variations in all variables were unavoidable. To include these small uncontrollable changes in variables on sensitivity parameters μk, the method developed in (Chen et al. Citation2020) was used. The problem is exchanged for the linear equation solution to find the μk parameters that include the variations of the inputs in the output. The common approach for the linear equation solution is to find the least-square solution to minimize the unfitted error referring to all used data points for sensitivity analysis. The input database created during the preliminary test has 2574 data points for each variable. The input data have seven different variables including controllable and uncontrollable boiler operating variables as shown in .

Table 1. Input variables for sensitivity analysis

The sensitivity analysis result is shown in . The results indicate that the economizer oxygen excess has the highest sensitivity to NOx emissions, the top three highly sensitive variables are related to fuel/air ratio and flue gas temperature, which are involved with NOx generation.

Figure 2. Sensitivity analysis result of seven selected input variables.

Figure 2. Sensitivity analysis result of seven selected input variables.

The steam flow for temperature control (Qa), steam atomization pressure (Pa), superheated steam temperature (Ts), and fuel temperature (Tf) are the variables with the least sensitivity to NOx emissions. With the sensitivity analysis result, the number of input variables can be reduced by selecting only the three inputs with the highest sensitivity to the NOx emission model. Therefore, it was determined that the economizer oxygen excess (O2), fuel mass flow (Qf), and the flue gas recirculation (FGR) damper position are the most important variables with regard to NOx emissions.

Parametric test

Once the sensitivity analysis results had been determined, parametric tests were carried out at 50%, 75%, and 100% of the unit load to leave room for maneuver in the boiler process variables. There were 56 parametric tests including the baseline (BL) tests. For each test point, operational data was collected during at least 15 minutes intervals with 20 seconds sample time, once the stable operation was settled down. For data processing and test documentation, a data set from boiler data acquisition and the CEM system was collected automatically. The CEM system is installed in the boiler flue stack. shows the 56 parametric tests used for NOx modeling.

Table 2. Parametric tests for NOx modeling

During the tests, different O2 excess, FGR gate opening and unit load values were manipulated, while the other boiler parameters were adjusted according to the values established by the baseline.

The experimental data tests were divided into two subsets, one for the model development process (test 1–44) and the other for model evaluation (test 45–56). The test subset was selected considering include data from all the operation range of the input variables (O2, FGR and Of) 50%, 75% and 100% of the unit load.

Model selection

Model selection is a process where some model family is defined and in a second stage, the model structure is fixed. It is necessary to propose a candidate model family for the application. For NOx modeling, Bayesian-Gaussian, multilayer perceptron, and Volterra polynomial basis functions, ANNs were used as model families due to their capability to imitate nonlinear systems (Haykin Citation1999; Hornik, Stinchcombe, and White Citation1989).

A classical nonlinear system can be represented by

(2) zt+1=Gzt,xt,(2)
(3) yt=hzt,xt,(3)

where G is a vector of nonlinear functions, h is a scalar nonlinear function, zt is the vector state, yt is the system output, and xt is the system input.

ANN can uniformly approximate any continuous function including the nonlinear. The input-output ANN model is presented in , where Xt is the input and the ot is the output.

Figure 3. Identification based on artificial neural networks.

Figure 3. Identification based on artificial neural networks.

The nonlinear system defined in (1) and (2) can be described by an equivalent nonlinear discrete system that can be characterized by a BG, an MLP, or a VPBF network model, that is,

(4) ot=fXt,(4)

where fis a nonlinear function, Xt=x1t,x2t,,xnt the input vector and yt the output which represents a multiple-input single-output (MISO) system.

Bayesian-Gaussian ANN model

The Bayesian-Gaussian artificial neural network is a probabilistic model built with a neural network of five layers. The BG artificial neural network was proposed and developed in (Ye, Nicolai, and Reh Citation1998) under the Gaussian hypothesis and using Bayes’ theorem.

The BG model is built based mainly on the training data Xi,yi,i=1,,N set, where N is the net order, Xi is the sample input represented by a n×1 vector, and yi is the sample output. The BG model is based on the probability distribution of YX when the combined information sources Xi,yi,i=1,,N in known, details theory can be found in (Ye, Nicolai, and Reh Citation1998). The probability distribution of YX is approximately by

(5) pk12πσNe12YoBtN2σN2,(5)

where k is a normalizing constant independent of Y and,

(6) oBtN=σN2i=1Nσi2yi,(6)
(7) σN2=i=1Nσi2.(7)

Assume that

(8) σi2=σ02eri,(8)

where ri=XtXiTDXtXi and D is input threshold matrix

(9) D=d11200dnn2,(9)

and d11,d22,,dnn are the parameters that have to be estimated.

The BG artificial neural network algorithm is built through EquationEquations (5Equation9) as shown in , where the lines between layers without weight indication mean weight equal to one

Figure 4. The topology of Bayesian-Gaussian artificial neural networks.

Figure 4. The topology of Bayesian-Gaussian artificial neural networks.

The BG ANN can be naturally set with only the training data set. The network training is focused only on getting the adequate D matrix for some minimization error criteria.

Multilayer perceptron ANN model

MLPs are networks with one or more layers of artificial neurons between the input and output layers. MLP can be used for function approximation like the nonlinear system defined in EquationEquation (4). A three-layer MLP using the sigmoid activation function can approximate nonlinear function f. The structure of a three-layer MLP is shown in , where n is the neuron number in the input layer, l the neuron number in the hidden layer, the weight matrices WtRn×l and VtRl, the input vector XtRn and the output vector oMt.

Figure 5. MISO multilayer perceptron ANN.

Figure 5. MISO multilayer perceptron ANN.

Backpropagation (BP) learning is the most popular algorithm rule for performing supervised learning tasks. The optimization objective function is defined as the sum of the squared error EMt between the actual network output oMt and the desired output yt for all training pattern pairs Xt,yt for each t=1,,M.

The j neuron in the hidden layer is defined as the excitation function, corresponding to the t sample

(10) sjt=i=1nwi,jtxnt+wn+1,jt,(10)

each neuron has a sigmoidal activation function. The hidden layer output for the j neuron is

(11) hjt=11+eSjt,(11)

the neuron k excitation of the output layer is calculated analogously in the following form

(12) rMt=j=1lvjthjt+vl+1t,(12)

the output of the neuron k is expressed as

(13) oMt=11+erMt,(13)

finally, the criteria to be minimized in the sample t is

(14) EMt=12eMt2=12ytoMt2.(14)

Volterra polynomial basis functions ANN model

The nonlinear system defined in EquationEquation (4) can also be described by a VPBF model, which is an equivalent nonlinear discrete system.

It is assumed that the function f is estimated by an ANN with only one layer (Wray and Green Citation1994) in the VPBF model, consisting of a linear combination of the basis functions.

(15) oVXt=i=1NwiφiXt,(15)

where Xt=x1t,x2t,,xnt, φkXt is the basis function and wi the weight. shows a VPBF ANN structure.

Figure 6. Volterra polynomials basis function ANN.

Figure 6. Volterra polynomials basis function ANN.

There are a finite number of neurons in the network to guarantee some precision requirements (Haykin Citation1999). For this, it is necessary to implement a viable approach for selecting the best basis function set. The precision can be approximated by a tolerable neuron number, using Volterra polynomial functions. For NOx modeling, VPBF neural network is used. The nonlinear function oVXt is

(16) oVXt=w1+w2x1t++wn+1xnt+wn+2x1t2+wn+3x1tx2t++wNxntk=i=1NwiφiXt,(16)

where k is the polynomial order, and the set of the basis functions of the Volterra polynomial is

(17) φ1,φ2,.,φn+1,φn+2,φn+3,,φN,Xt=1,x1t,,xnt,x1t2,x1tx2t,.,xntk,(17)

and the polynomial basis function number is given by

(18) N=n+k!n!k!.(18)

Applying the VPBF ANN, the function f is determined by

(19) oVXt=fˆXt+eVXtk,(19)

where eVXtk it is the approximation error.

When the order k increasesk, the basis functions number N increases exponentially. So, the aim is to determine the function oVXt using an appropriate ANN, fitted to achieve that approximation precision is inside the required error limit. The model size and weights learning of the ANN are also known as structure selection and parameter estimation, respectively.

Parameters estimation

Once the family and structure model are selected, the next step is parameter estimation to obtain the final model that best represents this family and structure model taking into account error minimization criteria. In this procedure stage, the error minimization criteria are proposed for the BG, MLP, and VPBF networks.

Bayesian-Gaussian ANN training

The BG network training is the updating of input threshold matrix D where the EB in EquationEquation (20) is minimized,

(20) minEB=minD12Ni=1NyioBi2,(20)

where yi is the output training sample and oBi is the network output sample.

The minimization method for BG ANN training selected is the Nelder-Mead simplex method (Nelder and Mead Citation1965).

Multilayer perceptron ANN training

The EMt function can be minimized concerning the weight coefficient wi,j and vjby applying the gradient-descent procedure. The EMt gradient is,

(21) EMt=EMtvjtEMtwi,jt.(21)

The gradient method consists of moving in the negative gradient direction, giving an update step of the weight coefficient values wi,j and vj with each training pattern pair Xt,yt i.e.

(22) vjtwi,jt=vjt1wi,jt1ηEMt,(22)

where η is a constant that is called the learning constant.

Volterra polynomial basis functions ANN training

To build a model for NOx emissions with a given accuracy, it is first necessary to look for the basis functions that best represent (under some criteria) the nonlinear dynamics of NOx emissions within a set of functions that is surely large due to the polynomial expansion nature and finally to estimate the model parameter for the reduced function set using some minimizing criteria.

The best basis function selection will be carried out offline applying the orthogonal least-squares approach (Billings, Korenberg, and Chen Citation1988) to define the more relevant VPBF set of basis functions for NOx modeling.

Consider that disposes of a data set Xt,yt,t=1,2,,T from system. Based on EquationEquation (19), can be organized in vector form:

(23) OVX=ΦXW+EVXk,(23)

where YRT1is the output vector, is the weight vector, WRT1 is the approximation error vector, and is the basis functions matrix

(24) Y=y1y2yTT,(24)
(25) W=w1w2wNT,(25)
(26) EVXk=eVx1keVx2keVxTkT,(26)
(27) ΦX=φ1x1φ1x2φ1xTφ2x1φ2x2φ2xTφNx1φNx2φNxT.(27)

The weight vector W is generally getting with the minimum of the norm, that is,

(28) Wˆ=argminWYΦXW2,(28)

which is the least-squares solution.

The vector Φi=φix1,φix2,,φixTT, for i=1,2,,N, form base vectors set, and the OLS solution Wˆ, ΦXWˆ is the projection of Y on the space generated by the basis functions Φi. The factorization of ΦX matrix is:

(29) ΦX=PQ,(29)

where P=P1,P2,,PN is a matrix of T×N and P1,P2,,PN are orthogonal, and Q is a unitary upper triangular matrix of N×N.

The P orthogonality property is useful for OLS solution, EquationEquation (23) can be written as

(30) Y=PV+OVX,(30)
(31) W=Q1V,(31)

where estimated Vˆ=vˆ1,vˆ2,,vˆNT through

(32) vˆi=YTPiPiTPi,parai=1,2,,N.(32)

So YPVˆ2is minimal. The best W vector is

(33) Wˆ=Q1Vˆ.(33)

The classical Gram Schmidt method (Billings, Chen, and Korenberg Citation1989; Billings, Korenberg, and Chen Citation1988) is used to factorize the ΦX matrix and estimating Wˆ.

Note that i=1Nvˆi2PiTPi/T is the variance of the basis function i, and OTO/T is the model error variance not included in yt. Therefore, vi2PiTPi is the variance that corresponds to Pi and the standardized error reduction due to Pi is:

(34) ri=vˆi2PiTPiYTY.(34)

This ratio is a simple algorithm to look for a relevant basis function subset.

Model evaluation

When models have been estimated or trained, they must be evaluated to determine whether or not they meet the requirements. In this work, the evaluation process is interactive and consists of estimating the model parameters with different model structures in BG, MLP, and VPBF networks families. For all three models, (BG, MLP, and VPBF) were developed 24 model structures varying the input number (n in Xt=x1t,x2t,,xnt the input vector); and the neuron number in the hidden layer l in the MLP networks and the order k in VPBF networks.

The RMSE and the MAE (Chai and Draxler Citation2014) are the error metrics used to validate the models. These metrics were used for BG, MLP, and VPBF model validation.

The RMSE is defined by

(35) RMSE=1ni=1nei2,(35)

and the MAE is defined by

(36) MAE=1ni=1nei,(36)

n is the sample number of model errors ei,i=1,2,,n.

The aim is to obtain a model that estimates NOx emissions as accurately as possible according to RMSE and MAE error metrics criteria with a well-defined structure, easy implementation, and low computational cost.

Results

Considering that NOx model is a system of multiple input single output (MISO) and due to the NOx emissions being related to the O2, FGR and Qf boiler variables, the input vector is defined as

(37) Xt=x1t,,x1tm1,x2t,,x2tm2,x3t,,x3tm3,(37)

where x1=FGR, x2=O2 and x3=Qf, m1 is x1 maximum delays, m2 is x2 maximum delays, m3 is x3 maximum delays and the output yt=NOx for BG, MLP, and VPBF models.

From EquationEquation (4) the NOx model can be represented by

(38) yt=fx1t,,x1tm1,x2t,,x2tm2,x3t,,x3tm3.(38)

The model selection was made iteratively. Several model configurations were progressively developed and validated for BG, MLP, VPBF family models. The BG neural network was built using the experimental data training, the MLP ANN structure was settled with only one hidden layer, and the VPBF model was approximated by a single-layer neural network.

The data-based NOx models were properly implemented taking into consideration 27 typical combinations of m1, m2 and m3 with maximum delays from 0 to 2, each one in the input vector x1t,,x1tm1,x2t,,x2tm2,x3t,,x3tm3 for BG, MLP, and VPBF models. The order of BG models was N=2148 which was defined based on the training sample number. The number of neurons in the hidden layer was l=4,5,6 in the MLP models. The VPBF models were tested for three different orders k=2,3,4. Because the polynomial expansion can grow intolerably, the VPBF models were reduced at the 20 most significant basis functions, using the criterion defined in EquationEquation (34), only when the number of basis functions N was greater than 20.

For the development and validation of the models, the experimental samples were divided into two subsets. The first, called training data, contains 2148 samples, and the second, called validation data, contains 180 samples. The training data subset was used for parameter estimation of all model combinations, and the validation data subset was used to test the developed models using MAE and RMSE error metrics.

shows the RMSE and MAE error metrics of the eight best BG models developed.

Table 3. The best BG model validation results

The BG model structure that provides the best NOx prediction with RMSE=2.5295 and MAE=1.7318 was with m1=0, m2=0 and m3=2, maximum delays in x1, x2 and x3 respectively, which corresponds to the model input number n=5 whose input vector is Xt=x1t,x2t,x3t,x3t1,x3t2. When m1,m2=0,1 and m3=2 the errors are very close, but when m1,m2=2 and m3=0,1 the error increases.

shows the RMSE and MAE error metrics for the best-developed structures in the MLP models.

Table 4. The best MLP model validation results

The MLP model structure that provides the best NOx prediction with RMSE=14.9220 and MAE=12.5862 was with m1=0, m2=1 and m3=0 in x1, x2 and x3 maximum delays, respectively, which corresponds to n=4 maximum size of Xt=x1t,x2t,x2t1,x3t input vector and l=4 four neurons in the hidden layer. However, when l=4,6 no large differences are observed in the metrics of the NOx emissions prediction error, and in m1=m2=m3=0 are good models.

shows the RMSE and MAE error metrics for the best-defined structures for the VPBF models.

Table 5. VPBF model validation results

The VPBF model structure that provides the best NOx prediction with RMSE=6.64 and MAE=4.7878 was with m1=0, m2=0 and m3=0 without delays in x1, x2 and x3, which corresponds to n=3 maximum size of Xt=x1t,x2t,x3t input vector and k=4 fourth-order system; whose maximum polynomial expansion number is N=20 basis functions.

In the results shown in , and 5, it is evident that the model family BG has better performance than the model family MLP and VPBF in NOx emission prediction. The predictions of NOx emissions, with validation data, of the best models achieved in the three structures BG, MLP, and VPBF are shown in .

Figure 7. NOx measured and estimated in the model validation process.

Figure 7. NOx measured and estimated in the model validation process.

The three families of non-linear models BG, MLP, and VPBF can be used for modeling NOx emissions in an industrial boiler. The decision of which model structure is suitable for NOx modeling depends on specifications such as accuracy, computational cost, system variation over time, and so on.

Models with BG structure show the best precision but have a high computation cost because it depends on the number of samples in the training layer, which is usually large because the Gaussian hypothesis must be met. Models with VPBF structure would have a lower computation cost because the model is built with relatively few base functions. The structure and parameters of the BG and VPBF artificial neural networks can be modified relatively easily online, allowing the model to be adjusted and to compensate for potential deviations from the online model.

Conclusion

Currently, around the world, most energy is generated through fossil fuels, which generates large amounts of greenhouse gas. Polluting emissions are regulated, and it is necessary to measure them in order to control air pollution. An alternative approach was developed to predict NOx emissions from a utility boiler. A NOx model was built and evaluated using a data-based approach. The model structure proposed allows the estimation of NOx emissions caused by fossil fuel combustion, based on O2 excess, fuel flue Qf and the FGR opening values, with good results. The NOx emission estimation can be improved by including other plant operation inputs correlated with NOx emissions and increasing the data training set. BG and VPBF neural networks are viable and cheap alternatives for emission quantification for prediction in industrial processes that pollute the environment. These algorithms can recycle plant operation data that contains valuable information about the internal operation of the boiler. This data has incalculable potential and is currently not exploited.

The black-box model approach does not require sophisticated algorithms or expensive equipment. It is the proper solution to the lack of emissions measurements and is a low-cost system for complete environmental monitoring of large pollution sources. Although these methods tend to produce lower-quality data, they can be used in a great number of localations, allowing an adequate pollution assessment in several places near power plants.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Notes on contributors

Guillermo Ronquillo-Lomeli

Guillermo Ronquillo-Lomeli was born in México City in 1968. He received the Ph.D. degree in Mechatronics from PICYT (interinstitutional postgraduate in science and technology), México in 2015, the M.S. in Automatic Control from the Autonomous University of Queretaro in 2002 and the B.S. degree in Electrical Engineering from Technological Institute of Queretaro in 1993. Guillermo has extensive experience in the design and manufacturing of special systems for the industry and is member of the National Mexican System of Research. His research interests include dynamic system modeling, control theory and system identification with application to energy.

Noé Amir Rodríguez-Olivares

Noé Amir Rodríguez-Olivares received the Ph.D. Degree in Mechatronics from PICYT (interinstitutional postgraduate in science and technology), Mexico in 2019, the Master Degree in Mechatronics also from the PICYT in 2014, and Mechatronics Engineering from the Technological Institute of Poza Rica, Veracruz, Mexico, in 2011. He works for CIDESI (Engineer Center and Industrial Development) and the Anahuac Queretaro University; he is a member of the National Mexican System of Research (SNI); his main area of interest is the algorithm in embedded systems for instrumentation and digital control.

Leonardo Barriga-Rodríguez

Leonardo Barriga-Rodríguez received the Ph.D. degree in Mechatronics from PICYT (interinstitutional postgraduate in science and technology), México in 2014, the Master Degree in Mechatronics also from the PICYT in 2009, and Computer Systems from the Technological Institute of Jiquilpan, Michoacán México in 2004. He works for CIDESI (Engineering Center and Industrial Development), as research professor in the Electrical and electronic engineering department; he is a member of the National Mexican System of Research; his research interests include development of algorithms in artificial intelligence and computer vision.

Antonio Ramírez-Martínez

Antonio Ramírez-Martínez received a Ph.D. degree in Mechatronics from PICYT (interinstitutional postgraduate in science and technology), México in 2015, the M.S. in mechanical Design in 2002 from PICYT and the B.S. degree in mechanics with a specialty in thermal from Technological Institute of Celaya in 1991. His research interest is on robotic inspection, particularly in Pipeline inspection with instrumented pigs, and participating in projects for thermal cycles of electricity generation.

Jorge Alberto Soto-Cajiga

Jorge Alberto Soto-Cajiga was born in Queretaro México in 1980. He received the Ph.D. degree in Mechatronics from PICYT (interinstitutional postgraduate in science and technology) in 2012, the M.S. in Mechatronics also from the PICYT in 2006 and the B.S. degree in Electrical Engineering from Technological Institute of Queretaro in 2003. Jorge has extensive experience in development of special electronics systems for the industry, actually is research professor at the center for engineering and industrial development, responsible for the instrumented equipment laboratory and his areas of interest are in the development of instrumented equipment for inspection, real-time signal processing implemented in hardware and development of electronic systems for specific application.

Luciano Nava-Balanzar

Luciano Nava-Balanzar received the Ph.D. Degree in Mechatronics from PICYT (interinstitutional postgraduate in science and technology), México in 2016, the Master Degree in Mechatronics also from the PICYT in 2010 and Control and Instrumentation Engineering from the Technological Institute of San Juan del Río, Querétaro, Mexico, in 2006. He works for CIDESI (Engineer Center and Industrial Development), as research professor in the area of Robotics and Instrumentation, and his research interests include; Electronics, design electronic, embedded system, signal processing algorithms and electronic architecture development for underwater robots.

References

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.