Search in:

Journal of Nuclear Science and Technology Volume 56, 2019 - Issue 1

Submit an article Journal homepage

Free access

578

Views

CrossRef citations to date

Altmetric

Listen

Article

Generalized formulation of extended cross-section adjustment method based on minimum variance unbiased linear estimation

Kenji Yokoyamaa Nuclear Science and Engineering Center, Japan Atomic Energy Agency, Ibaraki, JapanCorrespondence[email protected]

Takanori Kitadab Graduate School of Engineering, Osaka University, Osaka, Japan

Pages 87-104 | Received 20 Jul 2018, Accepted 25 Sep 2018, Published online: 24 Oct 2018

Cite this article
https://doi.org/10.1080/00223131.2018.1531078
CrossMark

In this article

ABSTRACT
1. Introduction
2. Review
3. Derivation
4. Discussion
5. Conclusions
Acknowledgements
Disclosure statement
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF View EPUB EPUB

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

By introducing a new assumption of linear estimation, we derive a new formulation of the extended cross-section adjustment (EA) method, which minimizes the variance of the design target core parameters. The new formulation is derived on the basis of minimum variance unbiased estimation with no use of the assumption of normal distribution. In this formulation, we found that EA has infinitely many solutions as the adjusted cross-section set. The new formulation of EA can represent all the possible solutions minimizing the variance of the design target core parameters and includes a special case identical to the classical Bayesian EA method, which was derived on the basis of the Bayes theorem under the assumption of normal distribution. Moreover, we prove that the special case minimizes not only the variance of the design target core parameters but also the variance of the nuclear data. Meanwhile, we show that the new assumption of linear estimation is consistent with the Kalman filter and demonstrate that we can formulate similarly the extended bias factor method, the conventional cross-section adjustment method, and the regressive cross-section adjustment method with no use of the assumption of normal distribution.

KEYWORDS:

Kalman filter
cross-section adjustment
linear estimation
minimum variance unbiased estimation
Bayes theorem
normal distribution
uncertainty quantification
nuclear data covariance

1. Introduction

As a neutronics design method for an innovative nuclear reactor system, the original cross-section adjustment methodology was established in 1970s–1980s [Citation1–Citation4]. With regard to the cross-section adjustment methodology, a comprehensive and detailed comparison study has been reported in references [Citation5,Citation6]. This methodology enables us to improve the prediction accuracy of neutronic characteristics by reducing uncertainties in nuclear data. In the cross-section adjustment methodology, the nuclear data are modified with the use of integral experimental quantities, such as measurement data acquired in critical experiments; the modified nuclear data are provided as an adjusted cross-section set for predicting the design target core parameters.

On the other hand, a formulation of the extended bias factor (EB) method [Citation7] was proposed in 2007 as an alternative methodology for improving the prediction accuracy of the design target core parameters by using integral experimental quantities. In EB, a bias factor is obtained by a semifictitious experimental value, which is defined by a linear combination of integral experimental quantities; the design prediction value is corrected by the bias factor.

Subsequently, a formulation of the extended cross-section adjustment (EA) method [Citation8] was proposed in 2012; EA can generate an adjusted cross-section set that reproduces the design prediction values and accuracies equivalent to EB. This formulation has revealed that EB minimizes the variance of the design prediction value of the target core parameter by considering cross-correlations in the analysis method error between the integral experimental quantities and the design target core parameters. Hence, in the case that there are strong cross-correlations in the analysis method error, EB can improve the design prediction accuracy more than the original cross-section adjustment methodology. Thus, the advantage of EB has been integrated into the cross-section adjustment methodology. In addition, the formulation of EA has revealed a difference between EA and EB. That is, the assumption of normal distribution required in the derivation of EA is not used in the derivation of EB. To explain this difference, the cross-section adjustment methods based on minimum variance unbiased estimation (MVUE) [Citation9] were proposed in 2016. In reference [Citation9], it has been shown that the MVUE-based rigorous extended cross-section adjustment (MREA) method, which was derived with no use of the assumption of normal distribution, can yield the design prediction values and accuracies equivalent to EB. Thus, it has been confirmed that the design prediction values and accuracies by EA are consistent with those by EB.

Moreover, the derivation of the MVUE-based cross-section adjustment methods [Citation9] has clarified that the original cross-section adjustment methodology minimizes the variance of the nuclear data. This finding led to a formulation of the regressive cross-section adjustment (RA) method [Citation9], which minimizes the variance of the integral experimental quantities. Although RA is unpractical as a neutronics design method, the formulation of RA is useful when we discuss what is optimized, that is, which variance is minimized, in the cross-section adjustment methodology. Thus, there are three variations of the cross-section adjustment methodology depending on the optimization target. In the present paper, we refer to the original cross-section adjustment methodology that minimizes the variance of the nuclear data as the conventional cross-section adjustment (CA) method. In particular, the original CA that was derived on the basis of the Bayes theorem under the assumption of normal distribution is called the classical Bayesian conventional cross-section adjustment (CBCA) method.

At that point in time, there was an unclear difference in the adjusted cross-section set with regard to CA. That is, the adjusted cross-section set by the MVUE-based rigorous conventional cross-section adjustment (MRCA) method [Citation9], which was derived with no use of the assumption of normal distribution, differs from that by CBCA. In reference [Citation9], it has been explained that a hypothetical mathematical operation, called ‘projection simplification,’ permits us to derive the MVUE-based simplified conventional cross-section adjustment (MSCA) method that can yield the adjusted cross-section set equivalent to CBCA. This explanation, however, is insufficient from the mathematical point of view. To explain this inconsistency in the adjusted cross-section sets between MRCA and CBCA, the dimension-reduced conventional cross-section adjustment (DRCA) method [Citation10] was proposed in 2018. The derivation of DRCA has revealed that the formulation for the adjusted cross-section set equivalent to CBCA can be derived with no use of the assumption of normal distribution by introducing a concept of dimensionality reduction. Thus, the difference in the adjusted cross-section set has been explained mathematically with regard to CA.

As for EA, however, there remains a similar problem in the adjusted cross-section set. That is, the adjusted cross-section set by the MREA method [Citation9], which was derived with no use of the assumption of normal distribution, differs from that by the classical Bayesian extended cross-section adjustment (CBEA) method [Citation8], which was derived on the basis of the Bayes theorem under the assumption of normal distribution. Similarly to MSCA, the MVUE-based simplified extended cross-section adjustment (MSEA) method, which was derived by using the projection simplification, was proposed in reference [Citation9] to explain the difference, but this explanation is also insufficient. Thus, it has not been clearly explained yet why this difference in the adjusted cross-section set is occurred with regard to EA.

In the present paper, with no use of the assumption of normal distribution, we derive a new formulation that can yield the adjusted cross-section set equivalent to CBEA. To derive the new formulation of EA, we focus attention on a fact that the formulation of the Kalman filter [Citation11,Citation12] can be derived with no use of the assumption of normal distribution. For this reason, we make a review on the derivation of the Kalman filter to clarify the relationship with the cross-section adjustment methodology. In accordance with this review, we introduce a new assumption of linear estimation into the derivation of the cross-section adjustment methods based on the MVUE. In the present paper, we refer to this derivation procedure based on MVUE with the use of the new assumption as minimum variance unbiased linear estimation (MVULE).

First, we derive the formulation of CA based on MVULE, namely, the MVULE-based conventional cross-section adjustment (MLCA) method. Next, we demonstrate that the formulation of EB can be also derived by the same procedure. That is, we formulate the MVULE-based extended bias factor (MLEB) method. Finally, we derive the formulation of the MVULE-based extended cross-section adjustment (MLEA) method and the MVULE-based regressive cross-section adjustment (MLRA) method.

In Section 2, we review the cross-section adjustment methods, the extended bias factor method, and the Kalman filter. In Section 3, the derivations of MLCA, MLEB, MLEA, and MLRA are described. In Section 4, we discuss the derived formulation and conclude in Section 5. The acronyms are described in the Abbreviations section at the end of this article.

2. Review

2.1. Review of cross-section adjustment method

2.1.1. Precondition

To review the formulation for the cross-section adjustment methodology, we first define the following numbers of data used in the methodology:

n_a: the number of nuclear data to be adjusted;

n⁽¹⁾: the number of integral experimental quantities used for adjustment; and

n⁽²⁾: the number of the design target core parameters to be considered in adjustment of EA.

In accordance with reference [Citation9], MRCA is equivalent to CBCA under the condition of the overdetermined (well-posed) problem, where $n^{(1)} \geq n_{a}$ (cf. Section 4.1.2 in reference [Citation9]). For this reason, we review the formulation under the condition of the underdetermined (ill-posed) problem, where $n^{(1)} < n_{a}$ . With regard to EAs – including MREA, MSEA, and CBEA, a precondition that $n^{(2)} < n^{(1)} (< n_{a})$ is adopted. As for DRCA, we focus on the special case equivalent to CBCA, namely, DRCA2 [Citation10].

2.1.2. Unified formula

The adjusted cross-section set by the foregoing methods can be expressed in the unified form [Citation9]:

(1)

T_{x} \equiv T_{0} + B_{x} D^{- 1} (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})),

(1)

where

(2)

B_{x} \equiv {\begin{cases} M G^{(1) T} + G^{(2) +} V_{m}^{(12) T} \\ (x = CBEA, MSEA) \\ G^{(1) +} {(G^{(2)} G^{(1) +})}^{+} (G^{(2)} M G^{(1) T} + V_{m}^{(12) T}) \\ (x = MREA) \\ M G^{(1) T} \\ (x = CBCA, MSCA, DRCA 2) \\ G^{(1) +} G^{(1)} M G^{(1) T} \\ (x = MRCA), \end{cases}

(2)

(3)

D \equiv G^{(1)} M G^{(1) T} + V_{e + m}^{(1)},

(3)

and

(4)

V_{e + m}^{(1)} \equiv V_{e}^{(1)} + V_{m}^{(1)} .

(4)

Here, we have defined as follows:

$T_{0} \in R^{n_{a} \times 1}$ : the unadjusted cross-section set;
$R_{e}^{(1)} \in R^{n^{(1)} \times 1}$ : the measurement value of the integral experimental quantities;
$R_{c}^{(1)} (T) \in R^{n^{(1)} \times 1}$ : the calculation value of the integral experimental quantities obtained using the cross-section set $T$ ;
$G^{(1)} \in R^{n^{(1)} \times n_{a}}$ : the sensitivity coefficient matrix of the integral experimental quantities with respect to the nuclear data;
$G^{(2)} \in R^{n^{(2)} \times n_{a}}$ : the sensitivity coefficient matrix of the design target core parameters with respect to the nuclear data;
$M \in R^{n_{a} \times n_{a}}$ : the variance–covariance matrix of the unadjusted cross-section set;
$V_{e}^{(1)} \in R^{n^{(1)} \times n^{(1)}}$ : the variance–covariance matrix with respect to the experimental error;
$V_{m}^{(1)} \in R^{n^{(1)} \times n^{(1)}}$ : the variance–covariance matrix with respect to the analysis method error; and
$V_{m}^{(12)} \in R^{n^{(1)} \times n^{(2)}}$ : the cross-correlation matrix between the integral experimental quantities and the design target core parameters with respect to the analysis method error,

where $X \in R^{m \times n}$ means that $X$ is an $m \times n$ real matrix. In addition, the superscript of ‘ $+$ ’ denotes the Moore–Penrose pseudoinverse, and the superscript of ‘ $T$ ’ represents the matrix transpose. Note that $D$ is the full variance–covariance matrix of the integral experimental quantities and is supposed to be invertible (i.e. nonsingular) since the inverse of $D$ appears in EquationEquation (1)(1) $T_{x} \equiv T_{0} + B_{x} D^{- 1} (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})),$ (1) . Moreover, it is assumed that $G^{(1)}$ and $G^{(2)}$ have full rank. Then, recalling the precondition that $n^{(2)} < n^{(1)} < n_{a}$ , we have

(5)

G^{(i)} G^{(i) +} = I (i = 1, 2),

(5)

where $I$ denotes the identity matrix.

In accordance with the formulation of the MVUE-based cross-section adjustment methods (cf. EquationEquations (103)(103) $\begin{aligned} R_{c}^{(2)} (\hat{T}) = R_{t}^{(2)} + G^{(2)} F (Δ e^{(1)} - Δ m^{(1)} - G^{(1)} Δ T_{0}) \\ + G^{(2)} Δ T_{0} + Δ m^{(2)} \\ = R_{t}^{(2)} + (G^{(2)} - G^{(2)} F G^{(1)}) Δ T_{0} \\ + G^{(2)} F Δ e^{(1)} - G^{(2)} F Δ m^{(1)} + Δ m^{(2)} . \end{aligned}$ (103) , Equation(113)(113) $G^{(2) T} (G^{(2)} F_{M L E A} - G^{(2)} B_{M L E A 0} D^{- 1}) = O .$ (113) , and (112) in reference [Citation9]), we can also write the variance-covariance matrix of the adjusted cross-section set, the variance of the calculation values of the integral experimental quantities reanalyzed using the adjusted cross-section set, and the variance of the prediction values of the design target core parameters in the unified forms:

(6)

M_{x} = M + B_{x} D^{- 1} B_{x}^{T} - M G^{(1) T} D^{- 1} B_{x}^{T} - B_{x} D^{- 1} G^{(1)} M,

(6)

(7)

V a r (R_{c}^{(1)} (T_{x})) = G^{(1)} M_{x} G^{(1) T} + V_{m}^{(1)} - F_{x} V_{m}^{(1)} - V_{m}^{(1)} F_{x}^{T},

(7)

and

(8)

V a r (R_{c}^{(2)} (T_{x})) = G^{(2)} M_{x} G^{(2) T} + V_{m}^{(2)} - L_{x} V_{m}^{(12)} - V_{m}^{(12) T} L_{x}^{T},

(8)

respectively, where

(9)

F_{x} \equiv G^{(1)} B_{x} D^{- 1},

(9)

and

(10)

L_{x} \equiv G^{(2)} B_{x} D^{- 1} .

(10)

Here, we have defined as follows:

$R_{c}^{(2)} (T) \in ℝ^{n^{(2)} \times 1}$ : the calculation value of the design target core parameters obtained using the cross-section set $T$ ; and
$V_{m}^{(2)} \in ℝ^{n^{(2)} \times n^{(2)}}$ : the variance–covariance matrix of the design target core parameters with respect to the analysis method error.

2.1.3. Detailed definition

In addition, let us define the following symbols:

$T_{t} \in R^{n_{a} \times 1}$ : the (unknown) true cross-section set;
$R_{t}^{(i)} \in R^{n^{(i)} \times 1}$ : the true value of integral experimental quantities ( $i = 1$ ) or of the design target core parameters ( $i = 2$ );
$Δ T_{0} \in R^{n_{a} \times 1}$ : the error of the unadjusted cross-section set;
$Δ e^{(1)} \in R^{n^{(1)} \times 1}$ : the experimental error of the integral experimental quantities; and
$Δ m^{(i)} \in R^{n^{(i)} \times 1}$ : the analysis method error of the integral experimental quantities ( $i = 1$ ) or of the design target core parameters ( $i = 2$ ).

Then, some symbols can be redefined by

(11)

G^{(i)} \equiv \frac{d R_{c}^{(i)}}{d T} |_{T = T_{0}} (i = 1, 2),

(11)

(12)

Δ T_{0} \equiv T_{0} - T_{t},

(12)

(13)

Δ e^{(1)} \equiv R_{e}^{(1)} - R_{t}^{(1)},

(13)

and

(14)

Δ m^{(i)} \equiv R_{c}^{(i)} (T_{t}) - R_{t}^{(i)} (i = 1, 2) .

(14)

Similarly, we can redefine the variance–covariance and cross-correlation matrices by

(15)

M \equiv V a r (T_{0}),

(15)

(16)

V_{e}^{(1)} \equiv V a r (R_{e}^{(1)}),

(16)

(17)

V_{m}^{(i)} \equiv V a r (R_{c}^{(i)} (T_{t})) (i = 1, 2),

(17)

and

(18)

V_{m}^{(i j)} \equiv C o v (R_{c}^{(i)} (T_{t}), R_{c}^{(j)} (T_{t})) (i = 1, 2; j = 1, 2) .

(18)

Here, we have defined $V a r (\cdot)$ and $C o v (\cdot)$ for column vectors $X$ and $Y$ by

(19)

Var (X) \equiv E ((X - E (X)) {(X - E (X))}^{T}),

(19)

and

(20)

Cov (X, Y) \equiv E ((X - E (X)) {(Y - E (Y))}^{T}),

(20)

respectively, where $E (\cdot)$ denotes the expectation value. Note that when $i = j$ in EquationEquation (18)(18) $V_{m}^{(i j)} \equiv C o v (R_{c}^{(i)} (T_{t}), R_{c}^{(j)} (T_{t})) (i = 1, 2; j = 1, 2) .$ (18) , $V_{m}^{(i i)}$ is reduced to $V_{m}^{(i)}$ by definition. Moreover, all of the following variance-covariance matrices are symmetric:

(21)

M^{T} = M,

(21)

(22)

V_{e}^{(1) T} = V_{e}^{(1)},

(22)

and

(23)

V_{m}^{(i) T} = V_{m}^{(i)} .

(23)

On the other hand, a variance–covariance matrix is always nonnegative definite. That is, a variance-covariance matrix is either positive definite or positive semidefinite. Hence, $M$ , $V_{e}^{(1)}$ , and $V_{m}^{(1)}$ are nonnegative definite. Moreover, $G^{(1)} M G^{(1) T}$ , $D$ , and $D^{- 1}$ are symmetric and nonnegative definite in accordance with the following theorems:

For an $n \times n$ matrix $A$ and an $n \times m$ matrix $B$ , if $A$ is (symmetric and) nonnegative definite, $B^{T} A B$ is (symmetric and) nonnegative definite (e.g. Theorem 14.2.9 in reference [Citation13]);
For $n \times n$ matrices $A$ and $B$ , if both $A$ and $B$ are nonnegative definite, $A + B$ is nonnegative definite (e.g. Lemma 14.2.4 in reference [Citation13]); and
For an $n \times n$ matrix $A$ , if $A$ is invertible and symmetric, $A^{- 1}$ is symmetric (e.g. Equation (2.4) in reference [Citation13]);
For an $n \times n$ matrix $A$ , if $A$ is invertible and nonnegative definite, $A^{- 1}$ is nonnegative definite (e.g. Corollary 14.2.11 in reference [Citation13]).

2.2. Review of extended bias factor method

Given that the design prediction values by CBEA are equal to those by EB [Citation8], the design prediction values by EB can be defined by

(24)

R_{E B}^{(2)} \equiv R_{c}^{(2)} (T_{0}) + (G^{(2)} M G^{(1) T} + V_{m}^{(12) T}) D^{- 1} (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .

(24)

This definition can be obtained by assuming linearity as it is done in EquationEquation (75)(75) $R_{c}^{(1)} (T^{'}) - R_{c}^{(1)} (T) \approx G^{(1)} (T^{'} - T) .$ (75) , which is described later, and substituting EquationEquations (1)(1) $T_{x} \equiv T_{0} + B_{x} D^{- 1} (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})),$ (1) and (Equation2(2) $B_{x} \equiv {\begin{cases} M G^{(1) T} + G^{(2) +} V_{m}^{(12) T} \\ (x = CBEA, MSEA) \\ G^{(1) +} {(G^{(2)} G^{(1) +})}^{+} (G^{(2)} M G^{(1) T} + V_{m}^{(12) T}) \\ (x = MREA) \\ M G^{(1) T} \\ (x = CBCA, MSCA, DRCA 2) \\ G^{(1) +} G^{(1)} M G^{(1) T} \\ (x = MRCA), \end{cases}$ (2) ) as follows:

(25)

\begin{aligned} R_{c}^{(2)} (T_{C B E A}) = R_{c}^{(2)} (T_{0}) + G^{(2)} (T_{C B E A} - T_{0}) \\ = R_{c}^{(2)} (T_{0}) + G^{(2)} \\ (M G^{(1) T} + G^{(2) +} V_{m}^{(12) T}) D^{- 1} \\ (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) = R_{E B}^{(2)}, (25) \end{aligned}

(25)

where EquationEquation (5)(5) $G^{(i)} G^{(i) +} = I (i = 1, 2),$ (5) is used.

2.3. Review of Kalman filter

Next, we review a derivation of the Kalman filter [Citation14] and examine the relationship with the cross-section adjustment methodology.

2.3.1. Non-linear Kalman filter and its linearization

Let us consider the following equations as a nonlinear discrete-time state model for time series data:

(26)

s_{k + 1} = f (s_{k}) + w_{k},

(26)

(27)

y_{k} = h (s_{k}) + v_{k},

(27)

where

$s_{k} \in R^{n \times 1}$ : the process state vector at time $t_{k}$ ;
$w_{k} \in R^{n \times 1}$ : the system error, which is a white noise contribution to the state vector for time interval $(t_{k + 1}, t_{k})$ ;
$y_{k} \in R^{m \times 1}$ : the measurement (observation vector) at time $t_{k}$ ;
$v_{k} \in R^{m \times 1}$ : the measurement error at time $t_{k}$ ;
$f (\cdot)$ : the time-independent state transition function; and
$h (\cdot)$ : the time-independent observation function giving the ideal (noiseless) connection between the measurement and the state vector.

It is assumed that an approximate $s_{k}^{*}$ can be determined by some means. When the difference between the actual and the approximate state vectors is defined by $Δ s_{k}$ , the actual state vector is written by

(28)

s_{k} = s_{k}^{*} + Δ s_{k} .

(28)

To linearize the equations of the discrete-time state model, let us approximate the state transition function and the observation function by the Taylor series expansion around $s_{k}^{*}$ :

(29)

f (s_{k}) \approx f (s_{k}^{*}) + \frac{\partial f}{\partial s_{k}} |_{s_{k} = s_{k}^{*}} Δ s_{k},

(29)

and

(30)

h (s_{k}) \approx h (s_{k}^{*}) + \frac{\partial h}{\partial s_{k}} |_{s_{k} = s_{k}^{*}} Δ s_{k},

(30)

where the higher-order terms are ignored. Thus, EquationEquation (26)(26) $s_{k + 1} = f (s_{k}) + w_{k},$ (26) can be written by

(31)

s_{k + 1}^{*} + Δ s_{k + 1} = f (s_{k}^{*}) + Φ_{k} Δ s_{k} + w_{k},

(31)

where

(32)

Φ_{k} \equiv \frac{\partial f}{\partial s_{k}} |_{s_{k} = s_{k}^{*}} \in R^{n \times n} .

(32)

Meanwhile, when we need an approximation of the state vector at the next time step, we can adopt a transition of the approximate state vector at the previous time step. That is, $s_{k + 1}^{*}$ can be determined such that $s_{k + 1}^{*} = f (s_{k}^{*})$ . Thus, EquationEquation (31)(31) $s_{k + 1}^{*} + Δ s_{k + 1} = f (s_{k}^{*}) + Φ_{k} Δ s_{k} + w_{k},$ (31) can be rewritten as

(33)

Δ s_{k + 1} = Φ_{k} Δ s_{k} + w_{k} .

(33)

On the other hand, EquationEquation (27)(27) $y_{k} = h (s_{k}) + v_{k},$ (27) can be expressed by

(34)

y_{k} - h (s_{k}^{*}) = H_{k} Δ s_{k} + v_{k},

(34)

where

(35)

H_{k} \equiv \frac{\partial h}{\partial s_{k}} |_{s_{k} = s_{k}^{*}} \in R^{m \times n} .

(35)

To simplify the linearized model described by EquationEquations (33)(33) $Δ s_{k + 1} = Φ_{k} Δ s_{k} + w_{k} .$ (33) and (Equation34(34) $y_{k} - h (s_{k}^{*}) = H_{k} Δ s_{k} + v_{k},$ (34) ), we replace the state vector and the observation vector by

(36)

x_{k} \equiv Δ s_{k} = s_{k} - s_{k}^{*},

(36)

and

(37)

z_{k} \equiv y_{k} - h (s_{k}^{*}),

(37)

respectively. Thus, the equations of the discrete-time state model can be rewritten as

(38)

x_{k + 1} = Φ_{k} x_{k} + w_{k},

(38)

(39)

z_{k} = H_{k} x_{k} + v_{k} .

(39)

This formulation is known as the linear Kalman filter.

2.3.2. Linear Kalman filter

Next, we review the state model equations of the linear Kalman filter. The covariance matrices of the system noise and the observation noise are defined by

(40)

E (w_{k} w_{i}^{T}) = \{\begin{matrix} Q_{k} (i = k) \\ O (i \neq k), \end{matrix}

(40)

(41)

E (v_{k} v_{i}^{T}) = \{\begin{matrix} R_{k} (i = k) \\ O (i \neq k), \end{matrix}

(41)

and

(42)

E (w_{k} v_{i}^{T}) = O (f o r a l l k a n d i),

(42)

where $O$ denotes the zero matrix. Let us denote the prior estimate by ${\hat{x}}_{k 0}$ , which is the state vector estimated from the information prior to time $t_{k}$ , where the hat $(^{\land})$ represents an estimate, and the subscript of ‘ $0$ ’ means an estimate prior to assimilating the measurement at time $t_{k}$ .

Let us assume that the covariance matrix of the prior estimate is known. That is, the estimation error is defined explicitly by

(43)

e_{k 0} \equiv x_{k} - {\hat{x}}_{k 0},

(43)

then the covariance matrix of the prior estimate can be written by

(44)

P_{k 0} = E (e_{k 0} e_{k 0}^{T}) = E ((x_{k} - {\hat{x}}_{k 0}) {(x_{k} - {\hat{x}}_{k 0})}^{T}) .

(44)

Next, let us consider improving the prior estimate by using the measurement at time $t_{k}$ . For this purpose, a linear blending of the measurement $z_{k}$ and the prior estimate ${\hat{x}}_{k 0}$ is adopted in the form:

(45)

{\hat{x}}_{k} = {\hat{x}}_{k 0} + K_{k} (z_{k} - H_{k} {\hat{x}}_{k 0}),

(45)

where

${\hat{x}}_{k}$ : the updated estimate of the state vector and

$K_{k}$ : the linear blending factor.

Meanwhile, the covariance matrix of the updated estimate is expressed by

(46)

P_{k} = E (e_{k} e_{k}^{T}) = E ((x_{k} - {\hat{x}}_{k}) {(x_{k} - {\hat{x}}_{k})}^{T}) .

(46)

Substituting EquationEquation (39)(39) $z_{k} = H_{k} x_{k} + v_{k} .$ (39) into EquationEquation (45)(45) ${\hat{x}}_{k} = {\hat{x}}_{k 0} + K_{k} (z_{k} - H_{k} {\hat{x}}_{k 0}),$ (45) yields

(47)

{\hat{x}}_{k} = {\hat{x}}_{k 0} + K_{k} (H_{k} x_{k} + v_{k} - H_{k} {\hat{x}}_{k 0}) .

(47)

Thus, substituting EquationEquation (47)(47) ${\hat{x}}_{k} = {\hat{x}}_{k 0} + K_{k} (H_{k} x_{k} + v_{k} - H_{k} {\hat{x}}_{k 0}) .$ (47) in EquationEquation (46)(46) $P_{k} = E (e_{k} e_{k}^{T}) = E ((x_{k} - {\hat{x}}_{k}) {(x_{k} - {\hat{x}}_{k})}^{T}) .$ (46) and considering that there is no correlation between $e_{k 0}$ and $v_{k}$ , we obtain

(48)

\begin{aligned} P_{k} = E (((x_{k} - {\hat{x}}_{k 0}) - K_{k} (H_{k} x_{k} + v_{k} - H_{k} {\hat{x}}_{k 0})) \\ {((x_{k} - {\hat{x}}_{k 0}) - K_{k} (H_{k} x_{k} + v_{k} - H_{k} {\hat{x}}_{k 0}))}^{T}) \\ = E ((e_{k 0} - K_{k} (H_{k} e_{k 0} + v_{k})) {(e_{k 0} - K_{k} (H_{k} e_{k 0} + v_{k}))}^{T}) \\ = (I - K_{k} H_{k}) E (e_{k 0} e_{k 0}^{T}) {(I - K_{k} H_{k})}^{T} + K_{k} E (v_{k} v_{k}^{T}) K_{k}^{T} \\ = (I - K_{k} H_{k}) P_{k 0} {(I - K_{k} H_{k})}^{T} + K_{k} R_{k} K_{k}^{T} \\ = P_{k 0} - K_{k} H_{k} P_{k 0} - P_{k 0} H_{k}^{T} K_{k}^{T} + K_{k} (H_{k} P_{k 0} H_{k}^{T} + R_{k}) K_{k}^{T} . \end{aligned}

(48)

To derive this equation, we have used that the following equation holds for arbitrary matrices $A$ and $B$ :

(49)

{(A B)}^{T} = B^{T} A^{T} .

(49)

To determine the blending factor that minimizes the trace of the covariance matrix of the updated estimate, we consider the partial derivative:

(50)

\begin{aligned} \frac{\partial}{\partial K_{k}} t r (P_{k}) = - 2 \frac{\partial}{\partial K_{k}} t r (P_{k 0} H_{k}^{T} K_{k}^{T}) \\ + \frac{\partial}{\partial K_{k}} t r (K_{k} (H_{k} P_{k 0} H_{k}^{T} + R_{k}) K_{k}^{T}) \\ = - 2 P_{k 0} H_{k}^{T} + 2 K_{k} (H_{k} P_{k 0} H_{k}^{T} + R_{k}), \end{aligned}

(50)

where $t r (\cdot)$ denotes the matrix trace, which is the sum of the diagonal elements. To derive EquationEquation (50)(50) $\begin{aligned} \frac{\partial}{\partial K_{k}} t r (P_{k}) = - 2 \frac{\partial}{\partial K_{k}} t r (P_{k 0} H_{k}^{T} K_{k}^{T}) \\ + \frac{\partial}{\partial K_{k}} t r (K_{k} (H_{k} P_{k 0} H_{k}^{T} + R_{k}) K_{k}^{T}) \\ = - 2 P_{k 0} H_{k}^{T} + 2 K_{k} (H_{k} P_{k 0} H_{k}^{T} + R_{k}), \end{aligned}$ (50) , we have used the following equations for arbitrary matrices $A$ and $B$ :

(51)

tr (A + B) = tr (A) + tr (B),

(51)

and

(52)

tr (A^{T}) = tr (A) .

(52)

Moreover, we have used the following derivative formulas for arbitrary constant matrices $A$ , $B$ , $C$ :

(53)

\frac{\partial}{\partial X} (t r (A) + t r (B)) = \frac{\partial}{\partial X} t r (A) + \frac{\partial}{\partial X} t r (B),

(53)

(54)

\frac{\partial}{\partial X} t r (A X^{T} B) = B A,

(54)

and

(55)

\frac{\partial}{\partial X} t r (A X B X^{T} C) = A^{T} C^{T} X B^{T} + C A X B .

(55)

To determine the blending factor that minimizes the trace of $P_{k}$ , we set EquationEquation (50)(50) $\begin{aligned} \frac{\partial}{\partial K_{k}} t r (P_{k}) = - 2 \frac{\partial}{\partial K_{k}} t r (P_{k 0} H_{k}^{T} K_{k}^{T}) \\ + \frac{\partial}{\partial K_{k}} t r (K_{k} (H_{k} P_{k 0} H_{k}^{T} + R_{k}) K_{k}^{T}) \\ = - 2 P_{k 0} H_{k}^{T} + 2 K_{k} (H_{k} P_{k 0} H_{k}^{T} + R_{k}), \end{aligned}$ (50) equal to zero and solve for $K_{k}$ . The result is

(56)

K_{k} = P_{k 0} H_{k}^{T} {(H_{k} P_{k 0} H_{k}^{T} + R_{k})}^{- 1} .

(56)

This equation is called the Kalman gain. Substituting this equation into EquationEquation (45)(45) ${\hat{x}}_{k} = {\hat{x}}_{k 0} + K_{k} (z_{k} - H_{k} {\hat{x}}_{k 0}),$ (45) , we obtain

(57)

{\hat{x}}_{k} = {\hat{x}}_{k 0} + P_{k 0} H_{k}^{T} {(H_{k} P_{k 0} H_{k}^{T} + R_{k})}^{- 1} (z_{k} - H_{k} {\hat{x}}_{k 0}) .

(57)

This equation enables us to estimate the state vector at time $t_{k}$ .

2.3.3. Relationship with cross-section adjustment method

As defined by EquationEquations (1)(1) $T_{x} \equiv T_{0} + B_{x} D^{- 1} (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})),$ (1) – (Equation3(3) $D \equiv G^{(1)} M G^{(1) T} + V_{e + m}^{(1)},$ (3) ), the adjusted cross-section set by CBCA is written by

(58)

\begin{aligned} T_{C B C A} = T_{0} + M G^{(1) T} \\ {(G^{(1)} M G^{(1) T} + V_{e + m}^{(1)})}^{- 1} (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) . \end{aligned}

(58)

Although this equation is similar to EquationEquation (57)(57) ${\hat{x}}_{k} = {\hat{x}}_{k 0} + P_{k 0} H_{k}^{T} {(H_{k} P_{k 0} H_{k}^{T} + R_{k})}^{- 1} (z_{k} - H_{k} {\hat{x}}_{k 0}) .$ (57) , it differs from EquationEquation (57)(57) ${\hat{x}}_{k} = {\hat{x}}_{k 0} + P_{k 0} H_{k}^{T} {(H_{k} P_{k 0} H_{k}^{T} + R_{k})}^{- 1} (z_{k} - H_{k} {\hat{x}}_{k 0}) .$ (57) in that it deals with the nonlinearity of $R_{c}^{(1)} (\cdot)$ . To eliminate this difference, let us consider the following correspondence in light of the derivation procedure for linearizing the Kalman filter:

(59)

{\hat{x}}_{k} = {\hat{s}}_{k} - s_{k}^{*} \leftrightarrow T_{C B C A} - T_{0},

(59)

(60)

{\hat{x}}_{k 0} = {\hat{s}}_{k 0} - s_{k}^{*} \leftrightarrow T_{0} - T_{0} (= O),

(60)

(61)

z_{k} = y_{k} - h (s_{k}^{*}) \leftrightarrow R_{e}^{(1)} - R_{c}^{(1)} (T_{0}),

(61)

(62)

P_{k 0} \leftrightarrow M,

(62)

(63)

H_{k} \leftrightarrow G^{(1)},

(63)

and

(64)

R_{k} \leftrightarrow V_{e + m}^{(1)} .

(64)

Thus, we can understand that EquationEquation (58)(58) $\begin{aligned} T_{C B C A} = T_{0} + M G^{(1) T} \\ {(G^{(1)} M G^{(1) T} + V_{e + m}^{(1)})}^{- 1} (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) . \end{aligned}$ (58) is identical to EquationEquation (57)(57) ${\hat{x}}_{k} = {\hat{x}}_{k 0} + P_{k 0} H_{k}^{T} {(H_{k} P_{k 0} H_{k}^{T} + R_{k})}^{- 1} (z_{k} - H_{k} {\hat{x}}_{k 0}) .$ (57) .

Moreover, in light of EquationEquation (64)(64) $R_{k} \leftrightarrow V_{e + m}^{(1)} .$ (64) , we can find another correspondence:

(65)

v_{k} \leftrightarrow - Δ e^{(1)} + Δ m^{(1)} .

(65)

From this correspondence, we can understand that the cross-section adjustment method is a linearized model of the nonlinear Kalman filter described by the state model equations:

(66)

T_{1} = f (T_{0}) + w_{0},

(66)

(67)

R_{e}^{(1)} + Δ e^{(1)} = R_{c}^{(1)} (T_{0}) + Δ m^{(1)},

(67)

where $T_{1}$ represents the cross-section set at time $t_{1}$ . Although EquationEquation (66)(66) $T_{1} = f (T_{0}) + w_{0},$ (66) is inessential for the cross-section adjustment methodology, the state model described by EquationEquation (67)(67) $R_{e}^{(1)} + Δ e^{(1)} = R_{c}^{(1)} (T_{0}) + Δ m^{(1)},$ (67) is reasonable as the observation of C/E (calculation/experiment) values for the integral experimental quantities. On the other hand, since the linearization of the nonlinear Kalman filter is known as the extended Kalman filter (EKF), we can understand that the cross-section adjustment methodology is a kind of EKF.

2.4. Discussion on assumption of linear estimation

As described before, the Kalman gain is determined by minimizing the variance with no use of the assumption of normal distribution. This procedure is similar to determining the linear combination factor $F$ in the derivation of the MVUE-based cross-section adjustment methods. Hence, we can find yet another correspondence:

(68)

K_{k} \leftrightarrow F .

(68)

Substituting the foregoing correspondence, that is, EquationEquations (59)(59) ${\hat{x}}_{k} = {\hat{s}}_{k} - s_{k}^{*} \leftrightarrow T_{C B C A} - T_{0},$ (59) – (Equation61(61) $z_{k} = y_{k} - h (s_{k}^{*}) \leftrightarrow R_{e}^{(1)} - R_{c}^{(1)} (T_{0}),$ (61) ), and (Equation68(68) $K_{k} \leftrightarrow F .$ (68) ), into EquationEquation (45)(45) ${\hat{x}}_{k} = {\hat{x}}_{k 0} + K_{k} (z_{k} - H_{k} {\hat{x}}_{k 0}),$ (45) , we obtain

(69)

\hat{T} - T_{0} = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .

(69)

This equation differs from the following equation that represents the assumption of linear estimation used in the derivation of the MVUE-based cross-section adjustment methods [Citation9]:

(70)

{\hat{R}}^{(1)} - R_{c}^{(1)} (T_{0}) = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .

(70)

In light of the review of the Kalman filter, it is expected that we can formulate a new cross-section adjustment method if we adopt EquationEquation (69)(69) $\hat{T} - T_{0} = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (69) as the assumption of linear estimation.

On the other hand, we can use another equation as the assumption of linear estimation. That is, if we consider the design target core parameters as the state vector, the assumption of linear estimation can be written by

(71)

{\hat{R}}^{(2)} - R_{c}^{(2)} (T_{0}) = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .

(71)

Since this equation estimates directly the design target core parameters, it is expected that a new formulation corresponding to EB can be derived if EquationEquation (71)(71) ${\hat{R}}^{(2)} - R_{c}^{(2)} (T_{0}) = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (71) is used as the assumption of linear estimation.

In the following, we derive MLCA with the use of EquationEquation (69)(69) $\hat{T} - T_{0} = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (69) as the assumption of linear estimation. Subsequently, MLEB is derived by using EquationEquation (71)(71) ${\hat{R}}^{(2)} - R_{c}^{(2)} (T_{0}) = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (71) . Finally, we derive MLEA and MLRA with EquationEquation (69)(69) $\hat{T} - T_{0} = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (69) .

3. Derivation

3.1. Assumptions

In the present paper, we adopt the same assumptions used in the derivation of the MVUE-based cross-section adjustment methods [Citation9] except for the assumption of linear estimation. However, since these assumptions are essential for the formulation, we summarize them in the following.

3.1.1. Assumptions on number of data

Unlike the foregoing review of the cross-section adjustment methods, we employ no particular assumption (precondition) regarding the relationship between the number of integral experimental quantities and the number of nuclear data:

(72)

n^{(1)} \frac{>}{<} n_{a} .

(72)

That is, it is unnecessary to distinguish whether the problem is overdetermined or underdetermined in the present formulation. With regard to EA, it is assumed that the number of design target core parameters is smaller than the number of nuclear data:

(73)

n^{(2)} < n_{a} .

(73)

In addition, we assume that $G^{(2)}$ has full rank:

(74)

r a n k (G^{(2)}) = n^{(2)} .

(74)

Note that $G^{(2)}$ is a wide matrix and has full row rank in light of EquationEquations (73)(73) $n^{(2)} < n_{a} .$ (73) and (Equation74(74) $r a n k (G^{(2)}) = n^{(2)} .$ (74) ). By contrast, at this moment, we pay no attention to whether $G^{(1)}$ is of full rank.

3.1.2. Assumption of linearity

Let us assume that the variation of two calculation values obtained by different cross-section sets $T^{'}$ and $T$ can be evaluated by the first-order approximation in the form:

(75)

R_{c}^{(1)} (T^{'}) - R_{c}^{(1)} (T) \approx G^{(1)} (T^{'} - T) .

(75)

Then, the difference between the experimental value and the calculation value of integral experimental quantities can be expressed by

(76)

\begin{aligned} R_{e}^{(1)} - R_{c}^{(1)} (T_{0}) = (R_{e}^{(1)} - R_{t}^{(1)}) - (R_{c}^{(1)} (T_{t}) - R_{t}^{(1)}) \\ - (R_{c}^{(1)} (T_{0}) - R_{c}^{(1)} (T_{t})) \approx Δ e^{(1)} \\ - Δ m^{(1)} - G^{(1)} Δ T_{0}, \end{aligned}

(76)

where EquationEquations(13) $Δ e^{(1)} \equiv R_{e}^{(1)} - R_{t}^{(1)},$ (13) (Equation12(12) $Δ T_{0} \equiv T_{0} - T_{t},$ (12) )–(Equation14(14) $Δ m^{(i)} \equiv R_{c}^{(i)} (T_{t}) - R_{t}^{(i)} (i = 1, 2) .$ (14) ) are used.

3.1.3. Assumption of unbiased estimation

Let us assume that the unbiased estimation is valid all for the nuclear data included in the unadjusted cross-section set, the experimental value and the calculation value of the integral experimental quantities, and the calculation value of the design target core parameters. That is, we assume that the expectation value is equal to the true value:

(77)

E (T_{0}) = T_{t},

(77)

(78)

E (R_{e}^{(1)}) = R_{t}^{(1)},

(78)

and

(79)

E (R_{c}^{(i)} (T_{t})) = R_{t}^{(i)} (i = 1, 2) .

(79)

Then, in light of EquationEquations (19)(19) $Var (X) \equiv E ((X - E (X)) {(X - E (X))}^{T}),$ (19) and (Equation20(20) $Cov (X, Y) \equiv E ((X - E (X)) {(Y - E (Y))}^{T}),$ (20) ), the foregoing covariance and cross-correlation matrices can be rewritten as

(80)

M = E (Δ T_{0} Δ T_{0}^{T}),

(80)

(81)

V_{e}^{(1)} = E (Δ e^{(1)} Δ e^{(1) T}),

(81)

(82)

V_{m}^{(i)} = E (Δ m^{(i)} Δ m^{(i) T}) (i = 1, 2),

(82)

and

(83)

V_{m}^{(i j)} = E (Δ m^{(i)} Δ m^{(j) T}) (i = 1, 2; j = 1, 2) .

(83)

3.1.4. Assumption of error independence

Moreover, we assume that the experimental error, the analysis method error, and the cross-section-induced errors with respect to the integral experimental quantities and the design target core parameters are independent of each other:

(84)

E (Δ x_{i} Δ x_{j}^{T}) = O (i \neq j),

(84)

where

(85)

Δ x_{i} \equiv \{\begin{matrix} Δ e^{(1)} (i = 1) \\ Δ m^{(1)}, Δ m^{(2)} (i = 2) \\ G^{(1)} Δ T_{0} (i = 3) \\ G^{(2)} Δ T_{0} (i = 4) . \end{matrix}

(85)

3.2. Derivation of conventional cross-section adjustment method

Let us derive MLCA by using EquationEquation (69)(69) $\hat{T} - T_{0} = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (69) as the assumption of linear estimation.

3.2.1. Derivation of adjusted cross-section set

In light of EquationEquation (12)(12) $Δ T_{0} \equiv T_{0} - T_{t},$ (12) , the estimate of the cross-section set can be written by

(86)

\hat{T} = T_{t} + (\hat{T} - T_{0}) + Δ T_{0} .

(86)

(87)

\begin{matrix} \hat{T} = T_{t} + F (Δ e^{(1)} - Δ m^{(1)} - G^{(1)} Δ T_{0}) + Δ T_{0} \\ = T_{t} + (I - F G^{(1)}) Δ T_{0} + F Δ e^{(1)} - F Δ m^{(1)} . \end{matrix}

(87)

Thus, the variance of $\hat{T}$ can be expressed by

(88)

\begin{aligned} V a r (\hat{T}) = E ((\hat{T} - T_{t}) {(\hat{T} - T_{t})}^{T}) \\ = (I - F G^{(1)}) E (Δ T_{0} Δ T_{0}^{T}) {(I - F G^{(1)})}^{T} \\ + F E (Δ e^{(1)} Δ e^{(1) T}) F + F E (Δ m^{(1)} Δ m^{(1) T}) F \\ = M + F G^{(1)} M G^{(1) T} F^{T} - M G^{(1) T} F^{T} \\ - F G^{(1)} M + {F V}_{e}^{(1)} F^{T} + {F V}_{m}^{(1)} F^{T} \\ = M + F (G^{(1)} M G^{(1) T} + V_{e + m}^{(1)}) F^{T} \\ - M G^{(1) T} F^{T} - {(M G^{(1) T} F^{T})}^{T} \end{aligned}

(88)

where EquationEquations(84) $E (Δ x_{i} Δ x_{j}^{T}) = O (i \neq j),$ (84) (Equation84(84) $E (Δ x_{i} Δ x_{j}^{T}) = O (i \neq j),$ (84) ), (Equation80(80) $M = E (Δ T_{0} Δ T_{0}^{T}),$ (80) ), (Equation81(81) $V_{e}^{(1)} = E (Δ e^{(1)} Δ e^{(1) T}),$ (81) ), (Equation82(82) $V_{m}^{(i)} = E (Δ m^{(i)} Δ m^{(i) T}) (i = 1, 2),$ (82) ) and (Equation4(4) $V_{e + m}^{(1)} \equiv V_{e}^{(1)} + V_{m}^{(1)} .$ (4) ) are used. To determine the linear combination factor $F$ that minimizes the variance, let us consider the partial derivative of the trace of this variance with respect to $F$ :

(89)

\begin{aligned} \frac{\partial}{\partial F} t r (V a r (\hat{T})) = \frac{\partial}{\partial F} t r (F (G^{(1)} M G^{(1) T} + V_{e + m}^{(1)}) F^{T}) \\ - 2 \frac{\partial}{\partial F} t r (M G^{(1) T} F^{T}) \\ = 2 F (G^{(1)} M G^{(1) T} + V_{e + m}^{(1)}) - 2 M G^{(1) T} \end{aligned}

(89)

where EquationEquations (51)(51) $tr (A + B) = tr (A) + tr (B),$ (51) –(Equation55(55) $\frac{\partial}{\partial X} t r (A X B X^{T} C) = A^{T} C^{T} X B^{T} + C A X B .$ (55) ) are used. It is possible to determine the linear combination factor matrix $F$ that minimizes the variance by setting EquationEquation (89)(89) $\begin{aligned} \frac{\partial}{\partial F} t r (V a r (\hat{T})) = \frac{\partial}{\partial F} t r (F (G^{(1)} M G^{(1) T} + V_{e + m}^{(1)}) F^{T}) \\ - 2 \frac{\partial}{\partial F} t r (M G^{(1) T} F^{T}) \\ = 2 F (G^{(1)} M G^{(1) T} + V_{e + m}^{(1)}) - 2 M G^{(1) T} \end{aligned}$ (89) equal to zero. Let $F_{M L C A}$ denote the minimizing factor of $F$ , then we have

(90)

F_{M L C A} (G^{(1)} M G^{(1) T} + V_{e + m}^{(1)}) = M G^{(1) T} .

(90)

Premultiplying both sides of this equation by the inverse of $D$ defined by EquationEquation (3)(3) $D \equiv G^{(1)} M G^{(1) T} + V_{e + m}^{(1)},$ (3) , we obtain

(91)

F_{M L C A} = M G^{(1) T} D^{- 1}

(91)

Substituting $F_{M L C A}$ into $F$ in EquationEquation (69)(69) $\hat{T} - T_{0} = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (69) and defining $T_{M L C A}$ in this case as $\hat{T}$ , we have

(92)

T_{M L C A} = T_{0} + M G^{(1) T} D^{- 1} (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .

(92)

This equation is identical to that for the adjusted cross-section set by CBCA:

(93)

T_{M L C A} = T_{C B C A} .

(93)

In summary, if EquationEquation (69)(69) $\hat{T} - T_{0} = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (69) is valid as the assumption of linear estimation, we can derive the formula identical to CBCA by using neither the projection simplification nor the concept of dimensionality reduction.

3.2.2. Derivation of variances

Given that EquationEquations (6)(6) $M_{x} = M + B_{x} D^{- 1} B_{x}^{T} - M G^{(1) T} D^{- 1} B_{x}^{T} - B_{x} D^{- 1} G^{(1)} M,$ (6) –(Equation8(8) $V a r (R_{c}^{(2)} (T_{x})) = G^{(2)} M_{x} G^{(2) T} + V_{m}^{(2)} - L_{x} V_{m}^{(12)} - V_{m}^{(12) T} L_{x}^{T},$ (8) ) can be derived with no use of the assumption of linear estimation (cf. Sections 4.2–4.4 in reference [Citation9]), we can adopt these equations also in MLCA. Thus, we can understand that the prediction accuracies by MLCA are also equivalent to CBCA.

3.3. Derivation of extended bias factor method

Next, let us derive MLEB by using EquationEquation (71)(71) ${\hat{R}}^{(2)} - R_{c}^{(2)} (T_{0}) = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (71) as the assumption of linear estimation.

3.3.1. Derivation of estimate of design target core parameters

The calculation values of the design target core parameters obtained using the unadjusted cross-section set can be written by

(94)

\begin{aligned} R_{c}^{(2)} (T_{0}) = R_{t}^{(2)} + (R_{c}^{(2)} (T_{0}) - R_{c}^{(2)} (T_{t})) \\ + (R_{c}^{(2)} (T_{t}) - R_{t}^{(2)}) \approx R_{t}^{(2)} + G^{(2)} (T_{0} - T_{t}) \\ + Δ m^{(2)} = R_{t}^{(2)} + G^{(2)} Δ T_{0} + Δ m^{(2)}, \end{aligned}

(94)

where EquationEquations (75)(75) $R_{c}^{(1)} (T^{'}) - R_{c}^{(1)} (T) \approx G^{(1)} (T^{'} - T) .$ (75) , (Equation14(14) $Δ m^{(i)} \equiv R_{c}^{(i)} (T_{t}) - R_{t}^{(i)} (i = 1, 2) .$ (14) ), and (Equation12(12) $Δ T_{0} \equiv T_{0} - T_{t},$ (12) ) are used. Substituting EquationEquations (94)(94) $\begin{aligned} R_{c}^{(2)} (T_{0}) = R_{t}^{(2)} + (R_{c}^{(2)} (T_{0}) - R_{c}^{(2)} (T_{t})) \\ + (R_{c}^{(2)} (T_{t}) - R_{t}^{(2)}) \approx R_{t}^{(2)} + G^{(2)} (T_{0} - T_{t}) \\ + Δ m^{(2)} = R_{t}^{(2)} + G^{(2)} Δ T_{0} + Δ m^{(2)}, \end{aligned}$ (94) and (Equation76(76) $\begin{aligned} R_{e}^{(1)} - R_{c}^{(1)} (T_{0}) = (R_{e}^{(1)} - R_{t}^{(1)}) - (R_{c}^{(1)} (T_{t}) - R_{t}^{(1)}) \\ - (R_{c}^{(1)} (T_{0}) - R_{c}^{(1)} (T_{t})) \approx Δ e^{(1)} \\ - Δ m^{(1)} - G^{(1)} Δ T_{0}, \end{aligned}$ (76) ) in EquationEquation (71)(71) ${\hat{R}}^{(2)} - R_{c}^{(2)} (T_{0}) = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (71) , we have

(95)

\begin{aligned} {\hat{R}}^{(2)} = R_{t}^{(2)} + G^{(2)} Δ T_{0} + Δ m^{(2)} \\ + F (Δ e^{(1)} - Δ m^{(1)} - G^{(1)} Δ T_{0}) \\ = R_{t}^{(2)} + (G^{(2)} - F G^{(1)}) Δ T_{0} + F Δ e^{(1)} \\ - F Δ m^{(1)} + Δ m^{(2)} . (95) \end{aligned}

(95)

Thus, the variance of ${\hat{R}}^{(2)}$ can be written by

(96)

\begin{aligned} V a r ({\hat{R}}^{(2)}) = E (({\hat{R}}^{(2)} - R_{t}^{(2)}) {({\hat{R}}^{(2)} - R_{t}^{(2)})}^{T}) \\ = (G^{(2)} - F G^{(1)}) E (Δ T_{0} Δ T_{0}^{T}) {(G^{(2)} - F G^{(1)})}^{T} \\ + F E (Δ e^{(1)} Δ e^{(1) T}) F^{T} + F E (Δ m^{(1)} Δ m^{(1) T}) F^{T} \\ + E (Δ m^{(2)} Δ m^{(2) T}) - F E (Δ m^{(1)} Δ m^{(2) T}) \\ - E (Δ m^{(2)} Δ m^{(1) T}) F^{T} = G^{(2)} M G^{(2) T} \\ + F G^{(1)} M G^{(1) T} F^{T} - G^{(2)} M G^{(1) T} F^{T} \\ - F G^{(1)} M G^{(2) T} + {F V}_{e}^{(1)} F^{T} + {F V}_{m}^{(1)} F^{T} \\ + V_{m}^{(2)} - {F V}_{m}^{(12)} - V_{m}^{(12) T} F^{T} \\ = G^{(2)} M G^{(2) T} + F (G^{(1)} M G^{(1) T} + V_{e + m}^{(1)}) F^{T} \\ - G^{(2)} M G^{(1) T} F^{T} - {(G^{(2)} M G^{(1) T} F^{T})}^{T} \\ + V_{m}^{(2)} - {(V_{m}^{(12) T} F^{T})}^{T} - V_{m}^{(12) T} F^{T} . \end{aligned}

(96)

where Equation EquationEquations (84)(84) $E (Δ x_{i} Δ x_{j}^{T}) = O (i \neq j),$ (84) , Equation(80)(80) $M = E (Δ T_{0} Δ T_{0}^{T}),$ (80) (80) $M = E (Δ T_{0} Δ T_{0}^{T}),$ (80) –(Equation83(83) $V_{m}^{(i j)} = E (Δ m^{(i)} Δ m^{(j) T}) (i = 1, 2; j = 1, 2) .$ (83) ), and (4) are used. To determine $F$ that minimizes the variance, let us consider the partial derivative:

(97)

\begin{aligned} \frac{\partial}{\partial F} t r (V a r ({\hat{R}}^{(2)})) = \frac{\partial}{\partial F} t r (F (G^{(1)} M G^{(1) T} + V_{e + m}^{(1)}) F^{T}) \\ - 2 \frac{\partial}{\partial F} t r (G^{(2)} M G^{(1) T} F^{T}) \\ - 2 \frac{\partial}{\partial F} t r (V_{m}^{(12) T} F^{T}) \\ = 2 F (G^{(1)} M G^{(1) T} + V_{e + m}^{(1)}) \\ - 2 G^{(2)} M G^{(1) T} - 2 V_{m}^{(12) T}, \end{aligned}

(97)

where EquationEquations (51)(51) $tr (A + B) = tr (A) + tr (B),$ (51) –Equation(55)(54) $\frac{\partial}{\partial X} t r (A X^{T} B) = B A,$ (54) are used. We can determine the linear combination factor matrix that minimizes the variance by setting EquationEquation (97)(97) $\begin{aligned} \frac{\partial}{\partial F} t r (V a r ({\hat{R}}^{(2)})) = \frac{\partial}{\partial F} t r (F (G^{(1)} M G^{(1) T} + V_{e + m}^{(1)}) F^{T}) \\ - 2 \frac{\partial}{\partial F} t r (G^{(2)} M G^{(1) T} F^{T}) \\ - 2 \frac{\partial}{\partial F} t r (V_{m}^{(12) T} F^{T}) \\ = 2 F (G^{(1)} M G^{(1) T} + V_{e + m}^{(1)}) \\ - 2 G^{(2)} M G^{(1) T} - 2 V_{m}^{(12) T}, \end{aligned}$ (97) equal to zero. Let $F_{M L E B}$ denote the minimizing factor, then we have

(98)

F_{M L E B} (G^{(1)} M G^{(1) T} + V_{e + m}^{(1)}) = G^{(2)} M G^{(1) T} + V_{m}^{(12) T} .

(98)

Premultiplying both sides of this equation by $D^{- 1}$ yields

(99)

F_{M L E B} = (G^{(2)} M G^{(1) T} + V_{m}^{(12) T}) D^{- 1} .

(99)

Substituting $F_{M L E B}$ of this equation into $F$ of EquationEquation (71)(71) ${\hat{R}}^{(2)} - R_{c}^{(2)} (T_{0}) = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (71) and defining ${\hat{R}}^{(2)}$ in this case by $R_{E B}^{(2)}$ , we have

(100)

\begin{aligned} R_{M L E B}^{(2)} = R_{c}^{(2)} (T_{0}) + (G^{(2)} M G^{(1) T} + V_{m}^{(12) T}) \\ D^{- 1} (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) . \end{aligned}

(100)

3.3.2. Comparison with original extended bias factor method

From a comparison between EquationEquations (100)(100) $\begin{aligned} R_{M L E B}^{(2)} = R_{c}^{(2)} (T_{0}) + (G^{(2)} M G^{(1) T} + V_{m}^{(12) T}) \\ D^{- 1} (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) . \end{aligned}$ (100) and (Equation24(24) $R_{E B}^{(2)} \equiv R_{c}^{(2)} (T_{0}) + (G^{(2)} M G^{(1) T} + V_{m}^{(12) T}) D^{- 1} (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (24) ), we obtain

(101)

R_{M L E B}^{(2)} = R_{E B}^{(2)} .

(101)

Thus, we can interpret that EB is a kind of the linear Kalman filter in which the design target core parameters are estimated directly as the state vector.

3.4. Derivation of extended cross-section adjustment method

Finally, let us derive MLEA by using EquationEquation (69)(69) $\hat{T} - T_{0} = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (69) as the assumption of linear estimation.

3.4.1. Minimization of variance of design target core parameters

Similarly to EquationEquation (94)(94) $\begin{aligned} R_{c}^{(2)} (T_{0}) = R_{t}^{(2)} + (R_{c}^{(2)} (T_{0}) - R_{c}^{(2)} (T_{t})) \\ + (R_{c}^{(2)} (T_{t}) - R_{t}^{(2)}) \approx R_{t}^{(2)} + G^{(2)} (T_{0} - T_{t}) \\ + Δ m^{(2)} = R_{t}^{(2)} + G^{(2)} Δ T_{0} + Δ m^{(2)}, \end{aligned}$ (94) , the calculation values of the design target core parameters obtained using the estimated cross-section set can be written by

(102)

\begin{aligned} R_{c}^{(2)} (\hat{T}) = R_{t}^{(2)} + (R_{c}^{(2)} (\hat{T}) - R_{c}^{(2)} (T_{t})) \\ + (R_{c}^{(2)} (T_{t}) - R_{t}^{(2)}) \approx R_{t}^{(2)} \\ + G^{(2)} (\hat{T} - T_{t}) + Δ m^{(2)} \\ = R_{t}^{(2)} + G^{(2)} (\hat{T} - T_{0}) \\ + G^{(2)} Δ T_{0} + Δ m^{(2)} . \end{aligned}

(102)

Substituting EquationEquations (69)(69) $\hat{T} - T_{0} = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (69) and (Equation76(76) $\begin{aligned} R_{e}^{(1)} - R_{c}^{(1)} (T_{0}) = (R_{e}^{(1)} - R_{t}^{(1)}) - (R_{c}^{(1)} (T_{t}) - R_{t}^{(1)}) \\ - (R_{c}^{(1)} (T_{0}) - R_{c}^{(1)} (T_{t})) \approx Δ e^{(1)} \\ - Δ m^{(1)} - G^{(1)} Δ T_{0}, \end{aligned}$ (76) ) in EquationEquation (102)(102) $\begin{aligned} R_{c}^{(2)} (\hat{T}) = R_{t}^{(2)} + (R_{c}^{(2)} (\hat{T}) - R_{c}^{(2)} (T_{t})) \\ + (R_{c}^{(2)} (T_{t}) - R_{t}^{(2)}) \approx R_{t}^{(2)} \\ + G^{(2)} (\hat{T} - T_{t}) + Δ m^{(2)} \\ = R_{t}^{(2)} + G^{(2)} (\hat{T} - T_{0}) \\ + G^{(2)} Δ T_{0} + Δ m^{(2)} . \end{aligned}$ (102) , we have

(103)

\begin{aligned} R_{c}^{(2)} (\hat{T}) = R_{t}^{(2)} + G^{(2)} F (Δ e^{(1)} - Δ m^{(1)} - G^{(1)} Δ T_{0}) \\ + G^{(2)} Δ T_{0} + Δ m^{(2)} \\ = R_{t}^{(2)} + (G^{(2)} - G^{(2)} F G^{(1)}) Δ T_{0} \\ + G^{(2)} F Δ e^{(1)} - G^{(2)} F Δ m^{(1)} + Δ m^{(2)} . \end{aligned}

(103)

Thus, the variance of $R_{c}^{(2)} (\hat{T})$ can be written by

(104)

\begin{aligned} V a r (R_{c}^{(2)} (\hat{T})) = E ((R_{c}^{(2)} (\hat{T}) - R_{t}^{(2)}) {(R_{c}^{(2)} (\hat{T}) - R_{t}^{(2)})}^{T}) \\ = (G^{(2)} - G^{(2)} F G^{(1)}) E (Δ T_{0} Δ T_{0}^{T}) \\ {(G^{(2)} - G^{(2)} F G^{(1)})}^{T} \\ + (G^{(2)} F) E (Δ e^{(1)} Δ e^{(1) T}) {(G^{(2)} F)}^{T} \\ + (G^{(2)} F) E (Δ m^{(1)} Δ m^{(1) T}) {(G^{(2)} F)}^{T} \\ + E (Δ m^{(2)} Δ m^{(2) T}) \\ - (G^{(2)} F) E (Δ m^{(1)} Δ m^{(2) T}) \\ - E (Δ m^{(2)} Δ m^{(1) T}) {(G^{(2)} F)}^{T} \\ = G^{(2)} M G^{(2) T} + G^{(2)} F G^{(1)} M G^{(1) T} F^{T} G^{(2) T} \\ - G^{(2)} M G^{(1) T} F^{T} G^{(2) T} - G^{(2)} F G^{(1)} M G^{(2) T} \\ + G^{(2)} {F V}_{e}^{(1)} F^{T} G^{(2) T} + G^{(2)} {F V}_{m}^{(1)} F^{T} G^{(2) T} \\ + V_{m}^{(2)} - G^{(2)} {F V}_{m}^{(12)} - V_{m}^{(12) T} F^{T} G^{(2) T} \\ = G^{(2)} M G^{(2) T} + G^{(2)} F (G^{(1)} M G^{(1) T} + V_{e + m}^{(1)}) \\ F^{T} G^{(2) T} - G^{(2)} M G^{(1) T} F^{T} G^{(2) T} \\ - {(G^{(2)} M G^{(1) T} F^{T} G^{(2) T})}^{T} \\ + V_{m}^{(2)} - {(V_{m}^{(12) T} F^{T} G^{(2) T})}^{T} \\ - V_{m}^{(12) T} F^{T} G^{(2) T} . \end{aligned}

(104)

where EquationEquations (84)(84) $E (Δ x_{i} Δ x_{j}^{T}) = O (i \neq j),$ (84) , (Equation80(80) $M = E (Δ T_{0} Δ T_{0}^{T}),$ (80) )– (Equation83(83) $V_{m}^{(i j)} = E (Δ m^{(i)} Δ m^{(j) T}) (i = 1, 2; j = 1, 2) .$ (83) ), and (Equation4(4) $V_{e + m}^{(1)} \equiv V_{e}^{(1)} + V_{m}^{(1)} .$ (4) ) are used. To determine $F$ that minimizes the variance, let us consider the partial derivative:

(105)

\begin{aligned} \frac{\partial}{\partial F} t r (V a r (R_{c}^{(2)} (\hat{T}))) \\ = \frac{\partial}{\partial F} t r (G^{(2)} F (G^{(1)} M G^{(1) T} + V_{e + m}^{(1)}) F^{T} G^{(2) T}) \\ - 2 \frac{\partial}{\partial F} t r (G^{(2)} M G^{(1) T} F^{T} G^{(2) T}) \\ - 2 \frac{\partial}{\partial F} t r (V_{m}^{(12) T} F^{T} G^{(2) T}) \\ = 2 G^{(2) T} G^{(2)} F (G^{(1)} M G^{(1) T} + V_{e + m}^{(1)}) \\ - 2 G^{(2) T} G^{(2)} M G^{(1) T} - 2 G^{(2) T} V_{m}^{(12) T} . \end{aligned}

(105)

where EquationEquations (51)(51) $tr (A + B) = tr (A) + tr (B),$ (51) Equation–(55)(55) $\frac{\partial}{\partial X} t r (A X B X^{T} C) = A^{T} C^{T} X B^{T} + C A X B .$ (55) are used. By setting EquationEquation (105)(105) $\begin{aligned} \frac{\partial}{\partial F} t r (V a r (R_{c}^{(2)} (\hat{T}))) \\ = \frac{\partial}{\partial F} t r (G^{(2)} F (G^{(1)} M G^{(1) T} + V_{e + m}^{(1)}) F^{T} G^{(2) T}) \\ - 2 \frac{\partial}{\partial F} t r (G^{(2)} M G^{(1) T} F^{T} G^{(2) T}) \\ - 2 \frac{\partial}{\partial F} t r (V_{m}^{(12) T} F^{T} G^{(2) T}) \\ = 2 G^{(2) T} G^{(2)} F (G^{(1)} M G^{(1) T} + V_{e + m}^{(1)}) \\ - 2 G^{(2) T} G^{(2)} M G^{(1) T} - 2 G^{(2) T} V_{m}^{(12) T} . \end{aligned}$ (105) equal to zero, we can determine the linear combination factor matrix that minimizes the variance. Let $F_{M L E A}$ denote the minimizing factor, then we have

(106)

G^{(2) T} G^{(2)} F_{M L E A} (G^{(1)} M G^{(1) T} + V_{e + m}^{(1)}) = G^{(2) T} (G^{(2)} M G^{(1) T} + V_{m}^{(12) T}) .

(106)

Premultiplying both sides of this equation by $D^{- 1}$ yields

(107)

G^{(2) T} G^{(2)} F_{M L E A} = G^{(2) T} (G^{(2)} M G^{(1) T} + V_{m}^{(12) T}) D^{- 1} .

(107)

Note that $G^{(2) T} G^{(2)}$ is a singular matrix in light of EquationEquation (73)(73) $n^{(2)} < n_{a} .$ (73) . Therefore, EquationEquation (107)(107) $G^{(2) T} G^{(2)} F_{M L E A} = G^{(2) T} (G^{(2)} M G^{(1) T} + V_{m}^{(12) T}) D^{- 1} .$ (107) has infinitely many solutions of $F_{M L E A}$ . In other words, we cannot determine uniquely the adjusted cross-section set for MLEA. Although a minimum norm solution was adopted in the derivation of MREA and MSEA to determine a unique solution by using the Moore–Penrose pseudoinverse (cf. EquationEquation (64)(64) $R_{k} \leftrightarrow V_{e + m}^{(1)} .$ (64) in reference [Citation9]), this treatment remains open to discussion. In the present derivation, we treat explicitly these numerous solutions.

3.4.2. Derivation of numerous solutions of adjusted cross-section set

To express the solutions for EquationEquation (107)(107) $G^{(2) T} G^{(2)} F_{M L E A} = G^{(2) T} (G^{(2)} M G^{(1) T} + V_{m}^{(12) T}) D^{- 1} .$ (107) , we introduce a generalized inverse for a singular matrix, which includes a rectangular matrix. Note that the generalized inverse of a matrix is not unique. The Moore–Penrose pseudoinverse is a special case of the generalized inverse, and is determined uniquely. By definition, for the generalized inverse of an arbitrary matrix $A$ , we have

(108)

A A^{-} A = A,

(108)

where the superscript of ‘ $-$ ’ denotes the generalized inverse. Furthermore, the generalized inverse of a matrix of full row rank is a right inverse, even if it is not the Moore–Penrose pseudoinverse (e.g. Lemma 9.2.8 in reference [Citation13]). That is, for a matrix of full row rank $B$ , we always have

(109)

B B^{-} = I .

(109)

In light of EquationEquations (73)(73) $n^{(2)} < n_{a} .$ (73) and (Equation74(74) $r a n k (G^{(2)}) = n^{(2)} .$ (74) ), $G^{(2)}$ has full row rank. Hence, $G^{(2) -}$ is the right inverse of $G^{(2)}$ :

(110)

G^{(2)} G^{(2) -} = I .

(110)

Thus, EquationEquation (107)(107) $G^{(2) T} G^{(2)} F_{M L E A} = G^{(2) T} (G^{(2)} M G^{(1) T} + V_{m}^{(12) T}) D^{- 1} .$ (107) can be rewritten as

(111)

G^{(2) T} G^{(2)} F_{M L E A} = G^{(2) T} G^{(2)} (M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T}) D^{- 1},

(111)

where $G_{0}^{(2) -}$ is a generalized inverse of $G^{(2)}$ . To indicate that $G_{0}^{(2) -}$ is a particular one of the numerous generalized inverses of $G^{(2)}$ , we have defined the new symbol of $G_{0}^{(2) -}$ .

To simplify EquationEquation (111)(111) $G^{(2) T} G^{(2)} F_{M L E A} = G^{(2) T} G^{(2)} (M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T}) D^{- 1},$ (111) , let us define

(112)

B_{M L E A 0} \equiv M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T},

(112)

then EquationEquation (111)(111) $G^{(2) T} G^{(2)} F_{M L E A} = G^{(2) T} G^{(2)} (M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T}) D^{- 1},$ (111) can be rewritten as

(113)

G^{(2) T} (G^{(2)} F_{M L E A} - G^{(2)} B_{M L E A 0} D^{- 1}) = O .

(113)

Premultiplying both sides of this equation by ${(F_{M L E A} - B_{M L E A 0} D^{- 1})}^{T}$ , we obtain

(114)

{(G^{(2)} F_{M L E A} - G^{(2)} B_{M L E A 0} D^{- 1})}^{T} (G^{(2)} F_{M L E A} - G^{(2)} B_{M L E A 0} D^{- 1}) = O,

(114)

where EquationEquation (49)(49) ${(A B)}^{T} = B^{T} A^{T} .$ (49) is used. In general, for an arbitrary matrix $A$ , we have $A = O$ if and only if $A^{T} A = O$ (e.g. Corollary 5.3.2 in reference [Citation13]). Hence, EquationEquation (114)(114) ${(G^{(2)} F_{M L E A} - G^{(2)} B_{M L E A 0} D^{- 1})}^{T} (G^{(2)} F_{M L E A} - G^{(2)} B_{M L E A 0} D^{- 1}) = O,$ (114) means that EquationEquation (111)(111) $G^{(2) T} G^{(2)} F_{M L E A} = G^{(2) T} G^{(2)} (M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T}) D^{- 1},$ (111) is equivalent to the equation:

(115)

G^{(2)} F_{M L E A} = G^{(2)} B_{M L E A 0} D^{- 1} .

(115)

Meanwhile, all the solutions for $A X = B$ can be expressed by $X^{*} = X_{0} + Z^{*}$ , where $X_{0}$ is an arbitrary particular solution for $A X = B$ and $Z^{*}$ is all the solutions for $A Z = O$ (e.g. Theorem 11.2.3 in reference [Citation13]). Moreover, all the solutions for $A Z = O$ can be expressed by $Z^{*} = (I - A^{-} A) Y$ , where $Y$ is an arbitrary matrix (e.g. Theorem 11.2.1 in reference [Citation13]).

Let us adopt $B_{M L E A 0} D^{- 1}$ as a particular solution of $F_{M L E A}$ for EquationEquation (115)(115) $G^{(2)} F_{M L E A} = G^{(2)} B_{M L E A 0} D^{- 1} .$ (115) , then we obtain the following expression as the all possible solutions for EquationEquation (115)(115) $G^{(2)} F_{M L E A} = G^{(2)} B_{M L E A 0} D^{- 1} .$ (115) :

(116)

F_{M L E A}^{*} = B_{M L E A 0} D^{- 1} + (I - G_{1}^{(2) -} G^{(2)}) Y,

(116)

where the superscript of ‘ $*$ ’ indicates that there are numerous solutions; $G_{1}^{(2) -}$ is a generalized inverse of $G^{(2)}$ ; and $Y \in R^{n_{a} \times n^{(1)}}$ is an arbitrary matrix. Note that the generalized inverse $G_{1}^{(2) -}$ may differ from the previously defined $G_{0}^{(2) -}$ . Hence, we have defined the new symbol of $G_{1}^{(2) -}$ .

Substituting EquationEquation (116)(116) $F_{M L E A}^{*} = B_{M L E A 0} D^{- 1} + (I - G_{1}^{(2) -} G^{(2)}) Y,$ (116) in EquationEquation (69)(69) $\hat{T} - T_{0} = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (69) , we obtain the following equation that represents all the solutions of the adjusted cross-section set in MLEA:

(117)

\begin{aligned} T_{M L E A}^{*} = T_{0} + (B_{M L E A 0} + (I - G_{1}^{(2) -} G^{(2)}) Y D) D^{- 1} \\ (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) . \end{aligned}

(117)

3.4.3. Derivation of variances

Similarly to MLCA, since EquationEquations (6)(6) $M_{x} = M + B_{x} D^{- 1} B_{x}^{T} - M G^{(1) T} D^{- 1} B_{x}^{T} - B_{x} D^{- 1} G^{(1)} M,$ (6) – (Equation8(8) $V a r (R_{c}^{(2)} (T_{x})) = G^{(2)} M_{x} G^{(2) T} + V_{m}^{(2)} - L_{x} V_{m}^{(12)} - V_{m}^{(12) T} L_{x}^{T},$ (8) ) can be derived with no use of the assumption of linear estimation (cf. Sections 4.2–4.4 in reference [Citation9]), we can adopt these equations in MLEA:

(118)

M_{M L E A}^{*} = M + B_{M L E A}^{*} D^{- 1} B_{M L E A}^{* T} - M G^{(1) T} D^{- 1} B_{M L E A}^{* T} - B_{M L E A}^{*} D^{- 1} G^{(1)} M,

(118)

(119)

V a r (R_{c}^{(1)} (T_{M L E A}^{*})) = G^{(1)} M_{M L E A}^{*} G^{(1) T} + V_{m}^{(1)} - F_{M L E A}^{*} V_{m}^{(1)} - V_{m}^{(1)} F_{M L E A}^{* T},

(119)

and

(120)

V a r (R_{c}^{(2)} (T_{M L E A}^{*})) = G^{(2)} M_{M L E A}^{*} G^{(2) T} + V_{m}^{(2)} - L_{M L E A}^{*} V_{m}^{(12)} - V_{m}^{(12) T} L_{M L E A}^{* T},

(120)

where

(121)

L_{M L E A}^{*} \equiv G^{(2)} B_{M L E A}^{*} D^{- 1} .

(121)

We can use these equations in the application of MLEA to nuclear reactor design. For instance, we can evaluate the variances (i.e. accuracies) of the design target core parameters by using EquationEquation (120)(120) $V a r (R_{c}^{(2)} (T_{M L E A}^{*})) = G^{(2)} M_{M L E A}^{*} G^{(2) T} + V_{m}^{(2)} - L_{M L E A}^{*} V_{m}^{(12)} - V_{m}^{(12) T} L_{M L E A}^{* T},$ (120) .

3.5. Derivation of regressive cross-section adjustment method

Although it is known that RA is unpractical as the neutronics design method, we derive MLRA since it is meaningful for understanding the difference of the optimization target in comparison with CA and EA. By following the derivation procedure similar to MLEA, we can derive the formulation of MLRA while taking into account the difference of the optimization target. That is, by replacing the superscript of ‘ $(2)$ ’ in the derivation of MLEA with the superscript of ‘ $(1)$ ’, we can obtain the formula for the adjusted cross-section set in MLRA:

(122)

\begin{aligned} T_{M L R A}^{*} = T_{0} + (B_{M L C A 0} + (I - G_{1}^{(1) -} G^{(1)}) Y D) D^{- 1} \\ (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})), \end{aligned}

(122)

where

(123)

B_{M L C A 0} \equiv M G^{(1) T} + G_{0}^{(1) -} V_{m}^{(1)},

(123)

and $G_{0}^{(1) -}$ and $G_{1}^{(1) -}$ are particular ones of the generalized inverses of $G^{(1)}$ . Similarly to MLEA, the following assumptions are required additionally to derive MLRA:

(124)

n^{(1)} < n_{a},

(124)

and

(125)

r a n k (G^{(1)}) = n^{(1)} .

(125)

Meanwhile, to derive EquationEquation (122)(122) $\begin{aligned} T_{M L R A}^{*} = T_{0} + (B_{M L C A 0} + (I - G_{1}^{(1) -} G^{(1)}) Y D) D^{- 1} \\ (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})), \end{aligned}$ (122) , we have used that $V_{m}^{(11)} = V_{m}^{(1)}$ holds in light of EquationEquations (82)(82) $V_{m}^{(i)} = E (Δ m^{(i)} Δ m^{(i) T}) (i = 1, 2),$ (82) and (Equation83(83) $V_{m}^{(i j)} = E (Δ m^{(i)} Δ m^{(j) T}) (i = 1, 2; j = 1, 2) .$ (83) ).

3.6. Summary of derived results by unified formula

In accordance with EquationEquations (1)(1) $T_{x} \equiv T_{0} + B_{x} D^{- 1} (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})),$ (1) and (Equation2(2) $B_{x} \equiv {\begin{cases} M G^{(1) T} + G^{(2) +} V_{m}^{(12) T} \\ (x = CBEA, MSEA) \\ G^{(1) +} {(G^{(2)} G^{(1) +})}^{+} (G^{(2)} M G^{(1) T} + V_{m}^{(12) T}) \\ (x = MREA) \\ M G^{(1) T} \\ (x = CBCA, MSCA, DRCA 2) \\ G^{(1) +} G^{(1)} M G^{(1) T} \\ (x = MRCA), \end{cases}$ (2) ), $B_{x}$ for MLCA can be defined by

(126)

B_{x} \equiv M G^{(1) T} (x = M L C A),

(126)

and $B_{x}^{*}$ for MLEA and MLRA can be defined by

(127)

B_{x}^{*} \equiv \{\begin{matrix} M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T} + (I - G_{1}^{(2) -} G^{(2)}) Y D \\ (x = M L E A) \\ M G^{(1) T} + G_{0}^{(1) -} V_{m}^{(1)} + (I - G_{1}^{(1) -} G^{(1)}) Y D \\ (x = M L R A), \end{matrix}

(127)

where

$Y \in R^{n_{a} \times n^{(1)}}$ : an arbitrary matrix; and

$G_{0}^{(i) -}, G_{1}^{(i) -} \in R^{n^{(i)} \times n_{a}} (i = 1, 2)$ : particular ones of the generalized inverses of $G^{(i)}$ – it is allowed to determine $G_{0}^{(i) -}$ and $G_{1}^{(i) -}$ independently.

4. Discussion

4.1. Consistency of MLEA with EB

Let us confirm that MLEA reproduces the design prediction values by EB. Since the design prediction values are obtained using the adjusted cross-section set, we can confirm that the prediction values by MLEA are equal to EquationEquation (24)(24) $R_{E B}^{(2)} \equiv R_{c}^{(2)} (T_{0}) + (G^{(2)} M G^{(1) T} + V_{m}^{(12) T}) D^{- 1} (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (24) :

(128)

\begin{aligned} R_{c}^{(2)} (T_{M L E A}^{*}) = R_{c}^{(2)} (T_{0}) + G^{(2)} (T_{M L E A}^{*} - T_{0}) \\ = R_{c}^{(2)} (T_{0}) + G^{(2)} (B_{M L E A 0} \\ + (I - G_{1}^{(2) -} G^{(2)}) Y D) D^{- 1} \\ (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) \\ = R_{c}^{(2)} (T_{0}) + G^{(2)} B_{M L E A 0} D^{- 1} \\ (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) = R_{E B}^{(2)}, (128) \end{aligned}

(128)

where EquationEquations (75)(75) $R_{c}^{(1)} (T^{'}) - R_{c}^{(1)} (T) \approx G^{(1)} (T^{'} - T) .$ (75) , (Equation1(1) $T_{x} \equiv T_{0} + B_{x} D^{- 1} (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})),$ (1) ), (Equation127(127) $B_{x}^{*} \equiv \{\begin{matrix} M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T} + (I - G_{1}^{(2) -} G^{(2)}) Y D \\ (x = M L E A) \\ M G^{(1) T} + G_{0}^{(1) -} V_{m}^{(1)} + (I - G_{1}^{(1) -} G^{(1)}) Y D \\ (x = M L R A), \end{matrix}$ (127) ), (Equation112(112) $B_{M L E A 0} \equiv M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T},$ (112) ), and (Equation108(108) $A A^{-} A = A,$ (108) ) are used.

4.2. Quasi-solutions of CA and compensation effect

As an analogy from the foregoing discussion, we can deduce that all the adjusted cross-section sets defined by the following equation can reproduce the calculation values of the integral experimental quantities of CBCA:

(129)

\begin{aligned} T_{q u a s i - C A}^{*} = T_{0} + (M G^{(1) T} + (I - G^{(1) -} G^{(1)}) Y D) D^{- 1} \\ (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) . \end{aligned}

(129)

That is, for all the adjusted cross-section sets expressed by this equation, we have

(130)

\begin{aligned} R_{c}^{(1)} (T_{q u a s i - C A}^{*}) = R_{c}^{(1)} (T_{0}) + G^{(1)} (T_{q u a s i - C A}^{*} - T_{0}) \\ = R_{c}^{(1)} (T_{0}) + G^{(1)} M G^{(1) T} D^{- 1} \\ (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) \\ = R_{c}^{(1)} (T_{0}) + G^{(1)} (T_{C B C A} - T_{0}) \\ = R_{c}^{(1)} (T_{C B C A}), \end{aligned}

(130)

where EquationEquations (75)(75) $R_{c}^{(1)} (T^{'}) - R_{c}^{(1)} (T) \approx G^{(1)} (T^{'} - T) .$ (75) , (Equation129(129) $\begin{aligned} T_{q u a s i - C A}^{*} = T_{0} + (M G^{(1) T} + (I - G^{(1) -} G^{(1)}) Y D) D^{- 1} \\ (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) . \end{aligned}$ (129) ), (Equation108(108) $A A^{-} A = A,$ (108) ), (Equation1(1) $T_{x} \equiv T_{0} + B_{x} D^{- 1} (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})),$ (1) ), and (Equation2(2) $B_{x} \equiv {\begin{cases} M G^{(1) T} + G^{(2) +} V_{m}^{(12) T} \\ (x = CBEA, MSEA) \\ G^{(1) +} {(G^{(2)} G^{(1) +})}^{+} (G^{(2)} M G^{(1) T} + V_{m}^{(12) T}) \\ (x = MREA) \\ M G^{(1) T} \\ (x = CBCA, MSCA, DRCA 2) \\ G^{(1) +} G^{(1)} M G^{(1) T} \\ (x = MRCA), \end{cases}$ (2) ) are used. Note that, however, these adjusted cross-section sets are not always the optimal solution that minimizes the variance of the nuclear data.

Thus, we can understand that there are infinitely many solutions of the adjusted cross-section set for solely reproducing the calculation values of the integral experimental quantities. This fact is known as a ‘compensation effect’ [Citation15] in the use of integral experimental benchmarks in nuclear data evaluations.

4.3. Practical solution of MLEA

With regard to MLEA, we have obtained the mathematical formulation representing all the solutions of the adjusted cross-section set that minimize the variance of the design target core parameters. Next, let us consider which solution should be employed in practice. To determine uniquely the solution, we introduce an additional constraint condition. From a practical point of view, we adopt the constraint condition of minimizing the variance of the nuclear data.

4.3.1. Conjecture

Let us make a conjecture that the variance of the adjusted cross-section set in MLEA (cf. EquationEquation (127)(127) $B_{x}^{*} \equiv \{\begin{matrix} M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T} + (I - G_{1}^{(2) -} G^{(2)}) Y D \\ (x = M L E A) \\ M G^{(1) T} + G_{0}^{(1) -} V_{m}^{(1)} + (I - G_{1}^{(1) -} G^{(1)}) Y D \\ (x = M L R A), \end{matrix}$ (127) ) is minimized when we adopt the case where $Y = O$ (n.b. the term of $G_{1}^{(2) -}$ vanishes) and $G_{0}^{(2) -} = G^{(2) +}$ .

Hence, we make the following definitions:

(131)

B_{M L E A}^{m i n} \equiv B_{M L E A}^{*} |_{Y = O, G_{0}^{(2) -} = G^{(2) +}},

(131)

(132)

T_{M L E A}^{m i n} \equiv T_{0} + B_{M L E A}^{m i n} D^{- 1} (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})),

(132)

and

(133)

M_{M L E A}^{m i n} \equiv V a r (T_{M L E A}^{m i n}) .

(133)

Note that $B_{M L E A}^{m i n}$ is identical to $B_{x}$ for CBEA:

(134)

B_{M L E A}^{m i n} = M G^{(1) T} + G^{(2) +} V_{m}^{(12) T} = B_{C B E A} .

(134)

In the following, we prove that the trace of $M_{M L E A}^{m i n}$ has the minimum value among the traces of $M_{M L E A}^{*}$ :

(135)

t r (M_{M L E A}^{*}) \geq t r (M_{M L E A}^{m i n}) .

(135)

4.3.2. Proof

The trace of EquationEquation (6)(6) $M_{x} = M + B_{x} D^{- 1} B_{x}^{T} - M G^{(1) T} D^{- 1} B_{x}^{T} - B_{x} D^{- 1} G^{(1)} M,$ (6) , which represents the covariance matrix of the adjusted cross-section set, can be expressed by

(136)

t r (M_{x}) = t r (M) + t r (B_{x} D^{- 1} B_{x}^{T}) - 2 t r (M G^{(1) T} D^{- 1} B_{x}^{T}),

(136)

where EquationEquations (51)(51) $tr (A + B) = tr (A) + tr (B),$ (51) and (Equation52(52) $tr (A^{T}) = tr (A) .$ (52) ) are used. Thus, the difference between the left-hand side and the right-hand side of EquationEquation (135)(135) $t r (M_{M L E A}^{*}) \geq t r (M_{M L E A}^{m i n}) .$ (135) can be written by

(137)

\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) = t r (B_{M L E A}^{*} D^{- 1} B_{M L E A}^{* T}) \\ - 2 t r (M G^{(1) T} D^{- 1} B_{M L E A}^{* T}) \\ - t r (B_{M L E A}^{m i n} D^{- 1} B_{M L E A}^{m i n T}) \\ + 2 t r (M G^{(1) T} D^{- 1} B_{M L E A}^{m i n T}) . \end{aligned}

(137)

To simplify this equation, let us define

(138)

P \equiv I - G_{1}^{(2) -} G^{(2)},

(138)

then $B_{M L E A}^{*}$ defined by EquationEquation (127)(127) $B_{x}^{*} \equiv \{\begin{matrix} M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T} + (I - G_{1}^{(2) -} G^{(2)}) Y D \\ (x = M L E A) \\ M G^{(1) T} + G_{0}^{(1) -} V_{m}^{(1)} + (I - G_{1}^{(1) -} G^{(1)}) Y D \\ (x = M L R A), \end{matrix}$ (127) can be rewritten as

(139)

B_{M L E A}^{*} = B_{M L E A 0} + P Y D .

(139)

Note that $P$ is an idempotent matrix:

(140)

P^{2} = P .

(140)

In addition, with respect to $P$ , we have

(141)

G^{(2)} P = P G^{(2) -} = O .

(141)

To derive EquationEquations (140)(140) $P^{2} = P .$ (140) and (Equation141(141) $G^{(2)} P = P G^{(2) -} = O .$ (141) ), we have used EquationEquation (108)(108) $A A^{-} A = A,$ (108) .

Substituting EquationEquation (139)(139) $B_{M L E A}^{*} = B_{M L E A 0} + P Y D .$ (139) in EquationEquation (137)(137) $\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) = t r (B_{M L E A}^{*} D^{- 1} B_{M L E A}^{* T}) \\ - 2 t r (M G^{(1) T} D^{- 1} B_{M L E A}^{* T}) \\ - t r (B_{M L E A}^{m i n} D^{- 1} B_{M L E A}^{m i n T}) \\ + 2 t r (M G^{(1) T} D^{- 1} B_{M L E A}^{m i n T}) . \end{aligned}$ (137) , we obtain

(142)

\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r ((B_{M L E A 0} + P Y D) D^{- 1} {(B_{M L E A 0} + P Y D)}^{T}) \\ - 2 t r (M G^{(1) T} D^{- 1} {(B_{M L E A 0} + P Y D)}^{T}) \\ - t r (B_{M L E A}^{m i n} D^{- 1} B_{M L E A}^{m i n T}) \\ + 2 t r (M G^{(1) T} D^{- 1} B_{M L E A}^{m i n T}) \\ = t r (P Y D Y^{T} P^{T}) + t r (B_{M L E A 0} D^{- 1} B_{M L E A 0}^{T}) \\ + 2 t r (B_{M L E A 0} Y^{T} P^{T}) - 2 t r (M G^{(1) T} D^{- 1} B_{M L E A 0}^{T}) \\ - 2 t r (M G^{(1) T} Y^{T} P^{T}) - t r (B_{M L E A}^{m i n} D^{- 1} B_{M L E A}^{m i n T}) \\ + 2 t r (M G^{(1) T} D^{- 1} B_{M L E A}^{m i n T}), \end{aligned}

(142)

where EquationEquations (52)(52) $tr (A^{T}) = tr (A) .$ (52) and (Equation49(49) ${(A B)}^{T} = B^{T} A^{T} .$ (49) ) are used. Substituting EquationEquations (112)(112) $B_{M L E A 0} \equiv M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T},$ (112) and (Equation134(134) $B_{M L E A}^{m i n} = M G^{(1) T} + G^{(2) +} V_{m}^{(12) T} = B_{C B E A} .$ (134) ) into EquationEquation (142)(142) $\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r ((B_{M L E A 0} + P Y D) D^{- 1} {(B_{M L E A 0} + P Y D)}^{T}) \\ - 2 t r (M G^{(1) T} D^{- 1} {(B_{M L E A 0} + P Y D)}^{T}) \\ - t r (B_{M L E A}^{m i n} D^{- 1} B_{M L E A}^{m i n T}) \\ + 2 t r (M G^{(1) T} D^{- 1} B_{M L E A}^{m i n T}) \\ = t r (P Y D Y^{T} P^{T}) + t r (B_{M L E A 0} D^{- 1} B_{M L E A 0}^{T}) \\ + 2 t r (B_{M L E A 0} Y^{T} P^{T}) - 2 t r (M G^{(1) T} D^{- 1} B_{M L E A 0}^{T}) \\ - 2 t r (M G^{(1) T} Y^{T} P^{T}) - t r (B_{M L E A}^{m i n} D^{- 1} B_{M L E A}^{m i n T}) \\ + 2 t r (M G^{(1) T} D^{- 1} B_{M L E A}^{m i n T}), \end{aligned}$ (142) , we obtain

(143)

\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (P Y D Y^{T} P^{T}) + t r ((M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T}) D^{- 1} \\ {(M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T})}^{T}) \\ + 2 t r ((M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T}) Y^{T} P^{T}) \\ - 2 t r (M G^{(1) T} D^{- 1} {(M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T})}^{T}) \\ - 2 t r (M G^{(1) T} Y^{T} P^{T}) \\ - t r ((M G^{(1) T} + G^{(2) +} V_{m}^{(12) T}) D^{- 1} \\ {(M G^{(1) T} + G^{(2) +} V_{m}^{(12) T})}^{T}) \\ + 2 t r (M G^{(1) T} D^{- 1} {(M G^{(1) T} + G^{(2) +} V_{m}^{(12) T})}^{T}) \\ = t r (P Y D Y^{T} P^{T}) + 2 t r (G_{0}^{(2) -} V_{m}^{(12) T} Y^{T} P^{T}) \\ + t r (G_{0}^{(2) -} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} G_{0}^{(2) - T}) \\ - t r (G^{(2) +} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} G^{(2) + T}) . \end{aligned}

(143)

Note that EquationEquation (143)(143) $\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (P Y D Y^{T} P^{T}) + t r ((M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T}) D^{- 1} \\ {(M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T})}^{T}) \\ + 2 t r ((M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T}) Y^{T} P^{T}) \\ - 2 t r (M G^{(1) T} D^{- 1} {(M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T})}^{T}) \\ - 2 t r (M G^{(1) T} Y^{T} P^{T}) \\ - t r ((M G^{(1) T} + G^{(2) +} V_{m}^{(12) T}) D^{- 1} \\ {(M G^{(1) T} + G^{(2) +} V_{m}^{(12) T})}^{T}) \\ + 2 t r (M G^{(1) T} D^{- 1} {(M G^{(1) T} + G^{(2) +} V_{m}^{(12) T})}^{T}) \\ = t r (P Y D Y^{T} P^{T}) + 2 t r (G_{0}^{(2) -} V_{m}^{(12) T} Y^{T} P^{T}) \\ + t r (G_{0}^{(2) -} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} G_{0}^{(2) - T}) \\ - t r (G^{(2) +} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} G^{(2) + T}) . \end{aligned}$ (143) is considered as a function of $Y$ , $G_{0}^{(2) -}$ , and $P$ .

Meanwhile, if a specific generalized inverse of an $m \times n$ matrix $A$ is given by $A^{\circ}$ , all the generalized inverses of $A$ can be expressed by

(144)

A^{-} = A^{\circ} + Z - A^{\circ} A Z A A^{\circ},

(144)

where $Z$ is an $n \times m$ arbitrary matrix (e.g. Theorem 9.2.7 in reference [Citation13]). Since the Moore–Penrose pseudoinverse is a special case of the generalized inverse, we may adopt $G^{(2) +}$ as the specific generalized inverse in the above equation. Hence, by using an arbitrary matrix $Z_{0} \in R^{n_{a} \times n^{(2)}}$ , we can express all the generalized inverses in the form:

G_{0}^{(2) -} = G^{(2) +} + Z_{0} - G^{(2) +} G^{(2)} Z_{0} G^{(2)} G^{(2) +}

(145)

= G^{(2) +} + Q Z_{0},

(145)

where

(146)

Q \equiv I - G^{(2) +} G^{(2)} .

(146)

To derive EquationEquation (145)(145) $= G^{(2) +} + Q Z_{0},$ (145) , we have used that $G^{(2)} G^{(2) +} = I$ in light of EquationEquation (110)(110) $G^{(2)} G^{(2) -} = I .$ (110) . Note that $Q$ is symmetric and idempotent:

(147)

Q^{T} = Q = Q^{2} .

(147)

In addition, with regard to $Q$ , we have

(148)

G^{(2)} Q = Q G^{(2) +} = O .

(148)

Similarly to $G_{0}^{(2) -}$ , by using an arbitrary matrix $Z_{1} \in R^{n_{a} \times n^{(2)}}$ , all the generalized inverses can be expressed by

(149)

G_{1}^{(2) -} = G^{(2) +} + Q Z_{1} .

(149)

Thus, EquationEquation (138)(138) $P \equiv I - G_{1}^{(2) -} G^{(2)},$ (138) can be rewritten as

(150)

\begin{matrix} P = I - (G^{(2) +} + Q Z_{1}) G^{(2)} \\ = Q (I - Z_{1} G^{(2)}), \end{matrix}

(150)

where EquationEquation (146)(146) $Q \equiv I - G^{(2) +} G^{(2)} .$ (146) is used. Note that $G_{0}^{(2) -}$ is a function of $Z_{0}$ as shown in EquationEquation (145)(145) $= G^{(2) +} + Q Z_{0},$ (145) , and that $P$ is a function of $Z_{1}$ as shown in EquationEquation (150)(150) $\begin{matrix} P = I - (G^{(2) +} + Q Z_{1}) G^{(2)} \\ = Q (I - Z_{1} G^{(2)}), \end{matrix}$ (150) . Hence, we can express EquationEquation (143)(143) $\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (P Y D Y^{T} P^{T}) + t r ((M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T}) D^{- 1} \\ {(M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T})}^{T}) \\ + 2 t r ((M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T}) Y^{T} P^{T}) \\ - 2 t r (M G^{(1) T} D^{- 1} {(M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T})}^{T}) \\ - 2 t r (M G^{(1) T} Y^{T} P^{T}) \\ - t r ((M G^{(1) T} + G^{(2) +} V_{m}^{(12) T}) D^{- 1} \\ {(M G^{(1) T} + G^{(2) +} V_{m}^{(12) T})}^{T}) \\ + 2 t r (M G^{(1) T} D^{- 1} {(M G^{(1) T} + G^{(2) +} V_{m}^{(12) T})}^{T}) \\ = t r (P Y D Y^{T} P^{T}) + 2 t r (G_{0}^{(2) -} V_{m}^{(12) T} Y^{T} P^{T}) \\ + t r (G_{0}^{(2) -} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} G_{0}^{(2) - T}) \\ - t r (G^{(2) +} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} G^{(2) + T}) . \end{aligned}$ (143) as a function of arbitrary matrices $Y$ , $Z_{0}$ , and $Z_{1}$ by substituting EquationEquations (145)(145) $= G^{(2) +} + Q Z_{0},$ (145) and (Equation150(150) $\begin{matrix} P = I - (G^{(2) +} + Q Z_{1}) G^{(2)} \\ = Q (I - Z_{1} G^{(2)}), \end{matrix}$ (150) ) in EquationEquation (143)(143) $\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (P Y D Y^{T} P^{T}) + t r ((M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T}) D^{- 1} \\ {(M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T})}^{T}) \\ + 2 t r ((M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T}) Y^{T} P^{T}) \\ - 2 t r (M G^{(1) T} D^{- 1} {(M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T})}^{T}) \\ - 2 t r (M G^{(1) T} Y^{T} P^{T}) \\ - t r ((M G^{(1) T} + G^{(2) +} V_{m}^{(12) T}) D^{- 1} \\ {(M G^{(1) T} + G^{(2) +} V_{m}^{(12) T})}^{T}) \\ + 2 t r (M G^{(1) T} D^{- 1} {(M G^{(1) T} + G^{(2) +} V_{m}^{(12) T})}^{T}) \\ = t r (P Y D Y^{T} P^{T}) + 2 t r (G_{0}^{(2) -} V_{m}^{(12) T} Y^{T} P^{T}) \\ + t r (G_{0}^{(2) -} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} G_{0}^{(2) - T}) \\ - t r (G^{(2) +} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} G^{(2) + T}) . \end{aligned}$ (143) .

First, substituting EquationEquation (145)(145) $= G^{(2) +} + Q Z_{0},$ (145) in EquationEquation (143)(143) $\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (P Y D Y^{T} P^{T}) + t r ((M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T}) D^{- 1} \\ {(M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T})}^{T}) \\ + 2 t r ((M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T}) Y^{T} P^{T}) \\ - 2 t r (M G^{(1) T} D^{- 1} {(M G^{(1) T} + G_{0}^{(2) -} V_{m}^{(12) T})}^{T}) \\ - 2 t r (M G^{(1) T} Y^{T} P^{T}) \\ - t r ((M G^{(1) T} + G^{(2) +} V_{m}^{(12) T}) D^{- 1} \\ {(M G^{(1) T} + G^{(2) +} V_{m}^{(12) T})}^{T}) \\ + 2 t r (M G^{(1) T} D^{- 1} {(M G^{(1) T} + G^{(2) +} V_{m}^{(12) T})}^{T}) \\ = t r (P Y D Y^{T} P^{T}) + 2 t r (G_{0}^{(2) -} V_{m}^{(12) T} Y^{T} P^{T}) \\ + t r (G_{0}^{(2) -} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} G_{0}^{(2) - T}) \\ - t r (G^{(2) +} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} G^{(2) + T}) . \end{aligned}$ (143) , we obtain

(151)

\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (P Y D Y^{T} P^{T}) + 2 t r ((G^{(2) +} + Q Z_{0}) V_{m}^{(12) T} Y^{T} P^{T}) \\ + t r ((G^{(2) +} + Q Z_{0}) V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} {(G^{(2) +} + Q Z_{0})}^{T}) \\ - t r (G^{(2) +} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} G^{(2) + T}) \\ = t r (P Y D Y^{T} P^{T}) \\ + 2 t r ((G^{(2) +} + Q Z_{0}) V_{m}^{(12) T} Y^{T} P^{T}) \\ + t r (Q Z_{0} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} Z_{0}^{T} Q) \\ + 2 t r (G^{(2) +} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} Z_{0}^{T} Q) . \end{aligned}

(151)

In light of EquationEquation (148)(148) $G^{(2)} Q = Q G^{(2) +} = O .$ (148) , the fourth term of the most right-hand side of EquationEquation (151)(151) $\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (P Y D Y^{T} P^{T}) + 2 t r ((G^{(2) +} + Q Z_{0}) V_{m}^{(12) T} Y^{T} P^{T}) \\ + t r ((G^{(2) +} + Q Z_{0}) V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} {(G^{(2) +} + Q Z_{0})}^{T}) \\ - t r (G^{(2) +} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} G^{(2) + T}) \\ = t r (P Y D Y^{T} P^{T}) \\ + 2 t r ((G^{(2) +} + Q Z_{0}) V_{m}^{(12) T} Y^{T} P^{T}) \\ + t r (Q Z_{0} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} Z_{0}^{T} Q) \\ + 2 t r (G^{(2) +} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} Z_{0}^{T} Q) . \end{aligned}$ (151) vanishes:

(152)

\begin{aligned} 2 t r (G^{(2) +} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} Z_{0}^{T} Q) \\ = t r (Q G^{(2) +} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} Z_{0}^{T}) = 0. \end{aligned}

(152)

To derive this equation, we have used the following equation for arbitrary matrices $A$ , $B$ , and $C$ :

(153)

tr (A B C) = tr (C A B) = tr (B C A) .

(153)

Second, let us substitute EquationEquation (150)(150) $\begin{matrix} P = I - (G^{(2) +} + Q Z_{1}) G^{(2)} \\ = Q (I - Z_{1} G^{(2)}), \end{matrix}$ (150) in EquationEquation (151)(151) $\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (P Y D Y^{T} P^{T}) + 2 t r ((G^{(2) +} + Q Z_{0}) V_{m}^{(12) T} Y^{T} P^{T}) \\ + t r ((G^{(2) +} + Q Z_{0}) V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} {(G^{(2) +} + Q Z_{0})}^{T}) \\ - t r (G^{(2) +} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} G^{(2) + T}) \\ = t r (P Y D Y^{T} P^{T}) \\ + 2 t r ((G^{(2) +} + Q Z_{0}) V_{m}^{(12) T} Y^{T} P^{T}) \\ + t r (Q Z_{0} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} Z_{0}^{T} Q) \\ + 2 t r (G^{(2) +} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} Z_{0}^{T} Q) . \end{aligned}$ (151) , then the second term of the most right-hand side of EquationEquation (151)(151) $\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (P Y D Y^{T} P^{T}) + 2 t r ((G^{(2) +} + Q Z_{0}) V_{m}^{(12) T} Y^{T} P^{T}) \\ + t r ((G^{(2) +} + Q Z_{0}) V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} {(G^{(2) +} + Q Z_{0})}^{T}) \\ - t r (G^{(2) +} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} G^{(2) + T}) \\ = t r (P Y D Y^{T} P^{T}) \\ + 2 t r ((G^{(2) +} + Q Z_{0}) V_{m}^{(12) T} Y^{T} P^{T}) \\ + t r (Q Z_{0} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} Z_{0}^{T} Q) \\ + 2 t r (G^{(2) +} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} Z_{0}^{T} Q) . \end{aligned}$ (151) can be written by

(154)

\begin{aligned} 2 t r ((G^{(2) +} + Q Z_{0}) V_{m}^{(12) T} Y^{T} P^{T}) \\ = 2 t r (P^{T} (G^{(2) +} + Q Z_{0}) V_{m}^{(12) T} Y^{T}) \\ = 2 t r ({(I - Z_{1} G^{(2)})}^{T} Q (G^{(2) +} + Q Z_{0}) V_{m}^{(12) T} Y^{T}) \\ = 2 t r ({(I - Z_{1} G^{(2)})}^{T} Q Z_{0} V_{m}^{(12) T} Y^{T}) \\ = 2 t r (Q Z_{0} V_{m}^{(12) T} Y^{T} {(I - Z_{1} G^{(2)})}^{T}), \end{aligned}

(154)

where EquationEquations (153)(153) $tr (A B C) = tr (C A B) = tr (B C A) .$ (153) , (Equation49(49) ${(A B)}^{T} = B^{T} A^{T} .$ (49) ), (Equation148(148) $G^{(2)} Q = Q G^{(2) +} = O .$ (148) ), and (Equation147(147) $Q^{T} = Q = Q^{2} .$ (147) ) are used.

Finally, EquationEquation (151)(151) $\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (P Y D Y^{T} P^{T}) + 2 t r ((G^{(2) +} + Q Z_{0}) V_{m}^{(12) T} Y^{T} P^{T}) \\ + t r ((G^{(2) +} + Q Z_{0}) V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} {(G^{(2) +} + Q Z_{0})}^{T}) \\ - t r (G^{(2) +} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} G^{(2) + T}) \\ = t r (P Y D Y^{T} P^{T}) \\ + 2 t r ((G^{(2) +} + Q Z_{0}) V_{m}^{(12) T} Y^{T} P^{T}) \\ + t r (Q Z_{0} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} Z_{0}^{T} Q) \\ + 2 t r (G^{(2) +} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} Z_{0}^{T} Q) . \end{aligned}$ (151) is expressed as a function of $Y$ , $Z_{0}$ , and $Z_{1}$ :

(155)

\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (Q (I - Z_{1} G^{(2)}) Y D Y^{T} {(I - Z_{1} G^{(2)})}^{T} Q) \\ + 2 t r (Q Z_{0} V_{m}^{(12) T} Y^{T} {(I - Z_{1} G^{(2)})}^{T}) \\ + t r (Q Z_{0} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} Z_{0}^{T} Q) . \end{aligned}

(155)

Note that the right-hand side of this equation is in the quadratic form of $Y$ , $Z_{0}$ , and $Z_{1}$ , and that it can be factorized. That is, we can rewrite EquationEquation (155)(155) $\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (Q (I - Z_{1} G^{(2)}) Y D Y^{T} {(I - Z_{1} G^{(2)})}^{T} Q) \\ + 2 t r (Q Z_{0} V_{m}^{(12) T} Y^{T} {(I - Z_{1} G^{(2)})}^{T}) \\ + t r (Q Z_{0} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} Z_{0}^{T} Q) . \end{aligned}$ (155) as

(156)

\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (\begin{matrix} (Q (I - Z_{1} G^{(2)}) Y D + Q Z_{0} V_{m}^{(12) T}) D^{- 1} \\ {(Q (I - Z_{1} G^{(2)}) Y D + Q Z_{0} V_{m}^{(12) T})}^{T} \end{matrix}) . \end{aligned}

(156)

By expanding the right-hand side of EquationEquation (156)(156) $\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (\begin{matrix} (Q (I - Z_{1} G^{(2)}) Y D + Q Z_{0} V_{m}^{(12) T}) D^{- 1} \\ {(Q (I - Z_{1} G^{(2)}) Y D + Q Z_{0} V_{m}^{(12) T})}^{T} \end{matrix}) . \end{aligned}$ (156) , we can confirm that it is equal to the right-hand side of EquationEquation (155)(155) $\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (Q (I - Z_{1} G^{(2)}) Y D Y^{T} {(I - Z_{1} G^{(2)})}^{T} Q) \\ + 2 t r (Q Z_{0} V_{m}^{(12) T} Y^{T} {(I - Z_{1} G^{(2)})}^{T}) \\ + t r (Q Z_{0} V_{m}^{(12) T} D^{- 1} V_{m}^{(12)} Z_{0}^{T} Q) . \end{aligned}$ (155) . As mentioned in Section 2.1.3, for an $n \times n$ matrix $A$ and an $n \times m$ matrix $B$ , if $A$ is symmetric and nonnegative definite, $B^{T} A B$ is a symmetric nonnegative definite matrix. Moreover, for an arbitrary symmetric nonnegative definite matrix $C$ , $t r (C) \geq 0$ holds (e.g. Corollary 14.7.3 in reference [Citation13]). Recalling that $D^{- 1}$ is symmetric and nonnegative definite, we find that EquationEquation (135)(135) $t r (M_{M L E A}^{*}) \geq t r (M_{M L E A}^{m i n}) .$ (135) holds for arbitrary matrices $Y$ , $Z_{0}$ , and $Z_{1}$ . Q.E.D.

4.3.3. Numerical verification

The correctness of EquationEquation (156)(156) $\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (\begin{matrix} (Q (I - Z_{1} G^{(2)}) Y D + Q Z_{0} V_{m}^{(12) T}) D^{- 1} \\ {(Q (I - Z_{1} G^{(2)}) Y D + Q Z_{0} V_{m}^{(12) T})}^{T} \end{matrix}) . \end{aligned}$ (156) derived in the proof was verified by numerical calculations using a simplified case ( $n^{(1)} = 5$ , $n^{(2)} = 3$ , $n_{a} = 36$ ). In this numerical verification, random numbers were generated to determine the arbitrary matrices $Y$ , $Z_{0}$ , and $Z_{1}$ ; and the right-hand side of EquationEquation (156)(156) $\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (\begin{matrix} (Q (I - Z_{1} G^{(2)}) Y D + Q Z_{0} V_{m}^{(12) T}) D^{- 1} \\ {(Q (I - Z_{1} G^{(2)}) Y D + Q Z_{0} V_{m}^{(12) T})}^{T} \end{matrix}) . \end{aligned}$ (156) was evaluated. Subsequently, $G_{0}^{(2) -}$ and $G_{1}^{(2) -}$ were determined by using EquationEquations (145)(145) $= G^{(2) +} + Q Z_{0},$ (145) and (Equation149(149) $G_{1}^{(2) -} = G^{(2) +} + Q Z_{1} .$ (149) ); and the left-hand side of EquationEquation (156)(156) $\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (\begin{matrix} (Q (I - Z_{1} G^{(2)}) Y D + Q Z_{0} V_{m}^{(12) T}) D^{- 1} \\ {(Q (I - Z_{1} G^{(2)}) Y D + Q Z_{0} V_{m}^{(12) T})}^{T} \end{matrix}) . \end{aligned}$ (156) was obtained by using EquationEquation (6)(6) $M_{x} = M + B_{x} D^{- 1} B_{x}^{T} - M G^{(1) T} D^{- 1} B_{x}^{T} - B_{x} D^{- 1} G^{(1)} M,$ (6) . Finally, we confirmed that the left-hand side of EquationEquation (156)(156) $\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (\begin{matrix} (Q (I - Z_{1} G^{(2)}) Y D + Q Z_{0} V_{m}^{(12) T}) D^{- 1} \\ {(Q (I - Z_{1} G^{(2)}) Y D + Q Z_{0} V_{m}^{(12) T})}^{T} \end{matrix}) . \end{aligned}$ (156) agreed well with the right-hand side of EquationEquation (156)(156) $\begin{aligned} t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n}) \\ = t r (\begin{matrix} (Q (I - Z_{1} G^{(2)}) Y D + Q Z_{0} V_{m}^{(12) T}) D^{- 1} \\ {(Q (I - Z_{1} G^{(2)}) Y D + Q Z_{0} V_{m}^{(12) T})}^{T} \end{matrix}) . \end{aligned}$ (156) .

Moreover, and show another calculation result of the difference of traces, namely $t r (M_{M L E A}^{*}) - t r (M_{M L E A}^{m i n})$ , for the simplified case. In this calculation, we defined by $Y = y J$ and $Z_{0} = Z_{1} = z J$ , where $y$ and $z$ are scalar variables, and $J$ represents the all-ones matrix, in which all elements are unity. Thus, by using $y$ and $z$ , we changed $Y$ and $Z_{0} (= Z_{1})$ around the zero matrix (i.e. around $y = z = 0$ ). From these figure, we can see that the difference of traces becomes zero when $Y = O$ and $Z_{0} = O$ (i.e. $G_{0}^{(2) -} = G^{(2) +}$ ). That is, $t r (M_{M L E A}^{m i n})$ has the minimum value among $t r (M_{M L E A}^{*})$ .

Figure 1. Difference of the traces of $M_{M L E A}^{*}$ and $M_{M L E A}^{m i n}$ as a function of $Y (= y J)$ and $Z_{0} (= Z_{1} = z J)$ within the ranges of $0 \leq y \leq 2$ and $0 \leq z \leq 4$ .

Figure 1. Difference of the traces of MMLEA∗ and MMLEAmin as a function of Y(=yJ) and Z0(=Z1=zJ) within the ranges of 0≤y≤2 and 0≤z≤4.

Figure 2. Difference of the traces of $M_{M L E A}^{*}$ and $M_{M L E A}^{m i n}$ as a function of $Y (= y J)$ and $Z_{0} (= Z_{1} = z J)$ within the ranges of $- 2 \leq y \leq 0$ and $- 4 \leq z \leq 0$ .

Figure 2. Difference of the traces of MMLEA∗ and MMLEAmin as a function of Y(=yJ) and Z0(=Z1=zJ) within the ranges of −2≤y≤0 and −4≤z≤0.

4.4. Role of assumption of normal distribution

With regard to MLCA, it was found that the formulation equivalent to CBCA can be derived by introducing the assumption of linear estimation expressed by EquationEquation (69)(69) $\hat{T} - T_{0} = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (69) . Meanwhile, the same formula can be derived by using the other assumptions, that is, MSCA and DRCA. For this reason, we examine these assumptions necessary for deriving the formulation equivalent to CBCA. summarizes the assumptions used in the derivation of CAs – including CBCA, MRCA, DRCA, MLCA, and MRCA.

Note that the assumption of normal distribution includes an assumption that the mean value is equal to the true value. Meanwhile, the mean value of the normal distribution is equal to the expectation value. Hence, we can understand that the assumption of normal distribution includes the assumption of unbiased estimation. Moreover, it should be noted that, when we assume the linear estimation of EquationEquation (70)(70) ${\hat{R}}^{(1)} - R_{c}^{(1)} (T_{0}) = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (70) , it is necessary to introduce the assumption of minimum norm solution:

(157)

\hat{T} = T_{0} + G^{(1) +} F^{'} (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})),

(157)

Table 1. Assumptions used in the derivation of the conventional cross-section adjustment methods

Display Table

where $F^{'} \in R^{n^{(1)} \times n^{(1)}}$ is another linear combination factor (cf. EquationEquation (40)(40) $E (w_{k} w_{i}^{T}) = \{\begin{matrix} Q_{k} (i = k) \\ O (i \neq k), \end{matrix}$ (40) in reference [Citation9]). Note that the dimension of $F^{'}$ differs from that of $F$ used in the present derivation. We can see that both EquationEquations (157)(157) $\hat{T} = T_{0} + G^{(1) +} F^{'} (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})),$ (157) and (Equation69(69) $\hat{T} - T_{0} = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (69) ) include a concept of dimensionality expansion in the case that $n_{a} > n^{(1)}$ : the former expands the dimension by $G^{(1) +} \in R^{n_{a} \times n^{(1)}}$ , and the latter by $F \in R^{n_{a} \times n^{(1)}}$ . Thus, we can deduce that either the projection simplification or the dimensionality reduction is required to compensate the dimensionality expansion introduced by the assumption of minimum norm solution.

Moreover, from this table, it is seen that the assumption of normal distribution adopted in CBCA is equivalent to a set of the assumptions of:

unbiased estimation, linear estimation of EquationEquation (70)(70) ${\hat{R}}^{(1)} - R_{c}^{(1)} (T_{0}) = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (70) , minimum norm solution, and projection simplification.
unbiased estimation, linear estimation of EquationEquation (70)(70) ${\hat{R}}^{(1)} - R_{c}^{(1)} (T_{0}) = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (70) , minimum norm solution, and dimensionality reduction.
unbiased estimation and linear estimation of EquationEquation (69)(69) $\hat{T} - T_{0} = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (69) .

In summary, when we apply CBCA under the condition of the underdetermined problem, the assumption of normal distribution plays an important role to expand properly the information of the integral experimental quantities to the adjusted cross-section set.

4.5. Comparison with derivation based on Bayes theorem

From the derivations of MLCA, MLEA, MLRA, and MLEB, we can deduce that the assumptions of linear estimation expressed by EquationEquations (69)(69) $\hat{T} - T_{0} = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (69) –(Equation71(71) ${\hat{R}}^{(2)} - R_{c}^{(2)} (T_{0}) = F (R_{e}^{(1)} - R_{c}^{(1)} (T_{0})) .$ (71) ) correspond to conditional probabilities maximized in derivations based on the Bayes theorem:

(158)

P (\hat{T} | R_{e}^{(1)}),

(158)

(159)

P ({\hat{R}}^{(1)} | R_{e}^{(1)}),

(159)

and

(160)

P ({\hat{R}}^{(2)} | R_{e}^{(1)}),

(160)

respectively, where $P (A | B)$ denotes the probability distribution of $A$ under the condition that $B$ is given. In fact, EquationEquation (158)(158) $P (\hat{T} | R_{e}^{(1)}),$ (158) is used explicitly in the derivation of CBCA [Citation1,Citation4], and EquationEquation (160)(160) $P ({\hat{R}}^{(2)} | R_{e}^{(1)}),$ (160) is utilized in the derivation of CBEA [Citation8] in the modified form:

(161)

P (R_{c}^{(2)} (\hat{T}) | R_{e}^{(1)}) .

(161)

Besides CBEA, a lot of new techniques based on the Bayes theorem have been recently proposed (e.g. [Citation16–Citation18]). Hence, a detailed comparison with these techniques would be a future task. The foregoing correspondence between the linear estimations and the conditional probabilities, however, will be useful for the comparison of our methodologies with the other techniques based on the Bayes theorem.

5. Conclusions

In comparison with the classical Bayesian conventional cross-section adjustment (CBCA) method, the derivation of the Kalman filter was reviewed. This review revealed that CBCA is equivalent to a linearized model of the nonlinear Kalman filter, while the assumption of linear estimation used in the cross-section adjustment methodology differs from that in the Kalman filter. For this reason, by introducing the assumption used in the Kalman filter, we formulated the conventional, extended, and regressive cross-section adjustment methods (CA, EA, and RA) with no use of the assumption of normal distribution; these methods are called the conventional, extended, and regressive cross-section adjustment methods based on minimum variance unbiased linear estimation (MLCA, MLEA, and MLRA), respectively. Consequently, we found that MLCA has the formulation equivalent to CBCA. From these findings, we discussed the role of the assumption of normal distribution adopted in CBCA. Moreover, we demonstrated that the extended bias factor (EB) method can be formulated with the same procedure; this method is referred to as MLEB. The derivation of MLEB is simple, and would be helpful to understand the difference between EB and EA.

With regard to MLEA, we derived a generalized formulation that can express all the numerous solutions minimizing the variance of the design target core parameters. The formulation of MLEA includes a special case identical to the conventional Bayesian extended cross-section adjustment (CBEA) method; when we adopt the special case, the variance of adjusted cross-section set is also minimized. Thus, we can interpret that CBEA minimizes not only the variance of the design target core parameters but also the variance of the nuclear data. For this reason, we recommend to use the special case of MLEA, which is equivalent to CBEA, from a practical point of view.

On the other hand, in our review, we showed that CBCA is a type of the Kalman filter. Since many enhanced techniques based on the Kalman filter have been proposed in the other fields, we expect that an application of these techniques lead to further improvement in the cross-section adjustment methodology. Moreover, MLEA itself is an enhancement of the Kalman filter. Therefore, MLEA would be applicable in various fields other than nuclear reactor design.

Abbreviations

derivation procedures
CB : classical Bayesian (inference)
MVUE : minimum variance unbiased estimation
MVULE : minimum variance unbiased linear estimation
variations of cross-section adjustment methodology
CA : conventional cross-section adjustment method
EA : extended cross-section adjustment method
RA : regressive cross-section adjustment method
CAs
CBCA : classical Bayesian conventional cross-section adjustment method
MRCA : MVUE-based rigorous conventional cross-section adjustment method
MSCA : MVUE-based simplified conventional cross-section adjustment method
MLCA : MVULE-based conventional cross-section adjustment method
DRCA : dimension-reduced conventional cross-section adjustment method
EAs
CBEA : classical Bayesian extended cross-section adjustment method
MREA : MVUE-based rigorous extended cross-section adjustment method
MSEA : MVUE-based simplified extended cross-section adjustment method
MLEA : MVULE-based extended cross-section adjustment method
RA
MLRA : MVULE-based regressive cross-section adjustment method
extended bias factor methods
EB : (original) extended bias factor method
MLEB : MVULE-based extended bias factor method

Acknowledgments

The authors express their deep gratitude to the coauthors of references [8–10], since the present study is due to discussion with them through the series of studies. The authors are also grateful to Dr S. Takeda of Osaka University for useful comments on the present study. K. Yokoyama wishes to extend his special thanks to Mr M. Ishikawa of JAEA for great encouragement and helpful discussion based on his expertise.

Disclosure statement

No potential conflict of interest was reported by the authors.

Related Research Data

New Results in Linear Filtering and Prediction Theory

Source: ASME International

Prediction Uncertainty Evaluation Methods of Core Performance Parameters in Large Liquid-Metal Fast Breeder Reactors

Source: Informa UK Limited

Extended cross-section adjustment method to improve the prediction accuracy of core parameters

Source: Informa UK Limited

Matrix Algebra From a Statistician’s Perspective

Source: Springer New York

Methods of Adjustment and Error Evaluation of Neutron Capture Cross Sections; Application to Fission Product Nuclides

Source: Informa UK Limited

A New Approach to Linear Filtering and Prediction Problems

Source: ASME International

Methods and issues for the combined use of integral experiments and covariance data: Results of a NEA international collaborative study

Source: HAL CCSD

Cross-section adjustment methods based on minimum variance unbiased estimation

Source: Taylor & Francis

Dimension-reduced cross-section adjustment method based on minimum variance unbiased estimation

Source: Informa UK Limited

A New Approach to Linear Filtering and Prediction Problems

Source: ASME International

Best-Estimate Model Calibration and Prediction through Experimental Data Assimilation—I: Mathematical Framework

Source: Informa UK Limited

Theoretical Study on New Bias Factor Methods to Effectively Use Critical Experiments for Improvement of Prediction Accuracy of Neutronic Characteristics

Source: Informa UK Limited

Bias factor method using random sampling technique

Source: Informa UK Limited

Linking provided by

References

Dragt JB. Statistical considerations on techniques for adjustment of Diﬀerential cross sections with measured integral parameters. In: STEK, The fast-thermal coupled facility of RCN at Petten. RCN-122. Petten: Reactor Centrum Nederland; 1970. p. 85–105.
Google Scholar
Gandini A, Petilli M, Salvatores M. Nuclear data and integral measurement correlation for fast reactors. Statistical formulation and analysis of methods. The Consistent Approach. International Symposium of Physics of Fast Reactors, 1973 October 16-19. Tokyo, Japan; Vol. 1. p. 612–628.
Google Scholar
Dragt JB, Dekker JWM, Guppelaar H, et al. Methods of adjustment and error evaluation of neutron capture cross sections; Application to fission product nuclides. Nucl Sci Eng. 1977;62:117–129.
Web of Science ®Google Scholar
Takeda T, Yoshimura A, Kamei T. Prediction uncertainty evaluation methods of core performance parameters in large liquid-metal fast breeder reactors. Nucl Sci Eng. 1989;103:157–165.
Web of Science ®Google Scholar
Salvatores M, Palmiotti G, Aliberti G, et al. Methods and issues for the combined use of integral experiments and covariance data: results of a NEA International Collaborative Study. Nuclear Data Sheets. 2014;118:38–71.
Web of Science ®Google Scholar
de Saint-Jean C, Dupont E, Ishikawa M, et al. Assessment of existing nuclear data adjustment methodologies. Paris: OECD/NEA; 2011. NEA/NSC/WPEC/DOC(2010)429.
Google Scholar
Kugo T, Mori T, Takeda T. Theoretical study on new bias factor methods to eﬀectively use critical experiments for improvement of prediction accuracy of neutronic characteristics. J Nucl Sci Technol. 2007;44(12):1509–1517.
Web of Science ®Google Scholar
Yokoyama K, Ishikawa M, Kugo T. Extended cross-section adjustment method to improve the prediction accuracy of core parameters. J Nucl Sci Technol. 2012;49(12):1165–1174.
Web of Science ®Google Scholar
Yokoyama K, Yamamoto A. Cross-section adjustment methods based on minimum variance unbiased estimation. J Nucl Sci Technol. 2016;53(10):1622–1638.
Web of Science ®Google Scholar
Yokoyama K, Yamamoto A, Kitada T. Dimension-reduced cross-section adjustment methods based on minimum variance unbiased estimation. J Nucl Sci Technol. 2018;55(3):319–334.
Web of Science ®Google Scholar
Kalman RE, New A. Approach to linear filtering and prediction problems. Trans ASME J Basic Eng. 1960;82:35–45.
Google Scholar
Kalman RE, Bucy RS. New results in linear filtering and prediction therory. Trans ASME J Basic Eng. 1961;83:95–108.
Google Scholar
Harville DA. Matrix algebra from a statistician’s perspective. 1st ed. New York (NY): Springer; 1997.
Google Scholar
Brown RG, Hwang PYC. Introduction to random signals and applied Kalman filtering. 4th ed. Hoboken (NJ): John Wiley & Sons, Inc; 2012.
Google Scholar
Palmiotti G, Salvatores M, Yokoyama K, et al. Methods and approaches to provide feedback from nuclear and covariance data adjustment for improvement of nuclear data files. Paris: OECD/NEA; 2017. NEA/NSC/WPEC/DOC(2016)6.
Google Scholar
Cacuci DG, Ionescu-Bujor M. Best-estimate model Calibration and Prediction Through Experimental Data Assimilation – I: mathematical Framework. Nucl Sci Eng. 2010;165:18–44.
Web of Science ®Google Scholar
Hoefer A, Buss O, Hennebach M, et al. MOCABA: A general Monte Carlo-Bayes pro- cedure for improved predictions of integral functions of nuclear data. Ann Nucl Energy. 2015;77:514–521.
Web of Science ®Google Scholar
Endo T, Yamamoto A, Watanabe T. Bias factor method using random sampling technique. J Nucl Sci Technol. 2016;53:1494–1501.
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Generalized formulation of extended cross-section adjustment method based on minimum variance unbiased linear estimation

ABSTRACT

1. Introduction

2. Review

2.1. Review of cross-section adjustment method

2.1.1. Precondition

2.1.2. Unified formula

2.1.3. Detailed definition

2.2. Review of extended bias factor method

2.3. Review of Kalman filter

2.3.1. Non-linear Kalman filter and its linearization

2.3.2. Linear Kalman filter

2.3.3. Relationship with cross-section adjustment method

2.4. Discussion on assumption of linear estimation

3. Derivation

3.1. Assumptions

3.1.1. Assumptions on number of data

3.1.2. Assumption of linearity

3.1.3. Assumption of unbiased estimation

3.1.4. Assumption of error independence

3.2. Derivation of conventional cross-section adjustment method

3.2.1. Derivation of adjusted cross-section set

3.2.2. Derivation of variances

3.3. Derivation of extended bias factor method

3.3.1. Derivation of estimate of design target core parameters

3.3.2. Comparison with original extended bias factor method

3.4. Derivation of extended cross-section adjustment method

3.4.1. Minimization of variance of design target core parameters

3.4.2. Derivation of numerous solutions of adjusted cross-section set

3.4.3. Derivation of variances

3.5. Derivation of regressive cross-section adjustment method

3.6. Summary of derived results by unified formula

4. Discussion

4.1. Consistency of MLEA with EB

4.2. Quasi-solutions of CA and compensation effect

4.3. Practical solution of MLEA

4.3.1. Conjecture

4.3.2. Proof

4.3.3. Numerical verification

4.4. Role of assumption of normal distribution

Table 1. Assumptions used in the derivation of the conventional cross-section adjustment methods

4.5. Comparison with derivation based on Bayes theorem

5. Conclusions

Abbreviations

Acknowledgments

Disclosure statement

Related Research Data

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date