Search in:

Inverse Problems in Science and Engineering Volume 27, 2019 - Issue 12

Submit an article Journal homepage

Free access

1,655

Views

CrossRef citations to date

Altmetric

Listen

Original Articles

The uncertainty analysis in linear and nonlinear regression revisited: application to concrete strength estimation

Fernández-Martínez Juan LuisGroup of Inverse Problems Optimization and Machine Learning, Department of Mathematics, University of Oviedo, Oviedo, SpainCorrespondence[email protected]

https://orcid.org/0000-0002-4758-2832

Fernández-Muñiz ZulimaGroup of Inverse Problems Optimization and Machine Learning, Department of Mathematics, University of Oviedo, Oviedo, Spain

https://orcid.org/0000-0002-6963-7507

Breysse DenysI2M-GCE, Université de Bordeaux, Bordeaux, France

Pages 1740-1764 | Received 04 Jul 2018, Accepted 21 Nov 2018, Published online: 13 Dec 2018

Cite this article
https://doi.org/10.1080/17415977.2018.1553969
CrossMark

In this article

ABSTRACT
1. Introduction
2. Problem statement in the application domain on non-destructive concrete strength estimation
3. The algebraic interpretation of a simple linear regression problem
4. The uncertainty region in linear inverse problems
5. Nonlinear regression: the exponential model
6. Nonlinear regression: the potential model
7. The multivariate linear model
8. The bivariate case
9. Application to concrete strength analysis
10. Conclusions
Acknowledgements
Disclosure statement
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF View EPUB EPUB

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

Regression is a common technique in engineering when physical laws are unknown. Practitioners usually look for a unique set of true parameters that optimally explain the observed data. This is, for instance, the case in concrete strength estimation where engineers have been looking for an universal law to estimate this magnitude. We show that this approach is incorrect if the uncertainty of the regression problem is not properly taken into account. The uncertainty analysis of linear regression problems is revisited providing an analytical expression for the direction of maximum uncertainty where most of the models are sampled when partial information is used. We also analyse the case of 1D nonlinear regression models (exponential and potential models) and the multivariate case. We show a simple way of sampling the posterior distribution of the model parameters by performing least-squares of different data bags (bootstrap), introducing the percentile curves for the concrete strength estimation, comparing the results obtained from the linearized and nonlinear bootstrap procedures in the case of the nonlinear regression models. The methodology introduced in this paper constitutes a robust and simple way of assessing the intrincic uncertainty of these well-known parameter identification problems and adopting more robust decisions.

KEYWORDS:

Experimental models
linear regression
uncertainty analysis
concrete strength estimation

SUBJECT CLASSIFICATION CODES:

65.3X 40-58
65 59
68.0X 40-58
68 73
62 59

1. Introduction

Inverse problems is a discipline of applied mathematics that has many applications in science and engineering. Most of the inverse problems in geosciences can be written in discrete form $F (m) = d .$ In this relationship $m = (m_{1}, m_{2}, \dots, m_{n}) \in M \subset R^{n}$ is the estimated model (or model parameters) that belongs to a set of admissible models $M$ defined in terms of some prior knowledge, $d \in R^{s}$ are the observed data, and $F (m) = (f_{1} (m), f_{2} (m), \dots f_{s} (m))$ is the vector field representing the forward model, being $f_{j} (m)$ the j-scalar field component function of $F$ , accounting for the j-th data.

The inverse problem consists in finding the models $m$ whose predictions $F (m)$ accurately match the observed data $d$ , and are ill-posed, that is, either the inverse problem does not admit solution, either the solution is not unique, or it is unstable, that is, the solution does not depend continuously on the observed data (ill-conditioned problem). Uncertainty analysis consists in sampling the family of models $m \in R^{s}$ that fit the observed data $d \in R^{s}$ within the same error bounds $| | F (m) - d | |_{2}^{} < t o l,$ and are compatible with the prior information at disposal, if any. These models are called equivalent.

It has been shown that the topography of the data error cost function corresponds to a straight flat elongated valley if the inverse problem is linear, whereas in the nonlinear case, the cost function topography consists of one or more curvilinear valleys (or basins) of low misfits, eventually connected by saddle points [Citation1,Citation2]. Besides, in the linear case, the equivalent models belong to the linear hyper quadric whose axes and orientations are related to the ill-conditioning of the system matrix. This theoretical development complements the stochastic approach of inverse problems originally proposed by Tarantola and Valette [Citation3,Citation4], showing that the uncertainty region of linear and nonlinear inverse problems has a mathematical structure that is embedded in the forward physics $F$ and also in the observed data. In addition, the noise in the data perturbs the least-squares solution and the size of the linear and nonlinear uncertainty regions for the same value of the data misfit (see Fernández Martínez et al. [Citation21,Citation22] for more details). Therefore, uncertainty analysis and model appraisal is always a necessary step in inversion and parameter identification problems (see for instance [Citation5,Citation6]). Despite its importance, these results are barely known in the engineering community.

2. Problem statement in the application domain on non-destructive concrete strength estimation

The research presented in this paper has been practically inspired by the need of concrete specialists to have a reliable methodology for assessing concrete strength from non-destructive measurements. Assessing the concrete strength of existing buildings is a common engineering activity which is driven, among other reasons, by the need of structural seismic retrofitting where the load capacity must be checked or improved [Citation7].

Non-destructive measurements based strength evaluation is justified by the fact that: (a) taking cores for assessing directly strength from theses samples is expensive and time-consuming, (b) it can induce some damage in the structures and sometimes even put it at risk, (c) non-destructive test results are supposed to be correlated with concrete strength. This correlation has been justified by physical reasons for several non-destructive techniques among which velocity of ultrasonic waves, which is directly correlated with the elastic rigidity of the material, and rebound number, measured after the impact of a steel ball on the concrete surface, which is correlated with the material hardness [Citation8]. However, this correlation is not perfect since many other parameters can influence the relation between the non-destructive test result and the strength (humidity, type of aggregates, carbonation …).

Therefore it is required to identify an empirical ‘conversion’ (or regression) models, which makes possible to derive strength estimates from the non-destructive test results, for instance, rebound measurements [Citation9] or ultrasonic pulse velocity measurements [Citation10]. The researchers in this engineering field as in other similar fields (forestry, petroleum exploration, mining, etc.) are usually looking for empirical laws that relate these parameters. The same methodology also applies to other construction materials, such as masonry [Citation11]. The common idea is that, once a conversion model $y = M (x)$ has been identified between a measured property $x$ (typically the result of non-destructive technique) and a required target $y$ (typically the material strength), this mathematical model can be used to estimate $y$ for new values of $x$ where the value of $y$ has not been measured.

The different steps of the conversion model identification have been recently described into details in Breysse et al. [Citation12]. Linear empirical relationships are commonly established and the researchers in this field are often looking for a hypothetical universal relationship [Citation8]. The accuracy of the assessed strength highly depends on the accuracy and relevancy of the prediction model. Relevancy means that the model must be used in a context corresponding to that on which it has been validated. The accuracy of the model can be quantified through the goodness of fit between the predicted strengths and true strength values, for instance by quantifying the root mean squared error (RMS error). Thus, a key issue is the identification of adequate model parameters. The number of model parameters depends on the mathematical expression of the model, which is often two, for monovariate models, or three for bivariate models. If one takes, for instance, the example of prediction of concrete strength $r$ from rebound hammer measurements $R$ , a literature review has shown that a variety of mathematical expressions can be used by researchers [Citation8]. The most common ones being linear models ( $r * (R) = a_{0} + a_{1} R$ [Citation13]), power models ( $r * (R) = a_{0} R^{a_{1}}$ [Citation14]) and exponential models ( $r * (R) = a_{0} e^{a_{1} R}$ [Citation9]). A similar statement can be done for prediction models based on other non-destructive techniques, like ultrasonic pulse velocity $V$ , replacing the rebound hammer, $R$ , in the above expressions [Citation15]. Whatever the case, the identification problem comes to find the best estimates $a_{0}^{e s t}$ and $a_{1}^{e s t}$ for the two model parameters. Bilinear estimators of the kind $r * (R, V) = a_{0} + a_{1} R + a_{2} V,$ are also used in common practice [Citation16] in order to try to reduce uncertainty in the estimation of the concrete strength by introducing different correlated variables to the output variable, $r *$ . For any given dataset made of $N$ pairs $(r_{i}, V_{i}, R_{i}), i = 1, \dots, N$ , these estimates will correspond to the minimization of the RMS error referred above.

Despite the huge number of research works in the domain on non-destructive testing applied to concrete, some important issues justify the original developments proposed in this paper. There is no consensus between experts about how a conversion model must be chosen and or calibrated and various options can be considered [Citation17], while their advantages and drawbacks have been analysed only very recently [Citation18].

The common engineering practice is unfortunately limited to the model calibration stage, i.e. to the identification of the estimates of model parameters, without considering the effective ability of the model to estimate strength at locations which have not been used during the parameter identification stage. In other words, only the fitting error is quantified, and the effective prediction uncertainty remains unknown. An extensive literature review has shown that: (a) such universal relationship does not exist; (b) when analysing existing relations, a trade-off between the model parameters is revealed as a consistent pattern [Citation19]. This trade-off has been explained by the fact that the model estimates can be viewed as the solution of an inverse problem with a given uncertainty space that causes each single identification problem to drive to new estimates. Besides, while looking at how this problem is commonly treated, several factors exist which have a significant influence on the quality of the regression model, and therefore on the magnitude of the prediction error on strength. These factors have been recently listed in M. Alwash et al. [Citation20], and include the dataset size, the magnitude of the data uncertainty, and the range of variation of the parameter to be identified. The objective of this paper is to further analyse the identification stage in order to understand how these factors influence the uncertainty on strength estimates. On this basis, concrete experts will be able to justify a more robust methodology for on-site concrete strength estimates with NDT measurements, and to consider properly the uncertainty attached to this estimate.

In this paper, we revised the uncertainty in linear regression using linear algebra, providing an analytical expression for the direction of maximum uncertainty where most of the models of the linear regression problem are sampled when partial information is used to achieve the regression. In the linear case, this direction implies a trade-off between the slope and the y-intercept of the least-squares regression line. We show that taking this trade-off into account it is possible to solve a one-parameter regression problem where the slope is the unique parameter to be identified, and its solution coincides with the regression line that is found via least-squares if first order approximations are considered. This analysis is also performed for two different 1D nonlinear regression models (exponential and potential models) and for the multivariate case (bilinear). We illustrate this analysis with a practical example consisting of the concrete strength assessment by means of non-destructive measurements, showing the existing trade-off among the parameters of these empirical relationships between Non-Destructive Technique (NDT) measurement and the concrete strength. We show that the use of partial data information (sampling of the data space) greatly influences the model parameters that are inverted along the direction of maximum anisotropy. Therefore, no universal relationship can be found for the linear regression. In conclusion the problem does not reside in finding the least-squares solution that coincides with the maximum likelihood solution in Bayesian approaches, but to appraise this solution in order to quantify uncertainty introduced by partial data knowledge, noise in data gathering and/or also wrong modelling assumptions, since the correct regression model is a priori unknown, and in any case it is a simplification of the reality.

3. The algebraic interpretation of a simple linear regression problem

In this section, we present the analytic formulas for the uncertainty region of a simple linear regression problem of the kind $y^{*} (x) = a_{0} + a_{1} x$ to the experimental data table given by: ${(x_{1}, y_{1}), (x_{2}, y_{2}), \dots, (x_{m}, y_{m})} .$

As it is very well-known, the problem consists of finding the parameters $m^{T} = (a_{0}, a_{1})$ so that the distance between the observed data $y = (\begin{matrix} y_{1} \\ y_{2} \\ ⋮ \\ y_{m} \end{matrix})$ and the corresponding predictions $y^{p r e} = (\begin{matrix} y_{1}^{*} \\ y_{2}^{*} \\ ⋮ \\ y_{m}^{*} \end{matrix})$ is minimum according to the Euclidean distance in the data space $R^{m}$ .

The vector of predictions can be written $y^{p r e} = a_{0} 1_{R^{m}} + a_{1} x$ and belongs therefore to the subspace of predictions $C o l (F) = ⟨ 1_{R^{m}}, x ⟩$ which is the column space of the forward operator $F = [1_{R^{m}} x] .$ The fact that the observed data does not belong to subspace $C o l (F)$ , makes the linear system $F m = y$ have no solution. Hence, the problem is solved in the least-squares sense, that is, finding $m_{L S}$ so that the prediction error $| | F m_{L S} - y | |_{2}$ is the minimum. For this purpose, $F m_{L S}$ is needed to coincide with the orthogonal projection of $y$ onto $C o l (F)$ : (1) $F m_{L S} = {Proj}_{C o l (F)}^{⊥} (y) .$ (1) Consequently, the prediction error $E = F m_{L S} - y$ belongs to the null space of the adjoint operator $F^{T}$ , $\ker (F^{T})$ , providing the normal equations: (2) $F^{T} (F m_{L S} - y) = 0_{R^{m}} \Leftrightarrow F^{T} F m_{L S} = F^{T} y$ (2)

The matrix $F^{T} F$ is square of size 2, symmetric, and has the same rank as $F$ . As a consequence, this linear system has a unique solution in the case of purely overdetermined linear systems, that is, if the number of data m is greater than 2 and the rank of $F$ is 2. The least-squares solution writes: (3) $m_{L S} = (\begin{matrix} a_{0}^{L S} \\ a_{1}^{L S} \end{matrix}) = (F^{T} F)^{- 1} F^{T} y .$ (3) being the system matrix of (2) in this case: (4) $F^{T} F = (\begin{matrix} m & \sum_{k = 1}^{m} x_{k} \\ \sum_{k = 1}^{m} x_{k} & \sum_{k = 1}^{m} x_{k}^{2} \end{matrix}) .$ (4)

The least-squares solution can be analytically written as follows: (5) $\begin{aligned} a_{0}^{L S} & = μ_{y} - μ_{x} a_{1}^{L S}, \\ a_{1}^{L S} & = \frac{cov (y, x)}{var (x)}, \end{aligned}$ (5) where $μ_{y}$ represents the arithmetic mean of the components of the observed data $y$ , $μ_{x}$ the arithmetic mean of $x$ , $var (x)$ is the variance of the random variable $x$ , and $cov (y, x)$ the covariance between the random variable $x$ and the observed data $y$ . Taking formula (5) into account, the regression straight line writes: (6) $y - μ_{y} = \frac{cov (y, x)}{var (x)} (x - μ_{x}) .$ (6)

Matrix $F^{T} F$ admits orthogonal diagonalization, as follows: (7) $F^{T} F = V D V^{T} = [v_{1} v_{2}] [\begin{matrix} λ_{1} & 0 \\ 0 & λ_{2} \end{matrix}] [\begin{matrix} v_{1}^{T} \\ v_{2}^{T} \end{matrix}], λ_{1}, λ_{2} \in R, λ_{1} > λ_{2} > 0.$ (7) Besides, using the singular value decomposition, $F = U Σ V^{T}$ , the least-squares solution in expression (3), can be written as follows: (8) $m_{L S} = V D^{- 1} Σ^{T} U^{T} y = \frac{y_{u 1}}{\sqrt{λ_{1}}} v_{1} + \frac{y_{u 2}}{\sqrt{λ_{2}}} v_{2},$ (8) where $y_{u 1}, y_{u 2}$ are the two first coordinates of vector $y$ in the ${v_{1}, v_{2}}$ basis. Expression (8) serves to explain that the least-squares solution can be expanded as a linear combination of the eigenvectors of the column correlation matrix of $F, F^{T} F .$ Therefore the least square solution will change when a partial subset of the data (called a data bag) is considered. We will numerically show that this iterative bagging procedure serves to sample the uncertainty region of the linear regression problem, following the direction of maximum uncertainty, $v_{2}$ , associated to the smaller eigenvalue $λ_{2}$ .

4. The uncertainty region in linear inverse problems

The uncertainty region of the linear regression problem is: (9) $M_{t o l} = {m : {| | F m - y | |}_{2} \leq t o l},$ (9) and coincides with the region of the model space inside the ellipse: (10) $m^{T} F^{T} F m - 2 m^{T} F^{T} y + y^{T} y = t o l^{2},$ (10) that it is centred in the least-squares solution and oriented in the direction of the eigenvectors of $F^{T} F$ . Hence, referred to its principal axes (provided by the $V$ basis) this ellipse is: (11) $\frac{{(a_{0 V} - a_{0 V}^{L S})}^{2}}{{(\frac{t o l}{\sqrt{λ_{1}}})}^{2}} + \frac{{(a_{1 V} - a_{1 V}^{L S})}^{2}}{{(\frac{t o l}{\sqrt{λ_{2}}})}^{2}} = 1.$ (11) A theoretical analysis of the uncertainty in linear and nonlinear inverse problems is shown in Fernández Martínez et al. [Citation1,Citation2]. Besides, the effect of noise and that of the regularization is provided in Fernández Martínez et al. [Citation21,Citation22], showing how the noise affects the size of uncertainty region in linear and nonlinear inverse problems, and the effect of the regularization in estabilizing the inversion through local optimization methods. Nevertheless, despite of the regularization, the equivalent models do not vanish. Therefore, the uncertainty analysis in linear and nonlinear inverse problems for model quality assessment is always needed.

The statistical approach of uncertainty in linear inverse problems can be consulted for instance in C. Aster et al. [Citation23], and it is based on the fact that squared data prediction error $| | F m - y | |_{2}^{2}$ follows a $χ^{2}$ distribution with $m - n$ degrees of freedom. That way a p-value test can be performed to analyse if the least-squares solution produces an acceptable data fit and if the statistical assumptions of data errors are consistent. Wrong models produce extraordinarily small p-values (e.g. 10⁻¹²). If the p-value is close to 1, the fit of the model predictions to the data is almost exact. This approach also allows us to find the confidence regions that are related to the region of uncertainty stated in Fernández Martínez et al. [Citation1]. In addition to this, it is possible to show that $F^{T} F$ is the inverse of the posterior covariance matrix between model parameters (see for instance [Citation23]). An alternative approach consists in solving the orthogonal distance regression problem, as shown in J. Mandel [Citation24], and E. Cruz and P. Fernandes [Citation25].

Continuing with the linear algebra approach of the simple linear regression problem, let us call: (12) $S = \sum_{k = 1}^{m} x_{k} = m μ_{x}, S_{c} = \sum_{k = 1}^{m} x_{k}^{2} = m μ_{x^{2}},$ (12) then, the matrix (13) $F^{T} F = (\begin{matrix} m & S \\ S & S_{c} \end{matrix}),$ (13) has the following eigenvalues: (14) $\begin{aligned} λ_{1} & = \frac{(S_{c} + m) + S_{c} \sqrt{{(1 - \frac{m}{S_{c}})}^{2} + 4 \frac{S^{2}}{S_{c}^{2}}}}{2} = m \frac{μ_{x^{2}} + 1 + \sqrt{{(μ_{x^{2}} - 1)}^{2} + 4 μ_{x}^{2}}}{2}, \\ λ_{2} & = \frac{(S_{c} + m) - S_{c} \sqrt{{(1 - \frac{m}{S_{c}})}^{2} + 4 \frac{S^{2}}{S_{c}^{2}}}}{2} = m \frac{μ_{x^{2}} + 1 - \sqrt{{(μ_{x^{2}} - 1)}^{2} + 4 μ_{x}^{2}}}{2} . \end{aligned}$ (14) The corresponding eigenvectors $v_{i} \in \ker (F^{T} F - λ_{i} I_{2}) = (S, λ_{i} - m)$ are orthogonal since they are associated with different eigenvalues of a symmetric matrix, and the following relationship holds: (15) $S^{2} + (λ_{i} - m) (S_{c} - λ_{i}) = 0.$ (15)

The region of uncertainty for an absolute error value $t o l$ is the ellipse with the minimum axis (and minimum uncertainty) in the direction of $v_{1} = ⟨ (S, λ_{1} - m) ⟩$ and length $t o l / \sqrt{λ_{1}}$ , and maximum axis (and maximum uncertainty) in the direction of $v_{2} = ⟨ (S, λ_{2} - m) ⟩$ and length $t o l / \sqrt{λ_{2}}$ .

Both eigenvalues can be approximated using Taylor’s first order approximation as follows: (16) ${\tilde{λ}}_{1} = \frac{(S_{c} + m) + S_{c} [1 - \frac{m}{S_{c}} + \frac{m^{2}}{2 S_{c}^{2}} + \frac{2 S^{2}}{S_{c}^{2}}]}{2} = m (μ_{x^{2}} + \frac{μ_{x}^{2}}{μ_{x^{2}}} + \frac{1}{4 μ_{x^{2}}}),$ (16) (17) ${\tilde{λ}}_{2} = \frac{(S_{c} + m) - S_{c} [1 - \frac{m}{S_{c}} + \frac{m^{2}}{2 S_{c}^{2}} + \frac{2 S^{2}}{S_{c}^{2}}]}{2} = m (1 - \frac{μ_{x}^{2}}{μ_{x^{2}}} - \frac{1}{4 μ_{x^{2}}}) .$ (17)

The condition number and its approximated value are: (18) $\begin{aligned} κ & = \sqrt{\frac{λ_{1}}{λ_{2}}} = \sqrt{\frac{S_{c}^{2} + m^{2} + 2 S^{2} + (S_{c} + m) S_{c} \sqrt{{(1 - \frac{m}{S_{c}})}^{2} + 4 \frac{S^{2}}{S_{c}^{2}}}}{2 m S_{c} - 2 S^{2}}} ≃ \\ ≃ \sqrt{μ_{x^{2}} + μ_{x}^{2} + \frac{μ_{x}^{2} + μ_{x}^{4}}{μ_{x^{2}} - μ_{x}^{2}} .} \end{aligned}$ (18) Furthermore, the equations of the axis of the ellipse of uncertainty are:

Axis of minimum uncertainty (associated to the biggest eigenvalue): (19) $a_{1} - a_{1}^{L S} = p_{1} (a_{0} - a_{0}^{L S}) .$ (19)
Axis of maximum uncertainty (associated with the smallest eigenvalue): (20) $a_{1} - a_{1}^{L S} = p_{2} (a_{0} - a_{0}^{L S}) .$ (20)

The true and simplified slope values of the axes of uncertainty are: (21) $\begin{aligned} p_{1} & = \frac{λ_{1} - m}{S}, {\tilde{p}}_{1} = \frac{S_{c} - m}{S} ≃ \frac{S_{c}}{S} = \frac{μ_{x^{2}}}{μ_{x}}, \\ p_{2} & = \frac{λ_{2} - m}{S}, {\tilde{p}}_{2} = - \frac{S}{S_{c} - m} - \frac{m}{S} ≃ - \frac{S}{S_{c}} = - \frac{μ_{x}}{μ_{x^{2}}} . \end{aligned}$ (21) This simplification holds if $S_{c} >> S$ and $S >> m$ .

This is an interesting result since the slope of the axis of the ellipse of uncertainty is mainly controlled by the ratio of two terms of the matrix $F^{T} F$ . The fact that $λ_{2} << λ_{1}$ provokes the uncertainty region to be elongated in the direction of the eigenvector $v_{2}$ . Most of the equivalent models are sampled along this axis, and are therefore related as follows: (22) $a_{1} = a_{1}^{L S} + p_{2} (a_{0} - a_{0}^{L S}) = a_{1}^{L S} - p_{2} a_{0}^{L S} + p_{2} a_{0} .$ (22) Taking into account relationships (5) and (22), we reach: (23) $\begin{aligned} a_{1}^{L S} - a_{0}^{L S} p_{2} & = a_{1}^{L S} (1 + p_{2} μ_{x}) - p_{2} μ_{y} = \\ ≃ \frac{cov (y, x)}{var (x)} (1 - (\frac{S}{S_{c}} + \frac{m^{2}}{4 S_{c} S}) μ_{x}) + (\frac{S}{S_{c}} + \frac{m^{2}}{4 S_{c} S}) μ_{y} ≃ \\ ≃ \frac{cov (y, x)}{var (x)} (1 - \frac{S}{S_{c}} μ_{x}) + \frac{S}{S_{c}} μ_{y}, \end{aligned}$ (23) and (24) $p_{2} a_{0} ≃ - (\frac{S}{S_{c}} + \frac{m^{2}}{4 S_{c} S}) a_{0} ≃ - \frac{S}{S_{c}} a_{0} .$ (24) In a first approximation the equation of the axis of maximum uncertainty is: (25) $a_{1} ≃ \frac{cov (y, x)}{var (x)} (1 - \frac{S}{S_{c}} μ_{x}) + \frac{S}{S_{c}} μ_{y} - \frac{S}{S_{c}} a_{0} .$ (25)

Therefore, taking into account the existing trade-off relationship between $a_{0}$ and $a_{1}$ , the simplified model of the simple regression using just one-parameter ( $a_{1}$ ) is: (26) $y - μ_{y} - \frac{cov (y, x)}{μ_{x}} = a_{1} (x - μ_{x} - \frac{var (x)}{μ_{x}}) .$ (26)

Moreover, the same regression line without any simplification writes as follows: (27) $y - μ_{y} + a_{1}^{L S} (μ_{x} + \frac{1}{p_{2}}) = a_{1} (x + \frac{1}{p_{2}}),$ (27) where $p_{2}$ is the slope of the axis of maximum uncertainty stated in (21).

The parameter $a_{1}$ can be found by least-squares using the following expression: (28) $a_{1} = \frac{(y - (μ_{y} + \frac{cov (y, x)}{μ_{x}}) 1) \cdot (x - (μ_{x} + \frac{var (x)}{μ_{x}}) 1)}{(x - (μ_{x} + \frac{var (x)}{μ_{x}}) 1) \cdot (x - (μ_{x} + \frac{var (x)}{μ_{x}}) 1)} .$ (28) In most of the practical cases, the expression (25) can be simplified as follows: (29) $a_{1} = \frac{cov (y, x)}{var (x)},$ (29) and the final one-parameter regression model writes: (30) $y - μ_{y} - \frac{cov (y, x)}{μ_{x}} = \frac{cov (y, x)}{var (x)} (x - μ_{x} - \frac{var (x)}{μ_{x}}),$ (30) which coincides with the least-squares regression line (6). This property means that (in first approximation) the standard regression line has the additional property of optimally taking into account the trade-off between the slope and the y-intercept of the regression line.

5. Nonlinear regression: the exponential model

A common nonlinear regression model is the exponential regression that consists of fitting the model $y^{*} (x) = a_{0} e^{a_{1} x}$ to the observed data. This nonlinear regression problem can be linearized by the logarithm parameterization: $\ln y^{*} (x) = \ln a_{0} + a_{1} x,$ which implies a linear regression of the observed data $\ln y = (\begin{matrix} \ln y_{1} \\ \ln y_{2} \\ ⋮ \\ \ln y_{m} \end{matrix})$ in $⟨ 1_{R^{m}}, x_{R^{m}} ⟩$ . The vector of predictions $\ln y^{p r e} = (\begin{matrix} \ln y_{1}^{*} \\ \ln y_{2}^{*} \\ ⋮ \\ \ln y_{m}^{*} \end{matrix})$ can be written as $\ln y^{p r e} = \ln a_{0} 1_{R^{m}} + a_{1} x$ and belongs to the subspace of predictions $⟨ 1_{R^{m}}, x ⟩$ which is the column space of the forward operator $F = [1_{R^{m}} x] .$ The system matrix of this linear regression problem has been stated in (4) and the relation between the parameters $\ln a_{0}^{L S}$ and $a_{1}^{L S}$ is: (31) $\begin{aligned} \ln a_{0}^{L S} & = \ln {\bar{y}}_{g} - μ_{x} a_{1}^{L S}, \\ a_{1}^{L S} & = \frac{cov (\ln y, x)}{var (x)}, \end{aligned}$ (31) where $cov (\ln y, x)$ is the covariance between the random variable $x$ and the observed data $\ln y$ and (32) $\ln {\bar{y}}_{g} = \frac{1}{m} \sum_{k = 1}^{m} \ln y_{k},$ (32) where ${\bar{y}}_{g}$ is the geometric mean of the observed data. Taking formula (31) into account, the regression straight line writes: (33) $\ln y - \ln {\bar{y}}_{g} = \frac{cov (\ln y, x)}{var (x)} (x - μ_{x}),$ (33) (34) $y = {\bar{y}}_{g} e^{\frac{cov (\ln y, x)}{var (x)} (x - μ_{x})} .$ (34)

The symmetric matrix $F^{T} F$ admits orthogonal diagonalization with the same eigenvalues and eigenvectors than in the simple linear regression case. Hence, the condition number is the same as for the linear case.

Considering previous results shown in (21), the equations of the axis of the ellipse of uncertainty are:

Axis of minimum uncertainty (associated to the biggest eigenvalue): (35) $a_{1} - a_{1}^{L S} = p_{1} (\ln a_{0} - \ln a_{0}^{L S})$ (35)
Axis of maximum uncertainty (associated with the smallest eigenvalue): (36) $a_{1} - a_{1}^{L S} = p_{2} (\ln a_{0} - \ln a_{0}^{L S})$ (36)

The linearized uncertainty region is elongated in the direction of the eigenvector $v_{2}$ , and the relationship between the model parameters is in this case: (37) $a_{1} = a_{1}^{L S} + p_{2} (\ln a_{0} - \ln a_{0}^{L S}) = a_{1}^{L S} - p_{2} \ln a_{0}^{L S} + p_{2} \ln a_{0} .$ (37)

Considering relationships (31) and (36), the relationship (37) writes as follows: (38) $a_{1} = a_{1}^{L S} (1 + p_{2} μ_{x}) - p_{2} \ln {\bar{y}}_{g} + p_{2} \ln a_{0} .$ (38)

Therefore, the final model of the regression model using the only parameter $a_{1}$ is: (39) $y = {\bar{y}}_{g} e^{- a_{1}^{L S} (μ_{x} + \frac{1}{p_{2}})} e^{a_{1} (x + \frac{1}{p_{2}})} .$ (39)

6. Nonlinear regression: the potential model

The potential regression model writes $y^{*} (x) = a_{0} x^{a_{1}}$ , and can be linearized by the logarithm parameterization: $\ln y^{*} (x) = \ln a_{0} + a_{1} \ln x,$ which implies the orthogonal projection of $\ln y = (\begin{matrix} \ln y_{1} \\ \ln y_{2} \\ ⋮ \\ \ln y_{m} \end{matrix})$ onto the subspace $⟨ 1_{R^{m}}, \ln x_{R^{m}} ⟩$ . In this case, the model parameters found in the linearized regression are $m^{T} = (\ln a_{0}, a_{1})$ and the projection matrix is: (40) $F^{T} F = (\begin{matrix} m & \sum_{k = 1}^{m} \ln x_{k} \\ \sum_{k = 1}^{m} \ln x_{k} & \sum_{k = 1}^{m} {(\ln x_{k})}^{2} \end{matrix}) .$ (40) Calling (41) $m_{L S} = (\begin{matrix} \ln a_{0}^{L S} \\ a_{1}^{L S} \end{matrix}) = (F^{T} F)^{- 1} F^{T} y,$ (41) the relationship between the parameters $\ln a_{0}^{L S}$ and $a_{1}^{L S}$ is: (42) $\begin{aligned} \ln a_{0}^{L S} & = \ln {\bar{y}}_{g} - \ln {\bar{x}}_{g} a_{1}^{L S}, \\ a_{1}^{L S} & = \frac{cov (\ln x, \ln y)}{var (\ln x)}, \end{aligned}$ (42) with (43) $\ln {\bar{x}}_{g} = \frac{1}{m} \sum_{k = 1}^{m} \ln x_{k},$ (43) that is, ${\bar{x}}_{g}$ is the geometric mean of the x-coordinates. The regression straight line writes in this case: (44) $\ln \frac{y}{{\bar{y}}_{g}} = \frac{cov (\ln x, \ln y)}{var (\ln x)} \ln \frac{x}{{\bar{x}}_{g}},$ (44) (45) $\frac{y}{{\bar{y}}_{g}} = {(\frac{x}{{\bar{x}}_{g}})}^{\frac{cov (\ln x, \ln y)}{var (\ln x)}} .$ (45)

On the other hand, calling $S_{c n} = \sum_{k = 1}^{m} {(\ln x_{k})}^{2} = m μ_{c n}$ , the eigenvalues of the matrix $F^{T} F$ are: (46) $\begin{aligned} λ_{1} & = \frac{S_{c n} + m + \sqrt{{(S_{c n} - m)}^{2} + 4 {(m \ln {\bar{x}}_{g})}^{2}}}{2} = m \frac{1 + μ_{c n} + \sqrt{{(μ_{c n} - 1)}^{2} + 4 {(\ln {\bar{x}}_{g})}^{2}}}{2}, \\ λ_{2} & = \frac{S_{c n} + m - \sqrt{{(S_{c n} - m)}^{2} + 4 {(m \ln {\bar{x}}_{g})}^{2}}}{2} = m \frac{1 + μ_{c n} - \sqrt{{(μ_{c n} - 1)}^{2} + 4 {(\ln {\bar{x}}_{g})}^{2}}}{2}, \end{aligned}$ (46) and their approximate values are: (47) $\begin{aligned} {\tilde{λ}}_{1} & = m (1 + \frac{μ_{c n}^{2}}{4} + {(\ln {\bar{x}}_{g})}^{2}), \\ {\tilde{λ}}_{2} & = m (μ_{c n} - \frac{μ_{c n}^{2}}{4} - {(\ln {\bar{x}}_{g})}^{2}) . \end{aligned}$ (47)

The corresponding eigenvectors are: (48) $\begin{array}{l} v_{1} = ⟨ (m \ln {\bar{x}}_{g}, λ_{1} - m) ⟩, \\ v_{2} = ⟨ (m \ln {\bar{x}}_{g}, λ_{2} - m) ⟩ . \end{array}$ (48)

Following a similar reasoning as in previous cases, the equation of the axis of maximum uncertainty is in this case: (49) $a_{1} - a_{1}^{L S} = p_{2} (\ln a_{0} - \ln a_{0}^{L S})$ (49) and the slope $p_{2}$ and its approximate value are: (50) $p_{2} = \frac{λ_{2} - m}{m \ln {\bar{x}}_{g}}, {\tilde{p}}_{2} = - \frac{1 + \frac{μ_{c n}^{2}}{4} + {(\ln {\bar{x}}_{g})}^{2} - μ_{c n}}{\ln {\bar{x}}_{g}} .$ (50)

Therefore, the relationship between $\ln a_{0}$ and $\ln a_{1}$ is: (51) $a_{1} = a_{1}^{L S} (1 + p_{2} \ln {\bar{x}}_{g}) - p_{2} \ln {\bar{y}}_{g} + p_{2} \ln a_{0}$ (51)

And the final model of the regression using just one-parameter $a_{1}$ (without any simplification) is as follows: (52) $y = {\bar{y}}_{g} e^{- a_{1}^{L S} (\ln {\bar{x}}_{g} + \frac{1}{p_{2}})} e^{a_{1} (\ln x + \frac{1}{p_{2}})} .$ (52)

7. The multivariate linear model

In this case, we fit a linear regression model $y * (x_{1}, x_{2}, \dots, x_{n}) = a_{0} + a_{1} x_{1} + \dots + a_{n} x_{n}$ to the experimental data table given by: (53) ${(x_{11}, x_{21}, \dots, x_{m 1}, y_{1}), (x_{12}, x_{22}, \dots, x_{m 2}, y_{2}), \dots, (x_{1 n}, x_{2 n}, \dots, x_{m n}, y_{m})} .$ (53)

The problem consists of finding the parameters $m^{T} = (a_{0}, a_{1}, \dots, a_{n})$ so that the distance between the observed data $y = (\begin{matrix} y_{1} \\ y_{2} \\ ⋮ \\ y_{m} \end{matrix})$ and the corresponding predictions $y^{p r e} = (\begin{matrix} y_{1}^{*} \\ y_{2}^{*} \\ ⋮ \\ y_{m}^{*} \end{matrix})$ is minimum according to the Euclidean distance in the data space $R^{m}$ . The vector of predictions can be written now as $y^{p r e} = a_{0} 1_{R^{m}} + a_{1} x_{1} + \dots + a_{n} x_{n}$ belonging to the column space of the forward operator $F = [1_{R^{m}}, x_{1}, \dots, x_{n}] .$ The normal equations $F^{T} F m_{L S} = F^{T} y$ have a symmetric system matrix $F^{T} F$ of size $n + 1$ and the least-squares solution writes (54) $m_{L S} = (\begin{matrix} \begin{matrix} a_{0}^{L S} \\ \begin{matrix} a_{1}^{L S} \\ ⋮ \end{matrix} \end{matrix} \\ a_{n}^{L S} \end{matrix}) = (F^{T} F)^{- 1} F^{T} y .$ (54)

The system matrix writes in this case: (55) $F^{T} F = (\begin{matrix} m & \sum_{k = 1}^{m} x_{k 1} & \sum_{k = 1}^{m} x_{k 2} & \dots & \sum_{k = 1}^{m} x_{k n} \\ \sum_{k = 1}^{m} x_{k 1} & \sum_{k = 1}^{m} x_{k 1}^{2} & \sum_{k = 1}^{m} x_{k 1} x_{k 2} & \dots & \sum_{k = 1}^{m} x_{k 1} x_{k n} \\ \sum_{k = 1}^{m} x_{k 2} & \sum_{k = 1}^{m} x_{k 2} x_{k 1} & \sum_{k = 1}^{m} x_{k 2}^{2} & \dots & \sum_{k = 1}^{m} x_{k 2} x_{k n} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ \sum_{k = 1}^{m} x_{k n} & \sum_{k = 1}^{m} x_{k n} x_{k 1} & \sum_{k = 1}^{m} x_{k n} x_{k 2} & \dots & \sum_{k = 1}^{m} x_{k n}^{2} \end{matrix}) .$ (55) From the first equation of the system of normal equations we have: (56) $a_{0}^{L S} = μ_{y} - \sum_{k = 1}^{n} a_{k}^{L S} μ_{x_{k}},$ (56) where $μ_{x_{i}} = \frac{1}{m} \sum_{k = 1}^{m} x_{k i},$ and $μ_{y} = \frac{1}{m} \sum_{k = 1}^{m} y_{k} .$

Taking relationship (54) into account, the system of the normal equations becomes: (57) $(\begin{matrix} var (x_{1}) & cov (x_{1}, x_{2}) & \dots & cov (x_{1}, x_{n}) \\ cov (x_{1}, x_{2}) & var (x_{2}) & \dots & cov (x_{2}, x_{n}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ cov (x_{1}, x_{n}) & cov (x_{2}, x_{n}) & \dots & var (x_{n}) \end{matrix}) (\begin{matrix} a_{1}^{L S} \\ a_{2}^{L S} \\ ⋮ \\ a_{n}^{L S} \end{matrix}) = (\begin{matrix} cov (x_{1}, y) \\ cov (x_{2}, y) \\ ⋮ \\ cov (x_{n}, y) \end{matrix}),$ (57) which involves the covariance matrix $C$ between the data vectors ${x_{k}}_{k = 1, \dots, n}$ and the covariance vector $c_{x y}$ between the data vectors ${x_{k}}_{k = 1, \dots, n}$ and the observed data $y$ . Covariance matrix $C$ is symmetric and definite positive, and can be considered as a Gram’s matrix of a scalar product in the subspace $⟨ x_{1}, x_{2}, \dots, x_{n} ⟩$ of $R^{m}$ . This matrix admits the orthogonal decomposition $C = V D V^{T}$ where the diagonal elements of $D$ (spectrum of $C$ ) are the real positive eigenvalues ${λ_{1}, λ_{2}, \dots, λ_{n}}$ , and $V$ is an orthogonal matrix whose columns are the corresponding eigenvectors $v_{k}$ of $C$ . These results are very well-known in linear algebra (see [Citation26]) for instance). Therefore, the coefficients $a^{L S} = (a_{1}^{L S}, \dots, a_{n}^{L S})^{T}$ in the linear system (55) are: (58) $a^{L S} = V D^{- 1} V^{T} c_{x y} = \sum_{k = 1}^{n} \frac{c_{V k}}{λ_{k}} v_{k},$ (58) where $c_{V k}$ represents the coordinates of $c_{x y}$ referred to the $V$ orthogonal basis in $R^{n} .$ Calling $x_{c} = (x_{1} - μ_{x_{1}}, x_{2} - μ_{x_{2}}, \dots, x_{n} - μ_{x_{n}})$ , the least-squares estimator writes: (59) $y^{*} (x) - μ_{y} = \sum_{k = 1}^{n} a_{k}^{L S} (x_{k} - μ_{x_{k}}) = \sum_{k = 1}^{n} \frac{c_{V k}}{λ_{k}} ⟨ v_{k}, x_{c} ⟩,$ (59) which is a hyperplane passing through the point $(μ_{x_{1}}, μ_{x_{2}}, \dots, μ_{x_{n}}, μ_{y})$ . Also, the prediction model has its biggest uncertainty associated to the smallest eigenvalues of $C$ . In fact, the ill-conditioning of (57) depends on how the spectrum of $C$ decays.

8. The bivariate case

In the particular case where $y^{*} (x_{1}, x_{2}) = a_{0} + a_{1} x_{1} + a_{2} x_{2}$ , we have: (60) $F^{T} F = (\begin{matrix} m & \sum_{k = 1}^{m} x_{k 1} & \sum_{k = 1}^{m} x_{k 2} \\ \sum_{k = 1}^{m} x_{k 1} & \sum_{k = 1}^{m} x_{k 1}^{2} & \sum_{k = 1}^{m} x_{k 1} x_{k 2} \\ \sum_{k = 1}^{m} x_{k 2} & \sum_{k = 1}^{m} x_{k 1} x_{k 2} & \sum_{k = 1}^{m} x_{k 2}^{2} \end{matrix}) = (\begin{matrix} m & S_{1} & S_{2} \\ S_{1} & S_{c 1} & S_{12} \\ S_{2} & S_{12} & S_{c 2} \end{matrix})$ (60) and the relation between the parameters $a_{0}^{L S}, a_{1}^{L S}$ and $a_{2}^{L S}$ is: (61) $\begin{aligned} a_{0}^{L S} & = μ_{y} - a_{1}^{L S} μ_{x_{1}} - a_{2}^{L S} μ_{x_{2}}, \\ a_{1}^{L S} & = \frac{cov (x_{1}, y) - cov (x_{1}, x_{2}) a_{2}^{L S}}{var (x_{1})}, \\ a_{2}^{L S} & = \frac{var (x_{1}) cov (x_{2}, y) - cov (x_{1}, x_{2}) cov (x_{1}, y)}{var (x_{1}) var (x_{2}) - {cov}^{2} (x_{1}, x_{2})} \end{aligned}$ (61)

Taking formula (61) into account, the regression plane writes: (62) $\begin{aligned} y - μ_{y} & = \frac{var (x_{2}) cov (x_{1}, y) - cov (x_{1}, x_{2}) cov (x_{2}, y)}{var (x_{1}) var (x_{2}) - {cov}^{2} (x_{1}, x_{2})} (x_{1} - μ_{x_{1}}) + \\ + \frac{var (x_{1}) cov (x_{2}, y) - cov (x_{1}, x_{2}) cov (x_{1}, y)}{var (x_{1}) var (x_{2}) - {cov}^{2} (x_{1}, x_{2})} (x_{2} - μ_{x_{2}}) . \end{aligned}$ (62)

From system (57), the normal equations for $a_{1}^{L S}$ and $a_{2}^{L S}$ in the bivariate case are: (63) $(\begin{matrix} var (x_{1}) & cov (x_{1}, x_{2}) \\ cov (x_{1}, x_{2}) & var (x_{2}) \end{matrix}) (\begin{matrix} a_{1}^{L S} \\ a_{2}^{L S} \end{matrix}) = (\begin{matrix} cov (x_{1}, y) \\ cov (x_{2}, y) \end{matrix}) .$ (63)

The eigenvalues of the covariance matrix are: (64) $\begin{aligned} η_{1} & = \frac{var (x_{1}) + var (x_{2}) + \sqrt{{(var (x_{1}) - var (x_{2}))}^{2} + 4 cov {(x_{1}, x_{2})}^{2}}}{2}, \\ η_{2} & = \frac{var (x_{1}) + var (x_{2}) - \sqrt{{(var (x_{1}) - var (x_{2}))}^{2} + 4 cov {(x_{1}, x_{2})}^{2}}}{2} . \end{aligned}$ (64) and the corresponding eigenvectors are (65) $\begin{aligned} v_{1} & = ⟨ (C_{12}, η_{1} - var (x_{1})) ⟩, \\ v_{2} & = ⟨ (C_{12}, η_{2} - var (x_{1})) ⟩ . \end{aligned}$ (65) where $C_{12} = cov (x_{1}, x_{2})$ calling (66) $\begin{aligned} S_{T} & = var (x_{1}) + var (x_{2}), \\ D & = var (x_{1}) - var (x_{2}) . \end{aligned}$ (66) the eigenvalues are: (67) $\begin{aligned} η_{1} & = \frac{S_{T} + \sqrt{D^{2} + 4 {C_{12}}^{2}}}{2} = \frac{S_{T} + D \sqrt{1 + 4 \frac{{C_{12}}^{2}}{D^{2}}}}{2} = \frac{S_{T} + D [1 + 2 \frac{{C_{12}}^{2}}{D^{2}} + \dots]}{2}, \\ η_{2} & = \frac{S_{T} - \sqrt{D^{2} + 4 {C_{12}}^{2}}}{2} = \frac{S_{T} - D \sqrt{1 + 4 \frac{{C_{12}}^{2}}{D^{2}}}}{2} = \frac{S_{T} - D [1 + 2 \frac{{C_{12}}^{2}}{D^{2}} + \dots]}{2} . \end{aligned}$ (67) which can be approximated as follows: (68) $\begin{aligned} {\tilde{η}}_{1} ≃ \frac{S_{T} + D}{2} & = var (x_{1}), \\ {\tilde{η}}_{2} ≃ \frac{S_{T} - D}{2} & = var (x_{2}) . \end{aligned}$ (68) Therefore, the conditioning of this regression problem depends in a first approximation on the variances of the regression variables.

From Equation (58) we have: (69) $(\begin{matrix} a_{1}^{L S} \\ a_{2}^{L S} \end{matrix}) = C_{v}^{- 1} (\begin{matrix} cov (x_{1}, y) \\ cov (x_{2}, y) \end{matrix}) = [v_{1} v_{2}] [\begin{matrix} \frac{1}{η_{1}} & 0 \\ 0 & \frac{1}{η_{2}} \end{matrix}] [\begin{matrix} v_{1}^{T} \\ v_{2}^{T} \end{matrix}] (\begin{matrix} cov (x_{1}, y) \\ cov (x_{2}, y) \end{matrix}),$ (69) where we have taken into account the orthogonal decomposition of the covariance matrix $C_{v}^{}$ to calculate its inverse. Besides, naming (70) $(\begin{matrix} c_{1} \\ c_{2} \end{matrix}) = [\begin{matrix} v_{1}^{T} \\ v_{2}^{T} \end{matrix}] (\begin{matrix} cov (x_{1}, y) \\ cov (x_{2}, y) \end{matrix}) = (\begin{matrix} C_{12} cov (x_{1}, y) + (η_{1} - var (x_{1})) cov (x_{2}, y) \\ C_{12} cov (x_{1}, y) + (η_{2} - var (x_{1})) cov (x_{2}, y) \end{matrix}),$ (70) we arrive at: (71) $(\begin{matrix} a_{1}^{L S} \\ a_{2}^{L S} \end{matrix}) = (\begin{matrix} C_{12} (\frac{c_{1}}{η_{1}} + \frac{c_{2}}{η_{2}}) \\ c_{1} + c_{2} - var (x_{1}) (\frac{c_{1}}{η_{1}} + \frac{c_{2}}{η_{2}}) \end{matrix}) = (\begin{matrix} C_{12} (\frac{c_{1}}{η_{1}} + \frac{c_{2}}{η_{2}}) \\ c_{1} + c_{2} - var (x_{1}) \frac{a_{1}^{L S}}{C_{12}} \end{matrix}) .$ (71) Finally, taking into consideration relationships (59) and (71), the equation for the regression plane is: (72) $y^{*} (x) - μ_{y} = a_{1}^{L S} (x_{1} - μ_{x_{1}} - \frac{var (x_{1})}{C_{12}} (x_{2} - μ_{x_{2}})) + (c_{1} + c_{2}) (x_{2} - μ_{x_{2}}),$ (72) where (73) $c_{1} + c_{2} = 2 C_{12} cov (x_{1}, y) + D cov (x_{2}, y) .$ (73)

9. Application to concrete strength analysis

We provide a real example consisting of the linear regression of the mechanical strength of a concrete (in MPa) through the measurement of the V speed of ultrasonic waves (m s⁻¹). Such NDT measurements are a very common way for assessing concrete strength after having calibrated an empirical relationship between the NDT measurement and strength. To illustrate numerically the theoretically results, we have used the data set published by Oktar et al. [Citation27]. The original dataset contains 60 values of core strength, ultrasonic pulse velocity and rebound index. The mean value, standard deviation, minimum and maximum are respectively:

23.6, 7.7, 7.5 and 42.2 for the concrete strength (in MPa),
4461, 416, 3640 and 5510 for the velocity (in m/s),
30.5, 4.5, 14.4 and 42.8 for rebound index (without unit).

9.1. The linear model

The least-squares solution of the concrete strength $r$ via the ultrasonic velocity $V$ is the regression line: (74) $r * (V) = - 46.37 + 0.01569 V, V > 2956,$ (74) with a relative misfit of $\frac{{| | F m - y | |}_{2}}{{| | y | |}_{2}} (%) = 16.57 % .$ This model provides positive concrete strength for velocity values greater than 2956 m/s.

Besides, the eigenvalues, eigenvectors and condition number of the $F^{T} F$ matrix are: (75) $\begin{aligned} λ_{1} & = 1.2 \times 10^{9}, λ_{2} = 0.508, κ = \sqrt{\frac{λ_{1}}{λ_{2}}} = 4, 87 \times 10^{4}, \\ v_{1} & = (3 \times 10^{5}, 1.2 \times 10^{9}), \\ v_{2} & = (2.68 \times 10^{5}, - 5.95 \times 10^{- 1}) . \end{aligned}$ (75) The equation of the axis of maximum uncertainty is in this case: (76) $a_{1} = 5.4 \times 10^{- 3} - 2.22 \times 10^{- 4} a_{0} .$ (76)

Besides, to mimick the process of partial information we have built $N_{S} = 100$ different simulations, randomly generating different data bags, and finding the least-squares fitting of these bags, calculating $100$ different sets of least-squares parameters ${a_{0 k}^{L S}, a_{1 k}^{L S}}_{k = 1, \dots, N_{S}} .$ In this process the observed data are randomly selected for each bag, and the least-squares solution is found. As each investigator of a structure gets a specific dataset and identifies a specific model after inversion, databagging reproduces the results that would be obtained on the same structure by a series of investigators who each get their own sample and model. Databagging is, therefore, an adapted mean of sampling the uncertainty region.

Figure (A) shows the plot of these sets with the ellipse of uncertainty for a relative data misfit of $\frac{{| | F m - y | |}_{2}}{{| | y | |}_{2}} (%) \leq 20 % .$ The error tolerance ( $20 %$ ) has been established based in the minimum misfit of the least-squares solution that has been found ( $16.57 %$ ). The main axes of this ellipse are the directions of the uncertainty of the identification problem, that correspond to the $v_{1}, v_{2}$ vectors of the $V$ orthonormal basis set provided by the singular value decomposition of $F$ . Besides, as it was explained, the longer axis of uncertainty corresponds to the smallest singular value of $F$ . The main conclusion of this analysis is that the model parameters identified by the bootstrapping procedure (least-squares of random data bags) are located (or ‘sampled’) along the axis of maximum uncertainty of this linear regression problem, which is by ill-conditioned. Finally, Figure (B) shows the observed data, the regression line and the one-parameter regression with the simplified formulas. As it can be observed, the numerical results confirm the theoretical analysis shown above.

Figure 1. Linear regression model. (A) Ellipse of uncertainty for relative misfit of 20% and the different sets of parameters ${a_{0 k}^{L S}, a_{1 k}^{L S}}_{k = 1, \dots, N_{S}}$ found in the different $N_{S}$ bagging experiments. The longer axis of the ellipse correspond to the direction of biggest uncertainty (smaller singular value of the system matrix). (B) Data points, regression line (black line) and one-parameter regression line. It can be observed that both lines are coincident and cannot be distinguished. This numerical result shows the accuracy provided by our theoretical analysis.

Figure 1. Linear regression model. (A) Ellipse of uncertainty for relative misfit of 20% and the different sets of parameters {a0kLS,a1kLS}k=1,…,NS found in the different NS bagging experiments. The longer axis of the ellipse correspond to the direction of biggest uncertainty (smaller singular value of the system matrix). (B) Data points, regression line (black line) and one-parameter regression line. It can be observed that both lines are coincident and cannot be distinguished. This numerical result shows the accuracy provided by our theoretical analysis.

9.2. Exponential model

The linear regression line is in this case: (77) $\ln r * (V) = - 46.37 + 7 \times 10^{- 4} V,$ (77) with a linearized relative misfit of $\frac{{| | F m - y | |}_{2}}{{| | y | |}_{2}} (%) = 5.8 %,$ where $r$ is the strength and $V$ the ultrasonic pulse velocity. The misfit is, in this case, lower due to the logarithmic reparameterization of $r$ . The corresponding nonlinear relative misfit for the least-squares model amounts 17.1%, which is close to the relative misfit of the linear model.

The eigenvalues, eigenvectors and condition number are the same that in the linear case. The equation of the axis of maximum uncertainty is: (78) $a_{1} = 10^{- 4} (6.96 - 2.22 \ln a_{0}) .$ (78) The process of building the simulations to calculate different sets of least-squares parameters ${\ln a_{0 k}^{L S}, a_{1 k}^{L S}}_{k = 1, \dots, N_{S}}$ is the same as in the linear case.

Figure (A) shows the plot of these sets with the ellipse of uncertainty for a nonlinear relative data misfit of $\frac{{| | F (m) - y | |}_{2}}{{| | y | |}_{2}} (%) \leq 20 %$ . Figure (B) shows the observed data, the regression line and the one-parameter regression line, that has been deduced based on theoretical developments.

Figure 2. Exponential regression model: (A) Ellipse of uncertainty for relative misfit of 6% and the different sets of parameters ${a_{0 k}^{L S}, a_{1 k}^{L S}}_{k = 1, \dots, N_{S}}$ found in the different $N_{S}$ bagging experiments. The same considerations as in the previous case for the ellipse of uncertainty apply. (B) Data points, regression line (black line) and one-parameter regression line. It can be observed that both lines are coincident and cannot be distinguished.

Figure 2. Exponential regression model: (A) Ellipse of uncertainty for relative misfit of 6% and the different sets of parameters {a0kLS,a1kLS}k=1,…,NS found in the different NS bagging experiments. The same considerations as in the previous case for the ellipse of uncertainty apply. (B) Data points, regression line (black line) and one-parameter regression line. It can be observed that both lines are coincident and cannot be distinguished.

9.3. Potential model

The linear regression line is in this case: (79) $\ln r * (V) = - 23.11 + 3.12 \ln V,$ (79) with a relative misfit of $\frac{{| | F m - y | |}_{2}}{{| | y | |}_{2}} (%) = 5.76 %,$ where $\ln r$ and $\ln V$ are the natural logarithms of the strength and the ultrasonic pulse velocity, respectively. The corresponding nonlinear relative misfit for the least-squares model was 16.8%.

Besides, the eigenvalues, eigenvectors and condition number of the $F^{T} F$ matrix are in this case: (80) $\begin{aligned} λ_{1} & = 4.3 \times 10^{3}, λ_{2} = 0.0073, κ = \sqrt{\frac{λ_{1}}{λ_{2}}} = 769.34, \\ v_{1} & = (5.04 \times 10^{2}, 4.23 \times 10^{3}), \\ v_{2} & = (5.04 \times 10^{2}, - 5.99 \times 10^{- 1}) . \end{aligned}$ (80) The equation of the axis of maximum uncertainty is in this case: (81) $a_{1} = 0.37 - 0.12 \ln a_{0} .$ (81) The process to build the simulations to calculate different sets of least-squares parameters ${\ln a_{0 k}^{L S}, a_{1 k}^{L S}}_{k = 1, \dots, N_{S}}$ is the same that in the previous cases.

Figure (A) shows the plot of these sets with the ellipse of uncertainty for a relative data misfit of $\frac{{| | F m - y | |}_{2}}{{| | y | |}_{2}} (%) \leq 20 % .$ Figure (B) shows the observed data, the regression line and the one-parameter regression line that has been deduced based on theoretical developments.

Figure 3. Potential case. (A) Ellipse of uncertainty for relative misfit of 6% and the different sets of parameters ${a_{0 k}^{L S}, a_{1 k}^{L S}}_{k = 1, \dots, N_{S}}$ found in the different $N_{S}$ bagging experiments. The same considerations as in the previous case for the ellipse of uncertainty apply. (B) Data points, regression line (black line) and one-parameter regression line. It can be observed that both lines are coincident and cannot be distinguished.

Figure 3. Potential case. (A) Ellipse of uncertainty for relative misfit of 6% and the different sets of parameters {a0kLS,a1kLS}k=1,…,NS found in the different NS bagging experiments. The same considerations as in the previous case for the ellipse of uncertainty apply. (B) Data points, regression line (black line) and one-parameter regression line. It can be observed that both lines are coincident and cannot be distinguished.

Besides, in the case of the nonlinear models (exponential and potential), the bootstrapping procedure could be also applied to the original nonlinear identification problems by solving for each data bag the corresponding nonlinear least-squares problem. In this case, instead of sampling the linearized equivalence region, the procedure would sample directly the nonlinear equivalence region that exhibits a curvilinear shape [Citation1].

Figure (A, B) shows the sampling of the nonlinear equivalence region for the exponential and potential cases, induced by the bootstrapping procedure. It can be observed how the parameters are located along the curvilinear valley of lower misfits. In this case, instead of solving the linearized systems, the algorithm solves iteratively the corresponding nonlinear systems via nonlinear least-squares.

Figure 4. Potential and exponential cases. Bootstrapping procedure applied to the sampling of the nonlinear equivalence region.

9.4. Bivariate model

The linear regression plane is: (82) $r * (V, R) = - 44.365 + 0.0104 V + 0.709 R,$ (82) with a relative misfit of $\frac{{| | F m - y | |}_{2}}{{| | y | |}_{2}} (%) = 13.8 %,$ where $r$ is the strength, $V$ the ultrasonic pulse velocity and $R$ the rebound. Therefore, constraining the concrete strength with the ultrasonic pulse velocity and the rebound is the regression method that provides the smallest data relative misfit.

The eigenvalues, eigenvectors and condition number of the $F^{T} F$ matrix are: (83) $\begin{aligned} λ_{1} & = 1.2 \times 10^{9}, λ_{2} = 615.684, λ_{3} = 0.504, κ = \sqrt{\frac{λ_{1}}{λ_{3}}} = 4.89 \times 10^{4} . \\ v_{1} & = (2 \times 10^{- 4}, 1, 6.8 \times 10^{- 3}), \\ v_{2} & = (2.3 \times 10^{- 3}, 6.8 \times 10^{- 3}, - 1), \\ v_{3} & = (1, - 2 \times 10^{- 4}, 2.3 \times 10^{- 3}) . \end{aligned}$ (83) This regression problem is also ill-conditioned. As in the previous cases we have obtained the simulations to calculate different sets of least-squares parameters ${a_{0 k}^{L S}, a_{1 k}^{L S}, a_{2 k}^{L S}}_{k} = 1, \dots, N_{S}$ .

Figure (A) shows the ellipsoid of 20% of relative misfit of this regression problem. It can be observed that the length of the main axis corresponding to $λ_{1}$ is much smaller than the two others, and the ellipsoide becomes ‘almost’ an ellipse in this case (flat ellipsoide). Finally, Figure (B) shows the regression plane and the observed data.

Figure 5. Bivariate linear model. (A) Ellipsoid of uncertainty for relative misfit of 15% and the different sets of parameters ${a_{0 k}^{L S}, a_{1 k}^{L S}, a_{2 k}^{L S}}_{k = 1, \dots, N_{S}}$ found in the different $N_{S}$ bagging experiments. (B) Data points and regression plane.

Figure 5. Bivariate linear model. (A) Ellipsoid of uncertainty for relative misfit of 15% and the different sets of parameters {a0kLS,a1kLS,a2kLS}k=1,…,NS found in the different NS bagging experiments. (B) Data points and regression plane.

In all the cases that have been analysed, it can be observed a perfect match between the numerical experiments and the theoretical results that have been obtained.

9.5. Percentile curves estimation

One of the most important achievements of the present methodology is the possibility of estimating the posterior distribution of the concrete strength for different values of the control variables: the ultrasonic pulse velocity and the rebound. For that purpose, we will be using the different sets of least-squares parameters that have been found with relative misfit lower than 20% using different data bags for each regression model. The idea consists in predicting the concrete strength for different concrete configurations in a grid of values for $(V, R)$ . Based on these estimations it is possible to estimate the percentile curves for each regression model and these grids of values.

Figure shows the percentile curves obtained for this data set as a function of $V$ . We show the percentiles 5, 25, 50, 75 and 95 of the posterior distribution of the concrete strength estimation. This graphic works as follows: let us imagine that we have adopted the bivariate model for concrete strength estimation. Given a concrete whose ultrasonic velocity is 4800 m/s, the median strength estimation (percentile 50) is 27.98 with an interquartile range of 0.72 accounting for the uncertainty in the estimation. If the concrete velocity increases to 5400 m/s, then these numbers are: $40.70 \pm 1.32$ . It can be observed that in all the regression models the uncertainty is smaller for middle velocities around 4500 m/s and increases towards the extremes of the interval variation of $V$ . While all the models provide a similar uncertainty quantification for middle concrete ultrasonic velocities, this confirms that nonlinear model leads to larger uncertainties. These curves can be directly used by concrete assessment experts who need to assess the risk of making a wrong assessment of local strength, since they directly include the effect of model error due to lack of fit.

Figure 6. Percentile curves for concrete strength estimation depending on the ultrasonic pulse velocity. (A) Linear model. (B) Exponential model. (C) Potential model. (D) Bivariate model (constrained with the rebound).

Finally, Figure shows the percentile curves for concrete strength estimation depending on the ultrasonic pulse velocity for the exponential and potential cases. It can be observed that in the exponential case the curves are almost indistinguishable but in the potential case, they differ around 5 MPa for almost all the velocities except for 4500 m/s. In any case, we have shown that linearized or nonlinear bootstrap serve to sample the uncertainty region in these nonlinear regression models and provide a robust method to assess regression.

Figure 7. Percentile curves for concrete strength estimation depending on the ultrasonic pulse velocity deduced from the nonlinear bootstrapping sampling shown in Figure . (A) Exponential model. (B) Potential model. Similarities to Figures (B, C) can be observed.

10. Conclusions

In this paper, we have revisited the uncertainty analysis of linear regression problems through the use of linear algebra techniques, that has been inspired by the concrete strength assessment by means of non-destructive measurements. We provided an analytical expression for the direction of maximum uncertainty where the models of the linear regression problem are sampled when partial data information is used. This situation always happens in practice since data gathering is always discrete and has an economic cost. This direction of maximum uncertainty implies a trade-off between the slope and the y-intercept of the least-squares regression line. We show that using this trade-off, it is possible to write a one-parameter regression problem where the slope is the unique parameter to be identified, and its solution coincides in first approximation with the least-squares regression line. This implies that the standard regression line has the additional property of optimally taking into account the trade-off between the slope and the y-intercept of the least-squares regression line. We have also generalized this analysis to the exponential and the potential regression models and the multivariate case, with special emphasis on the bivariate case.

Finally, as a practical example, we have performed the estimation of the concrete strength by means of a non-destructive experiment (ultrasonic pulse velocity and rebound), showing the existing trade-off between the least-squares parameters for these different regression models. This fact explains why experts of NDT concrete strength estimation have identified a large variety of empirical relationships describing the same phenomenon, and opens new ways for improving the estimation of this property by sampling the posterior distribution of the regression model parameters via data bagging. We introduce the percentile curves, that provide a robust method to accomplish the concrete strength estimation and computing at the same time uncertainty, without the need of looking for the right universal regression model that does not exist. This approach generalizes the statistical results provided by ANOVA to establish critical regions using the Fisher-Snedecor statistical distribution, but in this case, only linear algebra and optimization techniques are used and serve to explain deterministically the uncertainty involved in regression problems. Besides, we have shown analytically in the case of the nonlinear regression models that nonlinear bootstrap procedure provides very similar results than the bootstrap analysis of the corresponding bootstrap systems.

Therefore, we can conclude that the novel methodology presented in this paper is a robust way of assessing the intrinsic uncertainty existing in these simple parameter identification problems, introduced by noise in data, partial knowledge about the system under research, and modelling errors. The generalization to higher dimensions is straight forward and it is currently under research.

Acknowledgements

We acknowledge Ms. Celia Fernández-Brillet for English and style corrections that served to improve the quality of this paper.

Disclosure statement

No potential conflict of interest was reported by the authors.

ORCID

Fernández-Martínez Juan Luis http://orcid.org/0000-0002-4758-2832

Fernández-Muñiz Zulima http://orcid.org/0000-0002-6544-2753

References

Fernández-Martínez JL, Fernández-Muñiz MZ, Tompkins MJ. On the topography of the cost functional in linear and nonlinear inverse problems. Geophysics. 2012;77(1):W1–W15. doi:10.1190/geo2011-0341.1.
Web of Science ®Google Scholar
Fernández-Martínez JL, Pallero JLG, Fernández-Muñiz Z, et al. From Bayes to Tarantola: new insights to understand uncertainty in inverse problems. J Appl Geophys. 2013;98:62–72. doi: 10.1016/j.jappgeo.2013.07.005
Web of Science ®Google Scholar
Tarantola A, Valette B. Inverse problems = quest for information. J Geophys. 1982a;50(3):159–170.
Web of Science ®Google Scholar
Tarantola A, Valette B. Generalized nonlinear inverse problems solved using the least squares criterion. Rev Geophys Space Phys. 1982b;20:219–232. doi: 10.1029/RG020i002p00219
Web of Science ®Google Scholar
Scales JA, Snieder R. The anatomy of inverse problems. Geophysics. 2000;65(6):1708–1710. doi:10.1190/geo2000-0001.1.
Web of Science ®Google Scholar
Scales JA, Tenorio L. Prior information and uncertainty in inverse problems. Geophysics. 2001;66(2):389–397. doi: 10.1190/1.1444930
Web of Science ®Google Scholar
Biondi S. The knowledge level in existing buildings assessment. In: 14th World Conference on Earthquake Engineering; 2008 October 12–17; Beijing, China. CAEE Chinese Association. Earthquake Engineering, IAEE International Association. https://www.researchgate.net/publication/228871324_The_Knowledge_Level_in_existing_buildings_assessment
Google Scholar
Breysse D. Nondestructive evaluation of concrete strength: an historical review and a new perspective by combining NDT methods. Constr Build Mat. 2012;33:139–163. doi: 10.1016/j.conbuildmat.2011.12.103
Web of Science ®Google Scholar
Szilagyi K, Borosnyoi A, Zsigovics I. Rebound surface hardness of concrete: introduction of an empirical constitutive model. Constr Build Mat. 2011;25:2480–2487. doi: 10.1016/j.conbuildmat.2010.11.070
Web of Science ®Google Scholar
Bogas JA, Gomes MG, Gomes A. Compressive strength evaluation of structural lightweight concrete by non-destructive ultrasonic pulse velocity method. Ultrasonics. 2013;53:962–972. doi: 10.1016/j.ultras.2012.12.012
PubMed Web of Science ®Google Scholar
Vasanelli E, Colangiuli D, Calia A, et al. Combining non-invasive techniques for reliable prediction of soft stone strength in historic masonries. Constr Build Mat. 2017;146:744–754. doi: 10.1016/j.conbuildmat.2017.04.146
Web of Science ®Google Scholar
Breysse D, Villain G, Sbartai ZM, Garnier V. Construction of conversion models of observables into indicators, Chap. 7. In: Balayssac JP, Garnier V, editors. Non-destructive testing and evaluation of civil engineering structures. London: ISTE Press, Elsevier publication; 2018. p. 231–257.
Google Scholar
Qasrawi HY. Concrete strength by combined nondestructive methods simply and reliably predicted. Cem Concr Res. 2000;30:739–746. doi: 10.1016/S0008-8846(00)00226-X
Web of Science ®Google Scholar
Amini K, Jalalpour M, Delatte N. Advancing concrete strength prediction using non-destructive testing: development and verification of a generalizable model. Constr Build Mat. 2016;102:762–768. doi: 10.1016/j.conbuildmat.2015.10.131
Web of Science ®Google Scholar
Bashar SM, Najwa JA, Abdullahi M. Evaluation of rubbercrete based on ultrasonic pulse velocity and rebound hammer tests. Constr Build Mat. 2011;25:1388–1397. doi: 10.1016/j.conbuildmat.2010.09.004
Web of Science ®Google Scholar
Nada Mahdi F, AbdMuttalib Issa S, Ali Khalid J. Prediction of compressive strength of reinforced concrete structural elements by using combined non destructive tests. J Eng. 2013;10(19):1189–1211.
Google Scholar
ACI 228 1R. In-place methods to estimate concrete strength. Farmington Hills (MI): American Concrete Institute; 2003.
Google Scholar
Alwash M, Breysse D, Sbartaï ZM. Using Monte-Carlo simulations to evaluate the efficiency of different strategies for nondestructive assessment of concrete strength. Mat Str. 2017. doi:10.1617/s11527-016-0962-x.
Web of Science ®Google Scholar
Breysse D, Fernández-Martínez JL. Assessing concrete strength with rebound hammer: review of key issues and ideas for more reliable conclusions. Mater Struct. 2014;47(9):1589–1604. doi: 10.1617/s11527-013-0139-9
Web of Science ®Google Scholar
Alwash M, Breysse D, Sbartaï ZM. Non-destructive strength evaluation of concrete: analysis of some key factors using synthetic simulations. Const Build Mat. 2015;99:235–245. doi: 10.1016/j.conbuildmat.2015.09.023
Web of Science ®Google Scholar
Fernández-Martínez JL, Pallero JLG, Fernández-Muñiz Z, et al. The effect of noise and Tikhonov’s regularization in inverse problems. Part I: The linear case. J Appl Geophys. 2014a;108:176–185. doi:10.1016/j.jappgeo.2014.05.006.
Web of Science ®Google Scholar
Fernández-Martínez JL, Pallero JLG, Fernández-Muñiz Z, et al. The effect of noise and Tikhonov’s regularization in inverse problems. Part II: the nonlinear case. J Appl Geophys. 2014b;108:186–193. doi:10.1016/j.jappgeo.2014.05.005.
Web of Science ®Google Scholar
Aster C, Borchers B, Thurber CH. Parameter estimation and inverse problems. 1st ed. San Diego (CA): Elsevier Academic Press; 2005.
Google Scholar
Mandel J. Fitting straight lines when both variables are subject to error. J Qual Tech. 1984;16:1–14. doi: 10.1080/00224065.1984.11978881
Web of Science ®Google Scholar
Cruz de Oliveira E, Fernandes de Aguiar P. Least squares regression with errors in both variables: case studies. Quim Nova. 2013;36(6):885–889. doi: 10.1590/S0100-40422013000600025
Web of Science ®Google Scholar
Strang G. Linear algebra and its applications. 2nd ed. New York (NY): Academic Press; 1980.
Google Scholar
Oktar ON, Moral H, Tasdemir MA. Factors determining the correlations between concrete properties. Cem Concr Res. 1996;26(11):1629–1637. doi: 10.1016/S0008-8846(96)00167-6
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature