Full article: Response surface designs using the generalized variance inflation factors

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

We study response surface designs using the generalized variance inflation factors for subsets as an extension of the variance inflation factors.

Keywords:

Public Interest Statement

Response surface designs are a mainstay in applied statistics. The variance inflation factors VIF are a measure of collinearity for a single variable in a linear regression model. The generalization to subsets of variables is the generalized variance inflation factor GVIF. This research introduces $G V I F$ as a penalty measure for extending a linear response model to a response surface with the included quadratic terms. The methodology is demonstrated with case studies, and, in particular, it is shown that using GVIF, the H310 design can be improved for the standard global optimality criteria of $A$ , $D$ , and $E$ .

1. Introduction

We consider a linear regression $Y = X β + ε$ with $X$ a full rank $n \times p$ matrix and $L (ε) = N (0, σ^{2} I_{n})$ . The variance inflation factor $V I F$ , Belsley (Citation1986), measures the penalty for adding one non-orthogonal additional explanatory variable to a linear regression model, and they can be computed as a ratio of determinants. The extension of $V I F$ to a measure of the penalty for adding a subset of variables to a model is the generalized variance inflation factor $G V I F$ of Fox and Monette (Citation1992), which will be used to study response surface designs, in particular, as the penalty for adding the quadratic terms to the model.

2. Variance inflation factors

For our linear model $Y = X β + ε$ , let $D_{X}$ be the diagonal matrix with entries on the diagonal $D_{X} [i, i] = {(X^{'} X)}_{i, i}^{- 1 / 2}$ . When the design has been standardized $X \to X D_{X}$ , the $V I F$ s are the diagonal entries of the inverse of $S_{X} = D_{X} (X^{'} X) D_{X}$ . That is, the $V I F$ s are the ratios of the actual variances for the explanatory variables to the “ideal” variances had the columns of $X$ been orthogonal. Note that we follow Stewart (Citation1987) and do not necessarily center the explanatory variables.

For our linear model $Y = X β + ε$ , view $X = [X_{[p]}, x_{p}]$ with $x_{p}$ the $p^{th}$ column of $X$ and $X_{[p]}$ the matrix formed by the remaining columns. The variance inflation factor $V I F_{p}$ measures the effect of adding column $x_{p}$ to $X_{[p]}$ . For notational convenience, we demonstrate $V I F_{p}$ with the last column $p$ . An ideal column would be orthogonal to the previous columns with the entries in the off-diagonal elements of the $p^{th}$ row and $p^{th}$ column of $X^{'} X$ all zeros. Denote by $M_{p}$ the idealized moment matrix $\begin{matrix} M_{p} = [\begin{matrix} X_{[p]}^{'} X_{[p]} & 0_{p - 1} \\ 0_{p - 1}^{'} & x_{p}^{'} x_{p} \end{matrix}] . \end{matrix}$

The $V I F$ s are the diagonal entries of $S_{X}^{- 1} = D_{X}^{- 1} {(X^{'} X)}^{- 1} D_{X}^{- 1}$ . It remains to note that the inverse, $S_{X}^{- 1}$ , can be computed using cofactors $C_{i, j}$ . In particular,(1) $\begin{matrix} V I F_{p} & = {[S_{X}^{- 1}]}_{p, p} = {[D_{X}^{- 1} {(X^{'} X)}^{- 1} D_{X}^{- 1}]}_{p, p} \\ = {(x_{p}^{'} x_{p})}^{1 / 2} \frac{det (C_{p, p})}{det (X^{'} X)} {(x_{p}^{'} x_{p})}^{1 / 2} = \frac{det (M_{p})}{det (X^{'} X)} \end{matrix}$ (1)

the ratio of the determinant of the idealized moment matrix $M_{p}$ to the determinant of the moment matrix $X^{'} X$ . This definition extends naturally to subsets and is discussed in the next section.

For an alternate view of the how collinearities in the explanatory variables inflate the model variances of the regression coefficients when compared to a fictitious orthogonal reference design, consider the formula for the model variance $\begin{matrix} V a r_{M} ({\hat{β}}_{j}) = \frac{σ^{2}}{\sum_{i = 1}^{n} {(x_{i j} - {\bar{x}}_{j})}^{2}} \frac{1}{1 - R_{j}^{2}} \end{matrix}$

where $R_{j}^{2}$ is the square of the multiple correlation from the regression of the $j^{th}$ column of $X = [x_{i j}]$ on the remaining columns as in Liao and Valliant (Citation2012). The first term $σ^{2} / \sum {(x_{i j} - {\bar{x}}_{j})}^{2}$ is the model variance for ${\hat{β}}_{j}$ had the $j^{th}$ explanatory variable been orthogonal to the remaining variables. The second term $1 / (1 - R_{j}^{2})$ is a standard definition of the $j^{th}$ VIF as in Thiel (Citation1971).

3. Generalized variance inflation factors

In this section, we introduce the GVIFs as an extension of the classical variance inflation factors $V I F$ from Equation 1. For the linear model $Y = X β + ε$ , view $X = [X_{1}, X_{2}]$ partitioned with $X_{1}$ of dimension $n \times r$ usually consisting of the lower order terms and $X_{2}$ of dimension $n \times s$ usually consisting of the higher order terms. The idealized moment matrix for the $(r, s)$ partitioning of $X$ is $\begin{matrix} M_{(r, s)} = [\begin{matrix} X_{1}^{'} X_{1} & 0_{r \times s} \\ 0_{s \times r} & X_{2}^{'} X_{2} \end{matrix}] . \end{matrix}$

Following Equation 1, to measure the effect of adding $X_{2}$ to the design $X_{1}$ , that is for $X_{2} | X_{1}$ , we define the generalized variance inflation factor as(2) $\begin{matrix} G V I F (X_{2} | X_{1}) = \frac{det (M_{(r, s)})}{det (X^{'} X)} = \frac{det (X_{1}^{'} X_{1}) det (X_{2}^{'} X_{2})}{det (X^{'} X)} \end{matrix}$ (2)

as in Equation 10 of Fox and Monette (Citation1992), who compared the sizes of the joint confidence regions for $β$ for partitioned designs and noted when $X = [X_{[p]}, x_{p}]$ that $G V I F [x_{p} | X_{[p]}] = V I F_{p}$ . Equation 2 is in the spirit of the efficiency comparisons in linear inferences introduced in Theorems 4 and 5 of Jensen and Ramirez (Citation1993). A similar measure of collinearity is mentioned in Note 2 in Wichers (Citation1975), Theorem 1 of Berk (Citation1977), and Garcia, Garcia, and Soto (Citation2011). For the simple linear regression model with $p = 2$ , Equation 2 gives $V I F = \frac{1}{1 - ρ^{2}}$ with $ρ$ the correlation coefficient as required. Fox and Monette (Citation1992) suggested that $X_{1}$ contains the variables which are of “simultaneous interest,” while $X_{2}$ contains additional variables selected by the investigator. We will set $X_{1}$ for the constant and main effects and set $X_{2}$ the (optional) quadratic terms with values from $X_{1}$ .

Willan and Watts (Citation1978) measured the effect of collinearity using the ratio of the volume of the actual joint confidence region for $\hat{β}$ to the volume of the joint confidence region in the fictitious orthogonal reference design. Their ratio is in the spirit of $G V I F$ as $det (X^{'} X)$ is inversely proportional to the square of the volume of the joint confidence region for $\hat{β}$ . They also introduced a measure of relative predictability and they note: “The existence of near linear relations in the independent variables of the actual data reduces the overall predictive efficiency by this factor.” For a simple case study, consider the simple linear regression model with $n = 4$ , $x_{1} = {[- 2, - 1, 1, 2]}^{'}$ , and $y = {[4, 1, 1, 4]}^{'}$ . The $95 %$ prediction interval for $x_{1} = 0$ is $2.5 \pm 10.20$ . If the model also includes $x_{2} = {[- 2.001, - 1.001, 1.001, 2.001]}^{'}$ , then the $95 %$ prediction interval for $(x_{1}, x_{2}) = (0, 0)$ is $2.5 \pm 46.02$ demonstrating the loss of predictive efficiency due to the collinearity introduced by $x_{2}$ .

For the $(r, s)$ partition of $X = [X_{1}, X_{2}]$ with $X_{1}$ of dimension $n \times r$ and $X_{2}$ of dimension $n \times s$ , set $\begin{matrix} D_{(r, s)} = [\begin{matrix} {(X_{1}^{'} X_{1})}^{- 1 / 2} & 0 \\ 0 & {(X_{2}^{'} X_{2})}^{- 1 / 2} \end{matrix}], \end{matrix}$

and denote the canonical moment matrix as(3) $\begin{matrix} R & = D_{(r, s)} (X^{'} X) D_{(r, s)} \\ = [\begin{matrix} I_{r \times r} & {(X_{1}^{'} X_{1})}^{- 1 / 2} (X_{1}^{'} X_{2}) {(X_{2}^{'} X_{2})}^{- 1 / 2} \\ {(X_{2}^{'} X_{2})}^{- 1 / 2} (X_{2}^{'} X_{1}) {(X_{1}^{'} X_{1})}^{- 1 / 2} & I_{s \times s} \end{matrix}]; \end{matrix}$ (3)

with determinant $\begin{matrix} det (R) = \frac{det (X^{'} X)}{det (X_{1}^{'} X_{1}) det (X_{2}^{'} X_{2})} = \frac{1}{G V I F (X_{2} | X_{1})}; \end{matrix}$

equivalently, $\begin{matrix} det (R) = det (I_{r \times r} - B_{r \times s} B_{s \times r}^{'}) = det (I_{s \times s} - B_{s \times r}^{'} B_{r \times s}) \end{matrix}$

where $B_{r \times s} = {(X_{1}^{'} X_{1})}^{- 1 / 2} (X_{1}^{'} X_{2}) {(X_{2}^{'} X_{2})}^{- 1 / 2}$ .

In the case ${r = p - 1$ , $s = 1}$ , $X_{2} = x_{p}$ is a $n \times 1$ vector and the partitioned design $X = [X_{1}, x_{p}]$ has $det (R) = 1 - [x_{p}^{'} X_{1} {(X_{1}^{'} X_{1})}^{- 1} X_{1}^{'} x_{p}] / (x_{p}^{'} x_{p})$ . From standard facts for the inverse of a partitioned matrix, for example, Myers (Citation1990, p. 459), $V I F_{p} = {[R^{- 1}]}_{p, p} = {[D_{(p - 1, 1)}^{- 1} {(X^{'} X)}^{- 1} D_{(p - 1, 1)}^{- 1}]}_{p, p}$ can be computed directly as $\begin{matrix} {(x_{p}^{'} x_{p})}^{1 / 2} {(X^{'} X)}_{p, p}^{- 1} {(x_{p}^{'} x_{p})}^{1 / 2} & = \frac{x_{p}^{'} x_{p}}{x_{p}^{'} x_{p} - x_{p}^{'} X_{1} {(X_{1}^{'} X_{1})}^{- 1} X_{1}^{'} x_{p}} \\ = \frac{1}{1 - [x_{p}^{'} X_{1} {(X_{1}^{'} X_{1})}^{- 1} X_{1}^{'} x_{p}] / (x_{p}^{'} x_{p})} \\ = \frac{1}{det (R)} = G V I F (X_{2} | X_{1}) . \end{matrix}$

Table 1. CCD with parameter $a$ , canonical index $γ_{X}^{2}$ , and $G V I F$

Display Table

We study the eigenvalue structure of $M_{(r, s)}$ in Appendix 1. Let ${λ_{1} \geq λ_{2} \geq \dots \geq λ_{min (r, s)} \geq 0}$ be the non-negative singular values of ${(X_{1}^{'} X_{1})}^{- 1 / 2} (X_{1}^{'} X_{2}) {(X_{2}^{'} X_{2})}^{- 1 / 2}$ . It is shown in Appendix 1 that an alternative formulation for $G V I F$ is(4) $\begin{matrix} G V I F (X_{2} | X_{1}) = \prod_{i = 1}^{min (r, s)} {(1 - λ_{i}^{2})}^{- 1} . \end{matrix}$ (4)

4. Quadratic model with $p = 3$

For the partitioning $X = [X_{r} | X_{s}]$ , the canonical moment matrix, Equation 3, has the identity matrices $I_{r}$ , $I_{s}$ down the diagonal and off-diagonal array ${(X_{1}^{'} X_{1})}^{- 1 / 2} X_{1}^{'} X_{2} {(X_{2}^{'} X_{2})}^{- 1 / 2}$ . For the quadratic model $y = β_{0} + β_{1} x + β_{2} x^{2}$ and partitioning $X = [1, x | x^{2}]$ , we have $\begin{matrix} R = [\begin{matrix} 1 & 0 & ρ_{1} \\ 0 & 1 & ρ_{2} \\ ρ_{1} & ρ_{2} & 1 \end{matrix}] . \end{matrix}$

From Equation 4, $G V I F (x^{2} | 1, x) = {(1 - λ^{2})}^{- 1}$ where $λ = \sqrt{ρ_{1}^{2} + ρ_{2}^{2}}$ is the unique positive singular value of ${[ρ_{1}, ρ_{2}]}^{'}$ . Denote $\begin{matrix} γ_{X}^{2} = ρ_{1}^{2} + ρ_{2}^{2} \end{matrix}$

as the canonical index with $G V I F (x^{2} | 1, x) = \frac{1}{1 - γ_{X}^{2}} = \frac{1}{det (R)}$ . Surprisingly, many higher order designs also have the off-diagonal entry of the canonical moment matrix with a unique positive singular value with $G V I F (X_{2} | X_{1}) = \frac{1}{1 - γ_{X}^{2}}$ with the collinearity between the lower order terms and the upper order terms as a function of the canonical index $γ_{X}^{2}$ .

5. Central composite and factorial designs for quadratic models $(p = 6)$

In this section, we compare the central composite design (CCD) $X$ of Box and Wilson (Citation1951) and the factorial design $Z .$ The design points are shown in Table of Appendix 2. Both designs are $9 \times 6$ and use the quadratic response model $\begin{matrix} y = β_{0} + β_{1} x_{1} + β_{2} x_{2} + β_{11} x_{1}^{2} + β_{22} x_{2}^{2} + β_{12} x_{1} x_{2} + ε . \end{matrix}$

The CCD traditionally uses the value $a = \sqrt{2}$ in four entries, while the factorial design uses the value $a = 1$ . To study the difference in the designs with these different values, we computed the GVIF to compare the “orthogonality” between the lower order terms $X_{1}$ of dimension $9 \times 3$ and the higher order quadratic terms $X_{2}$ of dimension $9 \times 3$ . The off-diagonal $B_{3 \times 3}$ entry of $R$ from Equation 3 in Section 3 has the form $\begin{matrix} B_{3 \times 3} = [\begin{matrix} ρ_{1} & ρ_{2} & 0 \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end{matrix}] \end{matrix}$

with $ρ_{1} = ρ_{2} = \frac{2}{3} \frac{2 + a^{2}}{\sqrt{8 + 2 a^{4}}}$ , canonical index $γ_{X}^{2} = ρ_{1}^{2} + ρ_{2}^{2}$ and $G V I F (X_{2} | X_{1}) = \frac{1}{1 - γ_{X}^{2}}$ as in the quadratic model case with $p = 3$ shown in the Section 4. For Table , if $a = 1$ , then $ρ_{1} = ρ_{2} = 2 / \sqrt{10}$ , $γ_{X}^{2} = 8 / 10$ , and $G V I F (X_{2} | X_{1}) = 5$ . Surprisingly, the classical choice of $a = \sqrt{2}$ gives the largest value for $G V I F (X_{2} | X_{1})$ , that is the worst value, indicating the greatest collinearity between the lower and higher order terms, as noted in O’Driscoll and Ramirez (Citationin press).

6. Larger designs $(p = 10)$

We consider the quadratic response surface designs for(5) $\begin{matrix} y & = β_{0} + β_{1} x_{1} + β_{2} x_{2} + β_{3} x_{3} + \\ β_{11} x_{1}^{2} + β_{22} x_{2}^{2} + β_{33} x_{3}^{2} + β_{12} x_{1} x_{2} + β_{13} x_{1} x_{3} + β_{23} x_{2} x_{3} + ε \end{matrix}$ (5)

with $n$ responses and with $X$ partitioned into $[X_{1} | X_{2}]$ with $X_{1}$ the four lower order terms $(r = 4)$ and $X_{2}$ the six quadratic terms $(s = 6)$ . Four popular designs are given in Appendix 2. They are the hybrid designs ( $H 310$ and $H 311 B$ ) of Roquemore (Citation1976) Tables and , the Box and Behnken (Citation1960) $(B B D)$ design Table , and the CCD of Box and Wilson (Citation1951) Table .

For each design, we compute the $10 \times 10$ canonical moment matrix. It is striking that, for all these designs, the off-diagonal $4 \times 6$ array in $R$ has only one non-zero singular value with its square the canonical index $γ_{X}^{2}$ . It follows that $G V I F (X_{2} | X_{1}) = \frac{1}{1 - γ_{X}^{2}}$ .

Table 2. Hybrid designs $H 310$ , $H 311 B$ , Box and Behnken $B B D, a n d$ CCD

Display Table

Table 3. Singular values for off-diagonal array of $R$ for BDD and SCD with $G V I F$

Display Table

Table reports that the design $H 310$ is the most conditioned with respect to the GVIF with the least amount of collinearity between the lower and higher order terms.

7. More complicated designs with ordered singular values

Let $X$ be the minimal design of Box and Draper (Citation1974) BDD with $n = 11$ from Table , and let $Z$ be the small composite design of Hartley (Citation1959) $S C D$ with $n = 11$ from Table for the quadratic response surface model $(r = 4$ and $s = 6)$ as in Equation (Equation5(5) $\begin{matrix} y & = β_{0} + β_{1} x_{1} + β_{2} x_{2} + β_{3} x_{3} + \\ β_{11} x_{1}^{2} + β_{22} x_{2}^{2} + β_{33} x_{3}^{2} + β_{12} x_{1} x_{2} + β_{13} x_{1} x_{3} + β_{23} x_{2} x_{3} + ε \end{matrix}$ (5) ). Let $α = {α_{1} \geq \dots \geq α_{r} \geq 0}$ and $β = {β_{1} \geq \dots \geq β_{r} \geq 0}$ be the non-negative singular values of the off-diagonal array for $R_{X}$ and $R_{Z}$ , respectively. As $α_{i} \leq β_{i}$ $(1 \leq i \leq r)$ in Table , it follows that $G V I F (X_{2} | X_{1}) \leq G V I F (Z_{2} | Z_{1})$ showing less collinearity between the lower and higher order terms for the BDD design.

8. An improved H310 design

When the diagonal matrix $Λ_{r \times s}$ in Equation 6 in Appendix 1 has only one non-zero entry, we have denoted the square of this value the canonical index. We extend this definition to the case when ${(X_{1}^{'} X_{1})}^{- 1 / 2} (X_{1}^{'} X_{2}) {(X_{2}^{'} X_{2})}^{- 1 / 2}$ has multiple positive singular values. The Frobenious norm for a rectangular matrix $A_{r \times s}$ is defined by ${| | A | |}_{F}^{2} = \sum_{i = 1}^{r} \sum_{j = 1}^{s} a_{i j}^{2} = t r a c e (A^{'} A)$ . For a design matrix $X$ , we extend the definition of the canonical index with $γ_{X}^{2} = | | Λ_{r \times s} {| |}_{F}^{2}$ . Alternatively, $γ_{X}^{2} = t r a c e ({(X_{2}^{'} X_{2})}^{- 1} (X_{2}^{'} X_{1}) {(X_{1}^{'} X_{1})}^{- 1} (X_{1}^{'} X_{2}))$ as in Equation 7.

We examine, in detail, the $H 310$ design matrix $X_{11 \times 10}$ , Table in Appendix 2, with our attention to the value of $- 0.1360$ in row 2 for $x_{3}$ . In succession, we will replace the values ${1.1736, 0.6386, - 0.9273, 1.0000, 1.2906, - 0.1360}$ by a free parameter and use $γ_{X}^{2}$ to determine an optimal value. For example, replacing the four entries which are $1.1736$ with $c_{1}$ , we calculate the minimum value for $γ_{X}^{2} = 0.8199$ with $c_{1} = 1.1768$ denoted $c_{min}$ in Table . These values are within the four digit accuracy of the data. We performed a similar calculation with $c_{2}$ using the four entries which are $0.6386$ ; with $c_{3}$ with the four entries which are $- 0.9273$ ; with $c_{4}$ with the eight entries which are $1$ ; and with $c_{5}$ with the single entry $1.2906$ . The original design has $γ_{X}^{2} = 0.8199$ . The entries in the $H 310$ design are given to four significant digits. With this precision, the original design is nearly optimal with respect to the canonical index $γ_{X}^{2}$ for the first five entries in Table . The sixth entry of $c_{6} = - 0.1360$ was not optimal with $γ_{X}^{2} = 0.8181$ with $c_{min} = - 0.01264$ , a magnitude value smaller.

Table 4. Optimal values $c_{min}$ for $γ_{X}^{2}$

Display Table

Denote the “improved” $H 310$ design as the $H 310$ design with the value of $c_{6} = - 0.01264$ . The “improved” $H 310$ also has a unique positive singular value for the off-diagonal of $R$ with its square the canonical index $γ_{X}^{2}$ . All of the standard design criteria favor the “improved” $H 310$ design over the $H 310$ design, which was originally constructed based on the rotatability criterion to maintain equal variances for predicted responses for points that have the same distance from the design center. As usual $A (X) = t r ({(X^{'} X)}^{- 1})$ , $D (X) = det ({(X^{'} X)}^{- 1})$ , and $E (X) = max {$ eigenvalues of ${(X^{'} X)}^{- 1}}$ . The small relative changes $Δ$ in the design criteria are shown in Table in Column 4.

Table 5. Design criteria for the “improved” $H 310$ with $Δ$ the relative change

Display Table

The abnormality of the second row in $H 310$ has been noted in Jensen (Citation1998) who showed that the design is least sensitive to the second row of $X,$ the row containing the value $c_{6} = - 0.1360$ .

9. Conclusions

The VIF measure the penalty for adding a non-orthogonal variable to a linear regression. The $V I F$ can be computed as a ratio of determinant as in Equation 1. A similar ratio criterion was studied by Fox and Monette (Citation1992) to measure the effect of adding a subset of new variables to a design and they dubbed it the generalized variance inflation factor $G V I F$ , Equation 2. We have noted the relationship between $G V I F$ and the singular values of the off-diagonal array in the canonical moment matrix and have used $G F I V$ to study standard quadratic response designs. The $H 310$ design of Roquemorer (Citation1976) was shown not to be optimal with respect to $G F I V$ and an “improved” $H 310$ design was introduced which was favored over $H 310$ using the standard design criteria $A$ , $D$ , and $E$ .

Additional information

Funding

The authors received no direct funding for this research.

Notes on contributors

Diarmuid O’Driscoll

Diarmuid O’Driscoll is the head of the Mathematics and Computer Studies Department at Mary Immaculate College, Limerick. He was awarded a Travelling Studentship for his MSc at University College Cork in 1977. He has taught at University College Cork, Cork Institute of Technology, University of Virginia, and Frostburg State University. His research interests are in mathematical education, errors in variables regression, and design criteria. In 2014, he was awarded a Teaching Heroes Award by the National Forum for the Enhancement of Teaching and Learning (Ireland).

Donald E. Ramirez

Donald E Ramirez is a full professor in the Department of Mathematics at the University of Virginia in Charlottesville, Virginia. He received his PhD in Mathematics from Tulane University in New Orleans, Louisiana. His research is in harmonic analysis and mathematical statistics. His current research interests are in statistical outliers and ridge regression.

References

Belsley, D. A. (1986). Centering, the constant, first-differencing, and assessing conditioning. In E. Kuh & D. A. Belsley (Eds.),Model reliability (pp. 117–153). Cambridge: MIT Press.
Google Scholar
Berk, K. (1977). Tolerance and condition in regression computations. Journal of the American Statistical Association, 72, 863–866.
Web of Science ®Google Scholar
Box, G. E. P., & Behnken, D. W. (1960). Some new three-level designs for the study of quantitative variables. Technometrics, 2, 455–475.
Google Scholar
Box, M. J., & Draper, N. R. (1974). On minimum-point second order design. Technometrics, 16, 613–616.
Web of Science ®Google Scholar
Box, G. E. P., & Wilson, K. B. (1951). On the experimental attainment of optimum conditions. Journal of the Royal Statistical Society, Series B, 13, 1–45.
Web of Science ®Google Scholar
Eaton, M. L. (1983). Multivariate statistics. New York, NY: Wiley.
Google Scholar
Fox, J., & Monette, G. (1992). Generalized collinearity diagnostics. Journal of the American Statistical Association, 87, 178–183.
Web of Science ®Google Scholar
Garcia, C. B., Garcia, J., & Soto, J. (2011). The raise method: An alternative procedure to estimate the parameters in presence of collinearity. Quality and Quantity, 45, 403–423.
Web of Science ®Google Scholar
Hartley, H. O. (1959). Smallest composite design for quadratic response surfaces. Biometrics, 15, 611–624.
Web of Science ®Google Scholar
Jensen, D. R. (1998). Principal predictors and efficiency in small second-order designs. Biometrical Journal, 40, 183–203.
Web of Science ®Google Scholar
Jensen, D. R., & Ramirez, D. E. (1993). Efficiency comparisons in linear inference. Journal of Statistical Planning and Inference, 37, 51–68.
Web of Science ®Google Scholar
Liao, D., & Valliant, R. (2012). Variance inflation in the analysis of complex survey data. Survey Methodology, 38, 53–62.
Web of Science ®Google Scholar
Myers, R. (1990). Classical and modern regression with applications (2nd ed.). Boston, MA: PWS-Kent.
Google Scholar
O’Driscoll, D., & Ramirez, D. E. (in press). Revisiting some design criteria ( under review).
Google Scholar
Roquemorer, K. G. (1976). Hybrid designs for quadratic response surfaces. Technometrics, 18, 419–423.
Web of Science ®Google Scholar
Stewart, G. W. (1987). Collinearity and least squares regression. Statistical Science, 2, 68–84.
Google Scholar
Thiel, H. (1971). Principles of econometrics. New York, NY: Wiley.
Google Scholar
Wichers, R. (1975). The detection of multicollinearity: A comment. Review of Economics and Statistics, 57, 366–368.
Web of Science ®Google Scholar
Willan, A. R., & Watts, D. G. (1978). Meaningful multicollinearity measures. Technometrics, 20, 407–412.
Web of Science ®Google Scholar

Appendix 1

We study the eigenvalue structure of

M_{(r, s)}

. Let

{λ_{1} \geq λ_{2} \geq \dots \geq λ_{min (r, s)} \geq 0}

be the non-negative singular values of

{(X_{1}^{'} X_{1})}^{- 1 / 2} (X_{1}^{'} X_{2}) {(X_{2}^{'} X_{2})}^{- 1 / 2}

As with the canonical correlation coefficients Eaton (Citation1983), write the off-diagonal rectangular array $B_{r \times s}$ of $R$ as $P Λ Q^{'}$ with $P$ and $Q$ orthogonal matrices and $Λ_{r \times s}$ the rectangular diagonal matrix with the non-negative singular values down the diagonal. Set $\begin{matrix} L = [\begin{matrix} P_{r \times r} & 0_{r \times s} \\ 0_{s \times r} & Q_{s \times s} \end{matrix}] . \end{matrix}$

For notational convenience, we assume $r \leq s$ . The matrix $L$ is orthogonal and transforms $R \to$ $L^{'} R L$ into diagonal matrices:(A1) $\begin{matrix} [\begin{matrix} I_{r} & Λ_{r \times s} \\ Λ_{s \times r}^{'} & I_{s} \end{matrix}] = [\begin{matrix} I_{r} & [S V_{r \times r} | 0_{r \times (s - r)}] \\ {[S V_{r \times r} | 0_{r \times (s - r)}]}^{'} & I_{s} \end{matrix}] \end{matrix}$ (A1)

with $Λ_{r \times s} = [S V_{r \times r} | 0_{r \times (s - r)}]$ where $S V_{r \times r}$ is the diagonal matrix of the non-negative singular values. Since $L$ is orthogonal, this transformation has not changed the eigenvalues. To compute the determinant of $R$ , convert the matrix in Equation 6 into an upper diagonal matrix by Gauss Elimination on $Λ_{s \times r}^{'}$ . This changes $r$ of the $1^{'} s$ on the diagonal in rows $r + 1$ to $r + r$ into $1 - λ_{i}^{2}$ , and thus $det (R) = \prod_{i = 1}^{min (r, s)} (1 - λ_{i}^{2})$ with $\begin{matrix} G V I F (X_{2} | X_{1}) = \prod_{i = 1}^{min (r, s)} \frac{1}{1 - λ_{i}^{2}} . \end{matrix}$

The singular values of $R_{12} = {(X_{1}^{'} X_{1})}^{- 1 / 2} (X_{1}^{'} X_{2}) {(X_{2}^{'} X_{2})}^{- 1 / 2}$ are the non-negative square roots of the eigenvalues of $Λ^{'} Λ$ denoted by(A2) $\begin{matrix} e i g v a l s (Λ^{'} Λ) & = e i g v a l s ((Q^{'} R_{12}^{'} P) (P^{'} R_{12} Q))) \\ = e i g v a l s ({(X_{1}^{'} X_{1})}^{- 1}) (X_{1}^{'} X_{2}) {(X_{2}^{'} X_{2})}^{- 1} (X_{2}^{'} X_{1})) . \end{matrix}$ (A2)

If the trace of the inverse of the matrix in Equation 6 is required, then we note that $\begin{matrix} {[\begin{matrix} I_{r} & Λ_{r \times s} \\ Λ_{s \times r}^{'} & I_{s} \end{matrix}]}^{- 1} = [\begin{matrix} {(I_{r} - Λ_{r \times s} Λ_{s \times r}^{'})}^{- 1} & - Λ_{r \times s} {(I_{s} - Λ_{s \times r}^{'} Λ_{r \times s})}^{- 1} \\ - Λ_{s \times r}^{'} {(I_{r} - Λ_{r \times s} Λ_{s \times r}^{'})}^{- 1} & {(I_{s} - Λ_{s \times r}^{'} Λ_{r \times s})}^{- 1} \end{matrix}] \end{matrix}$

with trace given by $t r ({(L^{'} R L)}^{- 1}) = | r - s | + 2 \sum_{i = 1}^{min (r, s)} \frac{1}{1 - λ_{i}^{2}}$ .

Appendix 2 Table A1. The lower order matrix for the CCD with center run with $a = \sqrt{2}$ , $n = 9$ and the lower order matrix for the factorial design with center run $n = 9$

Display Table

Table A2. The lower order matrix for the hybrid $(H 310)$ design of Roquemore (Citation1976) with center run, $n = 11$

Display Table

Table A3. The lower order matrix for the hybrid $(H 311 B)$ design of Roquemore (Citation1976) with center run, $n = 11$

Display Table

Table A4. The lower order matrix for the Box and Behnken (Citation1960) design $(B B D)$ with center run, $n = 13$

Display Table

Table A5. The lower order matrix for the Box and Wilson (Citation1951) CCD for $α = 1.732$ with center run, $n = 15$

Display Table

Table A6. The lower order matrix for the Box and Draper (Citation1974) minimal design (BDD) with center run, $n = 11$

Display Table

Table A7. The Lower order matrix for the small composite design of Hartley (Citation1959) $(S C D)$ for $α = 1.732$ with center run, $n = 11$

Display Table

Response surface designs using the generalized variance inflation factors

Abstract

Public Interest Statement

1. Introduction

2. Variance inflation factors

3. Generalized variance inflation factors

Table 1. CCD with parameter $a$ , canonical index $γ_{X}^{2}$ , and $G V I F$

4. Quadratic model with $p = 3$

5. Central composite and factorial designs for quadratic models $(p = 6)$

6. Larger designs $(p = 10)$

Table 2. Hybrid designs $H 310$ , $H 311 B$ , Box and Behnken $B B D, a n d$ CCD

Table 3. Singular values for off-diagonal array of $R$ for BDD and SCD with $G V I F$

7. More complicated designs with ordered singular values

8. An improved H310 design

Table 4. Optimal values $c_{min}$ for $γ_{X}^{2}$

Table 5. Design criteria for the “improved” $H 310$ with $Δ$ the relative change

9. Conclusions

Notes on contributors

Diarmuid O’Driscoll

Donald E. Ramirez

References

Appendix 1

Appendix 2

Table A1. The lower order matrix for the CCD with center run with $a = \sqrt{2}$ , $n = 9$ and the lower order matrix for the factorial design with center run $n = 9$

Table A2. The lower order matrix for the hybrid $(H 310)$ design of Roquemore (Citation1976) with center run, $n = 11$

Table A3. The lower order matrix for the hybrid $(H 311 B)$ design of Roquemore (Citation1976) with center run, $n = 11$

Table A4. The lower order matrix for the Box and Behnken (Citation1960) design $(B B D)$ with center run, $n = 13$

Table A5. The lower order matrix for the Box and Wilson (Citation1951) CCD for $α = 1.732$ with center run, $n = 15$

Table A6. The lower order matrix for the Box and Draper (Citation1974) minimal design (BDD) with center run, $n = 11$

Table A7. The Lower order matrix for the small composite design of Hartley (Citation1959) $(S C D)$ for $α = 1.732$ with center run, $n = 11$

Information for

Open access

Opportunities

Help and information

Response surface designs using the generalized variance inflation factors

Abstract

Public Interest Statement

1. Introduction

2. Variance inflation factors

3. Generalized variance inflation factors

Table 1. CCD with parameter a, canonical index γX2, and GVIF

4. Quadratic model with p=3

5. Central composite and factorial designs for quadratic models (p=6)

6. Larger designs (p=10)

Table 2. Hybrid designs H310, H311B, Box and Behnken BBD,and CCD

Table 3. Singular values for off-diagonal array of R for BDD and SCD with GVIF

7. More complicated designs with ordered singular values

8. An improved H310 design

Table 4. Optimal values cmin for γX2

Table 5. Design criteria for the “improved” H310 with Δ the relative change

9. Conclusions

Additional information

Funding

Notes on contributors

Diarmuid O’Driscoll

Donald E. Ramirez

References

Appendix 1

Appendix 2

Table A1. The lower order matrix for the CCD with center run with a=2, n=9 and the lower order matrix for the factorial design with center run n=9

Table A2. The lower order matrix for the hybrid (H310) design of Roquemore (Citation1976) with center run, n=11

Table A3. The lower order matrix for the hybrid (H311B) design of Roquemore (Citation1976) with center run, n=11

Table A4. The lower order matrix for the Box and Behnken (Citation1960) design (BBD) with center run, n=13

Table A5. The lower order matrix for the Box and Wilson (Citation1951) CCD for α=1.732 with center run, n=15

Table A6. The lower order matrix for the Box and Draper (Citation1974) minimal design (BDD) with center run, n=11

Table A7. The Lower order matrix for the small composite design of Hartley (Citation1959) (SCD) for α=1.732 with center run, n=11

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

Table 1. CCD with parameter $a$ , canonical index $γ_{X}^{2}$ , and $G V I F$

4. Quadratic model with $p = 3$

5. Central composite and factorial designs for quadratic models $(p = 6)$

6. Larger designs $(p = 10)$

Table 2. Hybrid designs $H 310$ , $H 311 B$ , Box and Behnken $B B D, a n d$ CCD

Table 3. Singular values for off-diagonal array of $R$ for BDD and SCD with $G V I F$

Table 4. Optimal values $c_{min}$ for $γ_{X}^{2}$

Table 5. Design criteria for the “improved” $H 310$ with $Δ$ the relative change

Table A1. The lower order matrix for the CCD with center run with $a = \sqrt{2}$ , $n = 9$ and the lower order matrix for the factorial design with center run $n = 9$

Table A2. The lower order matrix for the hybrid $(H 310)$ design of Roquemore (Citation1976) with center run, $n = 11$

Table A3. The lower order matrix for the hybrid $(H 311 B)$ design of Roquemore (Citation1976) with center run, $n = 11$

Table A4. The lower order matrix for the Box and Behnken (Citation1960) design $(B B D)$ with center run, $n = 13$

Table A5. The lower order matrix for the Box and Wilson (Citation1951) CCD for $α = 1.732$ with center run, $n = 15$

Table A6. The lower order matrix for the Box and Draper (Citation1974) minimal design (BDD) with center run, $n = 11$

Table A7. The Lower order matrix for the small composite design of Hartley (Citation1959) $(S C D)$ for $α = 1.732$ with center run, $n = 11$