Full article: D-optimal saturated designs for main effects and interactions in 2k-factorial experiments

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

In a $2^{k}$ -factorial experiment with limited resources, when practitioners can identify the non-negligible effects and interactions beforehand, it is common to run an experiment with a saturated design that ensures the unbiased estimation of the non-negligible parameters of interest. We propose a method for the construction of D-optimal saturated designs for the mean, the main effects, and the second-order interactions of one factor with the remaining factors. In the process, we show the problem is just as hard as the Hadamard determinant problem.

Keywords:

1. Introduction

A saturated design (SD) in a two-level factorial experiment is a design with the minimum number of runs that ensures the unbiased estimation of the effects and interactions of interest given the remaining parameters are negligible. The number of runs n retained in an SD is equal to the total number of parameters of interest. Thus a saturated design matrix is a square non-singular matrix of order n with entries from ${- 1, 1}$ that is chosen to satisfy the conditions of the parameters of interest. Saturated designs are one of the most important designs in practice. They are desirable to practitioners mainly when the important effects and interactions to be estimated are known beforehand. In general, the statistical model retained for an SD is the regular linear model $Y = X β + ϵ$ , where Y is the response variable and ϵ is the usual error term. The matrix X is a saturated design matrix for the given vector parameter of interest β. Once X is chosen, the ordinary least square method (OLS) can be used to obtain the unbiased estimation of the parameters of interest. That is $\hat{β} = (X^{⊤} X)^{- 1} X^{⊤} Y = X^{- 1} Y$ . As a result of the estimator $\hat{β} = X^{- 1} Y$ , the determinant of the Fisher information $X^{⊤} X$ of an SD is maximal if the absolute value of the determinant of X is maximal. Such an SD is known as a D-optimal saturated design. Since the determinant of the Fisher information is inversely proportional to the volume of the confidence ellipsoid of the vector parameter β, the D-optimality criterion is one of the key criteria used to search for a design because it guarantees that the volume of the confidence ellipsoid of the vector parameter β is minimized. See (Wald, Citation1943) and Kiefer (Citation1959). However, it turns out that the construction of a D-optimal SD is not a trivial problem. There has been a vast literature as well as ongoing investigation about the construction of SD under certain conditions. Hedayat and Pesotan in Hedayat and Pesotan (Citation1992) and Hedayat and Pesotan (Citation2007) have discussed how to construct a saturated design that includes the estimation of the mean, the main effects, and a selected number of second-order interactions. Furthermore, various computer algorithms have been developed to search for SDs for two-level factorial experiments, some of which are SPAN, and DETMAX. As a case in point, see Hedayat and Zhu (Citation2011).

In this paper, we propose methods for the construction of D-optimal saturated design matrices for the estimation of the mean, the main effects, and the second-order interaction(s) of one factor with the remaining factors. Specifically, we consider a two-level factorial experiment with k factors $F_{1}, \dots, F_{k}$ and we develop algorithms for the construction of a saturated design matrix as well as a D-optimal saturated design matrix that includes the estimation of the main effects $F_{1}, \dots, F_{k}$ , the second-order interactions of factor $F_{1}$ with each of the remaining factors namely $F_{1} F_{2}, F_{1} F_{2}, \dots, F_{1} F_{k}$ , and the mean that we denote by $F_{0}$ . For simplicity, in the rest of the paper, $F_{1} F_{2}, F_{1} F_{2}, \dots, F_{1} F_{k}$ will be called $F_{1}$ -two-factor interactions and we write them as $F_{12}, \dots, F_{1 k}$ . We also define $D (k, 1)$ as the set of all saturated designs that ensure the unbiased estimation of the mean, k main effects, and the $F_{1}$ -interactions for a given k.

2. Construction of D-optimal saturated designs in $D (k, 1)$

2.1. Motivation

In Table , Vander Heyden et al. (Citation1999) used high-performance liquid chromatography (HPLC) to conduct an experiment that studied the assay of ridogrel and its related compounds in ridogrel oral film-coated tablet simulations. In the original experiment, multiple responses were of interest. One of the responses was the percentage recovery of the main compound. For scientific reasons, only eight factors were considered in the experiment that assessed the importance of the factors on the response which in this case is the percentage recovery of the main compound (MC). The eight factors retained were pH of the buffer (A), column manufacturer (B), column temperature (D), percent of organic solvent in the mobile phase at the start of the gradient (E), percent of organic solvent in the mobile phase at the end of the gradient (F), the flow of the mobile phase (H), the detection wavelength (I), and the concentration of the buffer(J). For this specific experiment factors C, G, and K in Table were not used in the experiment. It is worth pointing out that in Table , each factor has two levels coded as $+ 1$ and $- 1$ that are represented by + and − respectively. The 12-run Plackett-Burman design in Table was used to assess the importance of the eight factors on the response variable. Fitting a main effects model to the data yields (1) $\hat{y} = 101.04 + 0.34 A - 0.22 B - 0.36 D - 0.56 E + 0.44 F - 0.01 H + 0.26 I - 0.31 J .$ (1) This model has an $R^{2} = 0.78$ with $\hat{σ} = 1.045$ on 3 degrees of freedom. The most significant factors are E and F with p-values of 0.16 and 0.24 respectively. The experimenters decided the test was not significant and concluded there was no significant relationship between any of the factors and the response variable because, at the $10 %$ significance level, none of the effects is significant. Phoa et al. (Citation2009) reanalyzed the experiment in Table taking into account interactions, and found the following model (2) $\hat{y} = 101.04 - 0.56 E + 0.44 F - 0.30 H + 0.88 EF .$ (2) This model has an $R^{2} = 0.96$ which indicates a good fit. Furthermore, factor H is significant at the $5 %$ level (p-value = 0.012) and E, F, and EF are significant at the $1 %$ level. Here, the takeaway message is that in Plackett-Burman designs, the main effects are partially aliased with second-order interactions. Thus, since one or more second-order interactions are not negligible, some effects in the main-effect model in Equation (Equation1(1) $\hat{y} = 101.04 + 0.34 A - 0.22 B - 0.36 D - 0.56 E + 0.44 F - 0.01 H + 0.26 I - 0.31 J .$ (1) ) are biased. This misled the experimenters to draw the wrong conclusion that none of the effects is important. On the other hand, by taking into account second-order interactions, the experimenters were able to identify the important main effects and the second-order interaction given by the model in Equation (Equation2(2) $\hat{y} = 101.04 - 0.56 E + 0.44 F - 0.30 H + 0.88 EF .$ (2) ). In general, if for scientific reasons, the experimenter can identify the potential main effects and interactions, he may cut down the number of runs and conduct a saturated design for the experiment. That is the main purpose of the remainder of this paper.

Table 1. Experiment reported by Vander Heyden et al. (Citation1999).

Display Table

2.2. Preliminaries

In the rest of this paper, we consider a two-level factorial experiment with k factors $F_{1}, \dots, F_{k}$ . We investigate the class of saturated design matrices for a vector parameter β that includes the mean, the k main effects, and the second-order interactions of factor $F_{1}$ with the remaining factors $F_{2}, \dots, F_{k}$ . More precisely, for such a problem there are k main effects $F_{1}, \dots, F_{k}$ , the mean $F_{0}$ and k−1 second-order interactions $F_{12}, \dots, F_{1 k}$ . The total number of parameters to estimate is $2 k$ . A saturated design would therefore require $2 k$ runs. The corresponding linear model is on the form (3) $Y_{i} = β_{0} (F_{0})_{i} + β_{1} (F_{1})_{i} + \dots + β_{k} (F_{k})_{i} + β_{12} (F_{12})_{i} + \dots + β_{1 k} (F_{1 k})_{i} + ϵ_{i}$ (3) where $i \in {1, \dots, 2 k}$ , $ϵ_{i}$ , $Y_{i}$ and $(F_{. .})_{i}$ are respectively the ith error term, the response variable and the corresponding runs. $β = [β_{0}, β_{1}, \dots, β_{k}, β_{12}, \dots, β_{1 k}]^{⊤}$ is the vector parameter of interest.

To gain more intuition about the problem, we give an example of the particular case of $k = 3$ as follows. For k = 3 the number of parameters to estimate is 6, namely, $F_{0}, F_{1}, F_{2}, F_{3}, F_{12}, F_{13}$ . It follows that a saturated design would require 6 runs. Suppose we choose the candidate design with the runs ${(+ + +), (+ - -), (+ - +), (- - +), (- + +), (- + -)}$ . Then the candidate saturated design matrix would be a square matrix of order 6 that is obtained by converting the runs into the underlying design matrix. As illustrated below, the first matrix underlies the main effects plus mean $F_{1}, F_{2}, F_{3}$ and $F_{0}$ . The second matrix underlies the second order interactions $F_{12}$ and $F_{13}$ and is obtained by taking the Schür product of $F_{1}$ with $F_{2}$ and $F_{3}$ respectively. The third matrix is the candidate saturated design matrix obtained by combining the first and second matrices. It is worth pointing out that for convenience we set the factors in the order $F_{1}, F_{2}, F_{3}, F_{0}$ so that the first and last entries of each run correspond to $F_{1}$ and $F_{0}$ respectively. $\begin{aligned} \begin{array}{cccc} F_{1} & F_{2} & F_{3} & F_{0} \end{array} \\ [\begin{array}{cccc} + & + & + & + \\ + & - & - & + \\ + & - & + & + \\ - & - & + & + \\ - & + & + & + \\ - & + & - & + \end{array}] \Rightarrow \\ \begin{array}{cc} F_{12} & F_{13} \end{array} \\ [\begin{array}{cc} + & + \\ - & - \\ - & + \\ + & + \\ - & - \\ - & + \end{array}] \Rightarrow \\ \begin{array}{cccccc} F_{1} & F_{2} & F_{3} & F_{0} & F_{12} & F_{13} \end{array} \\ [\begin{array}{cccccc} + & + & + & + & + & + \\ + & - & - & + & - & - \\ + & - & + & + & - & + \\ - & - & + & + & + & - \\ - & + & + & + & - & - \\ - & + & - & + & - & + \end{array}] \end{aligned}$

It is important to observe that for the given candidate design matrix given above, $F_{1}$ is of the form $F_{1} = [\begin{matrix} 1_{3} \\ - 1_{3} \end{matrix}]$ , where the Schür product of $F_{1}$ by itself ( $F_{11}$ ) yields

$F_{11} = [\begin{matrix} 1_{3} * 1_{3} \\ - 1_{3} * (- 1_{3}) \end{matrix}] = [\begin{matrix} 1 \\ 1 \\ 1 \\ 1 \\ 1 \\ 1 \end{matrix}] = F_{0}$ $F_{12} = [\begin{matrix} 1_{3} \\ - 1_{3} \end{matrix}] * F_{2} = [\begin{matrix} 1_{3} \\ - 1_{3} \end{matrix}] * [\begin{matrix} - 1 \\ - 1 \\ - 1 \\ - 1 \\ - 1 \\ - 1 \end{matrix}] = [\begin{matrix} - 1 \\ - 1 \\ - 1 \\ - 1 \\ - 1 \\ - 1 \end{matrix}]$

and $F_{13} = [\begin{matrix} 1_{3} \\ - 1_{3} \end{matrix}] * F_{3} = [\begin{matrix} 1_{3} \\ - 1_{3} \end{matrix}] [\begin{matrix} - 1 \\ - 1 \\ - 1 \\ - 1 \\ - 1 \\ - 1 \end{matrix}] = [\begin{matrix} - 1 \\ - 1 \\ - 1 \\ - 1 \\ - 1 \\ - 1 \end{matrix}]$ .

Furthermore, the Schür product of $F_{1}$ with $F_{2}$ and $F_{3}$ leaves the first 3 entries of $F_{2}$ and $F_{3}$ unchanged and negates the last 3 entries. It turns out from the above observations that the candidate saturated design can be written as: $\begin{aligned} \begin{array}{cccccc} F_{1} & F_{2} & F_{3} & F_{11} & F_{12} & F_{13} \end{array} \\ [\begin{array}{cccccc} + & + & + & + & + & + \\ + & - & - & + & - & - \\ + & - & + & + & - & + \\ - & - & + & + & + & - \\ - & + & + & + & - & - \\ - & + & - & + & - & + \end{array}] = [\begin{matrix} M & M \\ - N & N \end{matrix}] \end{aligned}$ where $M = [\begin{matrix} + & + & + \\ + & - & - \\ + & - & + \end{matrix}]$ and $N = [\begin{matrix} + & + & - \\ + & - & - \\ + & - & + \end{matrix}]$ .

Remark 2.1

A few remarks can be made as follows.

The mean $F_{0}$ can be written as the Schür product of $F_{1}$ by itself. This simple fact will be crucial in the theorems we develop in the upcoming section.
For any choice of candidate saturated design the corresponding candidate saturated design matrix is necessarily of the form $[\begin{matrix} M & M \\ - N & N \end{matrix}]$ as shown above. In the example $F_{1}$ has as many $+ 1$ entries as $- 1$ entries which means $F_{1}$ is balanced. Therefore M and N are square matrices of order k.
The candidate design matrix as displayed above will be a valid design matrix if it is a non-singular matrix. We shall see in the remainder of this paper that in general, a candidate design matrix is a valid design matrix if and only if the design is chosen so that $F_{1}$ is balanced and that M and N are non-singular matrices.

2.3. Construction of saturated and D-optimal saturated design matrices in $D (k, 1)$

In the remainder of this section, we explore the construction of a D-optimal design matrix for mean, main effects, and the $F_{1}$ -second-order interactions from a general perspective. We assume without loss of generality that the vector parameter of interest is of the form $β = [β_{1}, \dots, β_{k}, β_{0}, β_{12}, \dots, β_{1 k}]^{⊤}$ . For convenience, we make the following definitions.

Definition 2.1

We make the following definitions:

We define $D (k, 1)$ to be the set of all the saturated design matrices that ensure the unbiased estimation of the vector parameter of interest β. We purposely use the notation $D (k, 1)$ to indicate that the vector parameter of interest β includes the k main effects, the mean, and all the $F_{1}$ -second-order interactions.
We define $M_{k} {- 1, 1}$ as the set of non-singular matrices of order k with entries from ${- 1, 1}$ for which the first column is the vector $1_{k}$ .
We define $Θ_{k}$ to be the maximal value of the absolute value of the determinant of matrices in $M_{k} {- 1, + 1}$ .

The factor $F_{1}$ plays a key role in the construction of a saturated design for the vector parameter β as specified above because it is the only factor that interacts with all the remaining factors. Therefore, we define the factor $F_{1}$ as the pivot factor. Since the entries of $F_{1}$ takes values from ${- 1, 1}$ we assume without loss of generality that $F_{1}$ is of the form $F_{1} = {[\begin{matrix} 1_{f_{+}}^{⊤} & - 1_{f_{-}}^{⊤} \end{matrix}]}^{⊤}$ , where $f_{+}$ and $f_{-}$ are respectively the frequencies of 1 and $- 1$ entries in the vector $F_{1}$ with $f_{+} + f_{-} = 2 k$ . For convenience we write $F_{2}, \dots, F_{k}$ as block vectors $F_{2} = {[\begin{matrix} m_{2}^{⊤} & n_{2}^{⊤} \end{matrix}]}^{⊤},$ $\dots,$ $F_{k} = {[\begin{matrix} m_{k}^{⊤} & n_{k}^{⊤} \end{matrix}]}^{⊤}$ , where $m_{2}, \dots, m_{k}$ are vectors of length $f_{+}$ and $n_{2}, \dots, n_{k}$ are vectors of length $f_{-}$ with entries from ${- 1, 1}$ . We enumerate the following key observations.

The $F_{1}$ -second-order interactions $F_{12}, \dots, F_{1 k}$ are obtained by the Schür product of $F_{1}$ with $F_{2}, \dots, F_{k}$ as follows.
$F_{12} = {[\begin{matrix} (1_{f_{+}} * m_{2})^{⊤} & (- 1_{f_{-}} * n_{2})^{⊤} \end{matrix}]}^{⊤} = {[\begin{matrix} m_{2}^{⊤} & - n_{2}^{⊤} \end{matrix}]}^{⊤}$
$⋮ ⋮ ⋮$
$F_{1 k} = {[\begin{matrix} (1_{f_{+}} * m_{k})^{⊤} & (- 1_{f_{-}} * n_{k})^{⊤} \end{matrix}]}^{⊤} = {[\begin{matrix} m_{k}^{⊤} & - n_{k}^{⊤} \end{matrix}]}^{⊤}$ .
The mean $F_{0}$ which is a $1_{2 k}$ column vector can be written as
$F_{0} = {[\begin{matrix} 1_{f_{+}}^{⊤} & 1_{f_{-}}^{⊤} \end{matrix}]}^{⊤} = {[\begin{matrix} (1_{f_{+}} * 1_{f_{+}})^{⊤} & (- 1_{f_{-}} * (- 1_{f_{-}})^{⊤} \end{matrix}]}^{⊤} = F_{1} * F_{1}$ . That is the mean $F_{0}$ can be obtained by the Schür product of $F_{1}$ with itself.

By preserving the order in which the parameters in the vector

β = [β_{1}, \dots, β_{k}, β_{0}, β_{12}, \dots, β_{1 k}]^{⊤}

appear, each element of

D (k, 1)

can be written as

$[\begin{matrix} 1_{f_{+}} & m_{2} & \dots & m_{k} & 1_{f_{+}} & m_{2} & \dots & m_{k} \\ - 1_{f_{-}} & n_{2} & \dots & n_{k} & 1_{f_{-}} & - n_{2} & \dots & - n_{k} \end{matrix}] = [\begin{matrix} M & M \\ - N & N \end{matrix}]$ ,

where $M = [\begin{matrix} 1_{f_{+}} & m_{2} & \dots & m_{k} \end{matrix}]$ and $N = [\begin{matrix} 1_{f_{-}} & n_{2} & \dots & n_{k} \end{matrix}]$ with dimensions $f_{+} \times k$ and $f_{-} \times k$ respectively. Thus each element of $D (k, 1)$ is necessarily on the block matrix form $[\begin{matrix} M & M \\ - N & N \end{matrix}]$ . Now just because we have the block matrix form $[\begin{matrix} M & M \\ - N & N \end{matrix}]$ doesn't mean that we have obtained an element of $D (k, 1)$ . The question one may ask is “ What are the necessary and sufficient conditions on the matrix $[\begin{matrix} M & M \\ - N & N \end{matrix}]$ to be an element of $D (k, 1)$ ? ”.

Our goal in what follows is to provide necessary and sufficient conditions to construct an element of $D (k, 1)$ . In the theorem below we provide the necessary and sufficient conditions to construct an element of $D (k, 1)$ .

Theorem 2.1

A square matrix D of order $2 k$ is a design matrix in $D (k, 1)$ if and only if it is in the form $D = [\begin{matrix} M & M \\ - N & N \end{matrix}]$ where M and N are elements of $M_{k} {- 1, 1}$ .

Proof.

We have seen that any element of $D (k, 1)$ is necessarily on the form $F = [\begin{matrix} M & M \\ - N & N \end{matrix}]$ , where M and N are ${- 1, 1}$ -matrices of dimensions $f_{+} \times k$ and $f_{-} \times k$ respectively. We will first show that if $f_{+} \neq f_{-}$ then the matrix F is a singular matrix. In that case, F is not an element of $D (k, 1)$ . We then show that M and N have to be both non-singular matrices of order k for F to be an element of $D (k, 1)$ .

Assume without loss of generality that $f_{+} > k$ . Then, since M is of dimensions $f_{+} \times k$ we have $rank (M)$ is at most k. Therefore, the rows of M that we define as $m_{1}^{⊤}, \dots, m_{f_{+}}^{⊤}$ are linearly dependent. We may assume without loss of generality that $m_{1}^{⊤}$ is linearly dependent on $m_{2}^{⊤}, \dots, m_{f_{+}}^{⊤}$ , so that $m_{1} = \sum_{i = 2}^{f_{+}} c_{i} m_{i}$ with some $c_{i} \neq 0$ , $2 \leq i \leq f_{+}$ . This implies that $[\begin{matrix} m_{1} \\ m_{1} \end{matrix}] = \sum_{i = 2}^{f_{+}} c_{i} [\begin{matrix} m_{i} \\ m_{i} \end{matrix}]$ . It means that the rows $[\begin{matrix} m_{1}^{⊤} & m_{1}^{⊤} \end{matrix}], \dots, [\begin{matrix} m_{f_{+}}^{⊤} & m_{f_{+}}^{⊤} \end{matrix}]$ of F are linearly dependent, which would make F a singular matrix. In a similar manner, one can show that if $f_{-} > k$ then F is a singular matrix. Thus, it turns out that $f_{-} = f_{+} = k$ is a necessary condition for F to be non-singular. It follows that any element F of $D (k, 1)$ is on the form $F = [\begin{matrix} M & M \\ - N & N \end{matrix}]$ , where M and N are ${- 1, 1}$ -matrices of order k.
Now, if the matrix M is singular the rows of F would be linearly dependent and F would be a singular matrix by the analogy of the argument above. By the same argument, if N is singular, F would be a singular matrix.
Now suppose both M and N are non-singular matrices, that is M and N are elements of $M_{k} {- 1, 1}$ . Then, $\det (F) = \det [\begin{matrix} M & M \\ - N & N \end{matrix}] = \det (N) \det (M + M N^{- 1} N) = 2^{k} \det (N) \det (M) \neq 0$ . It follows that F is an element of $D (k, 1)$ if and only if $F = [\begin{matrix} M & M \\ - N & N \end{matrix}]$ , where M and N are elements of $M_{k} {- 1, 1}$ .

Corollary 2.1

A design matrix $D^{*}$ is a D-optimal saturated design in $D (k, 1)$ if and only if it can be written as $D^{*} = [\begin{matrix} M^{*} & M^{*} \\ - N^{*} & N^{*} \end{matrix}]$ where $M^{*}$ and $N^{*}$ are elements of $M_{k} {- 1, 1}$ with maximal absolute value determinant. Furthermore $| \det (D^{*}) | = 2^{k} Θ_{k}^{2}$ .

Proof.

By Theorem 2.1 for any element D of $D (k, 1)$ , $\det (D) = 2^{k} \det (N) \det (M)$ for some N and M elements $M_{k} {- 1, 1}$ . This determinant is maximal in absolute value when both M and N have maximal absolute value determinants in $M_{k} {- 1, 1}$ .

2.4. Algorithm for the construction of an element of $D (k, 1)$

We use Theorem 2.1 and Corollary 2.1 to develop an algorithm for the construction of a saturated and a D-optimal saturated design matrix of $D (k, 1)$ .

Step 1: Select two matrices M and N from $M_{k} {- 1, 1}$ (For a D-optimal design select the matrices M and N with maximal absolute value of determinant).
Step 2: The design matrices $D_{1} = [\begin{matrix} M & M \\ - M & M \end{matrix}]$ and $D_{2} = [\begin{matrix} M & M \\ - N & N \end{matrix}]$ obtained through the above steps are saturated design matrices for the estimation of the mean $F_{0}$ , the k main effects $F_{1}, \dots, F_{k}$ and the interactions $F_{12}, \dots, F_{1 k}$ . $D_{1}$ is a D-optimal design matrix in $D (k, 1)$ if $| \det (M) |$ is maximal in $M_{k} {- 1, 1}$ . $D_{2}$ is a D-optimal design matrix in $D (k, 1)$ if both $| \det (M) |$ and $| \det (N) |$ have maximal determinant in $M_{k} {- 1, 1}$ .

In the appendix we give two examples of D-optimal design matrices in

D (15, 1)

and

D (16, 1)

3. Concluding remarks

The construction of saturated design matrices for two-level factorial experiments has gained a lot of interest over a long period of time by both mathematicians and statisticians. In general, mathematicians are interested in finding a matrix with maximal determinant in $M_{k} {- 1, 1}$ , as well as investigating the spectrum of the determinant function which is the set of the value(s) taken by the $| \det (D_{k}) | / 2^{k - 1}$ for $D_{k}$ element of $M_{k} {- 1, 1}$ . Thus, numerous papers have been written about the classification of saturated design matrices of fixed order via the spectrum of the determinant function. The spectra of the determinant function $S_{k}$ for ${- 1, + 1}$ -matrices of order k are well known in the literature for orders up to 11. The spectrum of order k = 8 is due to Metropolis (Citation1969). For k = 9 and k = 10, the spectra were computed by Živković (Citation2006) and the spectrum for k = 11 is due to Orrick (Citation2005). Furthermore, many other papers have investigated D-optimal saturated design matrices for a fixed order. Orrick (Citation2005) constructed a D-optimal design matrix of order 15. Chadjipantelis et al. (Citation1987) came up with a D-optimal design of order 21. The D-optimal design matrix discussed by these papers is a matrix with the maximal absolute value of the determinant in $M_{k} {- 1, 1}$ . Statisticians on the other hand are not only interested in the global D-optimal design matrices in $M_{k} {- 1, 1}$ but also they are interested in the local D-optimal design matrices that satisfy certain restrictions on the columns of matrices in $M_{k} {- 1, 1}$ . In fact, more than often it is desirable for design statisticians to find a D-optimal design matrix to estimate the mean, the main effects, and a selected number of two-factor interactions. The restriction imposed by the interactions on the columns of saturated design matrices makes it impossible to construct a saturated design matrix that achieves the maximal determinant in $M_{k} {- 1, 1}$ under certain conditions. The work we did in the current paper is a good illustration. We showed that the construction of saturated D-optimal design matrices in $D (k, 1)$ is equivalent to finding matrices with maximal determinant in $M_{k} {- 1, 1}$ . Thus, this problem is just as hard as the Hadamard determinant problem discussed in the introduction.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work is partially supported by the US National Science Foundation (NSF) [grant number 1809681].

References

Chadjipantelis, T., Kounias, S., & Moyssiadis, C. (1987). The maximum determinant of 21×21 (+1, -1)-matrices and D-optimal designs. Journal of Statistical Planning and Inference, 16(2), 167–178. https://doi.org/10.1016/0378-3758(87)90066-8
Web of Science ®Google Scholar
Cohn, J. H. E. (1989). On determinants with elements ±1, II. Bulletin of the London Mathematical Society, 21(1), 36–42. https://doi.org/10.1112/blms/21.1.36
Google Scholar
Cohn., J. H. E. (2000). Almost D-optimal designs. Utilitas Mathematica, 57, 121–128.
Web of Science ®Google Scholar
Hedayat, A. S., & Pesotan, H. (1992). Two-level factorial designs for main-effects and selected two-factor interactions. Statistica Sinica, 2, 453–464.
Web of Science ®Google Scholar
Hedayat, A. S., & Pesotan, H. (2007). Tools for constructing optimal two-level factorial designs for a linear model containing main effects and one two-factor interaction. Journal of Statistical Planning and Inference, 137(4), 1452–1463. https://doi.org/10.1016/j.jspi.2006.04.005
Web of Science ®Google Scholar
Hedayat, A. S., & Zhu, H. (2011). An effective algorithm for searching for D-optimal saturated two-level factorial designs. Journal of Statistical Theory and Applications, 10(2), 209–227.
Google Scholar
Kiefer, J. (1959). Optimum experimental designs. Journal of the Royal Statistical Society, B21, 272–319.
Google Scholar
Metropolis, N. (1969). Spectra of determinant values in (0,1) matrices. In A. O. L. Atkin & B. J. Birch (Eds.), Computers in Number Theory: Proceedings of the Science Research Atlas Symposium No. 2 held at Oxford, 18–23 August 1969 (pp. 271–276). Academic Press.
Google Scholar
Orrick, W. P. (2005). The maximal {−1,1}-determinant of order 15. Metrika, 62(2-3), 195–219. https://doi.org/10.1007/s00184-005-0410-3
Web of Science ®Google Scholar
Phoa, F. K. H., Wong, W. K., & Xu, H. (2009). The need of considering the interactions in the analysis of screening designs. Journal of Chemometrics, 23(10), 545–553. https://doi.org/10.1002/cem.v23:10
Web of Science ®Google Scholar
Smith, Warren D. (1988). Studies in computational geometry motivated by mesh generation [Unpublished doctoral dissertation]. Princeton University.
Google Scholar
Vander Heyden, Y., Jimidar, M., Hund, E., Niemeijer, N., Peeters, R., Smeyers-Verbeke, J., Massart, D. L., & Hoogmartens, J. (1999). Determination of system suitability limits with a robustness test. Journal of Chromatography A, 845(1-2), 145–154. https://doi.org/10.1016/S0021-9673(99)00328-3
Web of Science ®Google Scholar
Wald, A. (1943). On the efficient design of statistical investigations. The Annals of Mathematical Statistics, 14(2), 134–140. https://doi.org/10.1214/aoms/1177731454
Google Scholar
Živković, M. (2006). Classification of small (0,1) matrices. Linear Algebra and Its Applications, 414(1), 310–346. https://doi.org/10.1016/j.laa.2005.10.010
Web of Science ®Google Scholar

Appendix

Figure A1. Maximal determinant matrix in

M_{15} {- 1, 1}

. A result of the work of Smith (Citation1988), Cohn (Citation1989), Cohn. (Citation2000) and Orrick (Citation2005).

Figure A1. Maximal determinant matrix in M15{−1,1}. A result of the work of Smith (Citation1988), Cohn (Citation1989), Cohn. (Citation2000) and Orrick (Citation2005).

Figure A2. Normalized maximal determinant matrix in $M_{15} {- 1, 1}$ obtained from $M_{15}^{*}$ .

Figure A3. The opposite matrix of $M_{15}^{* +}$ .

Figure A4. Hadamard matrix of order 16.

Figure A5. Saturated D-optimal design matrix for

$F_{1}, F_{2}, F_{3}, F_{4}, F_{5}, F_{6}, F_{7}, F_{8}, F_{9}, F_{10}, F_{11}, F_{12}, F_{13}, F_{14}, F_{15}, F_{0}, F_{1, 2}, F_{1, 3}, F_{1, 4}, F_{1, 5}, F_{1, 6}, F_{1, 7}, F_{1, 8}, F_{1, 9}, F_{1, 10}, F_{1, 11}, F_{1, 12}, F_{1, 13}, F_{1, 14}, and F_{1, 15}$ .

Figure A6. Saturated D-optimal design matrix for

$F_{1}, F_{2}, F_{3}, F_{4}, F_{5}, F_{6}, F_{7}, F_{8}, F_{9}, F_{10}, F_{11}, F_{12}, F_{13}, F_{14}, F_{15}, F_{16} F_{0}, F_{1, 2}, F_{1, 3}, F_{1, 4}, F_{1, 5}, F_{1, 6}, F_{1, 7}, F_{1, 8}, F_{1, 9}, F_{1, 10}, F_{1, 11}, F_{1, 12}, F_{1, 13}, F_{1, 14}, F_{1, 15}, and F_{1, 16}$ .

D-optimal saturated designs for main effects and interactions in 2^k-factorial experiments

Abstract

1. Introduction