78
Views
0
CrossRef citations to date
0
Altmetric
Research Article

D-optimal saturated designs for main effects and interactions in 2k-factorial experiments

, &
Received 24 Nov 2023, Accepted 05 Apr 2024, Published online: 18 Apr 2024

Abstract

In a 2k-factorial experiment with limited resources, when practitioners can identify the non-negligible effects and interactions beforehand, it is common to run an experiment with a saturated design that ensures the unbiased estimation of the non-negligible parameters of interest. We propose a method for the construction of D-optimal saturated designs for the mean, the main effects, and the second-order interactions of one factor with the remaining factors. In the process, we show the problem is just as hard as the Hadamard determinant problem.

1. Introduction

A saturated design (SD) in a two-level factorial experiment is a design with the minimum number of runs that ensures the unbiased estimation of the effects and interactions of interest given the remaining parameters are negligible. The number of runs n retained in an SD is equal to the total number of parameters of interest. Thus a saturated design matrix is a square non-singular matrix of order n with entries from {1,1} that is chosen to satisfy the conditions of the parameters of interest. Saturated designs are one of the most important designs in practice. They are desirable to practitioners mainly when the important effects and interactions to be estimated are known beforehand. In general, the statistical model retained for an SD is the regular linear model Y=Xβ+ϵ, where Y is the response variable and ϵ is the usual error term. The matrix X is a saturated design matrix for the given vector parameter of interest β. Once X is chosen, the ordinary least square method (OLS) can be used to obtain the unbiased estimation of the parameters of interest. That is βˆ=(XX)1XY=X1Y. As a result of the estimator βˆ=X1Y, the determinant of the Fisher information XX of an SD is maximal if the absolute value of the determinant of X is maximal. Such an SD is known as a D-optimal saturated design. Since the determinant of the Fisher information is inversely proportional to the volume of the confidence ellipsoid of the vector parameter β, the D-optimality criterion is one of the key criteria used to search for a design because it guarantees that the volume of the confidence ellipsoid of the vector parameter β is minimized. See (Wald, Citation1943) and Kiefer (Citation1959). However, it turns out that the construction of a D-optimal SD is not a trivial problem. There has been a vast literature as well as ongoing investigation about the construction of SD under certain conditions. Hedayat and Pesotan in Hedayat and Pesotan (Citation1992) and Hedayat and Pesotan (Citation2007) have discussed how to construct a saturated design that includes the estimation of the mean, the main effects, and a selected number of second-order interactions. Furthermore, various computer algorithms have been developed to search for SDs for two-level factorial experiments, some of which are SPAN, and DETMAX. As a case in point, see Hedayat and Zhu (Citation2011).

In this paper, we propose methods for the construction of D-optimal saturated design matrices for the estimation of the mean, the main effects, and the second-order interaction(s) of one factor with the remaining factors. Specifically, we consider a two-level factorial experiment with k factors F1,,Fk and we develop algorithms for the construction of a saturated design matrix as well as a D-optimal saturated design matrix that includes the estimation of the main effects F1,,Fk, the second-order interactions of factor F1 with each of the remaining factors namely F1F2,F1F2,,F1Fk, and the mean that we denote by F0. For simplicity, in the rest of the paper, F1F2,F1F2,,F1Fk will be called F1-two-factor interactions and we write them as F12,,F1k. We also define D(k,1) as the set of all saturated designs that ensure the unbiased estimation of the mean, k main effects, and the F1-interactions for a given k.

2. Construction of D-optimal saturated designs in D(k,1)

2.1. Motivation

In Table , Vander Heyden et al. (Citation1999) used high-performance liquid chromatography (HPLC) to conduct an experiment that studied the assay of ridogrel and its related compounds in ridogrel oral film-coated tablet simulations. In the original experiment, multiple responses were of interest. One of the responses was the percentage recovery of the main compound. For scientific reasons, only eight factors were considered in the experiment that assessed the importance of the factors on the response which in this case is the percentage recovery of the main compound (MC). The eight factors retained were pH of the buffer (A), column manufacturer (B), column temperature (D), percent of organic solvent in the mobile phase at the start of the gradient (E), percent of organic solvent in the mobile phase at the end of the gradient (F), the flow of the mobile phase (H), the detection wavelength (I), and the concentration of the buffer(J). For this specific experiment factors C, G, and K in Table were not used in the experiment. It is worth pointing out that in Table , each factor has two levels coded as +1 and 1 that are represented by + and − respectively. The 12-run Plackett-Burman design in Table  was used to assess the importance of the eight factors on the response variable. Fitting a main effects model to the data yields (1) yˆ=101.04+0.34A0.22B0.36D0.56E+0.44F0.01H+0.26I0.31J.(1) This model has an R2=0.78 with σˆ=1.045 on 3 degrees of freedom. The most significant factors are E and F with p-values of 0.16 and 0.24 respectively. The experimenters decided the test was not significant and concluded there was no significant relationship between any of the factors and the response variable because, at the 10% significance level, none of the effects is significant. Phoa et al. (Citation2009) reanalyzed the experiment in Table  taking into account interactions, and found the following model (2) yˆ=101.040.56E+0.44F0.30H+0.88EF.(2) This model has an R2=0.96 which indicates a good fit. Furthermore, factor H is significant at the 5% level (p-value = 0.012) and E, F, and EF are significant at the 1% level. Here, the takeaway message is that in Plackett-Burman designs, the main effects are partially aliased with second-order interactions. Thus, since one or more second-order interactions are not negligible, some effects in the main-effect model in Equation (Equation1) are biased. This misled the experimenters to draw the wrong conclusion that none of the effects is important. On the other hand, by taking into account second-order interactions, the experimenters were able to identify the important main effects and the second-order interaction given by the model in Equation (Equation2). In general, if for scientific reasons, the experimenter can identify the potential main effects and interactions, he may cut down the number of runs and conduct a saturated design for the experiment. That is the main purpose of the remainder of this paper.

Table 1. Experiment reported by Vander Heyden et al. (Citation1999).

2.2. Preliminaries

In the rest of this paper, we consider a two-level factorial experiment with k factors F1,,Fk. We investigate the class of saturated design matrices for a vector parameter β that includes the mean, the k main effects, and the second-order interactions of factor F1 with the remaining factors F2,,Fk. More precisely, for such a problem there are k main effects F1,,Fk, the mean F0 and k−1 second-order interactions F12,,F1k. The total number of parameters to estimate is 2k. A saturated design would therefore require 2k runs. The corresponding linear model is on the form (3) Yi=β0(F0)i+β1(F1)i++βk(Fk)i+β12(F12)i++β1k(F1k)i+ϵi(3) where i{1,,2k}, ϵi, Yi and (F..)i are respectively the ith error term, the response variable and the corresponding runs. β=[β0,β1,,βk,β12,,β1k] is the vector parameter of interest.

To gain more intuition about the problem, we give an example of the particular case of k=3 as follows. For k = 3 the number of parameters to estimate is 6, namely, F0,F1,F2,F3,F12,F13. It follows that a saturated design would require 6 runs. Suppose we choose the candidate design with the runs {(+++),(+),(++),(+),(++),(+)}. Then the candidate saturated design matrix would be a square matrix of order 6 that is obtained by converting the runs into the underlying design matrix. As illustrated below, the first matrix underlies the main effects plus mean F1,F2,F3 and F0. The second matrix underlies the second order interactions F12 and F13 and is obtained by taking the Schür product of F1 with F2 and F3 respectively. The third matrix is the candidate saturated design matrix obtained by combining the first and second matrices. It is worth pointing out that for convenience we set the factors in the order F1,F2,F3,F0 so that the first and last entries of each run correspond to F1 and F0 respectively. F1F2F3F0[++++++++++++++++]F12F13[++++++]F1F2F3F0F12F13[+++++++++++++++++++++]

It is important to observe that for the given candidate design matrix given above, F1 is of the form F1=[1313], where the Schür product of F1 by itself (F11) yields

F11=[131313(13)]=[111111]=F0  F12=[1313]F2=[1313][111111]=[111111]

and F13=[1313]F3=[1313][111111]=[111111].

Furthermore, the Schür product of F1 with F2 and F3 leaves the first 3 entries of F2 and F3 unchanged and negates the last 3 entries. It turns out from the above observations that the candidate saturated design can be written as: F1F2F3F11F12F13[+++++++++++++++++++++]=[MMNN] where M=[++++++] and N=[+++++].

Remark 2.1

A few remarks can be made as follows.

  1. The mean F0 can be written as the Schür product of F1 by itself. This simple fact will be crucial in the theorems we develop in the upcoming section.

  2. For any choice of candidate saturated design the corresponding candidate saturated design matrix is necessarily of the form [MMNN] as shown above. In the example F1 has as many +1 entries as 1 entries which means F1 is balanced. Therefore M and N are square matrices of order k.

  3. The candidate design matrix as displayed above will be a valid design matrix if it is a non-singular matrix. We shall see in the remainder of this paper that in general, a candidate design matrix is a valid design matrix if and only if the design is chosen so that F1 is balanced and that M and N are non-singular matrices.

2.3. Construction of saturated and D-optimal saturated design matrices in D(k,1)

In the remainder of this section, we explore the construction of a D-optimal design matrix for mean, main effects, and the F1-second-order interactions from a general perspective. We assume without loss of generality that the vector parameter of interest is of the form β=[β1,,βk,β0,β12,,β1k]. For convenience, we make the following definitions.

Definition 2.1

We make the following definitions:

  1. We define D(k,1) to be the set of all the saturated design matrices that ensure the unbiased estimation of the vector parameter of interest β. We purposely use the notation D(k,1) to indicate that the vector parameter of interest β includes the k main effects, the mean, and all the F1-second-order interactions.

  2. We define Mk{1,1} as the set of non-singular matrices of order k with entries from {1,1} for which the first column is the vector 1k.

  3. We define Θk to be the maximal value of the absolute value of the determinant of matrices in Mk{1,+1}.

The factor F1 plays a key role in the construction of a saturated design for the vector parameter β as specified above because it is the only factor that interacts with all the remaining factors. Therefore, we define the factor F1 as the pivot factor. Since the entries of F1 takes values from {1,1} we assume without loss of generality that F1 is of the form F1=[1f+1f], where f+ and f are respectively the frequencies of 1 and 1 entries in the vector F1 with f++f=2k. For convenience we write F2,,Fk as block vectors F2=[m2n2], , Fk=[mknk], where m2,,mk are vectors of length f+ and n2,,nk are vectors of length f with entries from {1,1}. We enumerate the following key observations.

  1. The F1-second-order interactions F12,,F1k are obtained by the Schür product of F1 with F2,,Fk as follows.

    F12=[(1f+m2)(1fn2)]=[m2n2]

    F1k=[(1f+mk)(1fnk)]=[mknk].

  2. The mean F0 which is a 12k column vector can be written as

    F0=[1f+1f]=[(1f+1f+)(1f(1f)]=F1F1. That is the mean F0 can be obtained by the Schür product of F1 with itself.

By preserving the order in which the parameters in the vector β=[β1,,βk,β0,β12,,β1k] appear, each element of D(k,1) can be written as

[1f+m2mk1f+m2mk1fn2nk1fn2nk]=[MMNN],

where M=[1f+m2mk] and N=[1fn2nk] with dimensions f+×k and f×k respectively. Thus each element of D(k,1) is necessarily on the block matrix form [MMNN]. Now just because we have the block matrix form [MMNN] doesn't mean that we have obtained an element of D(k,1). The question one may ask is “ What are the necessary and sufficient conditions on the matrix [MMNN] to be an element of D(k,1) ? .

Our goal in what follows is to provide necessary and sufficient conditions to construct an element of D(k,1). In the theorem below we provide the necessary and sufficient conditions to construct an element of D(k,1).

Theorem 2.1

A square matrix D of order 2k is a design matrix in D(k,1) if and only if it is in the form D=[MMNN] where M and N are elements of Mk{1,1}.

Proof.

We have seen that any element of D(k,1) is necessarily on the form F=[MMNN], where M and N are {1,1}-matrices of dimensions f+×k and f×k respectively. We will first show that if f+f then the matrix F is a singular matrix. In that case, F is not an element of D(k,1). We then show that M and N have to be both non-singular matrices of order k for F to be an element of D(k,1).

  1. Assume without loss of generality that f+>k. Then, since M is of dimensions f+×k we have rank(M) is at most k. Therefore, the rows of M that we define as m1,,mf+ are linearly dependent. We may assume without loss of generality that m1 is linearly dependent on m2,,mf+, so that m1=i=2f+cimi with some ci0, 2if+. This implies that [m1m1]=i=2f+ci[mimi]. It means that the rows [m1m1],,[mf+mf+] of F are linearly dependent, which would make F a singular matrix. In a similar manner, one can show that if f>k then F is a singular matrix. Thus, it turns out that f=f+=k is a necessary condition for F to be non-singular. It follows that any element F of D(k,1) is on the form F=[MMNN], where M and N are {1,1}-matrices of order k.

    Now, if the matrix M is singular the rows of F would be linearly dependent and F would be a singular matrix by the analogy of the argument above. By the same argument, if N is singular, F would be a singular matrix.

  2. Now suppose both M and N are non-singular matrices, that is M and N are elements of Mk{1,1}. Then, det(F)=det[MMNN]=det(N)det(M+MN1N)=2kdet(N)det(M)0. It follows that F is an element of D(k,1) if and only if F=[MMNN], where M and N are elements of Mk{1,1}.

Corollary 2.1

A design matrix D is a D-optimal saturated design in D(k,1) if and only if it can be written as D=[MMNN] where M and N are elements of Mk{1,1} with maximal absolute value determinant. Furthermore |det(D)|=2kΘk2.

Proof.

By Theorem 2.1 for any element D of D(k,1), det(D)=2kdet(N)det(M) for some N and M elements Mk{1,1}. This determinant is maximal in absolute value when both M and N have maximal absolute value determinants in Mk{1,1}.

2.4. Algorithm for the construction of an element of D(k,1)

We use Theorem 2.1 and Corollary 2.1 to develop an algorithm for the construction of a saturated and a D-optimal saturated design matrix of D(k,1).

  • Step 1: Select two matrices M and N from Mk{1,1} (For a D-optimal design select the matrices M and N with maximal absolute value of determinant).

  • Step 2: The design matrices D1=[MMMM] and D2=[MMNN] obtained through the above steps are saturated design matrices for the estimation of the mean F0, the k main effects F1,,Fk and the interactions F12,,F1k. D1 is a D-optimal design matrix in D(k,1) if |det(M)| is maximal in Mk{1,1}. D2 is a D-optimal design matrix in D(k,1) if both |det(M)| and |det(N)| have maximal determinant in Mk{1,1}.

In the appendix we give two examples of D-optimal design matrices in D(15,1) and D(16,1).

3. Concluding remarks

The construction of saturated design matrices for two-level factorial experiments has gained a lot of interest over a long period of time by both mathematicians and statisticians. In general, mathematicians are interested in finding a matrix with maximal determinant in Mk{1,1}, as well as investigating the spectrum of the determinant function which is the set of the value(s) taken by the |det(Dk)|/2k1 for Dk element of Mk{1,1}. Thus, numerous papers have been written about the classification of saturated design matrices of fixed order via the spectrum of the determinant function. The spectra of the determinant function Sk for {1,+1}-matrices of order k are well known in the literature for orders up to 11. The spectrum of order k = 8 is due to Metropolis (Citation1969). For k = 9 and k = 10, the spectra were computed by Živković (Citation2006) and the spectrum for k = 11 is due to Orrick (Citation2005). Furthermore, many other papers have investigated D-optimal saturated design matrices for a fixed order. Orrick (Citation2005) constructed a D-optimal design matrix of order 15. Chadjipantelis et al. (Citation1987) came up with a D-optimal design of order 21. The D-optimal design matrix discussed by these papers is a matrix with the maximal absolute value of the determinant in Mk{1,1}. Statisticians on the other hand are not only interested in the global D-optimal design matrices in Mk{1,1} but also they are interested in the local D-optimal design matrices that satisfy certain restrictions on the columns of matrices in Mk{1,1}. In fact, more than often it is desirable for design statisticians to find a D-optimal design matrix to estimate the mean, the main effects, and a selected number of two-factor interactions. The restriction imposed by the interactions on the columns of saturated design matrices makes it impossible to construct a saturated design matrix that achieves the maximal determinant in Mk{1,1} under certain conditions. The work we did in the current paper is a good illustration. We showed that the construction of saturated D-optimal design matrices in D(k,1) is equivalent to finding matrices with maximal determinant in Mk{1,1}. Thus, this problem is just as hard as the Hadamard determinant problem discussed in the introduction.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work is partially supported by the US National Science Foundation (NSF) [grant number 1809681].

References

  • Chadjipantelis, T., Kounias, S., & Moyssiadis, C. (1987). The maximum determinant of 21×21 (+1, -1)-matrices and D-optimal designs. Journal of Statistical Planning and Inference, 16(2), 167–178. https://doi.org/10.1016/0378-3758(87)90066-8
  • Cohn, J. H. E. (1989). On determinants with elements ±1, II. Bulletin of the London Mathematical Society, 21(1), 36–42. https://doi.org/10.1112/blms/21.1.36
  • Cohn., J. H. E. (2000). Almost D-optimal designs. Utilitas Mathematica, 57, 121–128.
  • Hedayat, A. S., & Pesotan, H. (1992). Two-level factorial designs for main-effects and selected two-factor interactions. Statistica Sinica, 2, 453–464.
  • Hedayat, A. S., & Pesotan, H. (2007). Tools for constructing optimal two-level factorial designs for a linear model containing main effects and one two-factor interaction. Journal of Statistical Planning and Inference, 137(4), 1452–1463. https://doi.org/10.1016/j.jspi.2006.04.005
  • Hedayat, A. S., & Zhu, H. (2011). An effective algorithm for searching for D-optimal saturated two-level factorial designs. Journal of Statistical Theory and Applications, 10(2), 209–227.
  • Kiefer, J. (1959). Optimum experimental designs. Journal of the Royal Statistical Society, B21, 272–319.
  • Metropolis, N. (1969). Spectra of determinant values in (0,1) matrices. In A. O. L. Atkin & B. J. Birch (Eds.), Computers in Number Theory: Proceedings of the Science Research Atlas Symposium No. 2 held at Oxford, 18–23 August 1969 (pp. 271–276). Academic Press.
  • Orrick, W. P. (2005). The maximal {−1,1}-determinant of order 15. Metrika, 62(2-3), 195–219. https://doi.org/10.1007/s00184-005-0410-3
  • Phoa, F. K. H., Wong, W. K., & Xu, H. (2009). The need of considering the interactions in the analysis of screening designs. Journal of Chemometrics, 23(10), 545–553. https://doi.org/10.1002/cem.v23:10
  • Smith, Warren D. (1988). Studies in computational geometry motivated by mesh generation [Unpublished doctoral dissertation]. Princeton University.
  • Vander Heyden, Y., Jimidar, M., Hund, E., Niemeijer, N., Peeters, R., Smeyers-Verbeke, J., Massart, D. L., & Hoogmartens, J. (1999). Determination of system suitability limits with a robustness test. Journal of Chromatography A, 845(1-2), 145–154. https://doi.org/10.1016/S0021-9673(99)00328-3
  • Wald, A. (1943). On the efficient design of statistical investigations. The Annals of Mathematical Statistics, 14(2), 134–140. https://doi.org/10.1214/aoms/1177731454
  • Živković, M. (2006). Classification of small (0,1) matrices. Linear Algebra and Its Applications, 414(1), 310–346. https://doi.org/10.1016/j.laa.2005.10.010

Appendix

Figure A1. Maximal determinant matrix in M15{1,1}. A result of the work of Smith (Citation1988), Cohn (Citation1989), Cohn. (Citation2000) and Orrick (Citation2005).

Figure A1. Maximal determinant matrix in M15{−1,1}. A result of the work of Smith (Citation1988), Cohn (Citation1989), Cohn. (Citation2000) and Orrick (Citation2005).

Figure A2. Normalized maximal determinant matrix in M15{1,1} obtained from M15.

Figure A2. Normalized maximal determinant matrix in M15{−1,1} obtained from M15∗.

Figure A3. The opposite matrix of M15+.

Figure A3. The opposite matrix of M15∗+.

Figure A4. Hadamard matrix of order 16.

Figure A4. Hadamard matrix of order 16.

Figure A5. Saturated D-optimal design matrix for

F1,F2,F3,F4,F5,F6,F7,F8,F9,F10,F11,F12,F13,F14,F15,F0,F1,2,F1,3,F1,4,F1,5,F1,6,F1,7,F1,8,F1,9,F1,10,F1,11,F1,12,F1,13,F1,14,andF1,15.

Figure A5. Saturated D-optimal design matrix forF1,F2,F3,F4,F5,F6,F7,F8,F9,F10,F11,F12,F13,F14,F15,F0,F1,2,F1,3,F1,4,F1,5,F1,6,F1,7,F1,8,F1,9,F1,10,F1,11,F1,12,F1,13,F1,14,andF1,15.

Figure A6. Saturated D-optimal design matrix for

F1,F2,F3,F4,F5,F6,F7,F8,F9,F10,F11,F12,F13,F14,F15,F16F0,F1,2,F1,3,F1,4,F1,5,F1,6,F1,7,F1,8,F1,9,F1,10,F1,11,F1,12,F1,13,F1,14,F1,15, and F1,16.

Figure A6. Saturated D-optimal design matrix forF1,F2,F3,F4,F5,F6,F7,F8,F9,F10,F11,F12,F13,F14,F15,F16F0,F1,2,F1,3,F1,4,F1,5,F1,6,F1,7,F1,8,F1,9,F1,10,F1,11,F1,12,F1,13,F1,14,F1,15, and F1,16.