471

Views

CrossRef citations to date

Altmetric

Articles

Selecting baseline designs using a minimum aberration criterion when some two-factor interactions are important

Anqi ChenDepartment of Statistics and Actuarial Science, Simon Fraser University, Burnaby, CanadaView further author information

Cheng-Yu SunDepartment of Statistics and Actuarial Science, Simon Fraser University, Burnaby, CanadaView further author information

Boxin TangDepartment of Statistics and Actuarial Science, Simon Fraser University, Burnaby, CanadaCorrespondence[email protected]
View further author information

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

This article considers the problem of selecting two-level designs under the baseline parameterisation when some two-factor interactions are important. We propose a minimum aberration criterion, which minimises the bias caused by the non-negligible effects. Using this criterion, a class of optimal designs can be further distinguished from one another, and we present an algorithm to find the minimum aberration designs among the D-optimal designs. Sixteen-run and twenty-run designs are summarised for practical use.

Keywords:

1. Introduction

Fractional factorial designs, due to their run size economy, are widely used in many industrial or scientific areas. Particularly, two-level fractional factorial designs have received the most attention among practitioners. The experimenter often has some prior knowledge that allows a model containing certain important factorial effects to be postulated. While the most commonly used definition of factorial effects is given by a set of mutually orthogonal contrasts, which we refer to as the orthogonal parameterisation, an alternative considered in this article is the baseline parameterisation. Baseline parameterisation is more appropriate when each factor has a default or null state, and its usefulness has been increasingly recognised in recent years. For example, Yang and Speed (Citation2002), Kerr (Citation2006), and Banerjee and Mukerjee (Citation2008) investigated factorial designs under the baseline parameterisation in the context of cDNA microarray experiments.

Firstly proposed by Fries and Hunter (Citation1980), the minimum aberration is a popular criterion for selecting two-level fractional factorial designs. One justification for minimum aberration designs, given by Tang and Deng (Citation1999), is that they provide a protection for the estimation of main effects by minimising the bias caused by the non-negligible interactions. The minimum aberration criterion can further distinguish orthogonal arrays, which are universally optimal (Cheng, Citation1980) under the main effect model. When the baseline parameterisation is under consideration, this idea leads to the minimum K-aberration criterion (Mukerjee & Tang, Citation2012).

In the present paper, we consider how to select baseline designs when, in addition to the main effects, some two-factor interactions are also important. Knowledge of important two-factor interactions arise in many applications. For example, in robust parameter designs, the estimation of interactions between the control and noise factors is crucial for the experimental objectives. Clearly, the minimum K-aberration criterion is not appropriate in this situation, so we propose a modified criterion that sequentially minimises the contamination caused by the non-negligible effects. The modified criterion is then used to further distinguish a class of optimal designs, and an algorithm is given to find out the best designs of sixteen and twenty runs.

This paper is organised as follows. Section 2 introduces notation and provides the definitions of the basic concepts, including factorial effects, minimum aberration criterion, and D-optimality criterion. Section 3 presents an algorithm for searching for minimum aberration baseline designs among D-optimal designs, which is then applied to designs of sixteen and twenty runs. Section 4 is the concluding section.

2. Notation and definitions

2.1. Factorial effects

Consider a two-level factorial involving m factors $F_{1}, F_{2}, \dots, F_{m}$ . Let $τ_{g}$ denote the mean response at the level combination $g = (g_{1}, g_{2}, \dots, g_{m})$ , where $g_{i} = 0$ or 1 ( $i = 1, 2, \dots, m$ ). A factorial effect measures the impact on the mean response caused by the level changing of involved factor(s), and is defined by a treatment contrast. Let $G$ be the collection of all possible level combinations. Under the most commonly used orthogonal parmeterisation, for a subset $w = {i_{1}, i_{2}, \dots, i_{h}}$ of $S = {1, 2, \dots, m}$ , the h-factor interaction $F_{i_{1}} F_{i_{2}} \dots F_{i_{h}}$ (the main effect if h = 1) is (1) $β_{w} = \frac{1}{2^{m}} \sum_{g \in G} τ_{g} (- 1)^{\sum_{j = 1}^{h} g_{i_{j}}} .$ (1) We let $β_{ϕ} = 2^{- m} \sum_{G} τ_{g}$ , the grand mean. In the present paper, we focus on the alternative baseline parameterisation. For convenience, here and after we will also denote $τ_{(1, 1, 0, \dots, 0)}$ by $τ_{12}$ , and similar notation applies to any other $τ_{g}$ . Under the baseline parameterisation, the main effect of $F_{i}$ is $θ_{i} = τ_{i} - τ_{ϕ}$ , and the two-factor interaction $F_{i} F_{j}$ is $θ_{i j} = τ_{i j} - τ_{i} - τ_{j} + τ_{ϕ}$ . More generally, for a subset $w = {i_{1}, i_{2}, \dots, i_{h}}$ of S, the h-factor interaction $F_{i_{1}} F_{i_{2}} \dots F_{i_{h}}$ under the baseline parameterisation is (2) $θ_{w} = \sum_{u \subseteq w} τ_{u} (- 1)^{| w | - | u |} .$ (2) where $| \cdot |$ stands for the cardinality of a set. We let $θ_{ϕ} = τ_{ϕ}$ . The main distinction between the orthogonal and baseline parameterisations is that the former defines the effects in an overall sense, while the later defines the effects in a way that the non-involved factors are kept at their baseline levels.

The baseline parameterisation arises naturally in the experiments in which each factor has a default or null state. For example, in a toxicological study, each factor is a toxin, and each treatment is a mix of several toxins. Then, absence and presence can be represented by levels 0 and 1, respectively. The baseline parameterisation is also more appropriate if only a few factors are allowed to change their settings. Consider a situation in which the experimenter wants to improve an industrial process by changing only a few factors' current setting. Let levels 0 and 1 be the current and new settings, respectively. In this case, the baseline effects are more relevant and useful to the experimenter.

2.2. Minimum aberration criteria

An N-run and m-factor design $D = [g_{i j}]$ with $g_{i j} = 0$ or 1 is represented by an $N \times m$ matrix in which a row corresponds to an experimental run and a column to a factor. Let Y be the vector of N observations. Under design D, the main effect model is $E (Y) = W_{1} θ_{1},$ where $W_{1} = [1 D]$ with $1$ being the all-ones vector, and $θ_{1} = (θ_{ϕ}, θ_{1}, \dots, θ_{m})$ . We assume as usual that all observations are uncorrelated and have a common variance. If the interactions cannot be ignored, the true model under D is (3) $E (Y) = W_{1} θ_{1} + W_{2} θ_{2} + \dots + W_{m} θ_{m},$ (3) where for $j = 2, \dots, m$ , $θ_{j}$ is the vector of all interactions involving j factors and $W_{j}$ is the corresponding matrix obtained by taking all j-column products from D. Let ${\hat{θ}}_{1} = (W_{1}^{T} W_{1})^{- 1} W_{1}^{T} Y$ be the least square estimator of $θ_{1}$ under the main effect model, which is biased under model (Equation3(3) $E (Y) = W_{1} θ_{1} + W_{2} θ_{2} + \dots + W_{m} θ_{m},$ (3) ), and the bias can be found by $bias ({\hat{θ}}_{1}, θ_{1}) = B_{2} θ_{2} + \dots + B_{m} θ_{m},$ where $B_{j} = (W_{1}^{T} W_{1})^{- 1} W_{1}^{T} W_{j}$ , $j = 2, \dots, m$ . The contribution of $θ_{j}$ to the bias is $B_{j} θ_{j}$ , where $θ_{j}$ is unknown and $B_{j}$ depends on the design. To minimise the bias in the estimation of main effects caused by the non-negligible j-factor interactions, Mukerjee and Tang (Citation2012) proposed the minimum K-aberration criterion, which selects designs by sequentially minimising $K_{j} = tr ({B_{j}^{*}}^{T} B_{j}^{*})$ , a size measure of $B_{j}^{*}$ , where $B_{j}^{*}$ is the matrix obtained by deleting the first row of $B_{j}$ . Following the same path, we consider the model that contains the intercept, all main effects and some two-factor interactions that are presumably important, as given by (4) $E (Y) = W θ,$ (4) where $θ$ contains all the main effects and the important two-factor interactions, and W is $W_{1}$ plus the corresponding columns of $W_{2}$ . If the effects outside this model cannot be ignored, the true model is (5) $E (Y) = W θ + W_{2}^{*} θ_{2}^{*} + W_{3} θ_{3} + \dots + W_{m} θ_{m},$ (5) where $θ_{2}^{*}$ contains the non-important two-factor interactions and $W_{2}^{*}$ is the corresponding matrix. Let $\hat{θ} = (W^{T} W)^{- 1} W^{T} Y$ be the least square estimator of $θ$ under model (Equation4(4) $E (Y) = W θ,$ (4) ), which is biased under model (Equation5(5) $E (Y) = W θ + W_{2}^{*} θ_{2}^{*} + W_{3} θ_{3} + \dots + W_{m} θ_{m},$ (5) ), and the bias can be found by $bias (\hat{θ}, θ) = C_{2} θ_{2} + \dots + C_{m} θ_{m},$ where $C_{2} = (W^{T} W)^{- 1} W^{T} W_{2}^{*}$ and $C_{j} = (W^{T} W)^{- 1} W^{T} W_{j}$ , for $j = 3, \dots, m$ . We now define a new criterion, called minimum Q-aberration criterion, which is used to select baseline designs under model (Equation4(4) $E (Y) = W θ,$ (4) ). Let $Q_{j} = tr (C_{j}^{T} C_{j})$ , $j = 2, \dots, m$ , and $Q (D) = (Q_{2}, \dots, Q_{m})$ , the Q-aberration of D. For any two competing designs D and $D^{'}$ , let s be the smallest integer such that D and $D^{'}$ have different $Q_{s}$ values. If D has smaller $Q_{s}$ value than $D^{'}$ , we say D has less Q-aberration than $D^{'}$ . A design is said to have minimum Q-aberration if there is no other design that has less Q-aberration than it.

Though similar, our approach is slightly different from that of Mukerjee and Tang (Citation2012). The situations considered in Mukerjee and Tang (Citation2012) are screening experiments and they therefore focussed on the estimation of main effects by excluding the intercept from consideration. Our situations are different. If we are able to specify some important two-factor interactions, then we are reasonably confident that the model containing the intercept, main effects and important two-factor interactions is approximately correct. This means that all the parameters in the specified model are important and should be estimated to the best extent possible.

2.3. Optimality criterion

When a model is postulated, the experimenter would like to find designs that enjoy certain optimality properties. The optimality criterion considered in this article is the D-efficiency. Consider model (Equation4(4) $E (Y) = W θ,$ (4) ), the D-efficiency criterion is to minimise $[\det (W^{T} W)]^{- 1 / p}$ , where p is the number of columns of W, which minimises the volume of the confidence region of $θ$ .

In a similar study, Ke and Tang (Citation2003) considered regular designs under the orthogonal parameterisation. Regular designs are guaranteed to have the full efficiency provided they can estimate the fitted model.

Let $E (Y) = X β$ be the counterpart model of model (Equation4(4) $E (Y) = W θ,$ (4) ) under the orthogonal parameterisation. That is, the column of X that is associated with $β_{i}$ is the ith column of 2D−1, and the column associated with $β_{i j}$ is the Hadamard product of the ith and jth columns of 2D−1. According to C. Y. Sun and Tang (Citation2020), the two models are equivalent. We have the following lemma, which is a special case of Theorem 3 of C. Y. Sun and Tang (Citation2020).

Lemma 2.1

Consider model (Equation4(4) $E (Y) = W θ,$ (4) ). If D is a baseline design such that X is orthogonal, then it is D-optimal among all competing designs. Moreover, such a D minimises $var ({\hat{θ}}_{w})$ if there is no $θ_{u}$ in the model such that w is a proper subset of u.

C. Y. Sun and Tang (Citation2020) considered a model that is more general than model (Equation4(4) $E (Y) = W θ,$ (4) ), and they call $θ_{w}$ a cap effect if there is no $θ_{u}$ in the model such that w is a proper subset of u. For example, under the main effect model, all main effect are cap effects. Under model (Equation4(4) $E (Y) = W θ,$ (4) ), the important two-factor interactions are cap effects, so are the main effects of those factors that are not involved in any important two-factor interaction. As indicated by C. Y. Sun and Tang (Citation2020), the cap effects should be the first in line to be tested for their significance when one seeks a simpler model in the analysis stage.

3. Searching for best baseline designs

In this section, we present an algorithm to search for minimum Q-aberration designs among the D-optimal designs under model (Equation4(4) $E (Y) = W θ,$ (4) ). The first two subsections introduce two necessary concepts, and the algorithm and results are given in the third subsection. The last subsection provides an illustrative example.

3.1. Design isomorphism

Under the orthogonal parameterisation, two designs are isomorphic if one can be obtained from the other by (i) row permutation, (ii) column permutation, (iii) level permutation, or any combination of these three. Mukerjee and Tang (Citation2012) suggest a different definition of isomorphism for baseline designs, which is similarly defined except for that level permutations are not allowed, since the two levels are not symmetric under the baseline parameterisation. To avoid ambiguity, we call the former the combinatorial isomorphism. Clearly, two designs are combinatorially isomorphic if they are isomorphic, but the converse is not true. A two-level orthogonal array is an $N \times m$ matrix with entries from a set of two symbols, such that for every two columns, all level-combinations appear equally often. A complete catalogue of combinatorially non-isomorphic two-level orthogonal arrays with $N \leq 20$ are available in D. X. Sun et al. (Citation2008). A more comprehensive catalogue of orthogonal arrays can be found at http://www.pietereendebak.nl/oapackage/series.html. Based on the catalogue given by D. X. Sun et al. (Citation2008), we will conduct a complete search on the class of baseline designs that are orthogonal arrays, called orthogonal baseline designs for convenience.

3.2. Non-isomorphic models

There are a huge number of models that are given by (Equation4(4) $E (Y) = W θ,$ (4) ). Among these models, many share the same structures. The graph theory is a convenient tool to deal with the model structure. For example, the models with $θ = (θ_{1}, θ_{12}, θ_{34})$ and $θ = (θ_{1}, θ_{12}, θ_{13})$ can be represented by the graphs in Figure (a,b), respectively. In such a graph, a vertex stands for a factor, and a line connects two vertices if their interaction is included in the model. Note that the factors (vertices) not involved in any important two-factor interaction do not appear in the graph. We say two models are isomorphic if one can be obtained from the other by relabelling the factors. Let k denote the number of two-factor interactions in model (Equation4(4) $E (Y) = W θ,$ (4) ). All non-isomorphic models for different values of $k \leq 3$ are given in Figures . The cases for $k \geq 4$ are not considered because of the large number of possible models, and also because k tends to be small in practice due to the effect sparsity principle (Wu & Hamada, Citation2011, pp. 173).

Figure 1. Model containing one interaction (k = 1).

Figure 2. Models containing two interactions (k = 2).

Figure 3. Models containing three interactions (k = 3).

For a given graph and a design matrix, there are many possible ways to assign the columns to the vertexes, and the resulting design efficiency and Q-aberration may be different. To conduct a complete search, all possible column-to-vertex assignments need to be considered, as we will see in the next subsection.

3.3. Algorithm and results

Suppose N experimental runs are allowed to study m factors under a model whose structure is given by a graph R. Let $D_{1}, \dots, D_{n}$ be the combinatorially non-isomorphic $N \times m$ orthogonal arrays in the catalogue given by D. X. Sun et al. (Citation2008), where each $D_{i}$ consists of 1 and $- 1$ . The algorithm proceeds as follows, starting with i = 1.

Set $D_{i} = (D_{i} + 1) / 2$ . If i = 1, set $D_{b e s t} = D_{1}$ and $Q_{b e s t} = Q (D_{b e s t})$ .
Switch the two levels 0 and 1 for a subset of factors.
For the resulting baseline design matrix, assign the columns to the vertexes of R.
Obtain the resulting model matrix W and compute the D-efficiency. Compare $D_{i}$ with $D_{b e s t}$ . If (i) $D_{i}$ has better D-efficiency, or (ii) $D_{i}$ has the same D-efficiency and less Q-aberration, then set $D_{b e s t} = D_{i}$ , and $Q_{b e s t} = Q (D_{b e s t})$ .
Go back to step 3 with another possible column-to-vertex assignment. When all possible assignments are considered, go back to step 2 with another possible subset. When all possible subsets are considered, go back to step 1 with $D_{i + 1}$ .

This algorithm finds a minimum Q-aberration design that is D-optimal among all orthogonal baseline designs. Such a design may not be unique, and $D_{b e s t}$ is the first one found by the algorithm. In our algorithm, some isomorphic designs may be considered more than once, but no orthogonal baseline design will be missed. If a complete catalogue of non-isomorphic orthogonal baseline designs is available, a more efficient algorithm can be presented.

We apply this algorithm to all models given by Figures – and the $D_{b e s t}$ for N = 16 and 20 are summarised in Tables . For N = 20, the tables only cover the designs with $m \leq 7$ , since the required computation increases rapidly when m>7. In each of these tables, the second, third, and the fourth columns indicate, which $D_{i}$ should be used, which two-factor interactions should be included in the model, and for which design columns the level switching should be conducted, respectively. The A-efficiency of $D_{b e s t}$ is also calculated for the readers' information, where the A-efficiency is $tr (W^{T} W)^{- 1}$ , but it is not used in the search algorithm.

Consider Lemma 2.1. For a given N, m, and a model structure, if there exists a baseline design such that X is orthogonal, then step 4 in the algorithm can be replaced by step $4^{'}$ below to save the computation. For example, when N = 16 and $k \leq 3$ , we use it to obtain Tables .

4'.

Obtain the counterpart model matrix X under the orthogonal parameterisation. If X is orthogonal and $D_{i}$ has less Q-aberration than $D_{b e s t}$ , then set $D_{b e s t} = D_{i}$ and $Q_{b e s t} = Q (D_{i})$ .

Finally, we note that by Lemma 2.1, all the designs given by Tables are D-optimal among all competing designs because the model matrix X has orthogonal columns. The designs in Tables are D-optimal among all 20-run orthogonal arrays as our search is complete.

3.4. An example

We consider a cake baking experiment in which the experimenter wants to improve a cake recipe. There are eight factors: baking time (Time), baking temperature (Temp), the number of eggs, and the amounts of baking powder, flour, sugar, milk (M) and butter. For each factor, there are two settings: the currently used setting and the new setting. The experimenter can only afford a sixteen-run design. Two two-factor interactions are important based on prior knowledge: the temperature-by-time and milk-by-time interactions. In this case, k = 2, N = 16, and m = 8, and the model has the structure 2(b). By Table , the experimenter should start with $D_{2}$ and set $D_{2} = (D_{2} + 1) / 2$ , and then switch the two levels 0 and 1 for columns 1 and 6. Next, the factors Time, Temp, and M have to be assigned to columns 6, 7, and 8 (or 6, 8, and 7), respectively. All the remaining factors can be randomly assigned to the other columns. By Lemma 2.1, this design guarantees D-optimality among all competing designs; and except for the main effects of Time, Temp, and M, each of the other effects can be estimated with a minimal variance. Among all optimal designs, this design also minimises the contamination to the estimation of $θ$ caused by non-negligible effects.

4. Concluding remarks

In our algorithm, we first use D-optimality, and then use the minimum aberration. One can also do it in a reversed order. In fact, the designs that have minimum Q-aberration among all competing designs are generally not orthogonal arrays. Examples are Rechtschaffner designs; see C. Y. Sun and Tang (Citation2020) for details. In our search algorithm, the D-optimality can also be replaced by the A-optimality if one wishes, which is to minimise the A-efficiency and thus minimises $\sum_{w} var ({\hat{θ}}_{w})$ . One possible future work is to develop an efficient algorithm that allows us to obtain more designs without complete search. Li et al. (Citation2014) considered this problem for main effects models. Some of the ideas in that paper should be useful for the situations where some two-factor interactions are important.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by Natural Sciences and Engineering Research Council of Canada [Discovery Grant].

Notes on contributors

Anqi Chen

Anqi Chen, is currently studying biostatistics, working towards her PhD at Simon Fraser University. She obtained her BSc and MSc in 2017 and 2019, respectively, from the same institution.

Cheng-Yu Sun

Cheng-Yu, a PhD student in statistics, is working towards his PhD at Simon Fraser University. His research interest is in experimental design, and has published one paper prior to this one.

Boxin Tang

Boxin Tang, a professor of statistics at Simon Fraser University, conducts research in the area of experimental design. He is an elected Fellow of ASA and IMS, and has published more than 60 papers in refereed journals.

References

Banerjee, T., & Mukerjee, R. (2008). Optimal factorial designs for cDNA microarray experiments. The Annals of Applied Statistics, 2(1), 366–385. https://doi.org/10.1214/07-AOAS144
Web of Science ®Google Scholar
Cheng, C. S. (1980). Orthogonal arrays with variable numbers of symbols. The Annals of Statistics, 8(2), 447–453. https://doi.org/10.1214/aos/1176344964
Web of Science ®Google Scholar
Fries, A., & Hunter, W. G. (1980). Minimum aberration 2k−p designs. Technometrics, 22(4), 601–608. https://doi.org/10.1080/00401706.1980.10486210
Web of Science ®Google Scholar
Ke, W., & Tang, B. (2003). Selecting 2m−p designs using a minimum aberration criterion when some two-factor interactions are important. Technometrics, 45(4), 352–360. https://doi.org/10.1198/004017003000000186
Web of Science ®Google Scholar
Kerr, K. F. (2006). Efficient 2k factorial designs for blocks of size 2 with microarray applications. Journal of Quality Technology, 38(4), 309–318. https://doi.org/10.1080/00224065.2006.11918620
Web of Science ®Google Scholar
Li, P., Miller, A., & Tang, B. (2014). Algorithmic search for baseline minimum aberration designs. Journal of Statistical Planning and Inference, 149, 172–182. https://doi.org/10.1016/j.jspi.2014.02.009
Web of Science ®Google Scholar
Mukerjee, R., & Tang, B. (2012). Optimal fractions of two-level factorials under a baseline parameterization. Biometrika, 99(1), 71–84. https://doi.org/10.1093/biomet/asr071
Web of Science ®Google Scholar
Sun, D. X., Li, W., & Ye, K. Q. (2008). Algorithmic construction of catalogs of non-isomorphic two-level orthogonal designs for economic run sizes. Statistics and Applications, 6, 141–155.
Google Scholar
Sun, C. Y., & Tang, B. (2020). Relationship between orthogonal and baseline parameterizations and its applications to design constructions. Statistica Sinica. Accepted.
Web of Science ®Google Scholar
Tang, B., & Deng, L. Y. (1999). Minimum G2-aberration for nonregular fractional factorial designs. Annals of Statistics, 27(6), 1914–1926. https://doi.org/10.1214/aos/1017939244
Web of Science ®Google Scholar
Wu, C. J., & Hamada, M. S. (2011). Experiments: planning, analysis, and optimization (Vol. 552). John Wiley & Sons.
Google Scholar
Yang, Y. H., & Speed, T. (2002). Design issues for cDNA microarray experiments. Nature Reviews Genetics, 3(8), 579–588. https://doi.org/10.1038/nrg863
PubMed Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Selecting baseline designs using a minimum aberration criterion when some two-factor interactions are important

ABSTRACT

1. Introduction