Full article: Bayesian-inspired minimum contamination designs under a double-pair conditional effect model

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

In two-level fractional factorial designs, conditional main effects can provide insights by which to analyze factorial effects and facilitate the de-aliasing of fully aliased two-factor interactions. Conditional main effects are of particular interest in situations where some factors are nested within others. Most of the relevant literature has focused on the development of data analysis tools that use conditional main effects, while the issue of optimal factorial design for a given linear model involving conditional main effects has been largely overlooked. Mukerjee, Wu and Chang [Statist. Sinica 27 (2017) 997–1016] established a framework by which to optimize designs under a conditional effect model. Although theoretically sound, their results were limited to a single pair of conditional and conditioning factors. In this paper, we extend the applicability of their framework to double pairs of conditional and conditioning factors by providing the corresponding parameterization and effect hierarchy. We propose a minimum contamination-based criterion by which to evaluate designs and develop a complementary set theory to facilitate the search of minimum contamination designs. The catalogues of 16- and 32-run minimum contamination designs are provided. For five to twelve factors, we show that all 16-run minimum contamination designs under the conditional effect model are also minimum aberration according to Fries and Hunter [Technometrics 22 (1980) 601–608].

Keywords:

1. Introduction

Factorial designs have been widely used in industry and academia in the past decades. Two-level fractional factorial designs have proven particularly effective in situations in which the purpose of experiments is to screen out inactive factors. Researchers have dedicated considerable effort to the evaluation of two-level fractional factorial designs. Fries and Hunter (Citation1980) proposed a model robust criterion named minimum aberration tailored specifically to two-level regular fractional factorial designs. The minimum aberration criterion minimizes the wordlength patterns of two-level regular designs in a sequential manner from lower-order factorial effects to higher-order ones. It is established for design selection under the assumption that lower-order effects are more important than higher-order effects and effects of the same order are equally important. The minimum aberration criterion has since been adaptive to accommodate nonregular fractional factorial designs (Cheng et al., Citation2002; Tang & Deng, Citation1999; Xu & Wu, Citation2001) and multi-stratum factorial designs (Chang, Citation2022; Chang & Cheng, Citation2018). All the aforementioned minimum aberration criteria were developed under an orthogonal parameterization of factorial effects (Cheng, Citation2014, chapter 6). Accordingly, a defining word of length four, say $F_{1} F_{2} F_{3} F_{4}$ , creates three pairs of fully aliased two-factor interactions as follows: $F_{1} F_{2} = F_{3} F_{4}$ , $F_{1} F_{3} = F_{2} F_{4}$ and $F_{1} F_{4} = F_{2} F_{3}$ . The two-factor interactions in each pair are completely mixed up and cannot be estimated simultaneously. Interested readers may refer to Mukerjee and Wu (Citation2006), Cheng (Citation2014) and Wu and Hamada (Citation2021) for further details.

In some practical situations, it is preferable to investigate a two-factor interaction via two conditional main effects, with each one conditionally defined according to another factor of a specific level. For example, the two-factor interaction $F_{1} F_{2}$ can be decomposed as the difference between a conditional main effect $F_{1}$ conditioned on the low level of $F_{2}$ and that conditioned on the high level of $F_{2}$ . Sliding level experiments in engineering are structured in this way (Wu & Hamada, Citation2021, p. 343), where the interest is on the conditional main effects conditioned on the slid factors. Mukerjee et al. (Citation2017) reported on an industrial experiment involving motor and speed as two factors. The objective of the experiment was to assess the comparison of the motors separately at each speed; therefore, the conditional main effects of the motors conditioned on each level of speed are of particular interest. Other intriguing examples pertaining to the use of conditional main effects are outlined in Wu (Citation2015) and Wu (Citation2018).

It is known, also mentioned in Wu (Citation2015), that for 2 two-level factors $F_{1}$ and $F_{2}$ , the main effect $F_{1}$ in conjunction with the interaction $F_{1} F_{2}$ spans the same vector space as that spanned by the conditional main effects $F_{1}$ respectively conditioned on the two levels of $F_{2}$ . Mukerjee et al. (Citation2017) referred to $F_{1}$ and $F_{2}$ as a pair of conditional and conditioning factors. To deal with a single pair of conditional and conditioning factors, Mukerjee et al. (Citation2017) proposed a model in which the main effect of $F_{1}$ and the two-factor interaction $F_{1} F_{2}$ are replaced with the two associated conditional main effects. This can be viewed as an alternative to the aforementioned orthogonal parameterization, under which the defining relation $I = F_{1} F_{2} F_{3} F_{4}$ produces partially aliased effects (neither fully aliased nor orthogonal). In this paper, the model proposed by Mukerjee et al. (Citation2017) is referred to as a conditional effect model. The fact that the parameterization of a single-pair conditional effect model differs from that of a model tailored to the minimum aberration necessitates a new aberration criterion applicable to conditional effect models. Mukerjee et al. (Citation2017) proposed a minimum aberration criterion as well as a strategy by which to search for designs under a single-pair conditional effect model. Apart from Mukerjee et al. (Citation2017), an alternative approach by the use of the indicator function in Pistone and Wynn (Citation1996) is given by Sabbaghi (Citation2020), who developed an algebra for conditional effect models. From the data analysis aspect, Mak and Wu (Citation2019) proposed a bi-level variable selection of conditional main effects in observational data using a penalty function with two layers: the outer one controlling between-group selection, and the inner one controlling within-group selection.

It is important to consider that the parameterization under a condition effect model destroys the fully aliased relationship between two-factor interactions. Thus, one application of conditional main effects involves the de-aliasing of two-factor interactions in regular fractional factorial designs of resolution four. This involves identifying significant but aliased two-factor interactions and then transforming the model to the corresponding conditional effect model. Afterwards, the de-aliasing strategies in Wu (Citation2015), Su and Wu (Citation2017), Chang (Citation2019) and Lawson (Citation2020) can then be used to facilitate subsequent data analysis.

In the current study, we consider two-level factorial designs in conjunction with conditional effect models involving two pairs of conditional and conditioned factors. The remaining factors, not involving the two pairs of conditional and conditioning factors, are called traditional factors. Wu and Hamada (Citation2021, p. 347) described an experiment involving the sealing of a light bulb, in which the outcomes were determined mainly by two pairs of conditional and conditioning factors $(H, G)$ and $(J, I)$ , respectively. The corresponding linear model comprised the conditional main effects of H and J respectively conditioned on G and I. In the current study, we extend the work of Mukerjee et al. (Citation2017) to double-pair conditional effect models. The minimum aberration was based on the effect hierarchy principle (Wu & Hamada, Citation2021, p. 168); however, it is difficult to artificially argue an order among the effects in a conditional effect model. In accordance with Mukerjee et al. (Citation2017), we adopt the Bayesian approach proposed in Mitchell et al. (Citation1995), later popularized by Kerr (Citation2001), Joseph (Citation2006), Joseph and Delaney (Citation2007), Ai et al. (Citation2009), Joseph et al. (Citation2009), Kang and Joseph (Citation2009), Chang and Cheng (Citation2018) and Chang (Citation2019, Citation2022), to derive an order of factorial effects based on their prior variances. Once this order has been established, we then define a criterion for design evaluation by sequentially minimizing the bias (contamination) caused by lower-order interactions to higher-order interactions. We refer to the proposed criterion as the minimum contamination criterion. We provide the catalogues of 16-run and 32-run minimum contamination designs for various factor numbers at the end of this paper.

The remainder of this paper is organized as follows. Section 2 introduces the parametrization and a Bayesian-inspired hierarchical order of effects under a double-pair conditional effect model. Some sufficient conditions for a design to be universally optimal under the conditional effect model involving only main effects are given in Section 3. Section 4 presents a new minimum contamination criterion defined according to the Bayesian-inspired hierarchical order. In addition, we develop a complementary set theory to guide the search for minimum contamination designs involving a large number of factors. An efficient computational procedure is developed in Section 5 to search for minimum contamination regular/nonregular designs for an arbitrary number of factors. Numerical examples and a real experiment are discussed. Conclusions are drawn in Section 6.

2. Conditional effect model and Bayesian-inspired effect hierarchy

We give the details regarding the parameterization under a double-pair conditional effect model. We adopt the Bayesian approach in Mitchell et al. (Citation1995) to derive an effect order, which serves as the building block for the minimum contamination criterion introduced in Section 4.

2.1. Conditional effect model

A double-pair conditional effect model is a linear model with reparameterization using two pairs of conditional and conditioning factors. Consider a $2^{n}$ full factorial design with n ( $\geq 4$ ) factors $F_{1}, \dots, F_{n}$ , each at levels 0 and 1. Without loss of generality, let $F_{1}, F_{2}$ be one pair of conditional and conditioned factors and $F_{3}, F_{4}$ be the other pair. The main effect and interaction effects involving $F_{1}$ (respectively, $F_{3}$ ) are defined conditionally on each fixed level of $F_{2}$ (respectively, $F_{4}$ ). Define Ω as the set of $ν = 2^{n}$ binary n-tuples representing the $2^{n}$ treatment combinations of the full factorial design. For $(i_{1}, \dots, i_{n}) \in Ω$ , let $τ (i_{1} \dots i_{n})$ be the treatment effect of treatment combination $i_{1} \dots i_{n}$ . Under a linear model, $τ (i_{1} \dots i_{n})$ is the expectation of the response measured at the treatment combination $i_{1} \dots i_{n}$ . Similarly, we write $θ (i_{1} \dots i_{n})$ for the factorial effect $F_{1}^{i_{1}} \dots F_{n}^{i_{n}}$ using the conventional orthogonal parameterization (Cheng, Citation2014, chapter 6) when $i_{1} \dots i_{n}$ is nonnull, and $θ (0 \dots 0)$ for the grand mean. With n = 3 for illustration, $\begin{aligned} θ (100) & = \frac{1}{8} {τ (100) + τ (110) + τ (101) + τ (111) - τ (000) - τ (010) - τ (001) - τ (011)}, \\ θ (110) & = \frac{1}{8} {τ (000) + τ (110) + τ (001) + τ (111) - τ (100) - τ (101) - τ (010) - τ (011)}, \\ θ (111) & = \frac{1}{8} {τ (100) + τ (010) + τ (001) + τ (111) - τ (000) - τ (110) - τ (101) - τ (011)} \end{aligned}$ separately represent the main effect of $F_{1}$ , the two-factor interaction of $F_{1}$ and $F_{2}$ , and the three-factor interaction of $F_{1}, F_{2}, F_{3}$ . Each $θ (i_{1} i_{2} i_{3})$ can be interpreted using the mean responses measured at different levels of factors. For example, the main effect $θ (100)$ is the average of the difference between the mean responses measured at the levels 1 and 0 of $F_{1}$ . Let $τ$ and $θ$ be $ν \times 1$ vectors with elements $τ (i_{1} \dots i_{n})$ and $θ (i_{1} \dots i_{n})$ arranged in the lexicographic order, respectively, where for n = 3, we have $θ = (θ (000), θ (001), θ (010), θ (011), θ (100), θ (101), θ (110), θ (111))^{^{⊤}} .$ Then, the linear model using the orthogonal parameterization under the full factorial design is given by (1) $\begin{aligned} τ = H^{\otimes n} θ (or equivalently, θ = ν^{- 1} H^{\otimes n} τ), \end{aligned}$ (1) where ⊗ represents the Kronecker product and $H^{\otimes n}$ denotes the n-fold Kronecker product of $H = (\begin{array}{cc} 1 & 1 \\ 1 & - 1 \end{array}),$ a Hadamard matrix of order two. When interpreted under the linear model, the columns of $H$ respectively correspond to the grand mean and a contrast of the mean responses measured at the two levels of the factor. Since a $2^{n}$ full factorial design has a cross-product structure, the associated matrix of the grand mean and contrasts can be obtained via the n-fold Kronecker product of $H$ as in (Equation1(1) $\begin{aligned} τ = H^{\otimes n} θ (or equivalently, θ = ν^{- 1} H^{\otimes n} τ), \end{aligned}$ (1) ). We refer to the model in (Equation1(1) $\begin{aligned} τ = H^{\otimes n} θ (or equivalently, θ = ν^{- 1} H^{\otimes n} τ), \end{aligned}$ (1) ) as a traditional model under the full factorial design.

Let $H (0) = (1, 1)$ and $H (1) = (1, - 1)$ be the top and bottom rows of $H$ , respectively. Emphasizing $F_{1}$ and $F_{3}$ , we express Equation (Equation1(1) $\begin{aligned} τ = H^{\otimes n} θ (or equivalently, θ = ν^{- 1} H^{\otimes n} τ), \end{aligned}$ (1) ) as (2) $\begin{aligned} θ = ν^{- 1} (\begin{array}{l} H (0) \otimes H \otimes H (0) \otimes H \\ H (1) \otimes H \otimes H (0) \otimes H \\ H (0) \otimes H \otimes H (1) \otimes H \\ H (1) \otimes H \otimes H (1) \otimes H \end{array}) H^{\otimes (n - 4)} τ . \end{aligned}$ (2) Let $β (j_{1} \dots j_{n})$ be the factorial effect $F_{1}^{i_{1}} \dots F_{n}^{i_{n}}$ under a conditional effect model with two pairs conditional and conditioning factors $F_{1}, F_{2}$ and $F_{3}, F_{4}$ , respectively. Denote the vector with the ν elements $β (j_{1} \dots j_{n})$ 's by $β$ in the same lexicographic order as $θ$ . Under the double-pair conditional effect model, the factorial effects involving $F_{1}$ and $F_{3}$ are defined conditionally on the levels of $F_{2}$ and $F_{4}$ , respectively. Thus in (Equation2(2) $\begin{aligned} θ = ν^{- 1} (\begin{array}{l} H (0) \otimes H \otimes H (0) \otimes H \\ H (1) \otimes H \otimes H (0) \otimes H \\ H (0) \otimes H \otimes H (1) \otimes H \\ H (1) \otimes H \otimes H (1) \otimes H \end{array}) H^{\otimes (n - 4)} τ . \end{aligned}$ (2) ), $H$ is replaced with $\sqrt{2} I_{2}$ whenever $H (1)$ precedes it. We can reparametrize $θ$ by $β$ with (3) $\begin{aligned} β = ν^{- 1} W \otimes H^{\otimes (n - 4)} τ, \end{aligned}$ (3) where $W = (\begin{array}{l} H (0) \otimes H \otimes H (0) \otimes H \\ H (1) \otimes \sqrt{2} I_{2} \otimes H (0) \otimes H \\ H (0) \otimes H \otimes H (1) \otimes \sqrt{2} I_{2} \\ H (1) \otimes \sqrt{2} I_{2} \otimes H (1) \otimes \sqrt{2} I_{2} \end{array}),$ in which $I_{2}$ is the identity matrix of order two. The model in (Equation3(3) $\begin{aligned} β = ν^{- 1} W \otimes H^{\otimes (n - 4)} τ, \end{aligned}$ (3) ) is called a double-pair conditional effect model in this paper. The factorial effects involving $F_{1}$ and $F_{3}$ are referred to as conditional (factorial) effects, while those not involving them are referred to as unconditional (factorial) effects. We cluster $β$ into various groups of vectors representing unconditional and conditional factorial effects. Define $\begin{aligned} Ω_{0 l} & = {(j_{1}, \dots, j_{n}) : j_{1} = j_{3} = 0, and l of j_{2}, j_{4}, \dots, j_{n} equal 1}, \\ Ω_{1 l} & = {(j_{1}, \dots, j_{n}) : j_{1} = 1, j_{2} = 0, 1, j_{3} = 0, and l - 1 of j_{4}, \dots, j_{n} equal 1} \\ \cup {(j_{1}, \dots, j_{n}) : j_{3} = 1, j_{4} = 0, 1, j_{1} = 0, and l - 1 of j_{2}, j_{5}, \dots, j_{n} equal 1}, \\ Ω_{2 l} & = {(j_{1}, \dots, j_{n}) : j_{1} = j_{3} = 1, j_{2} = 0, 1, j_{4} = 0, 1, and l - 2 of j_{5}, \dots, j_{n} equal 1}, \end{aligned}$ where $1 \leq l \leq n - 2$ . It is apparent that $β (j_{1} \dots j_{n}) = θ (j_{1} \dots j_{n})$ if $(j_{1}, \dots, j_{n}) \in Ω_{0 l}$ . Let $β_{s l}$ be the vector with elements $β (j_{1} \dots j_{n})$ , where $(j_{1}, \dots, j_{n}) \in Ω_{s l}$ . Later we will see that the effects associated with the same $Ω_{s l}$ have the same importance using the Bayesian-inspired effect hierarchy introduced in the next subsection.

2.2. Bayesian-inspired effect hierarchy

Chipman et al. (Citation1997) proposed a Bayesian variable selection for designed experiments with complex aliasing. Rather than independence prior distribution on the factorial effects, Chipman et al. (Citation1997) used a hierarchical prior that is consistent with the effect heredity principle (Wu & Hamada, Citation2021, p. 169). A basic idea of their approach is to assign a larger prior variance to a more important factorial effect. This idea is compatible with the Bayesian (functional) prior distribution derived by Mitchell et al. (Citation1995), Kerr (Citation2001), Joseph (Citation2006), Joseph and Delaney (Citation2007), Ai et al. (Citation2009), Joseph et al. (Citation2009), Kang and Joseph (Citation2009), Chang and Cheng (Citation2018) and Chang (Citation2019, Citation2022). With the derived prior variances, one can readily define an effect hierarchical order and an aberration-like criterion for design evaluation.

In this paper, we adopt the Bayesian approach in Mitchell et al. (Citation1995), who set up a functional prior by regarding $τ$ as a realization of a stationary Gaussian random function. Then the prior distribution of factorial effects is induced by the relation in (Equation1(1) $\begin{aligned} τ = H^{\otimes n} θ (or equivalently, θ = ν^{- 1} H^{\otimes n} τ), \end{aligned}$ (1) ). When applied to (Equation3(3) $\begin{aligned} β = ν^{- 1} W \otimes H^{\otimes (n - 4)} τ, \end{aligned}$ (3) ), the Bayesian approach induces an effect hierarchy of the $β (j_{1} \dots j_{n})$ 's via a prior specification on $τ$ in terms of a zero-mean Gaussian random function such that $cov (τ) = σ^{2} R^{\otimes n}$ , where $σ^{2} > 0$ and the $2 \times 2$ matrix $R$ has diagonal elements 1 and off-diagonal elements ρ, $0 < ρ < 1$ . This covariance structure is equivalent to Equation (4) of Joseph (Citation2006). It follows by expanding $R^{\otimes n}$ that the correlation of $τ (i_{1} \dots i_{n})$ and $τ (j_{1} \dots j_{n})$ is $\prod_{l : 1 \leq l \leq n, i_{l} \neq j_{l}} ρ,$ which is equal to $ρ^{\sum_{l : 1 \leq l \leq n, i_{l} \neq j_{l}}}$ and only depends on the total number of components where $(i_{1} \dots i_{n})$ and $(j_{1} \dots j_{n})$ differ. Thus, such a covariance structure can be interpreted as treating every factor equally, which is reasonable since there is usually no knowledge about the importance of the factors at the experimentation stage. In conjunction with (Equation3(3) $\begin{aligned} β = ν^{- 1} W \otimes H^{\otimes (n - 4)} τ, \end{aligned}$ (3) ), the prior covariance matrix of $β$ is given by $\begin{aligned} cov (β) & = ν^{- 2} {W \otimes H^{\otimes (n - 4)}} cov (τ) {W \otimes H^{\otimes (n - 4)}}^{⊤} \\ = σ^{2} ν^{- 2} {W R^{\otimes 4} W^{^{⊤}}} \otimes {H R H}^{\otimes (n - 4)} . \end{aligned}$ The following result gives the variances of the $β (j_{1} \dots j_{n})$ 's.

Theorem 2.1

For a $(j_{1}, \dots, j_{n}) \in Ω_{s l}$ , we have $v a r (β (j_{1} \dots j_{n})) = σ^{2} ν^{- 1} (1 + ρ)^{n - l - s} (1 - ρ)^{l} .$

Proof.

By the identity $cov (β) = σ^{2} ν^{- 2} {W R^{\otimes 4} W^{^{⊤}}} \otimes {H R H}^{\otimes (n - 4)}$ , one can easily verify that $H R H = 2 diag (1 + ρ, 1 - ρ)$ , $W = (\begin{array}{cccc} H^{\otimes 2} & H^{\otimes 2} & H^{\otimes 2} & H^{\otimes 2} \\ \sqrt{2} I_{2} \otimes H & \sqrt{2} I_{2} \otimes H & - \sqrt{2} I_{2} \otimes H & - \sqrt{2} I_{2} \otimes H \\ \sqrt{2} H \otimes I_{2} & - \sqrt{2} H \otimes I_{2} & \sqrt{2} H \otimes I_{2} & - \sqrt{2} H \otimes I_{2} \\ 2 I_{2}^{\otimes 2} & - 2 I_{2}^{\otimes 2} & - 2 I_{2}^{\otimes 2} & 2 I_{2}^{\otimes 2} \end{array}),$ and $R^{\otimes 4} = (\begin{array}{cccc} R^{\otimes 2} & ρ R^{\otimes 2} & ρ R^{\otimes 2} & ρ^{2} R^{\otimes 2} \\ ρ R^{\otimes 2} & R^{\otimes 2} & ρ^{2} R^{\otimes 2} & ρ R^{\otimes 2} \\ ρ R^{\otimes 2} & ρ^{2} R^{\otimes 2} & R^{\otimes 2} & ρ R^{\otimes 2} \\ ρ^{2} R^{\otimes 2} & ρ R^{\otimes 2} & ρ R^{\otimes 2} & R^{\otimes 2} \end{array}) .$ $W R^{\otimes 4} W^{^{⊤}}$ is a $16 \times 16$ matrix, which can be regarded as a $4 \times 4$ block-matrix with each block being a $4 \times 4$ matrix. Denote the $(i, j)$ th block-matrix of $W R^{\otimes 4} W^{^{⊤}}$ by $[W R^{\otimes 4} W^{^{⊤}}]_{i j}$ . Then by calculation, we get $[W R^{\otimes 4} W^{^{⊤}}]_{i j} = 0$ when $i \neq j$ ; for i = j, we obtain $[W R^{\otimes 4} W^{^{⊤}}]_{11} = 4 (1 + ρ)^{2} (H R H)^{\otimes 2}$ , $[W R^{\otimes 4} W^{^{⊤}}]_{22} = 8 (1 - ρ^{2}) R \otimes (H R H)$ , $[W R^{\otimes 4} W^{^{⊤}}]_{33} = 8 (1 - ρ^{2}) (H R H) \otimes R$ and $[W R^{\otimes 4} W^{^{⊤}}]_{44} = 16 (1 - ρ)^{2} R \otimes R$ .

The diagonal elements of $(H R H)^{\otimes 2}$ , denoted by $diag ((H R H)^{\otimes 2})$ , can be obtained as $4 (1 + ρ)^{2}, 4 (1 - ρ) (1 + ρ), 4 (1 - ρ) (1 + ρ), 4 (1 - ρ)^{2}$ . We also have $diag (R \otimes (H R H)) = 2 (1 + ρ, 1 - ρ, 1 + ρ, 1 - ρ)$ , $diag ((H R H) \otimes R) = 2 (1 + ρ, 1 + ρ, 1 - ρ, 1 - ρ)$ and $diag (R \otimes R) = (1, 1, 1, 1)$ . Thus, we get $diag ([W R^{\otimes 4} W^{^{⊤}}]_{11}) = 16 ((1 + ρ)^{4}, (1 - ρ) (1 + ρ)^{3}, (1 - ρ) (1 + ρ)^{3}, (1 - ρ)^{2} (1 + ρ)^{2})$ , $diag ([W R^{\otimes 4} W^{^{⊤}}]_{22}) = 16 ((1 + ρ)^{2} (1 - ρ), (1 + ρ) (1 - ρ)^{2}, (1 + ρ)^{2} (1 - ρ), (1 + ρ) (1 - ρ)^{2})$ , $diag ([W R^{\otimes 4} W^{^{⊤}}]_{33}) = 16 ((1 + ρ)^{2} (1 - ρ), (1 + ρ)^{2} (1 - ρ), (1 + ρ) (1 - ρ)^{2}, (1 + ρ) (1 - ρ)^{2})$ and $diag ([W R^{\otimes 4} W^{^{⊤}}]_{44}) = 16 ((1 - ρ)^{2}, (1 - ρ)^{2}, (1 - ρ)^{2}, (1 - ρ)^{2})$ .

Since the variances of $β (j_{1} \dots j_{n})$ 's are only related to the diagonal elements of $cov (β)$ , one can easily check for a $(j_{1}, \dots, j_{n}) \in Ω_{s l}$ , $var (β (j_{1} \dots j_{n})) = σ^{2} ν^{- 1} (1 + ρ)^{n - l - s} (1 - ρ)^{l}$ based on the above calculation.

Let $V_{s l} = var (β (j_{1} \dots j_{n}))$ for $(j_{1}, \dots, j_{n}) \in Ω_{s l}$ . From Theorem 2.1, it is clear that $V_{0 l} > V_{1 l} > V_{2 l}$ for $2 \leq l \leq n - 2$ . Because $V_{2 l} / V_{0, l + 1} = 1 / (1 - ρ^{2}) > 1$ for all $0 < ρ < 1$ , we have (4) $\begin{aligned} V_{01} > V_{11} > V_{02} > V_{12} > V_{22} > V_{03} > V_{13} > V_{23} > \dots > V_{0, n - 2} > V_{1, n - 2} > V_{2, n - 2} . \end{aligned}$ (4) In view of (Equation4(4) $\begin{aligned} V_{01} > V_{11} > V_{02} > V_{12} > V_{22} > V_{03} > V_{13} > V_{23} > \dots > V_{0, n - 2} > V_{1, n - 2} > V_{2, n - 2} . \end{aligned}$ (4) ), we define the following effect hierarchy under the conditional effect model (Equation3(3) $\begin{aligned} β = ν^{- 1} W \otimes H^{\otimes (n - 4)} τ, \end{aligned}$ (3) ) as follows. The unconditional main effects have the largest variance $V_{01}$ and are the most important, while the conditional main effects with variance $V_{11}$ are positioned next; then come the unconditional two-factor interactions ( $V_{02}$ ), followed by the one-pair conditional two-factor interactions ( $V_{12}$ ), then two-pair conditional two-factor interactions ( $V_{22}$ ), and so on. This effect hierarchy order is not surprising since a conditional main effect is proportional to the average of a unconditional main effect and two-factor interaction, resulting in a variance in-between.

3. Universally optimal designs for main effect model

As mentioned in Mukerjee et al. (Citation2017), a justifiable criterion for design evaluation is to identify a class of designs which ensure optimal inference on the $β (j_{1} \dots j_{n})$ 's corresponding to $Ω_{01}$ and $Ω_{11}$ in the absence of all interactions. Then, in order to possess model robustness, among these designs we find one which sequentially minimizes a suitably defined measure of bias caused by successive interactions in the effect hierarchy. In this section, we connect the conditional effect model with the traditional model. Then we provide some sufficient conditions for a design to be universally optimal under a main-effect conditional effect model.

The connection between conditional effects $β$ and traditional factorial effects $θ$ can be established by (Equation1(1) $\begin{aligned} τ = H^{\otimes n} θ (or equivalently, θ = ν^{- 1} H^{\otimes n} τ), \end{aligned}$ (1) ) and (Equation3(3) $\begin{aligned} β = ν^{- 1} W \otimes H^{\otimes (n - 4)} τ, \end{aligned}$ (3) ) as follows: $\begin{aligned} β & = ν^{- 1} W \otimes H^{\otimes (n - 4)} τ \\ = ν^{- 1} W \otimes H^{\otimes (n - 4)} H^{\otimes n} θ \\ = ν^{- 1} {W H^{\otimes 4}} \otimes {H H}^{\otimes (n - 4)} θ . \end{aligned}$ By using the fact $H H = 2 I_{2}$ and $W H^{\otimes 4} = 4 (\begin{array}{cccc} H^{\otimes 2} H^{\otimes 2} & 0 & 0 & 0 \\ 0 & 0 & \sqrt{2} I_{2} \otimes H H^{\otimes 2} & 0 \\ 0 & \sqrt{2} H \otimes I_{2} H^{\otimes 2} & 0 & 0 \\ 0 & 0 & 0 & 2 I_{2}^{\otimes 2} H^{\otimes 2} \end{array}),$ we obtain $β = (\begin{array}{cccc} I_{2}^{\otimes (n - 2)} & 0 & 0 & 0 \\ 0 & 0 & \frac{1}{\sqrt{2}} H \otimes I_{2} \otimes I_{2}^{\otimes (n - 4)} & 0 \\ 0 & \frac{1}{\sqrt{2}} I_{2} \otimes H \otimes I_{2}^{\otimes (n - 4)} & 0 & 0 \\ 0 & 0 & 0 & \frac{1}{2} H^{\otimes 2} \otimes I_{2}^{\otimes (n - 2)} \end{array}) θ,$ which implies $θ = (\begin{array}{cccc} I_{2}^{\otimes (n - 2)} & 0 & 0 & 0 \\ 0 & 0 & \frac{1}{\sqrt{2}} H \otimes I_{2} \otimes I_{2}^{\otimes (n - 4)} & 0 \\ 0 & \frac{1}{\sqrt{2}} I_{2} \otimes H \otimes I_{2}^{\otimes (n - 4)} & 0 & 0 \\ 0 & 0 & 0 & \frac{1}{2} H^{\otimes 2} \otimes I_{2}^{\otimes (n - 2)} \end{array}) β$ because $H^{- 1} = (1 / 2) H$ . This yields $\begin{aligned} θ (0 j_{2} 0 j_{4} j_{5} \dots j_{n}) & = β (0 j_{2} 0 j_{4} j_{5} \dots j_{n}), \\ θ (1 j_{2} 0 j_{4} j_{5} \dots j_{n}) & = \frac{1}{\sqrt{2}} {β (100 j_{4} j_{5} \dots j_{n}) + δ (j_{2}) β (110 j_{4} j_{5} \dots j_{n})}, \\ θ (0 j_{2} 1 j_{4} j_{5} \dots j_{n}) & = \frac{1}{\sqrt{2}} {β (0 j_{2} 10 j_{5} \dots j_{n}) + δ (j_{4}) β (0 j_{2} 11 j_{5} \dots j_{n})}, \\ θ (1 j_{2} 1 j_{4} j_{5} \dots j_{n}) & = \frac{1}{2} {β (1010 j_{5} \dots j_{n}) + δ (j_{4}) β (1011 j_{5} \dots j_{n}) \\ + δ (j_{2}) β (1110 j_{5} \dots j_{n}) + δ (j_{2}) δ (j_{4}) β (1111 j_{5} \dots j_{n})}, \end{aligned}$ where $δ (j) = - 2 j + 1$ . In view of $θ$ , the first of above identities shows that $2^{n - 2} / 2^{n} = 1 / 4$ of the $θ (j_{1} \dots j_{n})$ 's remain unconditional effects, while the other 3/4 of the $θ (j_{1} \dots j_{n})$ 's involve $F_{1}$ and $F_{3}$ and hence are a combination of the conditional effects. These equations uncover the connection between factorial effects under the traditional model and under the conditional effect model. Take n = 5 for example. We have $θ (10000) = {β (10000) + β (11000)} / \sqrt{2}$ . Recall that $β (10000)$ and $β (11000)$ are the conditional main effects of $F_{1}$ conditioned on levels 0 and 1 of $F_{2}$ , respectively. Thus, the unconditional main effect of $F_{1}$ is proportional to the average of the two conditional main effects of $F_{1}$ . Likewise, we have $θ (11000) = {β (10000) - β (11000)} / \sqrt{2}$ , which means that the unconditional two-factor interaction of $F_{1}$ and $F_{2}$ is proportional to the difference between the two conditional main effects of $F_{1}$ .

Consider an N-run design represented by the $N \times n$ design matrix $D$ with elements 1 (high level) and $- 1$ (low level). Denote the corresponding $N \times 2^{n}$ full model matrix under (Equation1(1) $\begin{aligned} τ = H^{\otimes n} θ (or equivalently, θ = ν^{- 1} H^{\otimes n} τ), \end{aligned}$ (1) ) by $X$ , each column corresponding to one $θ (j_{1} \dots j_{n})$ . Note that $X$ can be obtained by deleting the rows of $H^{\otimes n}$ that are not in $D$ . With the vector of responses $y$ , we have $E (y) = X θ$ . Each column of $X$ is represented by $x (j_{1} \dots j_{n})$ with $(j_{1}, \dots, j_{n}) \in Ω$ . The connection between $θ (j_{1} \dots j_{n})$ 's and $β (j_{1} \dots j_{n})$ 's suggests $\begin{aligned} z (0 j_{2} 0 j_{4} j_{5} \dots j_{n}) & = x (0 j_{2} 0 j_{4} j_{5} \dots j_{n}), \\ z (1 j_{2} 0 j_{4} j_{5} \dots j_{n}) & = \frac{1}{\sqrt{2}} {x (100 j_{4} j_{5} \dots j_{n}) + δ (j_{2}) x (110 j_{4} j_{5} \dots j_{n})}, \\ z (0 j_{2} 1 j_{4} j_{5} \dots j_{n}) & = \frac{1}{\sqrt{2}} {x (0 j_{2} 10 j_{5} \dots j_{n}) + δ (j_{4}) x (0 j_{2} 11 j_{5} \dots j_{n})}, \\ z (1 j_{2} 1 j_{4} j_{5} \dots j_{n}) & = \frac{1}{2} {x (1010 j_{5} \dots j_{n}) + δ (j_{4}) x (1011 j_{5} \dots j_{n}) \\ + δ (j_{2}) x (1110 j_{5} \dots j_{n}) + δ (j_{2}) δ (j_{4}) x (1111 j_{5} \dots j_{n})}, \end{aligned}$ where $δ (j) = - 2 j + 1$ . Let $Z_{s l}$ and $X_{s l}$ consist of the $z (j_{1} \dots j_{n})$ 's and $x (j_{1} \dots j_{n})$ 's respectively, where $(j_{1}, \dots, j_{n}) \in Ω_{s l}$ . Then the conditional effect model under $D$ can be represented by (5) $\begin{aligned} E (y) = z (0 \dots 0) β (0 \dots 0) + \sum_{s = 0}^{2} \sum_{l = 1}^{n - 2} Z_{s l} β_{s l} . \end{aligned}$ (5) We follow the convention that the random observational errors are uncorrelated and homogeneous with equal variance.

3.1. Universally optimal designs

If all interactions are absent, then the model (Equation5(5) $\begin{aligned} E (y) = z (0 \dots 0) β (0 \dots 0) + \sum_{s = 0}^{2} \sum_{l = 1}^{n - 2} Z_{s l} β_{s l} . \end{aligned}$ (5) ) reduces to (6) $\begin{aligned} E (y) = z (0 \dots 0) β (0 \dots 0) + Z_{01} β_{01} + Z_{11} β_{11}, \end{aligned}$ (6) consisting of only unconditional and conditional main effects. In the following, we present a theorem which gives some requirements for a design to be universally optimal under model (Equation6(6) $\begin{aligned} E (y) = z (0 \dots 0) β (0 \dots 0) + Z_{01} β_{01} + Z_{11} β_{11}, \end{aligned}$ (6) ).

Theorem 3.1

Suppose an N-run design $D$ satisfies

(i)	$D$ is an orthogonal array of strength two;
(ii)	all eight triples of symbols occur equally often when $D$ is projected onto $F_{1}, F_{2}, F_{j}$ , $j \in {4, 5, \dots, n}$ ;
(iii)	all eight triples of symbols occur equally often when $D$ is projected onto $F_{3}, F_{4}, F_{j}$ , $j \in {2, 5, \dots, n}$ ;
(iv)	all sixteen triples of symbols occur equally often when $D$ is projected onto $F_{1}, F_{2}, F_{3}, F_{4}$ .

Then

D

is universally optimal among all N-run designs for inference on

β_{01}

and

β_{11}

under model (Equation6(6) $\begin{aligned} E (y) = z (0 \dots 0) β (0 \dots 0) + Z_{01} β_{01} + Z_{11} β_{11}, \end{aligned}$ (6) ).

Proof.

Let $Z_{1} = (Z_{01}, Z_{11})$ . Denote the information matrix of $β_{01}$ and $β_{11}$ under model (Equation6(6) $\begin{aligned} E (y) = z (0 \dots 0) β (0 \dots 0) + Z_{01} β_{01} + Z_{11} β_{11}, \end{aligned}$ (6) ) by $M$ . Note that $M$ can be obtained by the Schur complement $M = Z_{1}^{^{⊤}} {I_{N} - z (0 \dots 0) [z (0 \dots 0)^{^{⊤}} z (0 \dots 0)]^{- 1} z (0 \dots 0)^{^{⊤}}} Z_{1},$ which can be simplified as $M = Z_{1}^{^{⊤}} {I_{N} - \frac{1}{N} 1_{N} 1_{N}^{^{⊤}}} Z_{1}$ because $z (0 \dots 0) = 1_{N}$ . Because $Z_{1}^{^{⊤}} Z_{1} - M$ is nonnegative definite, we have (7) $\begin{aligned} tr [M] \leq tr [Z_{1}^{^{⊤}} Z_{1}] = N (n - 2) + 4 N = N (n + 2) \end{aligned}$ (7) for every N-run design. Under the conditions (i),…,(iv), it is easy to verify that $Z_{1}^{^{⊤}} 1_{N} = 0$ and $Z_{1}^{^{⊤}} Z_{1} = N I_{n + 2}$ . Thus $M = N I_{n + 2}$ and $tr [M]$ reaches the upper bound in (Equation7(7) $\begin{aligned} tr [M] \leq tr [Z_{1}^{^{⊤}} Z_{1}] = N (n - 2) + 4 N = N (n + 2) \end{aligned}$ (7) ). The result now follows from Kiefer (Citation1975).

By Theorem 3.1, a necessary condition for universally optimal designs is $N \geq 16$ . Therefore, if n = 4, the model (Equation5(5) $\begin{aligned} E (y) = z (0 \dots 0) β (0 \dots 0) + \sum_{s = 0}^{2} \sum_{l = 1}^{n - 2} Z_{s l} β_{s l} . \end{aligned}$ (5) ) can only involve the two pairs of conditional and conditioning factors $(F_{1}, F_{2})$ and $(F_{3}, F_{4})$ . Then the universally optimal design is exactly the $2^{4}$ full factorial design. Thus, to avoid trivialities, we let $n \geq 5$ in the discussion of design selection in the next section.

4. Minimum contamination and complementary set theory

The designs meeting the conditions (i),…,(iv) of Theorem 3.1 are universally optimal under model (Equation6(6) $\begin{aligned} E (y) = z (0 \dots 0) β (0 \dots 0) + Z_{01} β_{01} + Z_{11} β_{11}, \end{aligned}$ (6) ). In addition, ${\hat{β}}_{h 1} = N^{- 1} Z_{h 1}^{^{⊤}} y$ is the best linear unbiased estimator of $β_{h 1}$ , h = 0, 1. However, nonnegligible interactions may exist and ${\hat{β}}_{h 1}$ is no longer unbiased in this case. We revert back to the model (Equation5(5) $\begin{aligned} E (y) = z (0 \dots 0) β (0 \dots 0) + \sum_{s = 0}^{2} \sum_{l = 1}^{n - 2} Z_{s l} β_{s l} . \end{aligned}$ (5) ), which includes all interactions, to assess the impact of possible presence of interactions on ${\hat{β}}_{h 1}$ . Under model (Equation5(5) $\begin{aligned} E (y) = z (0 \dots 0) β (0 \dots 0) + \sum_{s = 0}^{2} \sum_{l = 1}^{n - 2} Z_{s l} β_{s l} . \end{aligned}$ (5) ), ${\hat{β}}_{h 1}$ has bias $N^{- 1} \sum_{s = 0}^{2} \sum_{l = 2}^{n - 2} Z_{h 1}^{^{⊤}} Z_{s l} β_{s l}$ . The matrix $N^{- 1} \sum_{s = 0}^{2} \sum_{l = 2}^{n - 2} Z_{h 1}^{^{⊤}} Z_{s l}$ is referred to as an alias matrix in Wu and Hamada (Citation2021, p. 419). A reasonable measure of the bias in ${\hat{β}}_{h 1}$ caused by the interactions, as in Tang and Deng (Citation1999), is $K_{s l} (h) = N^{- 2} tr [Z_{h 1}^{^{⊤}} Z_{s l} Z_{s l}^{^{⊤}} Z_{h 1}] = N^{- 2} tr [X_{h 1}^{^{⊤}} X_{s l} X_{s l}^{^{⊤}} X_{h 1}],$ where the last equality holds because $X_{s l}$ is an orthogonal transform of $Z_{s l}$ . Based on the effect hierarchy in (Equation4(4) $\begin{aligned} V_{01} > V_{11} > V_{02} > V_{12} > V_{22} > V_{03} > V_{13} > V_{23} > \dots > V_{0, n - 2} > V_{1, n - 2} > V_{2, n - 2} . \end{aligned}$ (4) ), one should minimize the bias successively in order of priority. Thus, we define a minimum contamination design as the one which minimizes the terms of (8) $\begin{aligned} K = {K_{02} (0), K_{02} (1), K_{12} (0), K_{12} (1), K_{22} (0), K_{22} (1), K_{03} (0), K_{03} (1), \dots} \end{aligned}$ (8) in a sequential manner from left to right. In (Equation8(8) $\begin{aligned} K = {K_{02} (0), K_{02} (1), K_{12} (0), K_{12} (1), K_{22} (0), K_{22} (1), K_{03} (0), K_{03} (1), \dots} \end{aligned}$ (8) ), $K_{s l} (0)$ appears before $K_{s l} (1)$ because the contamination or bias in ${\hat{β}}_{01}$ is deemed more severe than in ${\hat{β}}_{11}$ . The concept of minimizing contamination due to the existence of interactions is not new. We note that Cheng and Tang (Citation2005) used this idea to develop a general theory for minimum aberration.

The minimum contamination criterion in (Equation8(8) $\begin{aligned} K = {K_{02} (0), K_{02} (1), K_{12} (0), K_{12} (1), K_{22} (0), K_{22} (1), K_{03} (0), K_{03} (1), \dots} \end{aligned}$ (8) ) induces a ranking of designs of the same run size. It is time consuming, however, to find the minimum contamination design via complete search using (Equation8(8) $\begin{aligned} K = {K_{02} (0), K_{02} (1), K_{12} (0), K_{12} (1), K_{22} (0), K_{22} (1), K_{03} (0), K_{03} (1), \dots} \end{aligned}$ (8) ) if n is large. A useful technique of design construction is via complementary designs. Tang and Wu (Citation1996) provided identities related to the wordlength pattern of a regular two-level design to that of its complementary design. Suen et al. (Citation1997) extended these identities to regular $s^{n - p}$ designs. Cheng (Citation2014, p. 179) reviewed the design construction using complementary designs.

We now focus on regular designs under the conditional effect model due to their nice properties and popularity among practitioners. Let $Δ_{r}$ be the set of nonnull $r \times 1$ binary vectors. All operations with these vectors are over the finite field GF(2). Regarding the notation, we do not apply bold font style to these binary vectors to distinguish them from the vectors with the elements belonging to real numbers. A regular design in $N = 2^{r}$ (r<n) runs is given by n distinct vectors $b_{1}, \dots, b_{n}$ from $Δ_{r}$ such that the matrix $B = (b_{1}, \dots, b_{n})$ has full row rank. The design consists of the N treatment combinations of the form $a^{^{⊤}} B$ , where $a \in Δ_{r} \cup {0}$ .

In the following, we define some useful quantities to represent $K_{s l} (h)$ . Let $A_{l}^{(1)}$ be the number of ways of choosing l out of $b_{2}, b_{4}, \dots, b_{n}$ such that the sum of the chosen l equals 0; $A_{l}^{(21)}$ be the number of ways of choosing l out of $b_{4}, \dots, b_{n}$ such that the sum of the chosen l is in the set ${b_{1}, b_{1} + b_{2}}$ ; $A_{l}^{(22)}$ be the number of ways of choosing l out of $b_{2}, b_{5}, \dots, b_{n}$ such that the sum of the chosen l is in the set ${b_{3}, b_{3} + b_{4}}$ ; $A_{l}^{(2)} = A_{l}^{(21)} + A_{l}^{(22)}$ ; $A_{l}^{(31)}$ be the number of ways of choosing l out of $b_{4}, \dots, b_{n}$ such that the sum of the chosen l is in the set ${0, b_{2}}$ ; $A_{l}^{(32)}$ be the number of ways of choosing l out of $b_{2}, b_{5}, \dots, b_{n}$ such that the sum of the chosen l is in the set ${0, b_{4}}$ ; $A_{l}^{(3)} = A_{l}^{(31)} + A_{l}^{(32)}$ ; $A_{l}^{(42)}$ be the number of ways of choosing l out of $b_{2}, b_{5}, \dots, b_{n}$ such that the sum of the chosen l is in the set ${b_{1} + b_{3}, b_{1} + b_{3} + b_{4}}$ ; $A_{l}^{(43)}$ be the number of ways of choosing l out of $b_{5}, \dots, b_{n}$ such that the sum of the chosen l is in the set ${b_{1} + b_{3}, b_{1} + b_{3} + b_{4}}$ ; $A_{l}^{(52)}$ be the number of ways of choosing l out of $b_{5}, \dots, b_{n}$ such that the sum of the chosen l is in the set ${b_{1} + b_{2} + b_{3}, b_{1} + b_{2} + b_{3} + b_{4}}$ ; $A_{l}^{(7)}$ be the number of ways of choosing l out of $b_{5}, \dots, b_{n}$ such that the sum of the chosen l is in the set ${b_{1} + b_{3}, b_{1} + b_{2} + b_{3}, b_{1} + b_{3} + b_{4}, b_{1} + b_{2} + b_{3} + b_{4}}$ ; $A_{l}^{(8)}$ be the number of ways of choosing l out of $b_{5}, \dots, b_{n}$ such that the sum of the chosen l is in the set ${b_{1}, b_{3}, b_{1} + b_{2}, b_{1} + b_{4}, b_{2} + b_{3}, b_{3} + b_{4}, b_{1} + b_{2} + b_{4}, b_{2} + b_{3} + b_{4}}$ . The next result, with the proof deferred to the appendix, gives expressions for $K_{s l} (h)$ in terms of the quantities just introduced.

Theorem 4.1

For $2 \leq l \leq n - 2$ , we have

(a)	$K_{0 l} (0) = (l + 1) A_{l + 1}^{(1)} + (n - l - 1) A_{l - 1}^{(1)}$ ;
(b)	$K_{0 l} (1) = A_{l - 1}^{(2)} + A_{l}^{(2)}$ ;
(c)	$K_{1 l} (0) = (n - l - 1) A_{l - 2}^{(2)} + A_{l - 1}^{(2)} + l A_{l}^{(2)}$ ;
(d)	$K_{1 l} (1) = 2 A_{l - 1}^{(3)} + 2 {A_{l - 1}^{(42)} + A_{l - 2}^{(43)} + A_{l - 1}^{(52)}}$ ;
(e)	$K_{2 l} (0) = 2 A_{l - 2}^{(7)} + (n - l - 1) A_{l - 3}^{(7)} + (l - 1) A_{l - 1}^{(7)}$ ;
(f)	$K_{2 l} (1) = 2 A_{l - 2}^{(8)}$ .

In view of Theorem 4.1, sequential minimization of K is equivalent to that of the terms of $A = {A_{3}^{(1)}, A_{2}^{(2)}, A_{1}^{(42)} + A_{1}^{(52)}, A_{1}^{(7)}, A_{4}^{(1)}, A_{3}^{(2)}, \dots}$ , which is reduced to $A = {A_{3}^{(1)}, A_{2}^{(2)}, A_{1}^{(7)}, A_{4}^{(1)}, A_{3}^{(2)}, \dots}$ because $F_{1}, F_{2}, F_{3}, F_{4}$ form a complete factorial, implying $A_{1}^{(42)} + A_{1}^{(52)} = A_{1}^{(7)}$ .

We now develop a complementary set theory for the first four terms in the sequence A. Let $\tilde{T}$ be the complement of ${b_{2}, b_{4}, \dots, b_{n}}$ in $Δ_{r}$ ; $A_{l} (\tilde{T})$ be the number of ways of choosing l members of $\tilde{T}$ such that the sum of the chosen l equals 0. Let $T_{12} = \tilde{T} ∖ {b_{1}, b_{1} + b_{2}}$ ; $T_{34} = \tilde{T} ∖ {b_{3}, b_{3} + b_{4}}$ ; $A_{l}^{(12)} (T_{12})$ be the number of ways of choosing l members of $T_{12}$ such that the sum of the chosen l is in ${b_{1}, b_{1} + b_{2}}$ ; $A_{l}^{(34)} (T_{34})$ be the number of ways of choosing l members of $T_{34}$ such that the sum of the chosen l is in ${b_{3}, b_{3} + b_{4}}$ .

Theorem 4.2

Let $c_{j}$ , $j = 1, \dots, 5$ , be constants irrelevant to designs and $T = Δ_{r} ∖ {b_{5}, \dots, b_{n}}$ . Define $H_{i} (\cdot, \cdot)$ as Equation (2) in Mukerjee and Wu (Citation2001). We have

(a)	$A_{3}^{(1)} = c_{1} - A_{3} (\tilde{T})$ ;
(b)	$A_{4}^{(1)} = c_{2} + A_{3} (\tilde{T}) + A_{4} (\tilde{T})$ ;
(c)	$A_{2}^{(2)} = c_{3} + A_{2}^{(12)} (T_{12}) + A_{2}^{(34)} (T_{34})$ ;
(d)	$A_{1}^{(7)} = B_{1} + B_{2} + B_{3} + B_{4}$ , where $B_{1} = c_{41} + H_{1} ({b_{1} + b_{3}}, T)$ if $b_{1} + b_{3} = b_{j}$ for some $j \in {5, \dots, n}$ and zero otherwise; $B_{2} = c_{42} + H_{1} ({b_{1} + b_{2} + b_{3}}, T)$ if $b_{1} + b_{2} + b_{3} = b_{j}$ for some $j \in {5, \dots, n}$ and zero otherwise; $B_{3} = c_{43} + H_{1} ({b_{1} + b_{3} + b_{4}}, T)$ if $b_{1} + b_{3} + b_{4} = b_{j}$ for some $j \in {5, \dots, n}$ and zero otherwise; $B_{4} = c_{44} + H_{1} ({b_{1} + b_{2} + b_{3} + b_{4}}, T)$ if $b_{1} + b_{2} + b_{3} + b_{4} = b_{j}$ for some $j \in {5, \dots, n}$ and zero otherwise. $c_{4 j}$ 's are constants for every design.

Proof.

Parts (a) and (b) are evident from Tang and Wu (Citation1996).

For (c), note that $A_{2}^{(21)} = H_{2} ({b_{1}}, {b_{4}, \dots, b_{n}}) + H_{2} ({b_{1} + b_{2}}, {b_{4}, \dots, b_{n}})$ , which can be simplified as $A_{2}^{(21)} = c + H_{2} ({b_{1}}, {b_{2}, b_{1} + b_{2}} \cup T_{12}) + H_{2} ({b_{1} + b_{2}}, {b_{1}, b_{2}} \cup T_{12})$ by Lemmas 1 and 3 in Mukerjee and Wu (Citation2001), where c is a constant for every design. Because the design is an orthogonal array of strength two, we have $H_{2} ({b_{1}}, {b_{2}, b_{1} + b_{2}} \cup T_{12}) = 1 + H_{2} ({b_{1}}, T_{12})$ and $H_{2} ({b_{1} + b_{2}}, {b_{1}, b_{2}} \cup T_{12}) = 1 + H_{2} ({b_{1} + b_{2}}, T_{12})$ . Hence $A_{2}^{(21)} = c + 2 + H_{2} ({b_{1}}, T_{12}) + H_{2} ({b_{1} + b_{2}}, T_{12}) = c + 2 + A_{2}^{(12)} (T_{12})$ . Similarly, $A_{2}^{(22)} = c^{'} + 2 + A_{2}^{(34)} (T_{34})$ , where $c^{'}$ is a constant for every design. Therefore, we have $A_{2}^{(2)} = c_{3} + A_{2}^{(12)} (T_{12}) + A_{2}^{(34)} (T_{34})$ by letting $c_{3} = c + c^{'} + 4$ .

For (d), note that $A_{1}^{(7)} = H_{1} ({b_{1} + b_{3}}, {b_{5}, \dots, b_{n}}) + H_{1} ({b_{1} + b_{3} + b_{4}}, {b_{5}, \dots, b_{n}}) + H_{1} ({b_{1} + b_{2} + b_{3}}, {b_{5}, \dots, b_{n}}) + H_{1} ({b_{1} + b_{2} + b_{3} + b_{4}}, {b_{5}, \dots, b_{n}})$ . Let $F = Δ_{r} ∖ {b_{1} + b_{3}, b_{5}, \dots, b_{n}}$ . If $b_{1} + b_{3} \neq b_{j}$ for $j = 5, \dots, n$ , then $H_{1} ({b_{1} + b_{3}}, {b_{5}, \dots, b_{n}}) = 0$ . If $b_{1} + b_{3} = b_{j}$ for some $j \in {5, \dots, n}$ , then $H_{1} ({b_{1} + b_{3}}, {b_{5}, \dots, b_{n}}) = c_{41} + H_{1} ({b_{1} + b_{3}}, F)$ by Lemmas 1 and 3 in Mukerjee and Wu (Citation2001), where $c_{41}$ is a constant for every design. Since $b_{1} + b_{3} = b_{j}$ for some $j \in {5, \dots, n}$ , we have F = T and $H_{1} ({b_{1} + b_{3}}, F) = H_{1} ({b_{1} + b_{3}}, T)$ . Thus $H_{1} ({b_{1} + b_{3}}, {b_{5}, \dots, b_{n}}) = B_{1}$ . Similarly, we have $H_{1} ({b_{1} + b_{2} + b_{3}}, T) = B_{2}$ , $H_{1} ({b_{1} + b_{3} + b_{4}}, T) = B_{3}$ and $H_{1} ({b_{1} + b_{2} + b_{3} + b_{4}}, T) = B_{4}$ . So, $A_{1}^{(7)} = B_{1} + B_{2} + B_{3} + B_{4}$ .

Theorem 4.2 provides a way to evaluate designs using the sequence A, and equivalently sequence K, with the number of factors $n = (N - 1) + 2 - \tilde{t} = N - 1 - \tilde{t}$ , where $\tilde{t}$ is the cardinality of $\tilde{T}$ . Once $\tilde{T}$ is constructed, one can quickly get ${b_{2}, b_{4}, \dots, b_{n}}$ by the identity $Δ_{r} = \tilde{T} \cup {b_{2}, b_{4}, \dots, b_{n}}$ , and construct the design ${b_{1}, b_{2}, b_{3}, b_{4}, \dots, b_{n}}$ . Constructing $\tilde{T}$ with minimum contamination given a large $\tilde{t}$ is usually time-consuming. It is more practical to find a small $\tilde{T}$ with minimum contamination, leading to a large n. Thus, Theorem 4.2 helps find large minimum contamination designs. For example, consider $\tilde{t} = 5$ . The set $\tilde{T} = {α_{1}, α_{2}, α_{3}, α_{4}, α_{1} + α_{2}}$ can be verified to have maximal $A_{3} (\tilde{T}) = 1$ , where $α_{1}, α_{2}, α_{3}, α_{4}$ are four linearly independent vectors from $Δ_{r}$ . Moreover, assigning $α_{1} = b_{1}$ , $α_{2} = b_{3} + b_{4}$ , $α_{3} = b_{3}$ and $α_{4} = b_{1} + b_{2}$ results in minimal $A_{2}^{(12)} (T_{12}) = A_{2}^{(34)} (T_{34}) = 0$ . Thus, we have $\tilde{T} = {b_{1}, b_{3} + b_{4}, b_{3}, b_{1} + b_{2}, b_{1} + b_{3} + b_{4}}$ and construct a minimum contamination design with the number of factors N−6, which equals 10, 26, 58 for 16-, 32- and 64-run designs.

5. Efficient design search and examples

Finding minimum contamination designs using (Equation8(8) $\begin{aligned} K = {K_{02} (0), K_{02} (1), K_{12} (0), K_{12} (1), K_{22} (0), K_{22} (1), K_{03} (0), K_{03} (1), \dots} \end{aligned}$ (8) ) is a daunting task for even moderate run size and number of factors. This section presents an extension of a searching procedure given by Mukerjee et al. (Citation2017) to the current setting and provides examples for illustration.

5.1. A procedure for efficient design search

The minimum contamination criterion (Equation8(8) $\begin{aligned} K = {K_{02} (0), K_{02} (1), K_{12} (0), K_{12} (1), K_{22} (0), K_{22} (1), K_{03} (0), K_{03} (1), \dots} \end{aligned}$ (8) ) can be applied to regular and nonregular designs, but requires heavy computation of $K_{s l} (h) = N^{- 2} tr [X_{h 1}^{^{⊤}} X_{s l} X_{s l}^{^{⊤}} X_{h 1}]$ . By noting that $X_{s l} X_{s l}^{^{⊤}}$ is reminiscent of minimum moment aberration in Xu (Citation2003), Mukerjee et al. (Citation2017) developed an efficient computational procedure for $K_{s l} (h)$ . We now extend this procedure for computing $X_{s l} X_{s l}^{^{⊤}}$ to double-pair conditional effect models. For $0 \leq c \leq n - 2$ , let $Q_{0} (c) = 1$ , $Q_{1} (c) = 2 c - (n - 4)$ , and (9) $\begin{aligned} Q_{l} (c) = l^{- 1} {[2 c - (n - 4)] Q_{l - 1} (c) - (n - l - 2) Q_{l - 2} (c)}, \end{aligned}$ (9) where $2 \leq l \leq n - 2$ . Write $\tilde{D}$ for the subarray given by the last n−4 columns of $D$ (i.e. only consisting of traditional factors). For $1 \leq u, w \leq N$ , let $c_{u w}$ be the number of positions where the uth and wth rows of $\tilde{D}$ have the same entry, and $q_{s l} (u, w)$ be the $(u, w)$ th element of $X_{s l} X_{s l}^{^{⊤}}$ . Denote the $(u, j)$ th element of $D$ by $d_{u j}$ . Then the following result holds.

Theorem 5.1

For $1 \leq u, w \leq N$ and $2 \leq l \leq n - 2$ , we have

(a)	$q_{0 l} (u, w) = (d_{u 2} d_{w 2} d_{u 4} d_{w 4}) Q_{l - 2} (c_{u, w}) + (d_{u 2} d_{w 2} + d_{u 4} d_{w 4}) Q_{l - 1} (c_{u, w}) + Q_{l} (c_{u, w})$ ;
(b)	$q_{1 l} (u, w) = (d_{u 1} d_{w 1} + d_{u 1} d_{w 1} d_{u 2} d_{w 2} + d_{u 3} d_{w 3} + d_{u 3} d_{w 3} d_{u 4} d_{w 4}) Q_{l - 1} (c_{u, w})$ ;
(c)	$q_{2 l} (u, w) = d_{u 1} d_{w 1} d_{u 3} d_{w 3} (1 + d_{u 2} d_{w 2} + d_{u 4} d_{w 4} + d_{u 2} d_{w 2} d_{u 4} d_{w 4}) Q_{l - 2} (c_{u, w})$ .

Proof.

For $2 \leq l \leq n - 2$ , let $Σ^{(l)}$ be the sum over binary tuples $j_{5} \dots j_{n}$ such that l of $j_{5}, \dots, j_{n}$ equal 1. We have $\begin{aligned} q_{0 l} (u, w) & = Σ^{(l)} x (u; 0000 j_{5} \dots j_{n}) x (w; 0000 j_{5} \dots j_{n}) \\ + Σ^{(l - 1)} x (u; 0100 j_{5} \dots j_{n}) x (w; 0100 j_{5} \dots j_{n}) \\ + Σ^{(l - 1)} x (u; 0001 j_{5} \dots j_{n}) x (w; 0001 j_{5} \dots j_{n}) \\ + Σ^{(l - 2)} x (u; 0101 j_{5} \dots j_{n}) x (w; 0101 j_{5} \dots j_{n}) \\ = (d_{u 2} d_{w 2} d_{u 4} d_{w 4}) Ψ_{l - 2} (c_{u, w}) + (d_{u 2} d_{w 2} + d_{u 4} d_{w 4}) Ψ_{l - 1} (c_{u, w}) + Ψ_{l} (c_{u, w}), \end{aligned}$ where $Ψ_{l} (u, w) = Σ^{(l)} \prod_{s = 5}^{n} (d_{u} d_{w})^{j_{s}}$ . Similarly, we have $\begin{aligned} q_{1 l} (u, w) & = (d_{u 1} d_{w 1} + d_{u 1} d_{w 1} d_{u 2} d_{w 2} + d_{u 3} d_{w 3} + d_{u 3} d_{w 3} d_{u 4} d_{w 4}) Ψ_{l - 1} (c_{u, w}), \\ q_{2 l} (u, w) & = d_{u 1} d_{w 1} d_{u 3} d_{w 3} (1 + d_{u 2} d_{w 2} + d_{u 4} d_{w 4} + d_{u 2} d_{w 2} d_{u 4} d_{w 4}) Ψ_{l - 2} (c_{u, w}) . \end{aligned}$ The result will follow if $Ψ_{l} (u, w) = Q_{l} (c_{u w})$ . It is clear that $Ψ_{0} (u, w) = 1$ and $Ψ_{1} (u, w) = c_{u w} + (- 1) (n - 4 - c_{u w}) = 2 c_{u w} - (n - 4)$ . It remains to show $Ψ_{l} (u, w)$ satisfies the recursion relation (Equation9(9) $\begin{aligned} Q_{l} (c) = l^{- 1} {[2 c - (n - 4)] Q_{l - 1} (c) - (n - l - 2) Q_{l - 2} (c)}, \end{aligned}$ (9) ).

Let $Φ (ξ) = \prod_{j = 5}^{n} (1 + ξ d_{u j} d_{w j})$ and let $Φ_{l} (ξ)$ be the lth derivative of $Φ (ξ)$ . Note that $Ψ_{l} (u, w) = Φ_{l} (0) / l!$ . Differentiation of $\log Φ (ξ)$ yields $\begin{aligned} Φ_{1} (ξ) & = (\sum_{j = 5}^{n} \frac{d_{u j} d_{w j}}{1 + ξ d_{u j} d_{w j}}) Φ (ξ) \\ = (\frac{c_{u w}}{1 + ξ} - \frac{(n - 4) - c_{u w}}{1 - ξ}) Φ (ξ), \end{aligned}$ that is, $(1 - ξ^{2}) Φ_{1} (ξ) = {2 c_{u w} - (n - 4) (1 + ξ)} Φ (ξ)$ . Differentiating this l−1 and taking $ξ = 0$ , we get $Φ_{l} (0) = [2 c_{u w} - (n - 4)] Φ_{l - 1} (0) - (l - 1) (n - l - 2) Φ_{l - 2} (0) .$ This leads to (Equation9(9) $\begin{aligned} Q_{l} (c) = l^{- 1} {[2 c - (n - 4)] Q_{l - 1} (c) - (n - l - 2) Q_{l - 2} (c)}, \end{aligned}$ (9) ) by using $Ψ_{l} (u, w) = Φ_{l} (0) / l!$ .

With the help of Theorem 5.1 and suggested by Mukerjee et al. (Citation2017), an algorithm is provided as follows.

Search for minimum contamination designs by first listing of all nonisomorphic regular designs for given run size $N (\geq 16)$ and number of factors $n (\geq 5)$ .
For each nonisomorphic regular design, permute its columns such that the resulting design satisfies the conditions in Theorem 3.1. Let the first four columns represent the two pairs of conditional and conditioned factors, that is, $F_{1}, F_{2}$ and $F_{3}, F_{4}$ .
Calculate the criterion in (Equation8(8) $\begin{aligned} K = {K_{02} (0), K_{02} (1), K_{12} (0), K_{12} (1), K_{22} (0), K_{22} (1), K_{03} (0), K_{03} (1), \dots} \end{aligned}$ (8) ) by using Theorem 5.1, and hence find a minimum contamination design.

This procedure mostly consumes affordable computational time. For N = 16 and n = 10, for example, it takes around 3.94 minutes to find a minimum contamination design on a desktop with 3.8 GHz CPU and 64 GB of RAM.

5.2. Examples

We apply the algorithm presented in Section 5.1 to 16- and 32-run designs. Afterwards, the light bulb experiment (Wu & Hamada, Citation2021, p. 347) mentioned in Section 1 is revisited.

For N = 16 and 32, a list of all nonisomorphic regular designs are given in the catalogues in Chen et al. (Citation1993). Table 1 exhibits the results for N = 16 and $5 \leq n \leq 12$ . In the table, the numbers 1,2,4,8 represent basic factors in a design. The other numbers represent added factors. For example, for n = 5, if the five factors are denoted by A, B, C, D, E, then the minimum contamination design is the one with the defining relation E = ABCD because 15 = 1 + 2 + 4 + 8. We can see that all minimum contamination designs under conditional effect models are also minimum aberration under traditional models. The finding supports using minimum aberration designs under traditional models to perform subsequent de-aliasing analysis in Su and Wu (Citation2017). Table 2 exhibits the results for N = 32 and $6 \leq n \leq 18$ . Same as Table 1, the numbers 1,2,4,8,16 represent basic factors in a design. The other numbers represent added factors. For example, for n = 6, if the five factors are denoted by A, B, C, D, E, F, then the minimum contamination design is the one with the defining relation F = ABCDE because 31 = 1 + 2 + 4 + 8 + 16. The R codes for generating these designs are attached to the supplementary material.

Table 2. Regular minimum contamination designs for N = 32.

Display Table

The light bulb experiment mentioned in Wu and Hamada (Citation2021, p. 347) studied a light bulb sealing process performed to improve a cosmetic problem that was frequently occurring (Taguchi, Citation1987, Section 17.6). The outcomes were determined mainly by six two-level traditional factors A, B, C, D, E, F and two pairs of conditional and conditioning factors

(H, G)

and

(J, I)

, respectively. A 16-run regular fractional factorial design is to be conducted to study this light bulb sealing process. With the use of the algorithm in Section 5.1, we obtain the minimum contamination design with the design matrix given in Table (the column indices also provided in Table ). The first four columns are assigned to the two pairs of conditional and conditioning factors

(H, G)

and

(J, I)

, respectively; the remaining columns are assigned to the six traditional factors. This

2^{10 - 6}

design has the defining generators B = HI, C = AH, A = GI, D = HJ, E = AIJ and F = AHIJ. The corresponding K-sequence in (Equation8) has

2 \times 3 \times (10 - 2 + 1) = 42

elements given as follows:

\begin{aligned} K & = {9, 10, 17, 4, 2, 0, 28, 16, 21, 12, 12, 6, 35, 16, 54, 16, 30, 18, 28, 12, 18, 24, 40, 20, \\ 19, 6, 17, 4, 30, 12, 0, 4, 1, 0, 12, 6, 1, 0, 0, 0, 2, 2} . \end{aligned}

We also evaluate the design provided by Taguchi (Citation1987, Section 17.6) (see Table 7.13 of Wu and Hamada (Citation2021, p.347)) by the minimum contamination criterion in (Equation8). The resulting K-sequence is given by

\begin{aligned} K & = {15, 7, 11, 6, 3, 0, 20, 19, 30, 10, 10, 6, 35, 18, 38, 22, 31, 18, 32, 10, 32, 18, 40, 20, \\ 13, 7, 15, 4, 29, 12, 4, 3, 2, 0, 14, 6, 1, 0, 0, 0, 1, 2} . \end{aligned}

The first element of the K-sequence of the design in Table is 9, smaller than that of the design provided by Taguchi (Citation1987, Section 17.6). Thus, the proposed algorithm found a better design in terms of the minimization of contamination caused by interactions under the double-pair conditional effect model.

6. Concluding remarks

This paper extends the work of Mukerjee et al. (Citation2017) to two pairs of conditional and conditioning factors. The conditional effect model in (Equation3(3) $\begin{aligned} β = ν^{- 1} W \otimes H^{\otimes (n - 4)} τ, \end{aligned}$ (3) ) is an orthogonal reparameterization of the traditional model in (Equation1(1) $\begin{aligned} τ = H^{\otimes n} θ (or equivalently, θ = ν^{- 1} H^{\otimes n} τ), \end{aligned}$ (1) ). Such a reparameterization introduces a new effect hierarchy order of factorial effects, resulting in a different design evaluation from the minimum aberration due to Fries and Hunter (Citation1980). We note that alternative reparameterization of (Equation1(1) $\begin{aligned} τ = H^{\otimes n} θ (or equivalently, θ = ν^{- 1} H^{\otimes n} τ), \end{aligned}$ (1) ) is required if the topic of interest does not depend on conditional effects. For example, Yang and Speed (Citation2002) proposed to define the factorial effects with reference to natural baseline levels of the factors, referred to as baseline parameterization. Later, Mukerjee and Tang (Citation2012), Mukerjee and Huda (Citation2016) and Sun and Tang (Citation2022) discussed design optimality and construction under the baseline parameterization.

Table 1. Regular minimum contamination designs for N = 16.

Display Table

Table 3. Sixteen-run minimum contamination design with ten factors for the light bulb experiment.

Download CSV Display Table

Mukerjee et al. (Citation2017) claimed that, in practice, the number of conditional and conditioned pairs seldom exceeds two. On the other hand, the effect hierarchy order in (Equation4(4) $\begin{aligned} V_{01} > V_{11} > V_{02} > V_{12} > V_{22} > V_{03} > V_{13} > V_{23} > \dots > V_{0, n - 2} > V_{1, n - 2} > V_{2, n - 2} . \end{aligned}$ (4) ) is only irrelevant to the value of r when the number of pairs is not greater than two. When dealing with more than two pairs, the effect hierarchy order is a nontrivial function of r and deriving useful results can be exceedingly difficult. Alternatively, we note that the minimum contamination criterion in (Equation8(8) $\begin{aligned} K = {K_{02} (0), K_{02} (1), K_{12} (0), K_{12} (1), K_{22} (0), K_{22} (1), K_{03} (0), K_{03} (1), \dots} \end{aligned}$ (8) ) can be applied to regular as well as nonregular designs; however, this paper focuses on regular designs owing to their popularity and theoretical underpinnings. The application of this framework to nonregular designs is left for future research. Another interesting future direction suggested by a reviewer is to involve qualitative four-level conditional/conditioning factors. It is known that a qualitative four-level factor can be decomposed into three orthogonal main effect components with two levels each (Wu & Hamada, Citation2021, chapter 7). However, our theory cannot be directly applied since the three two-level main effect components do not share the same properties as a real two-level factor. The covariance structure for the main effect components is different from that for two-level factors; see Joseph et al. (Citation2009) for a discussion. The resulting effect hierarchy order of conditional factorial effects may be complicated with vague interpretations. Since this topic requires a nontrivial extension, we leave it for future research.

Supplemental material

Supplemental Material

Download (6.5 KB)

Acknowledgments

We thank the Editor and two Reviewers for their constructive comments and suggestions, which have helped us to improve the article.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

We gratefully acknowledge funding from Academia Sinica [Grant Number AS-CDA-111-M05] and from National Science and Technology Council [Grant Number 111-2118-M-001-001-MY3].

References

Ai, M. Y., Kang, L., & Joseph, V. R. (2009). Bayesian optimal blocking of factorial designs. Journal of Statistical Planning Inference, 139(9), 3319–3328. https://doi.org/10.1016/j.jspi.2009.03.008
Web of Science ®Google Scholar
Chang, M. C. (2019). De-aliasing in two-level factorial designs: A Bayesian approach. Journal of Statistical Planning and Inference, 203, 82–90. https://doi.org/10.1016/j.jspi.2019.03.002
Web of Science ®Google Scholar
Chang, M. C. (2022). A unified framework for minimum aberration. Statistica Sinica, 32(1), 251–270.
Web of Science ®Google Scholar
Chang, M. C., & Cheng, C. S. (2018). A Bayesian approach to the selection of two-level multi-stratum factorial designs. Annals of Statistics, 46(4), 1779–1806.
Web of Science ®Google Scholar
Chen, J., Sun, D. X., & Wu, C. F. J. (1993). A catalogue of two-level and three-level fractional factorial designs with small runs. International Statistical Review, 61(1), 131–145. https://doi.org/10.2307/1403599
Web of Science ®Google Scholar
Cheng, C. S. (2014). Theory of factorial design: Single- and multi-stratum experiments. CRC Press.
Google Scholar
Cheng, C. S., Deng, L. Y., & Tang, B. (2002). Generalized minimum aberration and design efficiency for nonregular fractional factorial designs. Statistica Sinica, 12(4), 991–1000.
Web of Science ®Google Scholar
Cheng, C. S., & Tang, B. (2005). A general theory of minimum aberration and its applications. Annals of Statistics, 33(2), 944–958. https://doi.org/10.1214/009053604000001228
Web of Science ®Google Scholar
Chipman, H., Hamada, M., & Wu, C. F. J. (1997). A bayesian variable-selection approach for analyzing designed experiments with complex aliasing. Technometrics, 39(4), 372–381. https://doi.org/10.1080/00401706.1997.10485156
Web of Science ®Google Scholar
Fries, A., & Hunter, W. G. (1980). Minimum aberration 2k−p designs. Technometrics, 22(4), 601–608.
Web of Science ®Google Scholar
Joseph, V. R. (2006). A Bayesian approach to the design and analysis of fractionated experiments. Technometrics, 48(2), 219–229. https://doi.org/10.1198/004017005000000652
Web of Science ®Google Scholar
Joseph, V. R., Ai, M., & Wu, C. F. J. (2009). Bayesian-inspired minimum aberration two- and four-level designs. Biometrika, 96(1), 95–106. https://doi.org/10.1093/biomet/asn062
Web of Science ®Google Scholar
Joseph, V. R., & Delaney, J. D. (2007). Functionally induced priors for the analysis of experiments. Technometrics, 49(1), 1–11. https://doi.org/10.1198/004017006000000372
Web of Science ®Google Scholar
Kang, L., & Joseph, V. R. (2009). Bayesian optimal single arrays for robust parameter design. Technometrics, 51(3), 250–261. https://doi.org/10.1198/tech.2009.08057
Web of Science ®Google Scholar
Kerr, M. K. (2001). Bayesian optimal fractional factorials. Statistica Sinica, 11(3), 605–630.
Web of Science ®Google Scholar
Kiefer, J. (1975). Construction and optimality of generalized Youden designs. In A survey of statistical design and linear models (pp. 333–353). North-Holland.
Google Scholar
Lawson, J. (2020). Comparison of conditional main effects analysis to the analysis of follow-up experiments for separating confounded two-factor interaction effects in 2IVk−p fractional factorial experiments. Quality and Reliability Engineering International, 36(4), 1454–1472. https://doi.org/10.1002/qre.v36.4
Web of Science ®Google Scholar
Mak, S., & Wu, C. F. J. (2019). cmenet: A new method for bi-level variable selection of conditional main effects. Journal of the American Statistical Association, 114(526), 844–856. https://doi.org/10.1080/01621459.2018.1448828
Web of Science ®Google Scholar
Mitchell, T. J., Morris, M. D., & Ylvisaker, D. (1995). Two-level fractional factorials and Bayesian prediction. Statistica Sinica, 15(2), 559–573.
Google Scholar
Mukerjee, R., & Huda, S. (2016). Approximate theory-aided robust efficient factorial fractions under baseline parameterization. Annals of the Institute of Statistical Mathematics, 68(4), 787–803. https://doi.org/10.1007/s10463-015-0509-x
Web of Science ®Google Scholar
Mukerjee, R., & Tang, B. (2012). Optimal fractions of two-level factorials under a baseline parametrization. Biometrika, 99(1), 71–84. https://doi.org/10.1093/biomet/asr071
Web of Science ®Google Scholar
Mukerjee, R., & Wu, C. F. J. (2001). Minimum aberration designs for mixed factorials in terms of complementary sets. Statistica Sinica, 11(1), 225–239.
Web of Science ®Google Scholar
Mukerjee, R., & Wu, C. F. J. (2006). A modern theory of factorial designs. Springer.
Google Scholar
Mukerjee, R., Wu, C. F. J., & Chang, M. C. (2017). Two-level minimum aberration designs under a conditional model with a pair of conditional and conditioning factors. Statistica Sinica, 27(3), 997–1016.
Web of Science ®Google Scholar
Pistone, G., & Wynn, H. P. (1996). Generalised confounding with gröbner bases. Biometrika, 83(3), 653–666. https://doi.org/10.1093/biomet/83.3.653
Web of Science ®Google Scholar
Sabbaghi, A. (2020). An algebra for the conditional main effect parameterization. Statistica Sinica, 30(2), 903–924.
Web of Science ®Google Scholar
Su, H., & Wu, C. F. J. (2017). CME analysis: A new method for unraveling aliased effects in two-level fractional factorial experiments. Journal of Quality Technology, 49(1), 1–10. https://doi.org/10.1080/00224065.2017.11918181
Web of Science ®Google Scholar
Suen, C. Y., Chen, H., & Wu, C. F. J. (1997). Some identities on qn−m designs with application to minimum aberration designs. Annals of Statistics, 25(3), 1176–1188. https://doi.org/10.1214/aos/1069362743
Web of Science ®Google Scholar
Sun, C. Y., & Tang, B. (2022). Relationship between orthogonal and baseline parameterizations and its applications to design constructions. Statistica Sinica, 32(1), 239–250.
Web of Science ®Google Scholar
Taguchi, G. (1987). System of experimental design: Engineering methods to optimize quality and minimize costs. UNIPUB/Kraus International Publications.
Google Scholar
Tang, B., & Deng, L. Y. (1999). Minimum G2-aberration for nonregular fractional factorial designs. Annals of Statistics, 27(6), 1914–1926.
Web of Science ®Google Scholar
Tang, B., & Wu, C. F. J. (1996). Characterization of minimum aberration 2n−k designs in terms of their complementary designs. Annals of Statistics, 27(6), 1914–1926.
Google Scholar
Wu, C. F. J. (2015). Post-Fisherian experimentation: From physical to virtual. Journal of the American Statistical Association, 110(510), 612–620. https://doi.org/10.1080/01621459.2014.914441
Web of Science ®Google Scholar
Wu, C. F. J. (2018). A fresh look at effect aliasing and interactions: Some new wine in old bottles. Annals of the Institute of Statistical Mathematics, 70(2), 249–268. https://doi.org/10.1007/s10463-018-0646-0
Web of Science ®Google Scholar
Wu, C. F. J., & Hamada, M. (2021). Experiments: Planning, analysis, and optimization (3rd ed.). Wiley Series.
Google Scholar
Xu, H. (2003). Minimum moment aberration for nonregular designs and supersaturated designs. Statistica Sinica, 13(3), 691–708.
Web of Science ®Google Scholar
Xu, H., & Wu, C. F. J. (2001). Generalized minimum aberration for asymmetrical fractional factorial designs. Annals of Statistics, 29(2), 1066–1077.
Web of Science ®Google Scholar
Yang, Y. H., & Speed, T. P. (2002). Design issues for cDNA microarray experiments. Nature Reviews Genetics, 3(8), 579–588. https://doi.org/10.1038/nrg863
PubMed Web of Science ®Google Scholar

Appendix. Proof of Theorem 4.1

In this proof, a traditional factorial effect is represented by a word, i.e. a subset of

{1, \dots, n}

. For two words

W_{1}

and

W_{2}

, we define

W_{1} △ W_{2}

to be

(W_{1} \cup W_{2}) ∖ (W_{1} \cap W_{2})

. Note that

K_{s l} (h) = N^{- 2} tr [X_{h 1}^{^{⊤}} X_{s l} X_{s l}^{^{⊤}} X_{h 1}]

, which is the sum of squared entries of

N^{- 2} X_{h 1}^{^{⊤}} X_{s l}

. Because the design is regular, each squared entry is either one or zero according to whether the corresponding effects are aliased.

Part (a) is evident from Tang and Deng (Citation1999) except that the number of factors considered in the computation is n−2 (exclude $F_{1}$ and $F_{3}$ ). So we have $K_{0 l} (0) = (l + 1) A_{l + 1}^{(1)} + (n - l - 1) A_{l - 1}^{(1)}$ .

For (b), let $S_{l}$ be the set of all words of length l not containing any word involving 1 and 3. Let $S_{l 2}$ be a subset of $S_{l}$ and 2 belongs to each word in $S_{l 2}$ . Then ${1} △ W$ , $W \in S_{l 2}$ , is of the form ${1, 2} \cup (W ∖ {2})$ , where $(W ∖ {2})$ is of length l−1. Similarly, ${1} △ W$ , $W \in (S ∖ S_{l 2})$ , is of the form ${1} \cup W$ , where W is of length l. Similar argument can be made when the roles of $F_{1}$ and $F_{3}$ , $F_{2}$ and $F_{4}$ are interchanged, respectively. By the definition of $A_{l}^{(21)}$ and $A_{l}^{(22)}$ , we obtain $K_{0 l} (1) = A_{l - 1}^{(2)} + A_{l}^{(2)}$ .

For (c), first consider ${2} △ ({1} \cup W)$ and ${2} △ ({1, 2} \cup W)$ , where W runs through all the words not involving $F_{1}, F_{2}, F_{3}$ and has length l−1. It is equivalent to consider ${1, 2} \cup W$ and ${1} \cup W$ for such W's. This yields $A_{l - 1}^{(21)}$ in $K_{1 l} (0)$ . Next we consider ${j} △ ({1} \cup W)$ and ${j} △ ({1, 2} \cup W)$ , where $j = 4, \dots, n$ and W runs through all the words not involving $F_{1}, F_{2}, F_{3}$ and has length l−1. By Tang and Deng (Citation1999), this yields $(n - l - 1) A_{l - 2}^{(21)} + l A_{l}^{(21)}$ in $K_{1 l} (0)$ . Similar argument can be made when the roles of $F_{1}$ and $F_{3}$ , $F_{2}$ and $F_{4}$ are interchanged, respectively. By the definition of $A_{l}^{(21)}$ and $A_{l}^{(22)}$ , we obtain $K_{1 l} (0) = (n - l - 1) A_{l - 2}^{(2)} + A_{l - 1}^{(2)} + l A_{l}^{(2)}$ .

For (d), first consider ${1} △ ({1} \cup W)$ , ${1} △ ({1, 2} \cup W)$ , ${1, 2} △ ({1} \cup W)$ and ${1, 2} △ ({1, 2} \cup W)$ , where W runs through all the words not involving $F_{1}, F_{2}, F_{3}$ and has length l−1. It is equivalent to consider W, ${2} \cup W$ , ${2} \cup W$ and W for such W's. This yields $2 A_{l - 1}^{(31)}$ in $K_{1 l} (1)$ . Next consider ${1} △ ({3} \cup W)$ , ${1} △ ({3, 4} \cup W)$ , ${1, 2} △ ({3} \cup W)$ and ${1, 2} △ ({3, 4} \cup W)$ , where W runs through all the words not involving $F_{1}, F_{3}, F_{4}$ and has length l−1. For such W's, ${1} △ ({3} \cup W)$ and ${1} △ ({3, 4} \cup W)$ yield $A_{l - 1}^{(42)}$ in $K_{1 l} (1)$ ; ${1, 2} △ ({3} \cup W)$ and ${1, 2} △ ({3, 4} \cup W)$ yield $A_{l - 2}^{(43)} + A_{l - 1}^{(52)}$ in $K_{1 l} (1)$ . Similar argument can be made when the roles of $F_{1}$ and $F_{3}$ , $F_{2}$ and $F_{4}$ are interchanged, respectively. By the definition of $A_{l}^{(3)}$ , we obtain $K_{1 l} (1) = 2 A_{l - 1}^{(3)} + 2 {A_{l - 1}^{(42)} + A_{l - 2}^{(43)} + A_{l - 1}^{(52)}}$ .

For (e), first consider ${2} △ ({1, 3} \cup W)$ , ${2} △ ({1, 3, 4} \cup W)$ , ${2} △ ({1, 2, 3} \cup W)$ , ${2} △ ({1, 2, 3, 4} \cup W)$ and ${4} △ ({1, 3} \cup W)$ , ${4} △ ({1, 3, 4} \cup W)$ , ${4} △ ({1, 2, 3} \cup W)$ , ${4} △ ({1, 2, 3, 4} \cup W)$ , where W runs through all the words not involving $F_{1}, F_{2}, F_{3}, F_{4}$ and has length l−2. This yields $2 A_{l - 2}^{(7)}$ in $K_{2 l} (0)$ . Next consider ${j} △ ({1, 3} \cup W)$ , ${j} △ ({1, 3, 4} \cup W)$ , ${j} △ ({1, 2, 3} \cup W)$ , ${j} △ ({1, 2, 3, 4} \cup W)$ for $j = 5, \dots, n$ . By Tang and Deng (Citation1999), this yields $(n - l - 1) A_{l - 3}^{(7)} + (l - 1) A_{l - 1}^{(7)}$ . So we obtain $K_{2 l} (0) = 2 A_{l - 2}^{(7)} + (n - l - 1) A_{l - 3}^{(7)} + (l - 1) A_{l - 1}^{(7)}$ .

For (f), consider ${j} △ ({1, 3} \cup W)$ , ${j} △ ({1, 3, 4} \cup W)$ , ${j} △ ({1, 2, 3} \cup W)$ , ${j} △ ({1, 2, 3, 4} \cup W)$ for j = 1, 3, and ${i, j} △ ({1, 3} \cup W)$ , ${i, j} △ ({1, 3, 4} \cup W)$ , ${i, j} △ ({1, 2, 3} \cup W)$ , ${i, j} △ ({1, 2, 3, 4} \cup W)$ for $(i, j) = (1, 2), (3, 4)$ , where W runs through all the words not involving $F_{1}, F_{2}, F_{3}, F_{4}$ and has length l−2. This yields $2 A_{l - 2}^{(8)}$ . So we obtain $K_{2 l} (1) = 2 A_{l - 2}^{(8)}$ .

Bayesian-inspired minimum contamination designs under a double-pair conditional effect model

Abstract

1. Introduction

2. Conditional effect model and Bayesian-inspired effect hierarchy

2.1. Conditional effect model

2.2. Bayesian-inspired effect hierarchy

3. Universally optimal designs for main effect model

3.1. Universally optimal designs

4. Minimum contamination and complementary set theory

5. Efficient design search and examples

5.1. A procedure for efficient design search

5.2. Examples

Table 2. Regular minimum contamination designs for N = 32.

6. Concluding remarks

Table 1. Regular minimum contamination designs for N = 16.

Table 3. Sixteen-run minimum contamination design with ten factors for the light bulb experiment.

Supplemental Material

Acknowledgments

Disclosure statement

References

Appendix. Proof of Theorem 4.1

Information for

Open access

Opportunities

Help and information

Bayesian-inspired minimum contamination designs under a double-pair conditional effect model

Abstract

1. Introduction

2. Conditional effect model and Bayesian-inspired effect hierarchy

2.1. Conditional effect model

2.2. Bayesian-inspired effect hierarchy

3. Universally optimal designs for main effect model

3.1. Universally optimal designs

4. Minimum contamination and complementary set theory

5. Efficient design search and examples

5.1. A procedure for efficient design search

5.2. Examples

Table 2. Regular minimum contamination designs for N = 32.

6. Concluding remarks

Table 1. Regular minimum contamination designs for N = 16.

Table 3. Sixteen-run minimum contamination design with ten factors for the light bulb experiment.

Supplemental Material

Acknowledgments

Disclosure statement

Additional information

Funding

References

Appendix. Proof of Theorem 4.1

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date