371
Views
0
CrossRef citations to date
0
Altmetric
Articles

Bayesian-inspired minimum contamination designs under a double-pair conditional effect model

ORCID Icon
Pages 336-349 | Received 01 Mar 2023, Accepted 15 Aug 2023, Published online: 30 Aug 2023

Abstract

In two-level fractional factorial designs, conditional main effects can provide insights by which to analyze factorial effects and facilitate the de-aliasing of fully aliased two-factor interactions. Conditional main effects are of particular interest in situations where some factors are nested within others. Most of the relevant literature has focused on the development of data analysis tools that use conditional main effects, while the issue of optimal factorial design for a given linear model involving conditional main effects has been largely overlooked. Mukerjee, Wu and Chang [Statist. Sinica 27 (2017) 997–1016] established a framework by which to optimize designs under a conditional effect model. Although theoretically sound, their results were limited to a single pair of conditional and conditioning factors. In this paper, we extend the applicability of their framework to double pairs of conditional and conditioning factors by providing the corresponding parameterization and effect hierarchy. We propose a minimum contamination-based criterion by which to evaluate designs and develop a complementary set theory to facilitate the search of minimum contamination designs. The catalogues of 16- and 32-run minimum contamination designs are provided. For five to twelve factors, we show that all 16-run minimum contamination designs under the conditional effect model are also minimum aberration according to Fries and Hunter [Technometrics 22 (1980) 601–608].

1. Introduction

Factorial designs have been widely used in industry and academia in the past decades. Two-level fractional factorial designs have proven particularly effective in situations in which the purpose of experiments is to screen out inactive factors. Researchers have dedicated considerable effort to the evaluation of two-level fractional factorial designs. Fries and Hunter (Citation1980) proposed a model robust criterion named minimum aberration tailored specifically to two-level regular fractional factorial designs. The minimum aberration criterion minimizes the wordlength patterns of two-level regular designs in a sequential manner from lower-order factorial effects to higher-order ones. It is established for design selection under the assumption that lower-order effects are more important than higher-order effects and effects of the same order are equally important. The minimum aberration criterion has since been adaptive to accommodate nonregular fractional factorial designs (Cheng et al., Citation2002; Tang & Deng, Citation1999; Xu & Wu, Citation2001) and multi-stratum factorial designs (Chang, Citation2022; Chang & Cheng, Citation2018). All the aforementioned minimum aberration criteria were developed under an orthogonal parameterization of factorial effects (Cheng, Citation2014, chapter 6). Accordingly, a defining word of length four, say F1F2F3F4, creates three pairs of fully aliased two-factor interactions as follows: F1F2=F3F4, F1F3=F2F4 and F1F4=F2F3. The two-factor interactions in each pair are completely mixed up and cannot be estimated simultaneously. Interested readers may refer to Mukerjee and Wu (Citation2006), Cheng (Citation2014) and Wu and Hamada (Citation2021) for further details.

In some practical situations, it is preferable to investigate a two-factor interaction via two conditional main effects, with each one conditionally defined according to another factor of a specific level. For example, the two-factor interaction F1F2 can be decomposed as the difference between a conditional main effect F1 conditioned on the low level of F2 and that conditioned on the high level of F2. Sliding level experiments in engineering are structured in this way (Wu & Hamada, Citation2021, p. 343), where the interest is on the conditional main effects conditioned on the slid factors. Mukerjee et al. (Citation2017) reported on an industrial experiment involving motor and speed as two factors. The objective of the experiment was to assess the comparison of the motors separately at each speed; therefore, the conditional main effects of the motors conditioned on each level of speed are of particular interest. Other intriguing examples pertaining to the use of conditional main effects are outlined in Wu (Citation2015) and Wu (Citation2018).

It is known, also mentioned in Wu (Citation2015), that for 2 two-level factors F1 and F2, the main effect F1 in conjunction with the interaction F1F2 spans the same vector space as that spanned by the conditional main effects F1 respectively conditioned on the two levels of F2. Mukerjee et al. (Citation2017) referred to F1 and F2 as a pair of conditional and conditioning factors. To deal with a single pair of conditional and conditioning factors, Mukerjee et al. (Citation2017) proposed a model in which the main effect of F1 and the two-factor interaction F1F2 are replaced with the two associated conditional main effects. This can be viewed as an alternative to the aforementioned orthogonal parameterization, under which the defining relation I=F1F2F3F4 produces partially aliased effects (neither fully aliased nor orthogonal). In this paper, the model proposed by Mukerjee et al. (Citation2017) is referred to as a conditional effect model. The fact that the parameterization of a single-pair conditional effect model differs from that of a model tailored to the minimum aberration necessitates a new aberration criterion applicable to conditional effect models. Mukerjee et al. (Citation2017) proposed a minimum aberration criterion as well as a strategy by which to search for designs under a single-pair conditional effect model. Apart from Mukerjee et al. (Citation2017), an alternative approach by the use of the indicator function in Pistone and Wynn (Citation1996) is given by Sabbaghi (Citation2020), who developed an algebra for conditional effect models. From the data analysis aspect, Mak and Wu (Citation2019) proposed a bi-level variable selection of conditional main effects in observational data using a penalty function with two layers: the outer one controlling between-group selection, and the inner one controlling within-group selection.

It is important to consider that the parameterization under a condition effect model destroys the fully aliased relationship between two-factor interactions. Thus, one application of conditional main effects involves the de-aliasing of two-factor interactions in regular fractional factorial designs of resolution four. This involves identifying significant but aliased two-factor interactions and then transforming the model to the corresponding conditional effect model. Afterwards, the de-aliasing strategies in Wu (Citation2015), Su and Wu (Citation2017), Chang (Citation2019) and Lawson (Citation2020) can then be used to facilitate subsequent data analysis.

In the current study, we consider two-level factorial designs in conjunction with conditional effect models involving two pairs of conditional and conditioned factors. The remaining factors, not involving the two pairs of conditional and conditioning factors, are called traditional factors. Wu and Hamada (Citation2021, p. 347) described an experiment involving the sealing of a light bulb, in which the outcomes were determined mainly by two pairs of conditional and conditioning factors (H,G) and (J,I), respectively. The corresponding linear model comprised the conditional main effects of H and J respectively conditioned on G and I. In the current study, we extend the work of Mukerjee et al. (Citation2017) to double-pair conditional effect models. The minimum aberration was based on the effect hierarchy principle (Wu & Hamada, Citation2021, p. 168); however, it is difficult to artificially argue an order among the effects in a conditional effect model. In accordance with Mukerjee et al. (Citation2017), we adopt the Bayesian approach proposed in Mitchell et al. (Citation1995), later popularized by Kerr (Citation2001), Joseph (Citation2006), Joseph and Delaney (Citation2007), Ai et al. (Citation2009), Joseph et al. (Citation2009), Kang and Joseph (Citation2009), Chang and Cheng (Citation2018) and Chang (Citation2019Citation2022), to derive an order of factorial effects based on their prior variances. Once this order has been established, we then define a criterion for design evaluation by sequentially minimizing the bias (contamination) caused by lower-order interactions to higher-order interactions. We refer to the proposed criterion as the minimum contamination criterion. We provide the catalogues of 16-run and 32-run minimum contamination designs for various factor numbers at the end of this paper.

The remainder of this paper is organized as follows. Section 2 introduces the parametrization and a Bayesian-inspired hierarchical order of effects under a double-pair conditional effect model. Some sufficient conditions for a design to be universally optimal under the conditional effect model involving only main effects are given in Section 3. Section 4 presents a new minimum contamination criterion defined according to the Bayesian-inspired hierarchical order. In addition, we develop a complementary set theory to guide the search for minimum contamination designs involving a large number of factors. An efficient computational procedure is developed in Section 5 to search for minimum contamination regular/nonregular designs for an arbitrary number of factors. Numerical examples and a real experiment are discussed. Conclusions are drawn in Section 6.

2. Conditional effect model and Bayesian-inspired effect hierarchy

We give the details regarding the parameterization under a double-pair conditional effect model. We adopt the Bayesian approach in Mitchell et al. (Citation1995) to derive an effect order, which serves as the building block for the minimum contamination criterion introduced in Section 4.

2.1. Conditional effect model

A double-pair conditional effect model is a linear model with reparameterization using two pairs of conditional and conditioning factors. Consider a 2n full factorial design with n (4) factors F1,,Fn, each at levels 0 and 1. Without loss of generality, let F1,F2 be one pair of conditional and conditioned factors and F3,F4 be the other pair. The main effect and interaction effects involving F1 (respectively, F3) are defined conditionally on each fixed level of F2 (respectively, F4). Define Ω as the set of ν=2n binary n-tuples representing the 2n treatment combinations of the full factorial design. For (i1,,in)Ω, let τ(i1in) be the treatment effect of treatment combination i1in. Under a linear model, τ(i1in) is the expectation of the response measured at the treatment combination i1in. Similarly, we write θ(i1in) for the factorial effect F1i1Fnin using the conventional orthogonal parameterization (Cheng, Citation2014, chapter 6) when i1in is nonnull, and θ(00) for the grand mean. With n = 3 for illustration, θ(100)=18{τ(100)+τ(110)+τ(101)+τ(111)τ(000)τ(010)τ(001)τ(011)},θ(110)=18{τ(000)+τ(110)+τ(001)+τ(111)τ(100)τ(101)τ(010)τ(011)},θ(111)=18{τ(100)+τ(010)+τ(001)+τ(111)τ(000)τ(110)τ(101)τ(011)}separately represent the main effect of F1, the two-factor interaction of F1 and F2, and the three-factor interaction of F1,F2,F3. Each θ(i1i2i3) can be interpreted using the mean responses measured at different levels of factors. For example, the main effect θ(100) is the average of the difference between the mean responses measured at the levels 1 and 0 of F1. Let τ and θ be ν×1 vectors with elements τ(i1in) and θ(i1in) arranged in the lexicographic order, respectively, where for n = 3, we have θ=(θ(000),θ(001),θ(010),θ(011),θ(100),θ(101),θ(110),θ(111)).Then, the linear model using the orthogonal parameterization under the full factorial design is given by (1) τ=Hnθ(or equivalently, θ=ν1Hnτ),(1) where ⊗ represents the Kronecker product and Hn denotes the n-fold Kronecker product of H=(1111),a Hadamard matrix of order two. When interpreted under the linear model, the columns of H respectively correspond to the grand mean and a contrast of the mean responses measured at the two levels of the factor. Since a 2n full factorial design has a cross-product structure, the associated matrix of the grand mean and contrasts can be obtained via the n-fold Kronecker product of H as in (Equation1). We refer to the model in (Equation1) as a traditional model under the full factorial design.

Let H(0)=(1,1) and H(1)=(1,1) be the top and bottom rows of H, respectively. Emphasizing F1 and F3, we express Equation (Equation1) as (2) θ=ν1(H(0)HH(0)HH(1)HH(0)HH(0)HH(1)HH(1)HH(1)H)H(n4)τ.(2) Let β(j1jn) be the factorial effect F1i1Fnin under a conditional effect model with two pairs conditional and conditioning factors F1,F2 and F3,F4, respectively. Denote the vector with the ν elements β(j1jn)'s by β in the same lexicographic order as θ. Under the double-pair conditional effect model, the factorial effects involving F1 and F3 are defined conditionally on the levels of F2 and F4, respectively. Thus in (Equation2), H is replaced with 2I2 whenever H(1) precedes it. We can reparametrize θ by β with (3) β=ν1WH(n4)τ,(3) where W=(H(0)HH(0)HH(1)2I2H(0)HH(0)HH(1)2I2H(1)2I2H(1)2I2),in which I2 is the identity matrix of order two. The model in (Equation3) is called a double-pair conditional effect model in this paper. The factorial effects involving F1 and F3 are referred to as conditional (factorial) effects, while those not involving them are referred to as unconditional (factorial) effects. We cluster β into various groups of vectors representing unconditional and conditional factorial effects. Define Ω0l={(j1,,jn):j1=j3=0, and l of j2,j4,,jn equal 1},Ω1l={(j1,,jn):j1=1,j2=0,1,j3=0,and l1 of j4,,jn equal 1}{(j1,,jn):j3=1,j4=0,1,j1=0,and l1 of j2,j5,,jn equal 1},Ω2l={(j1,,jn):j1=j3=1,j2=0,1,j4=0,1,and l2 of j5,,jn equal 1},where 1ln2. It is apparent that β(j1jn)=θ(j1jn) if (j1,,jn)Ω0l. Let βsl be the vector with elements β(j1jn), where (j1,,jn)Ωsl. Later we will see that the effects associated with the same Ωsl have the same importance using the Bayesian-inspired effect hierarchy introduced in the next subsection.

2.2. Bayesian-inspired effect hierarchy

Chipman et al. (Citation1997) proposed a Bayesian variable selection for designed experiments with complex aliasing. Rather than independence prior distribution on the factorial effects, Chipman et al. (Citation1997) used a hierarchical prior that is consistent with the effect heredity principle (Wu & Hamada, Citation2021, p. 169). A basic idea of their approach is to assign a larger prior variance to a more important factorial effect. This idea is compatible with the Bayesian (functional) prior distribution derived by Mitchell et al. (Citation1995), Kerr (Citation2001), Joseph (Citation2006), Joseph and Delaney (Citation2007), Ai et al. (Citation2009), Joseph et al. (Citation2009), Kang and Joseph (Citation2009), Chang and Cheng (Citation2018) and Chang (Citation2019Citation2022). With the derived prior variances, one can readily define an effect hierarchical order and an aberration-like criterion for design evaluation.

In this paper, we adopt the Bayesian approach in Mitchell et al. (Citation1995), who set up a functional prior by regarding τ as a realization of a stationary Gaussian random function. Then the prior distribution of factorial effects is induced by the relation in (Equation1). When applied to (Equation3), the Bayesian approach induces an effect hierarchy of the β(j1jn)'s via a prior specification on τ in terms of a zero-mean Gaussian random function such that cov(τ)=σ2Rn, where σ2>0 and the 2×2 matrix R has diagonal elements 1 and off-diagonal elements ρ, 0<ρ<1. This covariance structure is equivalent to Equation (4) of Joseph (Citation2006). It follows by expanding Rn that the correlation of τ(i1in) and τ(j1jn) is l:1ln,iljlρ,which is equal to ρl:1ln,iljl and only depends on the total number of components where (i1in) and (j1jn) differ. Thus, such a covariance structure can be interpreted as treating every factor equally, which is reasonable since there is usually no knowledge about the importance of the factors at the experimentation stage. In conjunction with (Equation3), the prior covariance matrix of β is given by cov(β)=ν2{WH(n4)}cov(τ){WH(n4)}=σ2ν2{WR4W}{HRH}(n4).The following result gives the variances of the β(j1jn)'s.

Theorem 2.1

For a (j1,,jn)Ωsl, we have var(β(j1jn))=σ2ν1(1+ρ)nls(1ρ)l.

Proof.

By the identity cov(β)=σ2ν2{WR4W}{HRH}(n4), one can easily verify that HRH=2diag(1+ρ,1ρ), W=(H2H2H2H22I2H2I2H2I2H2I2H2HI22HI22HI22HI22I222I222I222I22),and R4=(R2ρR2ρR2ρ2R2ρR2R2ρ2R2ρR2ρR2ρ2R2R2ρR2ρ2R2ρR2ρR2R2).WR4W is a 16×16 matrix, which can be regarded as a 4×4 block-matrix with each block being a 4×4 matrix. Denote the (i,j)th block-matrix of WR4W by [WR4W]ij. Then by calculation, we get [WR4W]ij=0 when ij; for i = j, we obtain [WR4W]11=4(1+ρ)2(HRH)2, [WR4W]22=8(1ρ2)R(HRH), [WR4W]33=8(1ρ2)(HRH)R and [WR4W]44=16(1ρ)2RR.

The diagonal elements of (HRH)2, denoted by diag((HRH)2), can be obtained as 4(1+ρ)2,4(1ρ)(1+ρ),4(1ρ)(1+ρ),4(1ρ)2. We also have diag(R(HRH))=2(1+ρ,1ρ,1+ρ,1ρ), diag((HRH)R)=2(1+ρ,1+ρ,1ρ,1ρ) and diag(RR)=(1,1,1,1). Thus, we get diag([WR4W]11)=16((1+ρ)4,(1ρ)(1+ρ)3,(1ρ)(1+ρ)3,(1ρ)2(1+ρ)2), diag([WR4W]22)=16((1+ρ)2(1ρ),(1+ρ)(1ρ)2,(1+ρ)2(1ρ),(1+ρ)(1ρ)2), diag([WR4W]33)=16((1+ρ)2(1ρ),(1+ρ)2(1ρ),(1+ρ)(1ρ)2,(1+ρ)(1ρ)2) and diag([WR4W]44)=16((1ρ)2,(1ρ)2,(1ρ)2,(1ρ)2).

Since the variances of β(j1jn)'s are only related to the diagonal elements of cov(β), one can easily check for a (j1,,jn)Ωsl, var(β(j1jn))=σ2ν1(1+ρ)nls(1ρ)l based on the above calculation.

Let Vsl=var(β(j1jn)) for (j1,,jn)Ωsl. From Theorem 2.1, it is clear that V0l>V1l>V2l for 2ln2. Because V2l/V0,l+1=1/(1ρ2)>1 for all 0<ρ<1, we have (4) V01>V11>V02>V12>V22>V03>V13>V23>>V0,n2>V1,n2>V2,n2.(4) In view of (Equation4), we define the following effect hierarchy under the conditional effect model (Equation3) as follows. The unconditional main effects have the largest variance V01 and are the most important, while the conditional main effects with variance V11 are positioned next; then come the unconditional two-factor interactions (V02), followed by the one-pair conditional two-factor interactions (V12), then two-pair conditional two-factor interactions (V22), and so on. This effect hierarchy order is not surprising since a conditional main effect is proportional to the average of a unconditional main effect and two-factor interaction, resulting in a variance in-between.

3. Universally optimal designs for main effect model

As mentioned in Mukerjee et al. (Citation2017), a justifiable criterion for design evaluation is to identify a class of designs which ensure optimal inference on the β(j1jn)'s corresponding to Ω01 and Ω11 in the absence of all interactions. Then, in order to possess model robustness, among these designs we find one which sequentially minimizes a suitably defined measure of bias caused by successive interactions in the effect hierarchy. In this section, we connect the conditional effect model with the traditional model. Then we provide some sufficient conditions for a design to be universally optimal under a main-effect conditional effect model.

The connection between conditional effects β and traditional factorial effects θ can be established by (Equation1) and (Equation3) as follows: β=ν1WH(n4)τ=ν1WH(n4)Hnθ=ν1{WH4}{HH}(n4)θ.By using the fact HH=2I2 and WH4=4(H2H2000002I2HH2002HI2H2000002I22H2),we obtain β=(I2(n2)0000012HI2I2(n4)0012I2HI2(n4)0000012H2I2(n2))θ,which implies θ=(I2(n2)0000012HI2I2(n4)0012I2HI2(n4)0000012H2I2(n2))βbecause H1=(1/2)H. This yields θ(0j20j4j5jn)=β(0j20j4j5jn),θ(1j20j4j5jn)=12{β(100j4j5jn)+δ(j2)β(110j4j5jn)},θ(0j21j4j5jn)=12{β(0j210j5jn)+δ(j4)β(0j211j5jn)},θ(1j21j4j5jn)=12{β(1010j5jn)+δ(j4)β(1011j5jn)+δ(j2)β(1110j5jn)+δ(j2)δ(j4)β(1111j5jn)},where δ(j)=2j+1. In view of θ, the first of above identities shows that 2n2/2n=1/4 of the θ(j1jn)'s remain unconditional effects, while the other 3/4 of the θ(j1jn)'s involve F1 and F3 and hence are a combination of the conditional effects. These equations uncover the connection between factorial effects under the traditional model and under the conditional effect model. Take n = 5 for example. We have θ(10000)={β(10000)+β(11000)}/2. Recall that β(10000) and β(11000) are the conditional main effects of F1 conditioned on levels 0 and 1 of F2, respectively. Thus, the unconditional main effect of F1 is proportional to the average of the two conditional main effects of F1. Likewise, we have θ(11000)={β(10000)β(11000)}/2, which means that the unconditional two-factor interaction of F1 and F2 is proportional to the difference between the two conditional main effects of F1.

Consider an N-run design represented by the N×n design matrix D with elements 1 (high level) and 1 (low level). Denote the corresponding N×2n full model matrix under (Equation1) by X, each column corresponding to one θ(j1jn). Note that X can be obtained by deleting the rows of Hn that are not in D. With the vector of responses y, we have E(y)=Xθ. Each column of X is represented by x(j1jn) with (j1,,jn)Ω. The connection between θ(j1jn)'s and β(j1jn)'s suggests z(0j20j4j5jn)=x(0j20j4j5jn),z(1j20j4j5jn)=12{x(100j4j5jn)+δ(j2)x(110j4j5jn)},z(0j21j4j5jn)=12{x(0j210j5jn)+δ(j4)x(0j211j5jn)},z(1j21j4j5jn)=12{x(1010j5jn)+δ(j4)x(1011j5jn)+δ(j2)x(1110j5jn)+δ(j2)δ(j4)x(1111j5jn)},where δ(j)=2j+1. Let Zsl and Xsl consist of the z(j1jn)'s and x(j1jn)'s respectively, where (j1,,jn)Ωsl. Then the conditional effect model under D can be represented by (5) E(y)=z(00)β(00)+s=02l=1n2Zslβsl.(5) We follow the convention that the random observational errors are uncorrelated and homogeneous with equal variance.

3.1. Universally optimal designs

If all interactions are absent, then the model (Equation5) reduces to (6) E(y)=z(00)β(00)+Z01β01+Z11β11,(6) consisting of only unconditional and conditional main effects. In the following, we present a theorem which gives some requirements for a design to be universally optimal under model (Equation6).

Theorem 3.1

Suppose an N-run design D satisfies

(i)

D is an orthogonal array of strength two;

(ii)

all eight triples of symbols occur equally often when D is projected onto F1,F2,Fj, j{4,5,,n};

(iii)

all eight triples of symbols occur equally often when D is projected onto F3,F4,Fj, j{2,5,,n};

(iv)

all sixteen triples of symbols occur equally often when D is projected onto F1,F2,F3,F4.

Then D is universally optimal among all N-run designs for inference on β01 and β11 under model (Equation6).

Proof.

Let Z1=(Z01,Z11). Denote the information matrix of β01 and β11 under model (Equation6) by M. Note that M can be obtained by the Schur complement M=Z1{INz(00)[z(00)z(00)]1z(00)}Z1,which can be simplified as M=Z1{IN1N1N1N}Z1 because z(00)=1N. Because Z1Z1M is nonnegative definite, we have (7) tr[M]tr[Z1Z1]=N(n2)+4N=N(n+2)(7) for every N-run design. Under the conditions (i),…,(iv), it is easy to verify that Z11N=0 and Z1Z1=NIn+2. Thus M=NIn+2 and tr[M] reaches the upper bound in (Equation7). The result now follows from Kiefer (Citation1975).

By Theorem 3.1, a necessary condition for universally optimal designs is N16. Therefore, if n = 4, the model (Equation5) can only involve the two pairs of conditional and conditioning factors (F1,F2) and (F3,F4). Then the universally optimal design is exactly the 24 full factorial design. Thus, to avoid trivialities, we let n5 in the discussion of design selection in the next section.

4. Minimum contamination and complementary set theory

The designs meeting the conditions (i),…,(iv) of Theorem 3.1 are universally optimal under model (Equation6). In addition, βˆh1=N1Zh1y is the best linear unbiased estimator of βh1, h = 0, 1. However, nonnegligible interactions may exist and βˆh1 is no longer unbiased in this case. We revert back to the model (Equation5), which includes all interactions, to assess the impact of possible presence of interactions on βˆh1. Under model (Equation5), βˆh1 has bias N1s=02l=2n2Zh1Zslβsl. The matrix N1s=02l=2n2Zh1Zsl is referred to as an alias matrix in Wu and Hamada (Citation2021, p. 419). A reasonable measure of the bias in βˆh1 caused by the interactions, as in Tang and Deng (Citation1999), is Ksl(h)=N2tr[Zh1ZslZslZh1]=N2tr[Xh1XslXslXh1],where the last equality holds because Xsl is an orthogonal transform of Zsl. Based on the effect hierarchy in (Equation4), one should minimize the bias successively in order of priority. Thus, we define a minimum contamination design as the one which minimizes the terms of (8) K={K02(0),K02(1),K12(0),K12(1),K22(0),K22(1),K03(0),K03(1),}(8) in a sequential manner from left to right. In (Equation8), Ksl(0) appears before Ksl(1) because the contamination or bias in βˆ01 is deemed more severe than in βˆ11. The concept of minimizing contamination due to the existence of interactions is not new. We note that Cheng and Tang (Citation2005) used this idea to develop a general theory for minimum aberration.

The minimum contamination criterion in (Equation8) induces a ranking of designs of the same run size. It is time consuming, however, to find the minimum contamination design via complete search using (Equation8) if n is large. A useful technique of design construction is via complementary designs. Tang and Wu (Citation1996) provided identities related to the wordlength pattern of a regular two-level design to that of its complementary design. Suen et al. (Citation1997) extended these identities to regular snp designs. Cheng (Citation2014, p. 179) reviewed the design construction using complementary designs.

We now focus on regular designs under the conditional effect model due to their nice properties and popularity among practitioners. Let Δr be the set of nonnull r×1 binary vectors. All operations with these vectors are over the finite field GF(2). Regarding the notation, we do not apply bold font style to these binary vectors to distinguish them from the vectors with the elements belonging to real numbers. A regular design in N=2r (r<n) runs is given by n distinct vectors b1,,bn from Δr such that the matrix B=(b1,,bn) has full row rank. The design consists of the N treatment combinations of the form aB, where aΔr{0}.

In the following, we define some useful quantities to represent Ksl(h). Let Al(1) be the number of ways of choosing l out of b2,b4,,bn such that the sum of the chosen l equals 0; Al(21) be the number of ways of choosing l out of b4,,bn such that the sum of the chosen l is in the set {b1,b1+b2}; Al(22) be the number of ways of choosing l out of b2,b5,,bn such that the sum of the chosen l is in the set {b3,b3+b4}; Al(2)=Al(21)+Al(22); Al(31) be the number of ways of choosing l out of b4,,bn such that the sum of the chosen l is in the set {0,b2}; Al(32) be the number of ways of choosing l out of b2,b5,,bn such that the sum of the chosen l is in the set {0,b4}; Al(3)=Al(31)+Al(32); Al(42) be the number of ways of choosing l out of b2,b5,,bn such that the sum of the chosen l is in the set {b1+b3,b1+b3+b4}; Al(43) be the number of ways of choosing l out of b5,,bn such that the sum of the chosen l is in the set {b1+b3,b1+b3+b4}; Al(52) be the number of ways of choosing l out of b5,,bn such that the sum of the chosen l is in the set {b1+b2+b3,b1+b2+b3+b4}; Al(7) be the number of ways of choosing l out of b5,,bn such that the sum of the chosen l is in the set {b1+b3,b1+b2+b3,b1+b3+b4,b1+b2+b3+b4}; Al(8) be the number of ways of choosing l out of b5,,bn such that the sum of the chosen l is in the set {b1,b3,b1+b2,b1+b4,b2+b3,b3+b4,b1+b2+b4,b2+b3+b4}. The next result, with the proof deferred to the appendix, gives expressions for Ksl(h) in terms of the quantities just introduced.

Theorem 4.1

For 2ln2, we have

(a)

K0l(0)=(l+1)Al+1(1)+(nl1)Al1(1);

(b)

K0l(1)=Al1(2)+Al(2);

(c)

K1l(0)=(nl1)Al2(2)+Al1(2)+lAl(2);

(d)

K1l(1)=2Al1(3)+2{Al1(42)+Al2(43)+Al1(52)};

(e)

K2l(0)=2Al2(7)+(nl1)Al3(7)+(l1)Al1(7);

(f)

K2l(1)=2Al2(8).

In view of Theorem 4.1, sequential minimization of K is equivalent to that of the terms of A={A3(1),A2(2),A1(42)+A1(52),A1(7),A4(1),A3(2),}, which is reduced to A={A3(1),A2(2),A1(7),A4(1),A3(2),}because F1,F2,F3,F4 form a complete factorial, implying A1(42)+A1(52)=A1(7).

We now develop a complementary set theory for the first four terms in the sequence A. Let T~ be the complement of {b2,b4,,bn} in Δr; Al(T~) be the number of ways of choosing l members of T~ such that the sum of the chosen l equals 0. Let T12=T~{b1,b1+b2}; T34=T~{b3,b3+b4}; Al(12)(T12) be the number of ways of choosing l members of T12 such that the sum of the chosen l is in {b1,b1+b2}; Al(34)(T34) be the number of ways of choosing l members of T34 such that the sum of the chosen l is in {b3,b3+b4}.

Theorem 4.2

Let cj, j=1,,5, be constants irrelevant to designs and T=Δr{b5,,bn}. Define Hi(,) as Equation (2) in Mukerjee and Wu (Citation2001). We have

(a)

A3(1)=c1A3(T~);

(b)

A4(1)=c2+A3(T~)+A4(T~);

(c)

A2(2)=c3+A2(12)(T12)+A2(34)(T34);

(d)

A1(7)=B1+B2+B3+B4, where B1=c41+H1({b1+b3},T) if b1+b3=bj for some j{5,,n} and zero otherwise; B2=c42+H1({b1+b2+b3},T) if b1+b2+b3=bj for some j{5,,n} and zero otherwise; B3=c43+H1({b1+b3+b4},T) if b1+b3+b4=bj for some j{5,,n} and zero otherwise; B4=c44+H1({b1+b2+b3+b4},T) if b1+b2+b3+b4=bj for some j{5,,n} and zero otherwise. c4j's are constants for every design.

Proof.

Parts (a) and (b) are evident from Tang and Wu (Citation1996).

For (c), note that A2(21)=H2({b1},{b4,,bn})+H2({b1+b2},{b4,,bn}), which can be simplified as A2(21)=c+H2({b1},{b2,b1+b2}T12)+H2({b1+b2},{b1,b2}T12) by Lemmas 1 and 3 in Mukerjee and Wu (Citation2001), where c is a constant for every design. Because the design is an orthogonal array of strength two, we have H2({b1},{b2,b1+b2}T12)=1+H2({b1},T12) and H2({b1+b2},{b1,b2}T12)=1+H2({b1+b2},T12). Hence A2(21)=c+2+H2({b1},T12)+H2({b1+b2},T12)=c+2+A2(12)(T12). Similarly, A2(22)=c+2+A2(34)(T34), where c is a constant for every design. Therefore, we have A2(2)=c3+A2(12)(T12)+A2(34)(T34) by letting c3=c+c+4.

For (d), note that A1(7)=H1({b1+b3},{b5,,bn})+H1({b1+b3+b4},{b5,,bn})+H1({b1+b2+b3},{b5,,bn})+H1({b1+b2+b3+b4},{b5,,bn}). Let F=Δr{b1+b3,b5,,bn}. If b1+b3bj for j=5,,n, then H1({b1+b3},{b5,,bn})=0. If b1+b3=bj for some j{5,,n}, then H1({b1+b3},{b5,,bn})=c41+H1({b1+b3},F) by Lemmas 1 and 3 in Mukerjee and Wu (Citation2001), where c41 is a constant for every design. Since b1+b3=bj for some j{5,,n}, we have F = T and H1({b1+b3},F)=H1({b1+b3},T). Thus H1({b1+b3},{b5,,bn})=B1. Similarly, we have H1({b1+b2+b3},T)=B2, H1({b1+b3+b4},T)=B3 and H1({b1+b2+b3+b4},T)=B4. So, A1(7)=B1+B2+B3+B4.

Theorem 4.2 provides a way to evaluate designs using the sequence A, and equivalently sequence K, with the number of factors n=(N1)+2t~=N1t~, where t~ is the cardinality of T~. Once T~ is constructed, one can quickly get {b2,b4,,bn} by the identity Δr=T~{b2,b4,,bn}, and construct the design {b1,b2,b3,b4,,bn}. Constructing T~ with minimum contamination given a large t~ is usually time-consuming. It is more practical to find a small T~ with minimum contamination, leading to a large n. Thus, Theorem 4.2 helps find large minimum contamination designs. For example, consider t~=5. The set T~={α1,α2,α3,α4,α1+α2} can be verified to have maximal A3(T~)=1, where α1,α2,α3,α4 are four linearly independent vectors from Δr. Moreover, assigning α1=b1, α2=b3+b4, α3=b3 and α4=b1+b2 results in minimal A2(12)(T12)=A2(34)(T34)=0. Thus, we have T~={b1,b3+b4,b3,b1+b2,b1+b3+b4} and construct a minimum contamination design with the number of factors N−6, which equals 10, 26, 58 for 16-, 32- and 64-run designs.

5. Efficient design search and examples

Finding minimum contamination designs using (Equation8) is a daunting task for even moderate run size and number of factors. This section presents an extension of a searching procedure given by Mukerjee et al. (Citation2017) to the current setting and provides examples for illustration.

5.1. A procedure for efficient design search

The minimum contamination criterion (Equation8) can be applied to regular and nonregular designs, but requires heavy computation of Ksl(h)=N2tr[Xh1XslXslXh1]. By noting that XslXsl is reminiscent of minimum moment aberration in Xu (Citation2003), Mukerjee et al. (Citation2017) developed an efficient computational procedure for Ksl(h). We now extend this procedure for computing XslXsl to double-pair conditional effect models. For 0cn2, let Q0(c)=1, Q1(c)=2c(n4), and (9) Ql(c)=l1{[2c(n4)]Ql1(c)(nl2)Ql2(c)},(9) where 2ln2. Write D~ for the subarray given by the last n−4 columns of D (i.e. only consisting of traditional factors). For 1u,wN, let cuw be the number of positions where the uth and wth rows of D~ have the same entry, and qsl(u,w) be the (u,w)th element of XslXsl. Denote the (u,j)th element of D by duj. Then the following result holds.

Theorem 5.1

For 1u,wN and 2ln2, we have

(a)

q0l(u,w)=(du2dw2du4dw4)Ql2(cu,w)+(du2dw2+du4dw4)Ql1(cu,w)+Ql(cu,w);

(b)

q1l(u,w)=(du1dw1+du1dw1du2dw2+du3dw3+du3dw3du4dw4)Ql1(cu,w);

(c)

q2l(u,w)=du1dw1du3dw3(1+du2dw2+du4dw4+du2dw2du4dw4)Ql2(cu,w).

Proof.

For 2ln2, let Σ(l) be the sum over binary tuples j5jn such that l of j5,,jn equal 1. We have q0l(u,w)=Σ(l)x(u;0000j5jn)x(w;0000j5jn)+Σ(l1)x(u;0100j5jn)x(w;0100j5jn)+Σ(l1)x(u;0001j5jn)x(w;0001j5jn)+Σ(l2)x(u;0101j5jn)x(w;0101j5jn)=(du2dw2du4dw4)Ψl2(cu,w)+(du2dw2+du4dw4)Ψl1(cu,w)+Ψl(cu,w),where Ψl(u,w)=Σ(l)s=5n(dudw)js. Similarly, we have q1l(u,w)=(du1dw1+du1dw1du2dw2+du3dw3+du3dw3du4dw4)Ψl1(cu,w),q2l(u,w)=du1dw1du3dw3(1+du2dw2+du4dw4+du2dw2du4dw4)Ψl2(cu,w).The result will follow if Ψl(u,w)=Ql(cuw). It is clear that Ψ0(u,w)=1 and Ψ1(u,w)=cuw+(1)(n4cuw)=2cuw(n4). It remains to show Ψl(u,w) satisfies the recursion relation (Equation9).

Let Φ(ξ)=j=5n(1+ξdujdwj) and let Φl(ξ) be the lth derivative of Φ(ξ). Note that Ψl(u,w)=Φl(0)/l!. Differentiation of logΦ(ξ) yields Φ1(ξ)=(j=5ndujdwj1+ξdujdwj)Φ(ξ)=(cuw1+ξ(n4)cuw1ξ)Φ(ξ),that is, (1ξ2)Φ1(ξ)={2cuw(n4)(1+ξ)}Φ(ξ). Differentiating this l−1 and taking ξ=0, we get Φl(0)=[2cuw(n4)]Φl1(0)(l1)(nl2)Φl2(0).This leads to (Equation9) by using Ψl(u,w)=Φl(0)/l!.

With the help of Theorem 5.1 and suggested by Mukerjee et al. (Citation2017), an algorithm is provided as follows.

  1. Search for minimum contamination designs by first listing of all nonisomorphic regular designs for given run size N(16) and number of factors n(5).

  2. For each nonisomorphic regular design, permute its columns such that the resulting design satisfies the conditions in Theorem 3.1. Let the first four columns represent the two pairs of conditional and conditioned factors, that is, F1,F2 and F3,F4.

  3. Calculate the criterion in (Equation8) by using Theorem 5.1, and hence find a minimum contamination design.

This procedure mostly consumes affordable computational time. For N = 16 and n = 10, for example, it takes around 3.94 minutes to find a minimum contamination design on a desktop with 3.8 GHz CPU and 64 GB of RAM.

5.2. Examples

We apply the algorithm presented in Section 5.1 to 16- and 32-run designs. Afterwards, the light bulb experiment (Wu & Hamada, Citation2021, p. 347) mentioned in Section 1 is revisited.

For N = 16 and 32, a list of all nonisomorphic regular designs are given in the catalogues in Chen et al. (Citation1993). Table 1 exhibits the results for N = 16 and 5n12. In the table, the numbers 1,2,4,8 represent basic factors in a design. The other numbers represent added factors. For example, for n = 5, if the five factors are denoted by A, B, C, D, E, then the minimum contamination design is the one with the defining relation E = ABCD because 15 = 1 + 2 + 4 + 8. We can see that all minimum contamination designs under conditional effect models are also minimum aberration under traditional models. The finding supports using minimum aberration designs under traditional models to perform subsequent de-aliasing analysis in Su and Wu (Citation2017). Table 2 exhibits the results for N = 32 and 6n18. Same as Table 1, the numbers 1,2,4,8,16 represent basic factors in a design. The other numbers represent added factors. For example, for n = 6, if the five factors are denoted by A, B, C, D, E, F, then the minimum contamination design is the one with the defining relation F = ABCDE because 31 = 1 + 2 + 4 + 8 + 16. The R codes for generating these designs are attached to the supplementary material.

Table 2. Regular minimum contamination designs for N = 32.

The light bulb experiment mentioned in Wu and Hamada (Citation2021, p. 347) studied a light bulb sealing process performed to improve a cosmetic problem that was frequently occurring (Taguchi, Citation1987, Section 17.6). The outcomes were determined mainly by six two-level traditional factors A, B, C, D, E, F and two pairs of conditional and conditioning factors (H,G) and (J,I), respectively. A 16-run regular fractional factorial design is to be conducted to study this light bulb sealing process. With the use of the algorithm in Section 5.1, we obtain the minimum contamination design with the design matrix given in Table  (the column indices also provided in Table ). The first four columns are assigned to the two pairs of conditional and conditioning factors (H,G) and (J,I), respectively; the remaining columns are assigned to the six traditional factors. This 2106 design has the defining generators B = HI, C = AH, A = GI, D = HJ, E = AIJ and F = AHIJ. The corresponding K-sequence in (Equation8) has 2×3×(102+1)=42 elements given as follows: K={9,10,17,4,2,0,28,16,21,12,12,6,35,16,54,16,30,18,28,12,18,24,40,20,19,6,17,4,30,12,0,4,1,0,12,6,1,0,0,0,2,2}.We also evaluate the design provided by Taguchi (Citation1987, Section 17.6) (see Table 7.13 of Wu and Hamada (Citation2021, p.347)) by the minimum contamination criterion in (Equation8). The resulting K-sequence is given by K={15,7,11,6,3,0,20,19,30,10,10,6,35,18,38,22,31,18,32,10,32,18,40,20,13,7,15,4,29,12,4,3,2,0,14,6,1,0,0,0,1,2}.The first element of the K-sequence of the design in Table  is 9, smaller than that of the design provided by Taguchi (Citation1987, Section 17.6). Thus, the proposed algorithm found a better design in terms of the minimization of contamination caused by interactions under the double-pair conditional effect model.

6. Concluding remarks

This paper extends the work of Mukerjee et al. (Citation2017) to two pairs of conditional and conditioning factors. The conditional effect model in (Equation3) is an orthogonal reparameterization of the traditional model in (Equation1). Such a reparameterization introduces a new effect hierarchy order of factorial effects, resulting in a different design evaluation from the minimum aberration due to Fries and Hunter (Citation1980). We note that alternative reparameterization of (Equation1) is required if the topic of interest does not depend on conditional effects. For example, Yang and Speed (Citation2002) proposed to define the factorial effects with reference to natural baseline levels of the factors, referred to as baseline parameterization. Later, Mukerjee and Tang (Citation2012), Mukerjee and Huda (Citation2016) and Sun and Tang (Citation2022) discussed design optimality and construction under the baseline parameterization.

Table 1. Regular minimum contamination designs for N = 16.

Table 3. Sixteen-run minimum contamination design with ten factors for the light bulb experiment.

Mukerjee et al. (Citation2017) claimed that, in practice, the number of conditional and conditioned pairs seldom exceeds two. On the other hand, the effect hierarchy order in (Equation4) is only irrelevant to the value of r when the number of pairs is not greater than two. When dealing with more than two pairs, the effect hierarchy order is a nontrivial function of r and deriving useful results can be exceedingly difficult. Alternatively, we note that the minimum contamination criterion in (Equation8) can be applied to regular as well as nonregular designs; however, this paper focuses on regular designs owing to their popularity and theoretical underpinnings. The application of this framework to nonregular designs is left for future research. Another interesting future direction suggested by a reviewer is to involve qualitative four-level conditional/conditioning factors. It is known that a qualitative four-level factor can be decomposed into three orthogonal main effect components with two levels each (Wu & Hamada, Citation2021, chapter 7). However, our theory cannot be directly applied since the three two-level main effect components do not share the same properties as a real two-level factor. The covariance structure for the main effect components is different from that for two-level factors; see Joseph et al. (Citation2009) for a discussion. The resulting effect hierarchy order of conditional factorial effects may be complicated with vague interpretations. Since this topic requires a nontrivial extension, we leave it for future research.

Supplemental material

Supplemental Material

Download (6.5 KB)

Acknowledgments

We thank the Editor and two Reviewers for their constructive comments and suggestions, which have helped us to improve the article.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

We gratefully acknowledge funding from Academia Sinica [Grant Number AS-CDA-111-M05] and from National Science and Technology Council [Grant Number 111-2118-M-001-001-MY3].

References

  • Ai, M. Y., Kang, L., & Joseph, V. R. (2009). Bayesian optimal blocking of factorial designs. Journal of Statistical Planning Inference, 139(9), 3319–3328. https://doi.org/10.1016/j.jspi.2009.03.008
  • Chang, M. C. (2019). De-aliasing in two-level factorial designs: A Bayesian approach. Journal of Statistical Planning and Inference, 203, 82–90. https://doi.org/10.1016/j.jspi.2019.03.002
  • Chang, M. C. (2022). A unified framework for minimum aberration. Statistica Sinica, 32(1), 251–270.
  • Chang, M. C., & Cheng, C. S. (2018). A Bayesian approach to the selection of two-level multi-stratum factorial designs. Annals of Statistics, 46(4), 1779–1806.
  • Chen, J., Sun, D. X., & Wu, C. F. J. (1993). A catalogue of two-level and three-level fractional factorial designs with small runs. International Statistical Review, 61(1), 131–145. https://doi.org/10.2307/1403599
  • Cheng, C. S. (2014). Theory of factorial design: Single- and multi-stratum experiments. CRC Press.
  • Cheng, C. S., Deng, L. Y., & Tang, B. (2002). Generalized minimum aberration and design efficiency for nonregular fractional factorial designs. Statistica Sinica, 12(4), 991–1000.
  • Cheng, C. S., & Tang, B. (2005). A general theory of minimum aberration and its applications. Annals of Statistics, 33(2), 944–958. https://doi.org/10.1214/009053604000001228
  • Chipman, H., Hamada, M., & Wu, C. F. J. (1997). A bayesian variable-selection approach for analyzing designed experiments with complex aliasing. Technometrics, 39(4), 372–381. https://doi.org/10.1080/00401706.1997.10485156
  • Fries, A., & Hunter, W. G. (1980). Minimum aberration 2k−p designs. Technometrics, 22(4), 601–608.
  • Joseph, V. R. (2006). A Bayesian approach to the design and analysis of fractionated experiments. Technometrics, 48(2), 219–229. https://doi.org/10.1198/004017005000000652
  • Joseph, V. R., Ai, M., & Wu, C. F. J. (2009). Bayesian-inspired minimum aberration two- and four-level designs. Biometrika, 96(1), 95–106. https://doi.org/10.1093/biomet/asn062
  • Joseph, V. R., & Delaney, J. D. (2007). Functionally induced priors for the analysis of experiments. Technometrics, 49(1), 1–11. https://doi.org/10.1198/004017006000000372
  • Kang, L., & Joseph, V. R. (2009). Bayesian optimal single arrays for robust parameter design. Technometrics, 51(3), 250–261. https://doi.org/10.1198/tech.2009.08057
  • Kerr, M. K. (2001). Bayesian optimal fractional factorials. Statistica Sinica, 11(3), 605–630.
  • Kiefer, J. (1975). Construction and optimality of generalized Youden designs. In A survey of statistical design and linear models (pp. 333–353). North-Holland.
  • Lawson, J. (2020). Comparison of conditional main effects analysis to the analysis of follow-up experiments for separating confounded two-factor interaction effects in 2IVk−p fractional factorial experiments. Quality and Reliability Engineering International, 36(4), 1454–1472. https://doi.org/10.1002/qre.v36.4
  • Mak, S., & Wu, C. F. J. (2019). cmenet: A new method for bi-level variable selection of conditional main effects. Journal of the American Statistical Association, 114(526), 844–856. https://doi.org/10.1080/01621459.2018.1448828
  • Mitchell, T. J., Morris, M. D., & Ylvisaker, D. (1995). Two-level fractional factorials and Bayesian prediction. Statistica Sinica, 15(2), 559–573.
  • Mukerjee, R., & Huda, S. (2016). Approximate theory-aided robust efficient factorial fractions under baseline parameterization. Annals of the Institute of Statistical Mathematics, 68(4), 787–803. https://doi.org/10.1007/s10463-015-0509-x
  • Mukerjee, R., & Tang, B. (2012). Optimal fractions of two-level factorials under a baseline parametrization. Biometrika, 99(1), 71–84. https://doi.org/10.1093/biomet/asr071
  • Mukerjee, R., & Wu, C. F. J. (2001). Minimum aberration designs for mixed factorials in terms of complementary sets. Statistica Sinica, 11(1), 225–239.
  • Mukerjee, R., & Wu, C. F. J. (2006). A modern theory of factorial designs. Springer.
  • Mukerjee, R., Wu, C. F. J., & Chang, M. C. (2017). Two-level minimum aberration designs under a conditional model with a pair of conditional and conditioning factors. Statistica Sinica, 27(3), 997–1016.
  • Pistone, G., & Wynn, H. P. (1996). Generalised confounding with gröbner bases. Biometrika, 83(3), 653–666. https://doi.org/10.1093/biomet/83.3.653
  • Sabbaghi, A. (2020). An algebra for the conditional main effect parameterization. Statistica Sinica, 30(2), 903–924.
  • Su, H., & Wu, C. F. J. (2017). CME analysis: A new method for unraveling aliased effects in two-level fractional factorial experiments. Journal of Quality Technology, 49(1), 1–10. https://doi.org/10.1080/00224065.2017.11918181
  • Suen, C. Y., Chen, H., & Wu, C. F. J. (1997). Some identities on qn−m designs with application to minimum aberration designs. Annals of Statistics, 25(3), 1176–1188. https://doi.org/10.1214/aos/1069362743
  • Sun, C. Y., & Tang, B. (2022). Relationship between orthogonal and baseline parameterizations and its applications to design constructions. Statistica Sinica, 32(1), 239–250.
  • Taguchi, G. (1987). System of experimental design: Engineering methods to optimize quality and minimize costs. UNIPUB/Kraus International Publications.
  • Tang, B., & Deng, L. Y. (1999). Minimum G2-aberration for nonregular fractional factorial designs. Annals of Statistics, 27(6), 1914–1926.
  • Tang, B., & Wu, C. F. J. (1996). Characterization of minimum aberration 2n−k designs in terms of their complementary designs. Annals of Statistics, 27(6), 1914–1926.
  • Wu, C. F. J. (2015). Post-Fisherian experimentation: From physical to virtual. Journal of the American Statistical Association, 110(510), 612–620. https://doi.org/10.1080/01621459.2014.914441
  • Wu, C. F. J. (2018). A fresh look at effect aliasing and interactions: Some new wine in old bottles. Annals of the Institute of Statistical Mathematics, 70(2), 249–268. https://doi.org/10.1007/s10463-018-0646-0
  • Wu, C. F. J., & Hamada, M. (2021). Experiments: Planning, analysis, and optimization (3rd ed.). Wiley Series.
  • Xu, H. (2003). Minimum moment aberration for nonregular designs and supersaturated designs. Statistica Sinica, 13(3), 691–708.
  • Xu, H., & Wu, C. F. J. (2001). Generalized minimum aberration for asymmetrical fractional factorial designs. Annals of Statistics, 29(2), 1066–1077.
  • Yang, Y. H., & Speed, T. P. (2002). Design issues for cDNA microarray experiments. Nature Reviews Genetics, 3(8), 579–588. https://doi.org/10.1038/nrg863

Appendix. Proof of Theorem 4.1

In this proof, a traditional factorial effect is represented by a word, i.e. a subset of {1,,n}. For two words W1 and W2, we define W1W2 to be (W1W2)(W1W2). Note that Ksl(h)=N2tr[Xh1XslXslXh1], which is the sum of squared entries of N2Xh1Xsl. Because the design is regular, each squared entry is either one or zero according to whether the corresponding effects are aliased.

Part (a) is evident from Tang and Deng (Citation1999) except that the number of factors considered in the computation is n−2 (exclude F1 and F3). So we have K0l(0)=(l+1)Al+1(1)+(nl1)Al1(1).

For (b), let Sl be the set of all words of length l not containing any word involving 1 and 3. Let Sl2 be a subset of Sl and 2 belongs to each word in Sl2. Then {1}W, WSl2, is of the form {1,2}(W{2}), where (W{2}) is of length l−1. Similarly, {1}W, W(SSl2), is of the form {1}W, where W is of length l. Similar argument can be made when the roles of F1 and F3, F2 and F4 are interchanged, respectively. By the definition of Al(21) and Al(22), we obtain K0l(1)=Al1(2)+Al(2).

For (c), first consider {2}({1}W) and {2}({1,2}W), where W runs through all the words not involving F1,F2,F3 and has length l−1. It is equivalent to consider {1,2}W and {1}W for such W's. This yields Al1(21) in K1l(0). Next we consider {j}({1}W) and {j}({1,2}W), where j=4,,n and W runs through all the words not involving F1,F2,F3 and has length l−1. By Tang and Deng (Citation1999), this yields (nl1)Al2(21)+lAl(21) in K1l(0). Similar argument can be made when the roles of F1 and F3, F2 and F4 are interchanged, respectively. By the definition of Al(21) and Al(22), we obtain K1l(0)=(nl1)Al2(2)+Al1(2)+lAl(2).

For (d), first consider {1}({1}W), {1}({1,2}W), {1,2}({1}W) and {1,2}({1,2}W), where W runs through all the words not involving F1,F2,F3 and has length l−1. It is equivalent to consider W, {2}W, {2}W and W for such W's. This yields 2Al1(31) in K1l(1). Next consider {1}({3}W), {1}({3,4}W), {1,2}({3}W) and {1,2}({3,4}W), where W runs through all the words not involving F1,F3,F4 and has length l−1. For such W's, {1}({3}W) and {1}({3,4}W) yield Al1(42) in K1l(1); {1,2}({3}W) and {1,2}({3,4}W) yield Al2(43)+Al1(52) in K1l(1). Similar argument can be made when the roles of F1 and F3, F2 and F4 are interchanged, respectively. By the definition of Al(3), we obtain K1l(1)=2Al1(3)+2{Al1(42)+Al2(43)+Al1(52)}.

For (e), first consider {2}({1,3}W), {2}({1,3,4}W), {2}({1,2,3}W), {2}({1,2,3,4}W) and {4}({1,3}W), {4}({1,3,4}W), {4}({1,2,3}W), {4}({1,2,3,4}W), where W runs through all the words not involving F1,F2,F3,F4 and has length l−2. This yields 2Al2(7) in K2l(0). Next consider {j}({1,3}W), {j}({1,3,4}W), {j}({1,2,3}W), {j}({1,2,3,4}W) for j=5,,n. By Tang and Deng (Citation1999), this yields (nl1)Al3(7)+(l1)Al1(7). So we obtain K2l(0)=2Al2(7)+(nl1)Al3(7)+(l1)Al1(7).

For (f), consider {j}({1,3}W), {j}({1,3,4}W), {j}({1,2,3}W), {j}({1,2,3,4}W) for j = 1, 3, and {i,j}({1,3}W), {i,j}({1,3,4}W), {i,j}({1,2,3}W), {i,j}({1,2,3,4}W) for (i,j)=(1,2),(3,4), where W runs through all the words not involving F1,F2,F3,F4 and has length l−2. This yields 2Al2(8). So we obtain K2l(1)=2Al2(8).