192
Views
1
CrossRef citations to date
0
Altmetric
Research Article

On the existence and generation of non- trivial BIBDs

&
Received 31 Mar 2023, Accepted 04 Oct 2023, Published online: 02 Nov 2023

Abstract

A balanced incomplete block design (BIBD) is an efficient tool to infer parameters of a linear model. With a BIBD v treatments are compared in b blocks of size k. A variety of algebraic methods were developed to determine BIBDs depending on the parameter pair (v,k). Despite the progress in the field, there are many pairs (v,k) without an algebraic solution. For such pairs it is solely known, that a trivial BIBD exists, which is commonly large. In this study we present new results on the existence and generation of BIBDs which are smaller than the trivial ones. Such so-called non- trivial BIBDs exist if a trivial BIBDs is not elementary. With a recursive method of BIBD- generation it is shown that for v126, 3kv/2, and (v,k) (8, 3), there exist non- trivial BIBDs. The same is shown for v1000 and 22kv/2. Numerical programs were written for the generation of new non- trivial BIBDs, which delivered smaller BIBDs than the recursive method for 25v40. Commonly used R-programs like “crossdes” and “ibd” failed to determine such BIBDs.

1. Introduction

Balanced incomplete block designs (BIBDs) are well-tried tools in experimental practise. A BIBD can be defined as a v×b incidence matrix X, where v is the number of treatments and b the number of blocks, see e.g. Colbourn and Dinitz (Citation2006). For v=7 and b=7, an incidence matrix of a BIBD is X=(1110000100110001010101000011001100100101100100101).

The conditions for X to represent a BIBD are: In each row i, there are r unities, i.e. j=1bxi,j=r, in each column j, there are k unities, i.e. i=1vxi,j=k, and for each pair i and i′ of rows, there are λ unities in common, i.e. j=1bxi,jxi,j=λ. In the following, these conditions will be called r-, k-, and λ- conditions. With the example we have r=3, k=3, and λ=1. A compressed form of the incidence matrix is (111223423536454765767), where column j consists of the treatments occurring in block j. For example, the first block consists of treatments (1,2,4).

A BIBD is characterized by the quintuple (v,k,b,r,λ). The well-known necessary conditions for the existence of a BIBD are: (1) bv(1) (2) v r=b k(2) (3) λ(v1)=r(k1).(3)

Parameters b,r, and λ fulfilling the necessary conditions for given v and k are called admissible.

It is known, that for each pair (v,k) the so- called trivial BIBD exists. A BIBD is called trivial if it’s incidence matrix is identical to the set of all (vk) possible v-tuples (column vectors) with k unities and vk zeros. The appropriate parameters of a trivial BIBD are (4) bt=(vk), rt=(v1k1) and λt=(v2k2).(4)

A BIBD for a pair (v, k) is called elementary if it cannot be split into at least two BIBDs for this (v, k). For many pairs (v, k) it is known, that the trivial BIBD is not elementary and can be reduced, i.e. there exist smaller BIBDs. Therefore, trivial BIBDs are also known as unreduced BIBDs.

For the example above we have (bt,rt,λt)=(35,15,5). The blocks of the trivial BIBD are

(1,2,3), (1,2,4), (1,2,5), (1,2,6), (1,2,7), (1,3,4), (1,3,5), (1,3,6), (1,3,7), (1,4,5), (1,4,6), (1,4,7), (1,5,6), (1,5,7), (1,6,7), (2,3,4), (2,3,5), (2,3,6), (2,3,7), (2,4,5), (2,4,6), (2,4,7), (2,5,6), (2,5,7), (2,6,7), (3,4,5), (3,4,6), (3,4,7), (3,5,6), (3,5,7), (3,6,7), (4,5,6), (4,5,7), (4,6,7), and (5,6,7).

This trivial BIBD can be partitioned into three elementary BIBDs, see Rasch and Herrendörfer (Citation1985). The first one (printed with bold digits) was given above. The second one (printed with italic digits) has also parameters b=7, r=3, λ=1. The set of the residual 21 of the 35 blocks is an elementary BIBD with b=21, r=9, λ=3.

The first author of Rasch, Teuscher, and Verdooren (Citation2016) observed that there is a trivial BIBD, which cannot be reduced: for (v, k) = (8, 3) there is no BIBD which is smaller than the trivial one. He formulated the conjecture: For 3kv/2, the case (v, k) = (8, 3) is the only one where the trivial BIBD is elementary.

This conjecture is relevant in the given context. If it was wrong, there were other pairs (v, k) for which the BIBD were elementary and attempts to generate a non- trivial BIBD were useless. If the conjecture was true, such attempts were reasonable.

While a proof of the conjecture is certainly difficult, a disproof would be complete, if a pair (v, k) was found, for which the smallest admissible parameters (b,r,λ) are those of the trivial BIBD. With the sections two and three of this study, this possibility is refuted. In section two, results on the smallest admissible parameters are summarized. In section three it is shown that for 3kv/2 and (v,k) (8, 3) there are admissible parameters which are smaller than those of the trivial design.

Note, that the restriction kv/2 has a technical background. If a BIBD for kv/2 exists, one can generate the complementary BIBD by exchanging zeros and unities in the incidence matrix. This complementary BIBD has parameters v*=v, k*=vk, b*=b, r*=br, and λ*=b2r+λ. Then, k*=vkvv/2=v/2, i.e. complementary BIBDs ensure the existence of BIBDs for k>v/2. In this study, we concentrate without loss of generality on designs for kv/2. Therefore, the BIBDs for (v, k) = (8, 3) and (v, k) = (8, 5) are elementary.

It is a demanding algebraic task to construct BIBDs for given v and k, see e.g. Colbourn and Dinitz (Citation2006). (When we write about the construction of BIBDs we mean the construction of non- trivial BIBDs.) The priority aim is to find the smallest BIBDs. Several authors contributed results on the existence, construction, and nonexistence of BIBDs with specific parameters. For a list of literature, we refer to Rasch, Teuscher, and Verdooren (Citation2016). Rasch et al. (Citation2011) collected 21 available methods to construct BIBDs.

There are a lot of pairs (v,k), for which no construction method is known. For v50, these pairs can be found in of Rasch, Teuscher, and Verdooren (Citation2016). The smallest v with an unknown non- trivial BIBD is v=26. For v25, the smallest BIBDs are known, see Rasch et al. (Citation2011). These BIBDs can be generated with the software package CADEMO, see Rasch et al. (Citation1987) and Rasch, Guiard, and Nürnberg (Citation1990). Rasch and Verdooren (Citation2023) made these BIBDs publicly available.

When for a pair (v,k) no specific construction methods are known, one is legitimated to try numerical search procedures. Numerical local search programs were developed. Most popular are the R- procedures “crossdes” (https://CRAN.R-project.org/package=crossdes) by Sailer (Citation2005) and “ibd” (https://CRAN.R-project.org/package=ibd), basing on the work of Mandal, Gupta, and Parsad (Citation2014a, Citation2014b) and Mandal (Citation2015). Prestwich (Citation2003) and Rueda, Cotta, and Fernandez (Citation2009) have investigated other general numeric algorithms. The data of Prestwich (Citation2003), ranging up to v=31, were taken by the later authors as reference data to evaluate the efficiency of the new methods. As their authors reported, the programs and algorithms do not always produce valid BIBDs, insofar that the generated designs do not always hold the conditions on r, k, and λ. Particularly, several BIBDs for v25 cannot be found, although they exist. Furthermore, the convergence of “crossdes” and “ibd” is problematic for vb>1000, i.e. for v32.

Due to these restrictions, “crossdes” and “ibd” are not useful to generate new BIBDs. For unknown BIBDs with 26v31, vb3380 is valid and the programs do not converge. Therefore, for the generation of new BIBDs new concepts are needed.

In Appendix A, the recursive generation of BIBDs is introduced. It is shown, how to construct a BIBD for the pair (v,k) from the BIBDs for the pairs (v1,k) and (v1,k1). With the recursive method, relevant propositions about the existence of non- trivial BIBDs are found.

In Appendix B, a general search procedure is presented, which ensures the generation of valid BIBDs. The usefulness of the methods is summarized in section four.

2. The smallest admissible parameters

In this section, we present essential facts on the smallest admissible parameters for parameters (v,k). In the literature, we did not find these results, although they are partly known, see the programs of Mandal (Citation2015).

Theorem 1.

With (b*,r*,λ*)=(b,r,λ)GCD(b,r,λ), where (b,r,λ) is an admissible triple for (v,k), the smallest admissible triple is (5) (b, r, λ)=μ (b*,r*,λ*),(5) where μ is the smallest integer with μb*v. (GCD = Greatest Common Divisor; the division and multiplication have to be carried out element-wise.)

Proof

: From Equation(2) and Equation(3) it follows (6) br=vk, rλ=v1k1, and bλ=v(v1)k(k1),(6) i.e. the ratios between the parameters b, r, λ depend on v and k only. Let there be another set of admissible parameters b*, r*, and λ*. Then,

b*r*=br, r*λ*=rλ and b*λ*=bλ or

b*b=r*r, r*r=λ*λ and b*b=λ*λ, i.e. (7) b*b=r*r=λ*λ=c(7)

holds for some rational number c=m/n with natural numbers m and n. Hence, b*=mn b,  r*=mn r,  and  λ*=mn λ.

The smallest triple of natural numbers (b*, r*, λ*) is obtained for n=GCD(b,r,λ) and m=1. In case b*<v, the parameters do not satisfy condition Equation(1). Therefore, they have to be multiplied with μ, the smallest integer with μ b*v.

EquationEquation (5) shows how to obtain the smallest admissible parameters from admissible ones. The parameters of the trivial BIBD, mentioned in the introduction, are admissible. Hence, they could be used as a starting point. It is however well known that there are smaller admissible parameters:

Eliminating r of Equationequation (2) gives r=bkv. Eliminating λ of Equationequation (3) gives λ=r(k1)v1 and substitution of r results in λ=bk(k1)v(v1). The parameters b, r, and λ are admissible if b, r, and λ are integers for arbitrary v and k and if bv holds. This clearly is guaranteed with b=(v1)v. Therefore, (8) b=(v1)v,  r=(v1)k, and  λ=(k1)k(8) are admissible parameters. The smallest admissible parameters are determined by Equationequation (5).

Often, no BIBD exists for the smallest admissible parameters (b, r, λ), e.g. for (v,k)=(15, 5) and (22, 7). One could think that the next admissible parameters would be n(b, r, λ) with n=2,3, However, thinking so could hide the smallest BIBD. For example, for (v,k)=(34, 12) we have via Equation(8) admissible parameters (b,r,λ)=(33×34, 33×12, 11×12) and GCD(b,r,λ)=33×2. Hence, from theorem 1, we obtain (b*,r*,λ*)=(17, 6, 2) and because of 17<v=34, μ=2 and (b, r, λ)=(34, 12, 4). It is known that for these parameters no BIBD exists. The former idea would lead to the next potentially admissible parameters 2(34, 12, 4)=(68, 24, 8). However, with μ=3 we obtain μ(b*,r*,λ*)=(52, 18, 6). Theory shows, that for these parameters, a BIBD exists.

Theorem 2:

With (b,r,λ)=(v(v1),(v1)k,(k1)k) and (b*,r*,λ*)=(b,r,λ)GCD(b,r,λ), the ordered set of admissible parameters v and k is given by (μ+n) (b,r,λ)GCD(b,r,λ) with n=0, 1, 2, , were μ is the smallest integer with μb*v.

3. For which v and k are the smallest admissible parameters identical with those of the trivial BIBD?

Theorem 3:

For all 3kv/2, with exception of (v,k)=(8,3), an admissible triple (b,r,λ) exists which is smaller than that of the trivial BIBD (bt,rt,λt).

Proof:

We investigate btb, where b=(v1)v belongs via Equation(8) to an admissible parameter triple. If bt>b holds, the admissible parameters are smaller than the trivial ones. If btb holds, we check via theorem 1, whether there are admissible parameters smaller than bt and b. If there are none, the trivial BIBD is elementary.

The relation bt/b=1 is equivalent to (9) (k+1)(k+2)(v2)(vk)!=1.(9)

In the numerator, we have vk2 factors. In the denominator, we have vk1 factors larger than one. Hence, with m=vk2 and a=k+1, we are looking for all solutions of (10) i=0m1(a+i)(m+2)!=1.(10)

For m=1 we get a/6=1, i.e. a=6 and following Equation(8) we receive k+1=v2=6, i.e. k=5 and v=8.

For m=2 we get a(a+1)/4!=1, which has no integer solution.

For m=3 we consider a(a+1)(a+2)/5!, which is a monotone function in a. For a=3 it is a(a+1)(a+2)/5!=1/2. For a=4 it is one. For a=5 it is 7/4. Thus a=4 is the only solution with k+1=a and v2=a+2, i.e. k=3 and v=8.

For m>3 and a=3 we obtain 3×4××(m+2)(m+2)!=12. For m>3 and a=4 we obtain 4×5××(m+3)(m+2)!=m+36. Since m>3, this expression is larger than one. Due to monotonicity with respect to a, there is no solution of equ. Equation(9) for m>3.

With the same technique it can be shown that for the designs with (v,k)= (6, 3) and (7, 3), there are solutions with bt<v(v1). However, it is known that existing BIBDs are smaller than the trivial designs. For v=6 and k=3 (bt=20, rt=10, λt=4) the BIBD with b=10, r=5 and λ=2 exists and for v=7 and k=3 (bt=35, rt=15, λt=5), the BIBD with b=7, r=3 and λ=1 exists. □

3.1. Conclusion

The equation (11) GCD(bt,rt,λt)=GCD((vk),(v1k1),(v2k2))=1(11)

can hold only for (v,k)=(8,3) and (v,k)=(8,5).

4. The generation of new BIBDs

For v25, the smallest BIBDs are known and reported by Rasch et al. (Citation2011). For v>25, there are many (v, k) pairs for which it is not known if non-trivial BIBDs exist, see, e.g. of Rasch, Teuscher, and Verdooren (Citation2016). Numerical programs like “crossdes” or “ibd” do not allow the generation of missing BIBDs for two reasons. The first one is that the algorithms often do not terminate for v b>1000. The second reason is that they often do not generate valid BIBDs. To overcome the problems, two new approaches were used.

Table 1. Generated BIBDs. The parameter pairs (v,k) comprise those pairs with v40 and kv/2, for which non- trivial BIBDs were not known. The lengths of the trivial BIBDs are denoted with bt. The smallest length satisfying the necessary conditions Equation(1), Equation(2), and Equation(3) is denoted with b. The lengths of the BIBDs generated with two methods are denoted with b. The factor f in parentheses says that b is the f- fold of b.

The first one is a recursive method, which generates a BIBD for (v,k) from the BIBDs for (v1,k) and (v1,k1). The method is described in Appendix A. When a preceding BIBD is not known, it has to be generated from its predecessors. This may go back to the BIBDs for v*, where v* is a prime power, i.e. a prime number or a power of a prime number. For such a v*, the smallest BIBDs can be generated (at least theoretically; for large v*, there might be practical numerical problems).

With the recursive method, BIBDs for parameter pairs (v,k) with 26v1000 and 10kv/2 were generated. From 243994 generated BIBDs, only 282 were trivial. This appeared particularly, when the distance between v and v*, the largest prime power smaller than v, was large. The generation of a BIBD for v=v*+1, i.e. v is the successor of a prime power, never led to a trivial BIBD. The same is true up to v=v*+6, i.e. trivial BIBD appeared for vv*+7.

The results are summarized by the following theorem.

Theorem 4:

For 26v126, there exists a non- trivial BIBD for each pair (v,k). For all pairs (v,k) with 26v1000 and k22, non- trivial BIBDs exist.

For 26v40, the sizes of the BIBDs generated with the recursive method are presented in . The table documents those pairs (v,k), for which the existence of non-trivial BIBDs was not known.

shows that the BIBDs generated with the recursive method are considerable smaller than the trivial BIBDs, but mostly much larger than the smallest admissible designs.

In Appendix B, basic and advanced numerical methods are introduced. They were intended to reach three goals. The first goal was the generation of approximate BIBDs for parameters v, k, and the smallest admissible b. The generation is guaranteed not only for parameters with vb<1000, but for parameters with vb<100000. The second goal was to generate valid BIBDs with elementary methods. The third goal was the improvement of BIBDs with advanced methods, i.e. the generation of BIBDs smaller than those generated with the basic methods.

With the basic numerical methods, BIBDs were generated for the pairs (v,k) presented in . With the exception for (v,k)=(26,12), these BIBDs were smaller than those generated with the recursive method, partly drastically.

The application of the advanced numerical methods to the pairs (v,k) of is not completed yet. So far, for twenty two pairs it succeeded to generate BIBDs with f=1 and b=b, i.e. the smallest possible ones (marked by an asterisk in the last column of the table). We are optimistic to find BIBDs with a maximum factor f=2 for the remaining pairs (v,k) of . For example, the intermediate result for the pair (v,k)=(39,15) is f=4, while it was f=32 with the basic methods. With an enlarged effort, a further improvement is expected. Hence, the potential of the advanced methods is large but the needed running time is large as well.

5. Discussion

The new methods allowed a considerable progress in generating BIBDs. With the recursive method, non-trivial BIBDs were generated for v126. Since the generated BIBDs were often large, the recursive method is more of theoretical than of practical importance.

The basic numerical methods were applied to produce BIBDs for 26v40 whose existence was unknown so far. For several pairs (v,k), the advanced numerical methods were applied. The generated BIBDs were smaller than those generated with the recursive method. For twenty two pairs (v,k), a fully satisfying result was obtained: the generation of a BIBD for the smallest admissible parameters (f=1, b=b). For the other pairs, further improvements demand longwinded calculations.

As a starting point of our procedures, approximate BIBDs were generated for given (v,k) and the smallest admissible number of blocks b′ “Approximate” means that the deviations of the actual λi,j from the nominal λ are minimized, while the r- and k- conditions are satisfied. As a rule, the obtained differences between λi,j and λ are −1, 0, or 1. Sometimes, the generated design is a valid BIBD, i.e. all obtained λi,j are identical with λ.

With the basic tool 1, approximate BIBDs can be calculated also for inadmissible numbers of blocks b. An example is given, that the results can compete with those of the program “ibd”. For (v,k)=(8,4) and b=8 (with the smallest admissible parameter b=14) we obtained the incidence matrix X and the concurrence matrix C=XX: X=[1100001111011000011010101010110000001111001101100101010110110001],  C(X)=[4221212224221122224222111224221221224221112224222211224222121224].

The design is balanced with respect to k and r. The pairwise parameters λi,j vary between one and two (with mean λ¯=12/7). The A- and D- efficiency parameters were 0.991 and 0.996, respectively. The concurrence matrix obtained with “ibd” contained pairwise parameters λi,j between zero and four. The parameters of A- and D-efficiency were 0.933 and 0.971, respectively. Hence, our result is clearly more balanced.

Ignoring the problem of large running times, there was a simple method to generate a BIBD for admissible parameters (v,k,b,r,λ): One could generate all incidence matrices of dimension v×b with r unities and br zeros in each row and check each matrix, whether the k- and λ- conditions for a BIBD are fulfilled. (Without loss of generality, the first two rows and the first column could be fixed in advance: x1,1==x1,r=1, x2,1==x2,λ=1, x2,r+1==x2,2rλ=1, x1,3==x1,k=1. The other entries were zero. The entries of the v-th row can be calculated by xv,j=kn=1v1 xn,j.) When an admissible matrix is found, it is the incidence matrix of a BIBD. Otherwise, applying theorem 2, the next larger admissible parameters could be attempted. However, this method is so time demanding, that it is not realizable even for small designs.

When there is no practicable strategy to solve the problem as a whole, sequential strategies are needed. With our methods, and with the programs “crossdes” and “ibd”, the strategy is the subsequent increasement of admissible rows of the incidence matrix. Often, the concrete problem aroused that at a certain stage, it was not possible to generate the next admissible row. Hopefully, theoretical results on this problem are obtainable.

One could call this problem the inverse BIBD problem: We consider row vectors of length β with ϱ unities and βϱ zeros. How many such row vectors can be built, so that each pair of them has λ unities in common (there are λ columns with unities for both row vectors)?

Those parameters β, ϱ, and λ are of interest which coincide with the smallest admissible parameters b, r, and λ for a BIBD with (v,k). If a BIBD exists for these parameters, the answer on the formulated question would be: v. If theoretical results (accompanied by a construction method) on the inverse BIBD problem would, for example, deliver v2, v3, or v4 admissible row vectors, a doubling of the design and the use of our methods would probably lead to a BIBD. With our methods it even happened that with a doubling of a design, seven new admissible rows were generated.

Of particular interest is the inverse problem for b=v(v1), r=k(v1), and λ=k(k1). Often, these parameters are the smallest admissible ones (ten cases in ). If for them BIBDs would exist, we had another case were the necessary conditions Equation(1) are sufficient. Furthermore, the conjecture of Rasch, Teuscher, and Verdooren (Citation2016) could be proofed via theorem 3.

Generally, the numerical search for BIBDs is a very expensive procedure. On the other hand, a suggested design can be checked for being a valid BIBD by some matrix operations. This discrepancy reminds of the “P versus NP” problem, see Clapham and Nicholson (Citation2014). Since the effort of optimization procedures with integer variables increases exponentially with the number of variables, the numerical generation of BIBDs is an excellent candidate for solving the “P versus NP” problem. A closer look of specialists is emphasized.

Generated BIBDs and the programs for the generation of approximate designs and BIBDs will be sent to interested readers.

Acknowledgement

The authors are indebted to the reviewers for valuable hints.

Disclosure statement

No potential conflict of interest was reported by the author(s).

References

  • Abel, R. J. R., I. Bluskov, and M. Greig. 2001. Balanced incomplete block designs with block size 8. Journal of Combinatorial Designs 9 (4):233–68. doi:10.1002/jcd.1010.
  • Abel, R. J. R., I. Bluskov, and M. Greig. 2004. Balanced incomplete block designs with block size 9: Part III. Australasian Journal of Combinatorics 30:57–73.
  • Abel, R. J. R., and M. Greig. 1998. Balanced incomplete block designs with block size 7. Designs, Codes and Cryptography 13 (1):5–30. doi:10.1023/A:1008204220755.
  • Clapham, C., and J. Nicholson. 2014. The concise Oxford dictionary of mathematics. 5th ed. Oxford: Oxford University Press.
  • Colbourn, C. J., and J. H. Dinitz. (eds.) 2006. Handbook of combinatorial designs. 2nd ed. Boca Raton: Chapman and Hall.
  • Mandal, B. N. 2015. Linear integer programming approach to construction of balanced incomplete block designs. Communications in Statistics - Simulation and Computation 44 (6):1405–11. doi:10.1080/03610918.2013.821482.
  • Mandal, B. N., V. K. Gupta, and R. Parsad. 2014a. Efficient incomplete block designs through linear integer programming. American Journal of Mathematical and Management Sciences 33 (2):110–24. doi:10.1080/01966324.2014.901198.
  • Mandal, B. N., V. K. Gupta, and R. Parsad. 2014b. Balanced treatment incomplete block designs through integer programming. Communications in Statistics - Theory and Methods 46 (8):3728–37. doi:10.1080/03610926.2015.1071394.
  • Prestwich, S. 2003. A local search algorithm for balanced incomplete block designs. In CP 2003. LNCS, ed. F. Rossi, vol. 2833, 53–64. Heidelberg: Springer.
  • Rasch, D., V. Guiard, and G. Nürnberg. 1990. Present and planned future of the expert system CADEMO. In SOFTSTAT'89 Fortschritte der Statistik-Software 2, eds. F. Faulbaum, R. Haux, und K-H. Jöckel (Hrsg.), 332–9. Stuttgart: Gustav Fischer.
  • Rasch, D., V. Guiard, G. Nürnberg, P. E. Rudolph, and F. Teuscher. 1987. The expert system CADEMO. Computer Aided Design of Experiments and Modelling. Statistical Software Newsletter 13:107–14.
  • Rasch, D., and G. Herrendörfer. 1985. Experimental design: Sample size determination and block designs. Dordrecht: D. Reidel Publishing Company.
  • Rasch, D., J. Pilz, L. R. Verdooren, and A. Gebhardt. 2011. Optimal experimental design with R. Boca Raton: Chapman and Hall.
  • Rasch, D., F. Teuscher, and L. R. Verdooren. 2016. A conjecture about BIBDs. Communications in Statistics - Simulation and Computation 45 (5):1526–37. doi:10.1080/03610918.2014.955111.
  • Rasch, D., and L. R. Verdooren. 2023. Angewandte Statistik mit R für Agrarwissenschaften. Berlin: Springer.
  • Rueda, D. R., C. Cotta, and A. J. Fernandez. 2009. Finding balanced incomplete block designs with metaheuristics. In Proceedings of the 9th European Conference on evolutionary computation in combinatorial optimization. Berlin Heidelberg: Springer-Verlag. doi:10.1007/978-3-642-01009-5_14.
  • Sailer, O. 2005. crossdes: A package for design and randomization in crossover studies. Rnews 5(2):24–7.
  • Wolfram, S. 1999. The mathematica book. 4th ed. Cambridge: Wolfram Media/Cambridge University Press.
  • Zeidler, E. 2004. Oxford users’ guide to mathematics. Oxford: Oxford University Press.

Appendix A.

The construction of a BIBD for (v, k) from the BIBDs with (v-1, k) and (v-1, k-1)

Let X be the v×b incidence matrix, consisting of zeros and unities, of a BIBD with parameters (v,k,b,r,λ). We order the columns of this matrix so that the first row starts with zeros and ends with unities: X=[0 0 0  0 0 01 1 1  1 1 1X1X2], where X1 and X2 are certain zero- one matrices. Since the number of unities in the first row is r and the number of zeros is therefore br, X1 has dimension (v1)×(br) and X2 has dimension (v1)×r.

We now assume, that X1 and X2 are BIBDs. Since the sum of column elements in X is k, the sum of column elements in X1 is also k, while it is k1 in X2. Hence, X1 and X2 have parameters (v1,k,b1=br,r1,λ1) and (v1,k1,b2=r,r2,λ2), respectively.

Then, the parameters of the BIBDs are related by the following conditions: b=b1+b2 (A1) r=r1+r2  and(A1) λ=λ1+λ2.

Example A1

: Consider the smallest BIBD for v = 8 and k = 4. Following Rasch et al. (Citation2011), it has parameters (8,4,14,7,3). An ordered (with respect to the first row) incidence matrix is, e.g. X=[0000000111111111110000000111110011000110011010101010101010010110110100001111011000010101101101001001100111001100].

Ignoring the first row, the matrix X can be partitioned into matrices X1 and X2. The matrices X1=[1111000110011010101011001011001111001011010110011] and  X2=[0000111001100101010100110100110000110100101001100] are BIBDs with parameters (7,4,7,4,2) and (7,3,7,3,1), respectively (the sum of row elements is ri, the sum of column elements is ki, and each pair of rows has λi unities in common). Particularly, X1 is the complementary design to X2 (zeros and unities are interchanged).

Now, we change the viewpoint. With given BIBDs X1 with (v1,k,b1,r1,λ1) and X2 with (v1,k1,b2,r2,λ2) (not necessarily the smallest ones) we construct a BIBD X with (v,k,b,r,λ) in the following manner: X=[0 0 0  0 0 01 1 1  1 1 1X1X1X2X2], where the design X1 appears n1 times and the design X2 appears n2 times. The parameters of X result according to b=n1 b1+n2 b2, (A2) r=n1 r1+n2 r2,(A2) λ=n1 λ1+n2 λ2.

From the first row, we obtain the condition r=n2 b2, since there are n2 b2 unities. Furthermore, the first row has unities in common with other rows only on the right-hand side, namely n2 r2, i.e. λ=n2 r2. Hence there are to solve two equations: n2 b2=n1 r1+n2 r2 and n2 r2=n1 λ1+n2 λ2. Setting q=n1/n2, this gives (A3) b2r2=q r1 and(A3) r2λ2=q λ1.

From Equation(A1), Equation(2), v1=v2=v1, and k=k1=k2+1 we get (A4) q=b2r2r1=b2b2k2v2b1k1v1=b2(v1)b2k2b1k1=b2(v1k+1)b1k=b2(vk)b1 k.(A4)

Having canceled out common factors of the numerator and denominator of q, we obtain n1=numerator(q) and n2=denominator(q).

Example A2

: Consider BIBDs for v=26. From of Rasch, Teuscher, and Verdooren (Citation2016), the existence of non-trivial BIBD is unknown for k=11 and k=12. Available software like “crossdes” and “ibd” fail to generate BIBDs for these parameters. Since the smallest BIBDs for v=25 are known (Table 6.1 of Rasch et al. (Citation2011)), the recursive method can be applied. Afterwards it can be checked, whether the resulting BIBDs are trivial or shorter.

For the construction of the BIBD for v=26 and k=11 we use the BIBDs (25,11,300,132,55) and (25,10,40,16,6), i.e. b1=300 and b2=40. Following (A6) we obtain q=2/11 and b=1040, shown within .

Table A1. The construction of BIBDs for v=26 and k=11 and 12. The length of the smallest admissible design is b.

For the construction of the BIBD for v=26 and k=12 we use the BIBDs (25,12,50,24,11) and (25,11,300,132,55), i.e. b1=50 and b2=300. Following (A6) we obtain q=7/1 and b=650.

The constructed designs have parameters (26,11,1040,440,176) and (26,12,650,300,132). The first one is the eight-fold of the smallest admissible design (with length b’) and the second one is the two-fold of the smallest admissible design. Both are much shorter than the trivial BIBDs.

The described construction of a BIBD for given v and k assumes the availability of BIBDs for (v1,k) and (v1,k1). If this assumption is not fulfilled, the method is not directly applicable. Then, BIBDs for v2 are required, i.e. the method has to be used recursively. Fortunately, we have not to go backwards too much, since we may start with v*, a prime number or a power of a prime number. With such a v=v*, BIBDs can always be constructed, e.g. via method 6 of Rasch et al. (Citation2011). The lengths b of such BIBDs are at most b=v(v1). This length was used in the recursive procedure. Beside the BIBDs for a prime power v*, the BIBDs for k=7, 8,  and 9 are taken as known, following the theory of Abel and Greig (Citation1998), Abel, Bluskov, and Greig (Citation2001), and Abel, Bluskov, and Greig (Citation2004).

Appendix B:

Basic and advanced numerical methods

Advanced methods are introduced, which allow the generation of shorter BIBDs than generated with the basic program.

The basic numerical method

We describe the numerical method to generate a BIBD by means of an example: (v,k)=(15,5). We start with an incidence matrix X for the smallest admissible design, which has parameters (v,k,b,r,λ)=(15,5,21,7,2). X has to satisfies the r and k condition. Such a matrix is simply constructed: (B1) X=(111111100000000000000111111100000000000000111111100000000000000111111100000000000000111111100000000000000000000011111110000000000000011111110000000000000011111110000000000000011111110000000000000011111110000000000000000000001111111000000000000001111111000000000000001111111000000000000001111111000000000000001111111)(B1)

As it can be seen, the sums of row elements are seven and the sums of column elements are five. The λ- condition was not in the focus at this stage.

Definition B1

: Let b and r be admissible parameters for given (v,k). Then, a v×b incidence matrix X (consisting of zeros and unities) is called a design, when the sum of row elements is r for each row and when the sum of column elements is k for each column.

The appropriateness of design X is measured by the concurrence matrix C: (B2) C(X)=XXλ 1.(B2)

If X were a BIBD, we had (B3) XX=(rλλλλr λλλλ rλλλλr) and C(X)=(rλ0000rλ 0000 rλ0000rλ).(B3)

For the design Equation(B1), the concurrence matrix is (B4) C(X)=(555552222222222555552222222222555552222222222555552222222222555552222222222222225555522222222225555522222222225555522222222225555522222222225555522222222222222255555222222222255555222222222255555222222222255555222222222255555)(B4) with elements ci,j. The diagonal elements are admissibly ci,i=rλ=5. The non- diagonal elements are all different from zero. Since ci,j2 is a measure of the deviation from zero, the appropriateness of a design X can be quantified by the sum of squared deviations (B5) P(C(X))=i=2vj=1i1ci,j2,(B5)

which is the penalty function. The general aim of an algorithm is to suggest changes within X and to check whether a change decreases the penalty function.

Note that due to the symmetry of C(X), only elements below the diagonal found entrance into P(C(X)). Hence, there is no need to change row one of X. It is therefore called admissible.

Definition B2

: Row i2 of design X is called to be admissible, if ci,1=ci,2==ci,Na1=0 holds, where Na is the number of admissible rows, 1Nav.

For the initial design Equation(B1), we have Na=1. The aim is to increase Na.

The change of one entry, say xi,j, of X (substitution of a “1” by a “0”, or vice versa) would lead to a violation of the r and k  conditions. We allow only changes, which do not violate these conditions. Therefore, a second row, i, and a second column, j, have to be taken into account.

We are ready now to define the first basic algorithm. Often, it has to be applied several times. Then, the parameters will differ (with exception of v and k). When the algorithm is applied the first time, we have b=b, r=r, λ=λ, and Na=1.

Basic tool 1

For a pair (i,i) of rows of design X, Na<i<iv, determine the set M1 of the column numbers j, for which xi,j=1 and xi,j=0 hold. Then, determine the set M2 of the column numbers j, for which xi,j=0 and xi,j=1 hold. (The number of elements of M1 and M2 is rci,iλ.) Then, with X˜=X, put x˜i,j=0, x˜i,j=1, x˜i,j=1, and x˜i,j=0 for a pair (j,j), jM1 and jM2. With such X˜, determine C(X˜)=X˜X˜λ 1 and calculate the penalty parameter P, (B6) P(C(X˜))=i=Na+1vj=1i1[1000 if i=Na+1 1 if iNa+1 ]ci,j2.(B6)

If P(C(X˜)) is smaller than P(C(X)), set X=X˜. In case cNa+1,1=cNa+1,2==cNa+1,Na=0, Na is increased by one. The algorithm is applied to all pairs (i,i) and (j,j). The algorithm terminates, when no further reduction of the penalty P can be obtained. □

Ideally, the basic tool 1 terminates with penalty zero. Then, a BIBD was obtained. If the basic tool 1 did not terminate with penalty zero, it is finalized it by another run with penalty Equation(B5). The resulting design X can be taken as an approximation of a BIBD. If one wants to come near to C- or D- optimality of the design, one can use appropriate penalty terms.

However, the aim is to generate a BIBD. When the basic tool 1 was not successful, we have to enlarge the dimensions of the task by setting (b,r,λ):=2(b,r,λ). With our example, the concurrence matrix with the lowest penalty was (B7) C(X)=(500000000000011050000000000000005000000000110000500000000000000050000000011000005000000000000000500000000000000050000011000000005000000000000000500110000000000050000000000000005011001000000100500101010010101050100010010001005)(B7)

We compose a new design X by taking the design X and hang on another design X2 of length b, i.e. the row vectors of X are prolonged by the row vectors of X2. We abbreviate this merging with X=[X,X2]. The resulting concurrence matrix is C(X)=C(X)+C(X2), since the scalar products are additive.

Then the question is, how to choose X2. Taking X2=X has the advantage, that a zero in C(X) becomes also a zero in C(X), i.e. rows 1 to Na of X are admissible. The disadvantage is that a non-zero in C(X) becomes twice the non- zero in C(X). In example Equation(B7) we have c13,3(X)=1. To obtain a zero in C(X), i.e. c13,3(X)=0, we need a modification of X2 so that c13,3(X2)=1. Furthermore, we have c13,10(X)=1. To obtain c13,10(X)=0, we need c13,10(X2)=1. This modification is reached when X2 is taken as X, but with exchanged rows 3 and 10. Then, C(X2) results from C(X) by exchanging rows 3 and 10 and as well columns 3 and 10. The concurrence matrix of X is then (B8) C(X)=(100000000000002201000000000000000010000000000000000100000000000000001000000000220000010000000000000000100000000000000001000000220000000010000000000000000100000000000000001000000000000000010022000000000000100020002002000201002000200200020010)(B8)

Incidentally, by the exchange of rows 3 and 10 in X2, two more zeros in C(X) were generated: c14,3(X)=0 and c14,10(X)=0. This resulted from c14,3(X)=c14,10(X).

Visiting the concurrence matrix Equation(B8), one can see, that additionally exchanging rows 1 and 5 of X2 will lead to c14,1(X)=0, c15,1(X)=0, c14,5(X)=0, and c15,5(X)=0. When we still exchange rows 8 and 12 of X2, we obtained a BIBD X by defining X=[X,X2].

Basic tool 2

With the doubling of the design, as much as possible new admissible rows are generated.

Unfortunately, basic tool 1 does not often terminate with a comfortable situation given with the example Equation(B7). Such uncomfortable situation is given, when j=1NacNa+1,j0. Then, with the basic tool 1 but with an appropriate penalty term, row Na+1 of X must be brought into a shape, that pairs (j,j) ensure cNa+1,j(X)=cNa+1,j(X). The following two theorems are useful in this context.

Theorem B1

: Let X be a design. Then, the sum of all non- diagonal elements of each row and each column of C(X) is zero.

Proof:

Without loss of generality, we consider row one (for row i{2,,v}, rows i and one are interchanged). The assertion is then j=2vc1,j=0. By construction we have j=2vc1,j=j=2v(l=1bx1,lxj,lλ)=j=2vl=1bx1,lxj,l(v1)λ= =l=1bj=2vx1,lxj,l(v1)λ=l=1bx1,lj=2vxj,l(v1)λ= =l:x1,l=0x1,lj=2vxj,l+l:x1,l=1x1,lj=2vxj,l(v1)λ

The sums j=1vxj,l of the column elements equal k. Therefore, the sum j=2vxj,l is k, if x1,l=0 and k1, if x1,l=1. There are r columns with x1,l=1. Thus, j=2vc1,j=r(k1)(v1)λ holds. Since r(k1)=(v1)λ is a necessary condition for the existence of a BIBD and we apply only admissible parameters, the assertion is proofed. Due to symmetry of C, the result applies also to the columns.□

Theorem B2

: If solely the two last rows of X are not admissible (i.e. Na=v2) and if cv1,j{1, 0, 1} holds for j=1,,Na, then one can immediately define a design X2 so that X=[X,X2] is a BIBD.

Proof:

Following theorem B1 applied to columns j=1,,v2, cv1,j=cv,j holds. Thus, the last two rows of the concurrence matrix C have the following form: (B9) RowColumn12v2v1vv1cv1,1cv1,2cv1,v2rλcv1,vvcv1,1cv1,2cv1,v2cv,v1rλ(B9)

From theorem B1 we have j=1v2cv1,j+cv1,v=0 for row v1 and j=1v2(cv1,j)+cv,v1=0 for row v. Adding both equalities and regarding symmetry of C we get 2cv1,v=0, i.e. cv1,v=0.

Assume furthermore, that cv1,j{1, 0, 1} holds. Then, 0=j=1v2cv1,j=j:cv1,j=1cv1,j+j:cv1,j=1cv1,j=#(j: cv1,j=1)+#(j: cv1,j=1). This means, that in rows v1 and v, there is the same number, say n, of “-1” and “1”. Therefore, we may create n pairs (j, j) of columns, so that cv1,j=1 and cv1,j=1. The design X2 is generated from design X by exchanging all pairs (j, j) of rows. □

Conclusion B1

If Na=v1 rows of the design X are admissible, the v-th row is also admissible.

When the basic tool 2 ends with Na=v admissible rows, a BIBD was generated. If not, the basic tool 1 is started again with b:=2b, r:=2r, λ:=2λ, and the actual Na.

Advanced numerical methods

The basic numerical program often does not generate the smallest possible BIBDs. To obtain smaller BIBDs advanced methods are necessary. The improvements of the basic tools were caused by the exchange of four entries of a design X, which were the points of intersection of two rows and two columns. Advanced methods have to offer more than four changes. A straightforward approach would be to try, e.g. two rows and four columns. We experienced that the gain of such approaches was not substantial enough to compensate the increase of running time. Therefore, other approaches were used.

A view on the concurrence matrix Equation(B7) shows, that the non- diagonal elements of columns 2, 4, 6, 7, 9, and 11 are zero. We call them admissible columns, although they are caused by those rows of design X which satisfy the λ- condition with respect to all other rows. We reorder the rows of X, so that rows 2, 4, 6, 7, 9, and 11 are the first six rows of X afterwards.

Definition

Nc, 0NcNav, is called the number of admissible columns of the concurrence matrix C(X), if cNa+1,i=cNa+2,i==cv,i=0 holds for 1iNc.

Generally, the number of admissible columns Nc can be increased by another run of the basic tool 1 with the penalty (B10) P(C(X˜))=i=Na+1vj=1i1[106 if j<Nc+1103 if j=Nc+1 1 if j>Nc+1]ci,j2.(B10)

In case there was no admissible column, Nc=0 has to be taken.

We may now modify the basic tool 1 to improve the design X under the constraints that the first Nc columns of C(X) remain admissible. Analogously to the basic tool 1 we determine the sets M1 and M2 for rows i and i of X. The number of elements of M=M1M2 is NM=2(rci,iλ). We reorder the columns of design X, so that the columns with xi,j=1 and xi,j=0, i.e. jM1, are then the columns 1, 3, 5, etc… Analogously, the columns with xi,j=0 and xi,j=1, i.e. jM2, are then the columns 2, 4, 6, etc… Rows i and i of design X (which is now the reordered design) have the following principal shape then: (B11) RowColumn12NM1NMNM+1bi101001i010101(B11)

We now want to create a new design X˜ with few changes compared to design X which is potentially better. The first Nc columns of C(X˜) should remain admissible, i.e. ci,j=0 and ci,j=0 must hold for j=1, ,Nc. (Note however, that the latter equations ci,j=0 automatically hold due to theorem B1.) Furthermore, we demand j=1bxi,j=r. Hence, we have Nc+1 equations to hold. To exploit these equations, we have to introduce variables y1, y2,…, yNc+1. For example, this can be done by putting xi,j=yj and xi,j=1yj with j=1,,Nc+1. Generally, the set of columns, assigned to the variables, is My={j1,j2,,jNc+1}, where My is a set of size Nc+1 with MyM. (B12) RowColumn12Nc+1Nc+2Nc+3NMiy1y2yNc+1010i1y11y21yNc+1101(B12)

We have Nc+1 equations and Nc+1 variables. (We assume here, that the system of linear equations is of full rank. This is not always true. Then, the number of variables has to be reduced in an efficient way.)

Now, the system of equations can be solved. However, the solutions are yj=xi,j, i.e. we obtain the former entries of X. The reason is, that the design X satisfied the equations and that we did not really change the design X. Hence, we have to change some entries xi,j and xi,j with jM\My. The most elementary case is to change just one column and to put x˜i,j=1xi,j and x˜i,j=1xi,j. Such change was successful when the solutions yj are zeros or unities and when the penalty P(C(X˜)) is smaller than P(C(X)). For the exchange index j, all elements of M\My were applied.

A penalty to increase the number of admissible rows or admissible columns is (B13) P(C(X˜))=i=Na+1vj=Nc+1i1[1000 if i=Na+1 1 if iNa+1 ]×[1000 if j=Nc+1 1 if j>Nc+1 ]×ci,j2(B13)

From running time reasons, it is not realistic to apply all subsets MyM. We applied a grid by first taking My={1, 2,,Nc+1}, then My={1+Δ, 2+Δ,,Nc+1+Δ}, then My={1+2Δ, 2+2Δ,,Nc+1+2Δ}, etc., where Δ is a natural number with Nc+1Δ<NM.

The next modification is to suggest not only one changed column, but to suggest L>1 changed columns. This is done by generating vectors of zeros and unities of length L. With L=20, the generation of these vectors takes seconds. For L=40, the generation of these vectors takes more than a week on a normal computer. The potential of this modification is large, but may demand enormous running time.

The chance to find an admissible new row depends on the goodness of row Na+1. Ideally, there is only one non- zero entry cNa+1,j with jNa. In that case, one can try to improve row j, although row j is admissible. If there are more non- zero entries cNa+1,j, jJ, one can try to find sequentially new admissible rows jJ. Thus, an admissible row needs not to be fixed for all times.

It may happen, that the introduced methods fail to increase Na. For (v,k)=(45,22) with (b,r,λ)=(90,44,21), this appeared for Na=25 and Nc=24. The number of equations was Nc+1=25. However, the maximum number for L is limited by Nc+1+L NM. In that case, the average number of NM was 2(rλ)=46. Often, we could not generate vectors longer than L=21. In such cases it is not appropriate to handle pairs of rows, but solely row Na+1. In the described case we can add the equation c26,25=0. Then we may generate vectors longer than L=21. For each generated vector it must be checked, if the solution of the equations consists of zeros and unities.

If an admissible new row was found we have Na:=Na+1. Since we treated only one row, the resulting incidence matrix is no longer an admissible design (some k conditions are violated). Therefore, rows Na+1 to v must be renewed. This is done by a modified version of the initial generation of a design X. Then, the procedure is continued with a new run of the basic tool 1.

The optimization procedures of the software package “Mathematica” (Wolfram Citation1999) are known to be efficient (see Zeidler Citation2004). The optimization procedures NMaximize and NMinimize allow a variety of constraints, among them the restriction of the variables to be integers. Objective functions of different kind may be applied. Particularly, linear objective functions were treated effectively. In the given context, the objective functions are of quadratic or of forth degree. Because of xi,j{0,1}, we have xi,j=xi,j2=xi,j3=xi,j4 and the objective functions can be transformed into quasilinear ones. We checked whether the procedure NMinimize runs faster than our advanced methods. But this was not the case. This was also the case with the recently implemented Mathematica- method “Couenne”. Then the question arose, how to apply NMinimize with a strictly linear objective function.

We start with an incidence matrix obtained with the basic tool 1 and the advanced methods of moderate effort (L=20). Experience shows, that the first inadmissible row Na+1 of the incidence matrix has then entries cNa+1,j{1,0,1} for jNa in the concurrence matrix, Instead of minimizing j=1NacNa+1,j2, we demand cNa+1,j=0 for jNa now. We introduce variables yi, i=1,,Na+1+L, in row Na+1 like described above. Then we use the procedure Solve to solve Na+1 linear equations (cNa+1,j=0, j=1,,Na, and j=1bxNa+1,j=r) with respect to yL+1,,yL+Na+1. Denoting the solutions (depending on y1,,yL) with z1,,zNa+1, the side conditions are formulated as 0yj1 and yjIntegers, j=1,,L and 0zj1, j=1,,Na+1. (It is useful to program the inequalities in matrix form.)

In a strong sense, this is not an optimization problem, since it would be sufficient if the problem has a solution. Hence, to apply NMinimize, an auxiliary linear objective function was used: y1. This means that we are looking for the minimum of a free variable (although we are not really interested whether it is zero or one). Applied in this way, NMinimize works much more effective than the earlier described advanced methods.

The application of NMinimize to solve a system of conditions was also much more effective than the direct use of the Mathematica- procedures NSolve or FindInstance.

As expected, the effectiveness of the NMinimize- method turned out to be best when there was only one non-zero entry cNa+1,j for jNa and became worse with an increasing number of non-zero entries. Therefore, a modification is applied for Nu>1 non-zero entries. We reorder the rows of the incidence matrix, so that cNa+1,j0 for j{NaNu+1, NaNu+2,,Na}. Then, rows Na and Na+1 are exchanged. Thus, cNa+1,Na is the only non-zero entry in row Na+1 (while there are Nu1 non-zero entries in row Na). Now we apply the described method (to row Na+1). Having determined a solution rows Na and Na+1 are exchanged again. Thus, the number of non-zero entries in row Na+1 was reduced to Nu1. This technique is repeated until Nu=1 is reached. Then, the original technique can be used.

When this procedure succeeded, we generated a new admissible row. Checking the concurrence matrix of the obtained incidence matrix, we generally observe a deterioration of the fit of the remaining inadmissible rows. Furthermore, some k- conditions are violated (changes in a row were not compensated by changes in another row). Hence, we cannot start immediately with the treatment of the next inadmissible row. First we have to modify the inadmissible rows of the incidence matrix so that the r- and k- conditions are satisfied. This causes another deterioration of the fit of the remaining inadmissible rows.

Since the application of the NMinimize- technique is particularly effective when there are many zero entries in row Na+1 of the concurrence matrix and only a few 1 and 1 entries, the improvement of the actual incidence matrix is necessary. This can be realized by the application of the basic tool 1 and the earlier introduced advanced methods. (These methods are suited to improve the whole concurrence matrix as well as parts of it, while the NMinimize- technique is suited to improve certain cells of the concurrence matrix.)