![MathJax Logo](/templates/jsp/_style2/_tandf/pb2/images/math-jax.gif)
Abstract
A balanced incomplete block design (BIBD) is an efficient tool to infer parameters of a linear model. With a BIBD treatments are compared in
blocks of size
A variety of algebraic methods were developed to determine BIBDs depending on the parameter pair (v,k). Despite the progress in the field, there are many pairs (v,k) without an algebraic solution. For such pairs it is solely known, that a trivial BIBD exists, which is commonly large. In this study we present new results on the existence and generation of BIBDs which are smaller than the trivial ones. Such so-called non- trivial BIBDs exist if a trivial BIBDs is not elementary. With a recursive method of BIBD- generation it is shown that for
and
there exist non- trivial BIBDs. The same is shown for
and
Numerical programs were written for the generation of new non- trivial BIBDs, which delivered smaller BIBDs than the recursive method for
Commonly used R-programs like “crossdes” and “ibd” failed to determine such BIBDs.
1. Introduction
Balanced incomplete block designs (BIBDs) are well-tried tools in experimental practise. A BIBD can be defined as a incidence matrix
where
is the number of treatments and
the number of blocks, see e.g. Colbourn and Dinitz (Citation2006). For
and
an incidence matrix of a BIBD is
The conditions for to represent a BIBD are: In each row
there are
unities, i.e.
in each column
there are
unities, i.e.
and for each pair
and i′ of rows, there are
unities in common, i.e.
In the following, these conditions will be called
-,
-, and
- conditions. With the example we have
and
A compressed form of the incidence matrix is
where column j consists of the treatments occurring in block j. For example, the first block consists of treatments (1,2,4).
A BIBD is characterized by the quintuple The well-known necessary conditions for the existence of a BIBD are:
(1)
(1)
(2)
(2)
(3)
(3)
Parameters and
fulfilling the necessary conditions for given
and
are called admissible.
It is known, that for each pair the so- called trivial BIBD exists. A BIBD is called trivial if it’s incidence matrix is identical to the set of all
possible v-tuples (column vectors) with
unities and
zeros. The appropriate parameters of a trivial BIBD are
(4)
(4)
A BIBD for a pair (v, k) is called elementary if it cannot be split into at least two BIBDs for this (v, k). For many pairs (v, k) it is known, that the trivial BIBD is not elementary and can be reduced, i.e. there exist smaller BIBDs. Therefore, trivial BIBDs are also known as unreduced BIBDs.
For the example above we have The blocks of the trivial BIBD are
(1,2,3), (1,2,4), (1,2,5), (1,2,6), (1,2,7), (1,3,4), (1,3,5), (1,3,6), (1,3,7), (1,4,5), (1,4,6), (1,4,7), (1,5,6), (1,5,7), (1,6,7), (2,3,4), (2,3,5), (2,3,6), (2,3,7), (2,4,5), (2,4,6), (2,4,7), (2,5,6), (2,5,7), (2,6,7), (3,4,5), (3,4,6), (3,4,7), (3,5,6), (3,5,7), (3,6,7), (4,5,6), (4,5,7), (4,6,7), and (5,6,7).
This trivial BIBD can be partitioned into three elementary BIBDs, see Rasch and Herrendörfer (Citation1985). The first one (printed with bold digits) was given above. The second one (printed with italic digits) has also parameters
The set of the residual 21 of the 35 blocks is an elementary BIBD with
The first author of Rasch, Teuscher, and Verdooren (Citation2016) observed that there is a trivial BIBD, which cannot be reduced: for (v, k) = (8, 3) there is no BIBD which is smaller than the trivial one. He formulated the conjecture: For the case (v, k) = (8, 3) is the only one where the trivial BIBD is elementary.
This conjecture is relevant in the given context. If it was wrong, there were other pairs (v, k) for which the BIBD were elementary and attempts to generate a non- trivial BIBD were useless. If the conjecture was true, such attempts were reasonable.
While a proof of the conjecture is certainly difficult, a disproof would be complete, if a pair (v, k) was found, for which the smallest admissible parameters are those of the trivial BIBD. With the sections two and three of this study, this possibility is refuted. In section two, results on the smallest admissible parameters are summarized. In section three it is shown that for
and (v,k)
(8, 3) there are admissible parameters which are smaller than those of the trivial design.
Note, that the restriction has a technical background. If a BIBD for
exists, one can generate the complementary BIBD by exchanging zeros and unities in the incidence matrix. This complementary BIBD has parameters
and
Then,
i.e. complementary BIBDs ensure the existence of BIBDs for
In this study, we concentrate without loss of generality on designs for
Therefore, the BIBDs for (v, k) = (8, 3) and (v, k) = (8, 5) are elementary.
It is a demanding algebraic task to construct BIBDs for given and
see e.g. Colbourn and Dinitz (Citation2006). (When we write about the construction of BIBDs we mean the construction of non- trivial BIBDs.) The priority aim is to find the smallest BIBDs. Several authors contributed results on the existence, construction, and nonexistence of BIBDs with specific parameters. For a list of literature, we refer to Rasch, Teuscher, and Verdooren (Citation2016). Rasch et al. (Citation2011) collected 21 available methods to construct BIBDs.
There are a lot of pairs for which no construction method is known. For
these pairs can be found in of Rasch, Teuscher, and Verdooren (Citation2016). The smallest
with an unknown non- trivial BIBD is
For
the smallest BIBDs are known, see Rasch et al. (Citation2011). These BIBDs can be generated with the software package CADEMO, see Rasch et al. (Citation1987) and Rasch, Guiard, and Nürnberg (Citation1990). Rasch and Verdooren (Citation2023) made these BIBDs publicly available.
When for a pair no specific construction methods are known, one is legitimated to try numerical search procedures. Numerical local search programs were developed. Most popular are the R- procedures “crossdes” (https://CRAN.R-project.org/package=crossdes) by Sailer (Citation2005) and “ibd” (https://CRAN.R-project.org/package=ibd), basing on the work of Mandal, Gupta, and Parsad (Citation2014a, Citation2014b) and Mandal (Citation2015). Prestwich (Citation2003) and Rueda, Cotta, and Fernandez (Citation2009) have investigated other general numeric algorithms. The data of Prestwich (Citation2003), ranging up to
were taken by the later authors as reference data to evaluate the efficiency of the new methods. As their authors reported, the programs and algorithms do not always produce valid BIBDs, insofar that the generated designs do not always hold the conditions on
and
Particularly, several BIBDs for
cannot be found, although they exist. Furthermore, the convergence of “crossdes” and “ibd” is problematic for
i.e. for
Due to these restrictions, “crossdes” and “ibd” are not useful to generate new BIBDs. For unknown BIBDs with
is valid and the programs do not converge. Therefore, for the generation of new BIBDs new concepts are needed.
In Appendix A, the recursive generation of BIBDs is introduced. It is shown, how to construct a BIBD for the pair from the BIBDs for the pairs
and
With the recursive method, relevant propositions about the existence of non- trivial BIBDs are found.
In Appendix B, a general search procedure is presented, which ensures the generation of valid BIBDs. The usefulness of the methods is summarized in section four.
2. The smallest admissible parameters
In this section, we present essential facts on the smallest admissible parameters for parameters In the literature, we did not find these results, although they are partly known, see the programs of Mandal (Citation2015).
Theorem 1.
With where
is an admissible triple for
the smallest admissible triple is
(5)
(5)
where
is the smallest integer with
(GCD = Greatest Common Divisor; the division and multiplication have to be carried out element-wise.)
Proof
: From Equation(2)(2)
(2) and Equation(3)
(3)
(3) it follows
(6)
(6)
i.e. the ratios between the parameters b, r,
depend on v and k only. Let there be another set of admissible parameters
and
Then,
and
or
i.e.
(7)
(7)
holds for some rational number with natural numbers m and n. Hence,
The smallest triple of natural numbers () is obtained for
and
In case
the parameters do not satisfy condition Equation(1)
(1)
(1) . Therefore, they have to be multiplied with
the smallest integer with
□
EquationEquation (5)(5)
(5) shows how to obtain the smallest admissible parameters from admissible ones. The parameters of the trivial BIBD, mentioned in the introduction, are admissible. Hence, they could be used as a starting point. It is however well known that there are smaller admissible parameters:
Eliminating of Equationequation (2)
(2)
(2) gives
Eliminating
of Equationequation (3)
(3)
(3) gives
and substitution of
results in
The parameters b, r, and
are admissible if b, r, and
are integers for arbitrary
and
and if
holds. This clearly is guaranteed with
Therefore,
(8)
(8)
are admissible parameters. The smallest admissible parameters are determined by Equationequation (5)
(5)
(5) .
Often, no BIBD exists for the smallest admissible parameters e.g. for
and
One could think that the next admissible parameters would be
with
However, thinking so could hide the smallest BIBD. For example, for
we have via Equation(8)
(8)
(8) admissible parameters
and
Hence, from theorem 1, we obtain
and because of
and
It is known that for these parameters no BIBD exists. The former idea would lead to the next potentially admissible parameters
However, with
we obtain
Theory shows, that for these parameters, a BIBD exists.
Theorem 2:
With and
the ordered set of admissible parameters
and
is given by
with
were
is the smallest integer with
3. For which v and k are the smallest admissible parameters identical with those of the trivial BIBD?
Theorem 3:
For all with exception of
an admissible triple
exists which is smaller than that of the trivial BIBD
Proof:
We investigate where
belongs via Equation(8)
(8)
(8) to an admissible parameter triple. If
holds, the admissible parameters are smaller than the trivial ones. If
holds, we check via theorem 1, whether there are admissible parameters smaller than
and
If there are none, the trivial BIBD is elementary.
The relation is equivalent to
(9)
(9)
In the numerator, we have v2 factors. In the denominator, we have v
1 factors larger than one. Hence, with
and
we are looking for all solutions of
(10)
(10)
For we get
i.e.
and following Equation(8)
(8)
(8) we receive
i.e.
and
For we get
which has no integer solution.
For we consider
which is a monotone function in
For
it is
For
it is one. For
it is
Thus
is the only solution with
and
i.e.
and
For and
we obtain
For
and
we obtain
Since
this expression is larger than one. Due to monotonicity with respect to
there is no solution of equ. Equation(9)
(9)
(9) for
With the same technique it can be shown that for the designs with (6, 3) and (7, 3), there are solutions with
However, it is known that existing BIBDs are smaller than the trivial designs. For
and
(
the BIBD with
and
exists and for
and
(
the BIBD with
and
exists. □
3.1. Conclusion
The equation
(11)
(11)
can hold only for and
4. The generation of new BIBDs
For the smallest BIBDs are known and reported by Rasch et al. (Citation2011). For
there are many (v, k) pairs for which it is not known if non-trivial BIBDs exist, see, e.g. of Rasch, Teuscher, and Verdooren (Citation2016). Numerical programs like “crossdes” or “ibd” do not allow the generation of missing BIBDs for two reasons. The first one is that the algorithms often do not terminate for
The second reason is that they often do not generate valid BIBDs. To overcome the problems, two new approaches were used.
Table 1. Generated BIBDs. The parameter pairs comprise those pairs with
and
for which non- trivial BIBDs were not known. The lengths of the trivial BIBDs are denoted with
The smallest length satisfying the necessary conditions Equation(1)
(1)
(1) , Equation(2)
(2)
(2) , and Equation(3)
(3)
(3) is denoted with
The lengths of the BIBDs generated with two methods are denoted with
The factor
in parentheses says that
is the
- fold of
The first one is a recursive method, which generates a BIBD for from the BIBDs for
and
The method is described in Appendix A. When a preceding BIBD is not known, it has to be generated from its predecessors. This may go back to the BIBDs for
where
is a prime power, i.e. a prime number or a power of a prime number. For such a
the smallest BIBDs can be generated (at least theoretically; for large
there might be practical numerical problems).
With the recursive method, BIBDs for parameter pairs with
and
were generated. From 243994 generated BIBDs, only 282 were trivial. This appeared particularly, when the distance between
and
the largest prime power smaller than
was large. The generation of a BIBD for
i.e.
is the successor of a prime power, never led to a trivial BIBD. The same is true up to
i.e. trivial BIBD appeared for
The results are summarized by the following theorem.
Theorem 4:
For there exists a non- trivial BIBD for each pair
For all pairs
with
and
non- trivial BIBDs exist.
For the sizes of the BIBDs generated with the recursive method are presented in . The table documents those pairs
for which the existence of non-trivial BIBDs was not known.
shows that the BIBDs generated with the recursive method are considerable smaller than the trivial BIBDs, but mostly much larger than the smallest admissible designs.
In Appendix B, basic and advanced numerical methods are introduced. They were intended to reach three goals. The first goal was the generation of approximate BIBDs for parameters
and the smallest admissible
The generation is guaranteed not only for parameters with
but for parameters with
The second goal was to generate valid BIBDs with elementary methods. The third goal was the improvement of BIBDs with advanced methods, i.e. the generation of BIBDs smaller than those generated with the basic methods.
With the basic numerical methods, BIBDs were generated for the pairs presented in . With the exception for
these BIBDs were smaller than those generated with the recursive method, partly drastically.
The application of the advanced numerical methods to the pairs of is not completed yet. So far, for twenty two pairs it succeeded to generate BIBDs with
and
i.e. the smallest possible ones (marked by an asterisk in the last column of the table). We are optimistic to find BIBDs with a maximum factor
for the remaining pairs
of . For example, the intermediate result for the pair
is
while it was
with the basic methods. With an enlarged effort, a further improvement is expected. Hence, the potential of the advanced methods is large but the needed running time is large as well.
5. Discussion
The new methods allowed a considerable progress in generating BIBDs. With the recursive method, non-trivial BIBDs were generated for Since the generated BIBDs were often large, the recursive method is more of theoretical than of practical importance.
The basic numerical methods were applied to produce BIBDs for whose existence was unknown so far. For several pairs
the advanced numerical methods were applied. The generated BIBDs were smaller than those generated with the recursive method. For twenty two pairs
a fully satisfying result was obtained: the generation of a BIBD for the smallest admissible parameters
For the other pairs, further improvements demand longwinded calculations.
As a starting point of our procedures, approximate BIBDs were generated for given and the smallest admissible number of blocks b′ “Approximate” means that the deviations of the actual
from the nominal
are minimized, while the
- and
- conditions are satisfied. As a rule, the obtained differences between
and
are −1, 0, or 1. Sometimes, the generated design is a valid BIBD, i.e. all obtained
are identical with
With the basic tool 1, approximate BIBDs can be calculated also for inadmissible numbers of blocks An example is given, that the results can compete with those of the program “ibd”. For
and
(with the smallest admissible parameter
) we obtained the incidence matrix
and the concurrence matrix
The design is balanced with respect to and
The pairwise parameters
vary between one and two (with mean
). The A- and D- efficiency parameters were 0.991 and 0.996, respectively. The concurrence matrix obtained with “ibd” contained pairwise parameters
between zero and four. The parameters of A- and D-efficiency were 0.933 and 0.971, respectively. Hence, our result is clearly more balanced.
Ignoring the problem of large running times, there was a simple method to generate a BIBD for admissible parameters One could generate all incidence matrices of dimension
with
unities and
zeros in each row and check each matrix, whether the
- and
- conditions for a BIBD are fulfilled. (Without loss of generality, the first two rows and the first column could be fixed in advance:
The other entries were zero. The entries of the
-th row can be calculated by
) When an admissible matrix is found, it is the incidence matrix of a BIBD. Otherwise, applying theorem 2, the next larger admissible parameters could be attempted. However, this method is so time demanding, that it is not realizable even for small designs.
When there is no practicable strategy to solve the problem as a whole, sequential strategies are needed. With our methods, and with the programs “crossdes” and “ibd”, the strategy is the subsequent increasement of admissible rows of the incidence matrix. Often, the concrete problem aroused that at a certain stage, it was not possible to generate the next admissible row. Hopefully, theoretical results on this problem are obtainable.
One could call this problem the inverse BIBD problem: We consider row vectors of length with
unities and
zeros. How many such row vectors can be built, so that each pair of them has
unities in common (there are
columns with unities for both row vectors)?
Those parameters
and
are of interest which coincide with the smallest admissible parameters
and
for a BIBD with
If a BIBD exists for these parameters, the answer on the formulated question would be:
If theoretical results (accompanied by a construction method) on the inverse BIBD problem would, for example, deliver
or
admissible row vectors, a doubling of the design and the use of our methods would probably lead to a BIBD. With our methods it even happened that with a doubling of a design, seven new admissible rows were generated.
Of particular interest is the inverse problem for
and
Often, these parameters are the smallest admissible ones (ten cases in ). If for them BIBDs would exist, we had another case were the necessary conditions Equation(1)
(1)
(1) are sufficient. Furthermore, the conjecture of Rasch, Teuscher, and Verdooren (Citation2016) could be proofed via theorem 3.
Generally, the numerical search for BIBDs is a very expensive procedure. On the other hand, a suggested design can be checked for being a valid BIBD by some matrix operations. This discrepancy reminds of the “P versus NP” problem, see Clapham and Nicholson (Citation2014). Since the effort of optimization procedures with integer variables increases exponentially with the number of variables, the numerical generation of BIBDs is an excellent candidate for solving the “P versus NP” problem. A closer look of specialists is emphasized.
Generated BIBDs and the programs for the generation of approximate designs and BIBDs will be sent to interested readers.
Acknowledgement
The authors are indebted to the reviewers for valuable hints.
Disclosure statement
No potential conflict of interest was reported by the author(s).
References
- Abel, R. J. R., I. Bluskov, and M. Greig. 2001. Balanced incomplete block designs with block size 8. Journal of Combinatorial Designs 9 (4):233–68. doi:10.1002/jcd.1010.
- Abel, R. J. R., I. Bluskov, and M. Greig. 2004. Balanced incomplete block designs with block size 9: Part III. Australasian Journal of Combinatorics 30:57–73.
- Abel, R. J. R., and M. Greig. 1998. Balanced incomplete block designs with block size 7. Designs, Codes and Cryptography 13 (1):5–30. doi:10.1023/A:1008204220755.
- Clapham, C., and J. Nicholson. 2014. The concise Oxford dictionary of mathematics. 5th ed. Oxford: Oxford University Press.
- Colbourn, C. J., and J. H. Dinitz. (eds.) 2006. Handbook of combinatorial designs. 2nd ed. Boca Raton: Chapman and Hall.
- Mandal, B. N. 2015. Linear integer programming approach to construction of balanced incomplete block designs. Communications in Statistics - Simulation and Computation 44 (6):1405–11. doi:10.1080/03610918.2013.821482.
- Mandal, B. N., V. K. Gupta, and R. Parsad. 2014a. Efficient incomplete block designs through linear integer programming. American Journal of Mathematical and Management Sciences 33 (2):110–24. doi:10.1080/01966324.2014.901198.
- Mandal, B. N., V. K. Gupta, and R. Parsad. 2014b. Balanced treatment incomplete block designs through integer programming. Communications in Statistics - Theory and Methods 46 (8):3728–37. doi:10.1080/03610926.2015.1071394.
- Prestwich, S. 2003. A local search algorithm for balanced incomplete block designs. In CP 2003. LNCS, ed. F. Rossi, vol. 2833, 53–64. Heidelberg: Springer.
- Rasch, D., V. Guiard, and G. Nürnberg. 1990. Present and planned future of the expert system CADEMO. In SOFTSTAT'89 Fortschritte der Statistik-Software 2, eds. F. Faulbaum, R. Haux, und K-H. Jöckel (Hrsg.), 332–9. Stuttgart: Gustav Fischer.
- Rasch, D., V. Guiard, G. Nürnberg, P. E. Rudolph, and F. Teuscher. 1987. The expert system CADEMO. Computer Aided Design of Experiments and Modelling. Statistical Software Newsletter 13:107–14.
- Rasch, D., and G. Herrendörfer. 1985. Experimental design: Sample size determination and block designs. Dordrecht: D. Reidel Publishing Company.
- Rasch, D., J. Pilz, L. R. Verdooren, and A. Gebhardt. 2011. Optimal experimental design with R. Boca Raton: Chapman and Hall.
- Rasch, D., F. Teuscher, and L. R. Verdooren. 2016. A conjecture about BIBDs. Communications in Statistics - Simulation and Computation 45 (5):1526–37. doi:10.1080/03610918.2014.955111.
- Rasch, D., and L. R. Verdooren. 2023. Angewandte Statistik mit R für Agrarwissenschaften. Berlin: Springer.
- Rueda, D. R., C. Cotta, and A. J. Fernandez. 2009. Finding balanced incomplete block designs with metaheuristics. In Proceedings of the 9th European Conference on evolutionary computation in combinatorial optimization. Berlin Heidelberg: Springer-Verlag. doi:10.1007/978-3-642-01009-5_14.
- Sailer, O. 2005. crossdes: A package for design and randomization in crossover studies. Rnews 5(2):24–7.
- Wolfram, S. 1999. The mathematica book. 4th ed. Cambridge: Wolfram Media/Cambridge University Press.
- Zeidler, E. 2004. Oxford users’ guide to mathematics. Oxford: Oxford University Press.
Appendix A.
The construction of a BIBD for (v, k) from the BIBDs with (v-1, k) and (v-1, k-1)
Let be the
incidence matrix, consisting of zeros and unities, of a BIBD with parameters
We order the columns of this matrix so that the first row starts with zeros and ends with unities:
where
and
are certain zero- one matrices. Since the number of unities in the first row is r and the number of zeros is therefore
has dimension
and
has dimension
We now assume, that and
are BIBDs. Since the sum of column elements in
is k, the sum of column elements in
is also k, while it is
in
Hence,
and
have parameters
and
respectively.
Then, the parameters of the BIBDs are related by the following conditions:
(A1)
(A1)
Example A1
: Consider the smallest BIBD for v = 8 and k = 4. Following Rasch et al. (Citation2011), it has parameters An ordered (with respect to the first row) incidence matrix is, e.g.
Ignoring the first row, the matrix X can be partitioned into matrices X1 and X2. The matrices
are BIBDs with parameters
and
respectively (the sum of row elements is
the sum of column elements is
and each pair of rows has
unities in common). Particularly,
is the complementary design to
(zeros and unities are interchanged).
Now, we change the viewpoint. With given BIBDs with
and
with
(not necessarily the smallest ones) we construct a BIBD
with
in the following manner:
where the design
appears
times and the design
appears
times. The parameters of
result according to
(A2)
(A2)
From the first row, we obtain the condition since there are
unities. Furthermore, the first row has unities in common with other rows only on the right-hand side, namely
i.e.
Hence there are to solve two equations:
and
Setting
this gives
(A3)
(A3)
From Equation(A1)(A1)
(A1) , Equation(2)
(2)
(2) ,
and
we get
(A4)
(A4)
Having canceled out common factors of the numerator and denominator of q, we obtain and
Example A2
: Consider BIBDs for From of Rasch, Teuscher, and Verdooren (Citation2016), the existence of non-trivial BIBD is unknown for
and
Available software like “crossdes” and “ibd” fail to generate BIBDs for these parameters. Since the smallest BIBDs for
are known (Table 6.1 of Rasch et al. (Citation2011)), the recursive method can be applied. Afterwards it can be checked, whether the resulting BIBDs are trivial or shorter.
For the construction of the BIBD for and
we use the BIBDs
and
i.e.
and
Following (A6) we obtain
and
shown within .
Table A1. The construction of BIBDs for and
and
The length of the smallest admissible design is
For the construction of the BIBD for and
we use the BIBDs
and
i.e.
and
Following (A6) we obtain
and
The constructed designs have parameters and
The first one is the eight-fold of the smallest admissible design (with length b’) and the second one is the two-fold of the smallest admissible design. Both are much shorter than the trivial BIBDs.
The described construction of a BIBD for given and
assumes the availability of BIBDs for
and
If this assumption is not fulfilled, the method is not directly applicable. Then, BIBDs for
are required, i.e. the method has to be used recursively. Fortunately, we have not to go backwards too much, since we may start with
a prime number or a power of a prime number. With such a
BIBDs can always be constructed, e.g. via method 6 of Rasch et al. (Citation2011). The lengths
of such BIBDs are at most
This length was used in the recursive procedure. Beside the BIBDs for a prime power
the BIBDs for
and 9 are taken as known, following the theory of Abel and Greig (Citation1998), Abel, Bluskov, and Greig (Citation2001), and Abel, Bluskov, and Greig (Citation2004).
Appendix B:
Basic and advanced numerical methods
Advanced methods are introduced, which allow the generation of shorter BIBDs than generated with the basic program.
The basic numerical method
We describe the numerical method to generate a BIBD by means of an example: We start with an incidence matrix
for the smallest admissible design, which has parameters
has to satisfies the
and
condition. Such a matrix is simply constructed:
(B1)
(B1)
As it can be seen, the sums of row elements are seven and the sums of column elements are five. The - condition was not in the focus at this stage.
Definition B1
: Let and
be admissible parameters for given
Then, a
incidence matrix
(consisting of zeros and unities) is called a design, when the sum of row elements is
for each row and when the sum of column elements is
for each column.
The appropriateness of design is measured by the concurrence matrix
(B2)
(B2)
If were a BIBD, we had
(B3)
(B3)
For the design Equation(B1)(B1)
(B1) , the concurrence matrix is
(B4)
(B4)
with elements
The diagonal elements are admissibly
The non- diagonal elements are all different from zero. Since
is a measure of the deviation from zero, the appropriateness of a design
can be quantified by the sum of squared deviations
(B5)
(B5)
which is the penalty function. The general aim of an algorithm is to suggest changes within and to check whether a change decreases the penalty function.
Note that due to the symmetry of only elements below the diagonal found entrance into
Hence, there is no need to change row one of
It is therefore called admissible.
Definition B2
: Row of design
is called to be admissible, if
holds, where
is the number of admissible rows
For the initial design Equation(B1)(B1)
(B1) , we have
The aim is to increase
The change of one entry, say of
(substitution of a “1” by a “0”, or vice versa) would lead to a violation of the
and
conditions. We allow only changes, which do not violate these conditions. Therefore, a second row,
and a second column,
have to be taken into account.
We are ready now to define the first basic algorithm. Often, it has to be applied several times. Then, the parameters will differ (with exception of and
). When the algorithm is applied the first time, we have
and
Basic tool 1
For a pair of rows of design
determine the set
of the column numbers
for which
and
hold. Then, determine the set
of the column numbers
for which
and
hold. (The number of elements of
and
is
) Then, with
put
and
for a pair
and
With such
determine
and calculate the penalty parameter
(B6)
(B6)
If is smaller than
set
In case
is increased by one. The algorithm is applied to all pairs
and
The algorithm terminates, when no further reduction of the penalty
can be obtained. □
Ideally, the basic tool 1 terminates with penalty zero. Then, a BIBD was obtained. If the basic tool 1 did not terminate with penalty zero, it is finalized it by another run with penalty Equation(B5)(B5)
(B5) . The resulting design
can be taken as an approximation of a BIBD. If one wants to come near to C- or D- optimality of the design, one can use appropriate penalty terms.
However, the aim is to generate a BIBD. When the basic tool 1 was not successful, we have to enlarge the dimensions of the task by setting With our example, the concurrence matrix with the lowest penalty was
(B7)
(B7)
We compose a new design by taking the design
and hang on another design
of length
i.e. the row vectors of
are prolonged by the row vectors of
We abbreviate this merging with
The resulting concurrence matrix is
since the scalar products are additive.
Then the question is, how to choose Taking
has the advantage, that a zero in
becomes also a zero in
i.e. rows 1 to
of
are admissible. The disadvantage is that a non-zero in
becomes twice the non- zero in
In example Equation(B7)
(B7)
(B7) we have
To obtain a zero in
i.e.
we need a modification of
so that
Furthermore, we have
To obtain
we need
This modification is reached when
is taken as
but with exchanged rows 3 and 10. Then,
results from
by exchanging rows 3 and 10 and as well columns 3 and 10. The concurrence matrix of
is then
(B8)
(B8)
Incidentally, by the exchange of rows 3 and 10 in two more zeros in
were generated:
and
This resulted from
Visiting the concurrence matrix Equation(B8)(B8)
(B8) , one can see, that additionally exchanging rows 1 and 5 of
will lead to
and
When we still exchange rows 8 and 12 of
we obtained a BIBD
by defining
Basic tool 2
With the doubling of the design, as much as possible new admissible rows are generated.
Unfortunately, basic tool 1 does not often terminate with a comfortable situation given with the example Equation(B7)(B7)
(B7) . Such uncomfortable situation is given, when
Then, with the basic tool 1 but with an appropriate penalty term, row
of
must be brought into a shape, that pairs
ensure
The following two theorems are useful in this context.
Theorem B1
: Let be a design. Then, the sum of all non- diagonal elements of each row and each column of
is zero.
Proof:
Without loss of generality, we consider row one (for row rows
and one are interchanged). The assertion is then
By construction we have
The sums of the column elements equal
Therefore, the sum
is
if
and
if
There are
columns with
Thus,
holds. Since
is a necessary condition for the existence of a BIBD and we apply only admissible parameters, the assertion is proofed. Due to symmetry of
the result applies also to the columns.□
Theorem B2
: If solely the two last rows of are not admissible (i.e.
) and if
holds for
then one can immediately define a design
so that
is a BIBD.
Proof:
Following theorem B1 applied to columns
holds. Thus, the last two rows of the concurrence matrix
have the following form:
(B9)
(B9)
From theorem B1 we have for row
and
for row
Adding both equalities and regarding symmetry of
we get
i.e.
Assume furthermore, that holds. Then,
This means, that in rows
and
there is the same number, say
of “-1” and “1”. Therefore, we may create
pairs
of columns, so that
and
The design
is generated from design
by exchanging all pairs
of rows. □
Conclusion B1
If rows of the design
are admissible, the
-th row is also admissible.
When the basic tool 2 ends with admissible rows, a BIBD was generated. If not, the basic tool 1 is started again with
and the actual
Advanced numerical methods
The basic numerical program often does not generate the smallest possible BIBDs. To obtain smaller BIBDs advanced methods are necessary. The improvements of the basic tools were caused by the exchange of four entries of a design which were the points of intersection of two rows and two columns. Advanced methods have to offer more than four changes. A straightforward approach would be to try, e.g. two rows and four columns. We experienced that the gain of such approaches was not substantial enough to compensate the increase of running time. Therefore, other approaches were used.
A view on the concurrence matrix Equation(B7)(B7)
(B7) shows, that the non- diagonal elements of columns 2, 4, 6, 7, 9, and 11 are zero. We call them admissible columns, although they are caused by those rows of design
which satisfy the
- condition with respect to all other rows. We reorder the rows of
so that rows 2, 4, 6, 7, 9, and 11 are the first six rows of
afterwards.
Definition
is called the number of admissible columns of the concurrence matrix
if
holds for
Generally, the number of admissible columns can be increased by another run of the basic tool 1 with the penalty
(B10)
(B10)
In case there was no admissible column, has to be taken.
We may now modify the basic tool 1 to improve the design under the constraints that the first
columns of
remain admissible. Analogously to the basic tool 1 we determine the sets
and
for rows
and
of
The number of elements of
is
We reorder the columns of design
so that the columns with
and
i.e.
are then the columns 1, 3, 5, etc… Analogously, the columns with
and
i.e.
are then the columns 2, 4, 6, etc… Rows
and
of design
(which is now the reordered design) have the following principal shape then:
(B11)
(B11)
We now want to create a new design with few changes compared to design
which is potentially better. The first
columns of
should remain admissible, i.e.
and
must hold for
(Note however, that the latter equations
automatically hold due to theorem B1.) Furthermore, we demand
Hence, we have
equations to hold. To exploit these equations, we have to introduce variables
…,
For example, this can be done by putting
and
with
Generally, the set of columns, assigned to the variables, is
where
is a set of size
with
(B12)
(B12)
We have equations and
variables. (We assume here, that the system of linear equations is of full rank. This is not always true. Then, the number of variables has to be reduced in an efficient way.)
Now, the system of equations can be solved. However, the solutions are i.e. we obtain the former entries of
The reason is, that the design
satisfied the equations and that we did not really change the design
Hence, we have to change some entries
and
with
The most elementary case is to change just one column and to put
and
Such change was successful when the solutions
are zeros or unities and when the penalty
is smaller than
For the exchange index
all elements of
were applied.
A penalty to increase the number of admissible rows or admissible columns is
(B13)
(B13)
From running time reasons, it is not realistic to apply all subsets We applied a grid by first taking
then
then
etc., where
is a natural number with
The next modification is to suggest not only one changed column, but to suggest changed columns. This is done by generating vectors of zeros and unities of length
With
the generation of these vectors takes seconds. For
the generation of these vectors takes more than a week on a normal computer. The potential of this modification is large, but may demand enormous running time.
The chance to find an admissible new row depends on the goodness of row Ideally, there is only one non- zero entry
with
In that case, one can try to improve row
although row
is admissible. If there are more non- zero entries
one can try to find sequentially new admissible rows
Thus, an admissible row needs not to be fixed for all times.
It may happen, that the introduced methods fail to increase For
with
this appeared for
and
The number of equations was
However, the maximum number for
is limited by
In that case, the average number of
was
Often, we could not generate vectors longer than
In such cases it is not appropriate to handle pairs of rows, but solely row
In the described case we can add the equation
Then we may generate vectors longer than
For each generated vector it must be checked, if the solution of the equations consists of zeros and unities.
If an admissible new row was found we have Since we treated only one row, the resulting incidence matrix is no longer an admissible design (some
conditions are violated). Therefore, rows
to
must be renewed. This is done by a modified version of the initial generation of a design
Then, the procedure is continued with a new run of the basic tool 1.
The optimization procedures of the software package “Mathematica” (Wolfram Citation1999) are known to be efficient (see Zeidler Citation2004). The optimization procedures NMaximize and NMinimize allow a variety of constraints, among them the restriction of the variables to be integers. Objective functions of different kind may be applied. Particularly, linear objective functions were treated effectively. In the given context, the objective functions are of quadratic or of forth degree. Because of we have
and the objective functions can be transformed into quasilinear ones. We checked whether the procedure NMinimize runs faster than our advanced methods. But this was not the case. This was also the case with the recently implemented Mathematica- method “Couenne”. Then the question arose, how to apply NMinimize with a strictly linear objective function.
We start with an incidence matrix obtained with the basic tool 1 and the advanced methods of moderate effort (). Experience shows, that the first inadmissible row
of the incidence matrix has then entries
for
in the concurrence matrix, Instead of minimizing
we demand
for
now. We introduce variables
in row
like described above. Then we use the procedure Solve to solve
linear equations (
and
) with respect to
Denoting the solutions (depending on
) with
the side conditions are formulated as
and
and
+1. (It is useful to program the inequalities in matrix form.)
In a strong sense, this is not an optimization problem, since it would be sufficient if the problem has a solution. Hence, to apply NMinimize, an auxiliary linear objective function was used: This means that we are looking for the minimum of a free variable (although we are not really interested whether it is zero or one). Applied in this way, NMinimize works much more effective than the earlier described advanced methods.
The application of NMinimize to solve a system of conditions was also much more effective than the direct use of the Mathematica- procedures NSolve or FindInstance.
As expected, the effectiveness of the NMinimize- method turned out to be best when there was only one non-zero entry for
and became worse with an increasing number of non-zero entries. Therefore, a modification is applied for
non-zero entries. We reorder the rows of the incidence matrix, so that
for
Then, rows
and
are exchanged. Thus,
is the only non-zero entry in row
(while there are
non-zero entries in row
). Now we apply the described method (to row
). Having determined a solution rows
and
are exchanged again. Thus, the number of non-zero entries in row
was reduced to
This technique is repeated until
is reached. Then, the original technique can be used.
When this procedure succeeded, we generated a new admissible row. Checking the concurrence matrix of the obtained incidence matrix, we generally observe a deterioration of the fit of the remaining inadmissible rows. Furthermore, some - conditions are violated (changes in a row were not compensated by changes in another row). Hence, we cannot start immediately with the treatment of the next inadmissible row. First we have to modify the inadmissible rows of the incidence matrix so that the
- and
- conditions are satisfied. This causes another deterioration of the fit of the remaining inadmissible rows.
Since the application of the NMinimize- technique is particularly effective when there are many zero entries in row of the concurrence matrix and only a few
and
entries, the improvement of the actual incidence matrix is necessary. This can be realized by the application of the basic tool 1 and the earlier introduced advanced methods. (These methods are suited to improve the whole concurrence matrix as well as parts of it, while the NMinimize- technique is suited to improve certain cells of the concurrence matrix.)