Full article: Bounded Hanoi

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

The classic Tower of Hanoi puzzle involves moving a set of disks on three pegs. The number of moves required for a given number of disks is easy to determine, but when the number of pegs is increased to four or more, this becomes more challenging. After 75 years, the answer for four pegs was resolved only recently, and this time complexity question remains open for five or more pegs. In this article, the space complexity, i.e., how many disks need to be accommodated on the pegs involved in the transfer, is considered for the first time. Suppose m disks are to be transferred from some peg L to another peg R using k intermediate work pegs of heights $j_{1}, \dots, j_{k}$ , then how large can m be? We denote this value by $H (j_{1}, \dots, j_{k})$ . If k = 1, as in the classic problem, the answer is easy: $H (j) = j + 1$ . We have the exact value for two work pegs, but so far only very partial results for three or more pegs. For example, $H (10!, 10!) = 26336386137601$ and $H (0!, 1!, 2!, \dots, 10!) = 16304749471397$ , but we still do not know the value for $H (1, 3, 3)$ .

MSC:

1 Introduction

It is often extremely hard to formally prove the optimality of an algorithm, even if it looks beyond doubt. A good example is the 4-peg Tower of Hanoi problem. The Tower of Hanoi (TOH) is a popular puzzle whose origin dates back to 1883, due to French mathematician Édouard Lucas [Citation6]. In its original version, there are three pegs on a board and m disks of different sizes are stacked on one peg in descending order of size. The puzzle requires us to move all the disks to another peg, while moving just one disk at a time, and never placing a larger disk on top of a smaller one. A simple recursive algorithm which runs in $2^{m} - 1$ steps is well known. Proving its optimality is not difficult, due to the obvious fact that when the largest disk moves from one peg to another, all the other m – 1 disks must be on the third peg.

A natural extension of TOH is to increase the number of pegs from three to four or more, whereupon the situation changes drastically. Its time complexity, certainly smaller than the above $2^{m} - 1$ steps, was first asked in the literature by Stewart [Citation8] in 1939, and two years later, Stewart and Frame independently proposed what they call “optimal” algorithms without proof. The two algorithms turned out to be equivalent [Citation9] and since then they have been known as the Frame–Stewart algorithm, and its optimality presented as a challenging open question. Indeed, as reported in [Citation7], Donald Knuth wrote in a letter to Martin Gardner in 1979 that he doubted “if anyone will ever resolve the conjecture; it is truly difficult.”

After some partial solutions (e.g., [Citation4]) and exhaustive checks up to some number of disks by computer [Citation3], the optimality was proved by Bousch [Citation1] in 2014 for four pegs. The solution for more pegs still remains open [Citation2]. Thus the time complexity of TOH has been a popular topic, but there has been no literature about its space complexity. Judging by the history of the time complexity, the space complexity could be equally nontrivial. In this article, we provide some preliminary results and give an exact solution for the case of four pegs. By convention, disks are initially located on the leftmost peg (denoted by L) and they have to move to the rightmost one (denoted by R). The other pegs between L and R we call the work pegs, and use k for the number of them. The height of a peg is the maximum number of disks that can be held on the peg. Pegs L and R have unbounded height. We regard the finite heights of the work pegs as the space complexity of the problem. For k = 1, i.e., for the standard 3-peg model, it is easily seen that a work peg of height m – 1 suffices to execute the optimal algorithm and it is also necessary for any algorithm. For k = 2, Frame–Stewart needs a height of approximately $m - \sqrt{2 m}$ for one work peg and $\sqrt{2 m}$ for the other, but this is not necessary in general.

This article shows we can indeed use substantially shorter pegs for k = 2, the heights of which can be described exactly by a simple formula. Let H(i, j) denote the maximum number of disks that can be moved from L to R using two work pegs of height $i \geq 0$ and $j \geq 0$ . Our main contribution is to prove $H (i, j) = 2 i j + i + j + 1.$

So for m disks, it is sufficient to take i and j equal to a rounded value of $\sqrt{m / 2}$ , much smaller than the height needed for the Frame–Stewart algorithm. The lower bound proof is relatively easy, just by giving a recursive algorithm. Its optimality (the upper bound part), however, needs a lot more work. As in the standard model with one work peg, a key step in the analysis is when the largest disk leaves L and goes to one of the work pegs or to R. The other work peg can hold several disks, so there are many possible subsequent moves.

For general k, we give a recursive formula for a seemingly close lower bound for $H (j_{1}, j_{2}, \dots, j_{k})$ . However, it is not likely that it can be presented as a closed-form formula, or that we can easily extend the upper bound proof for k = 2 to this general case. Surprisingly, if the successive work peg heights grow relatively slowly, satisfying $j_{1} = 1$ and $j_{i + 1} \leq {(j_{1} + \dots + j_{i} - 1)}^{2} + 1$ for $1 \leq i < k$ , then the tight bound can be given as a simple formula, namely we can prove $H (j_{1}, j_{2}, \dots, j_{k}) = {(j_{1} + \dots + j_{k})}^{2} + 1$ .

2 Preliminaries

All the problems we will consider involve transferring the set of disks $1, 2, \dots, m$ from an unbounded initial peg L, via a set of bounded work pegs W, to an unbounded final peg R. In this article, W consists of some number k of pegs of heights $j_{1}, \dots, j_{k}$ and we denote this storage structure as $W (j_{1}, \dots, j_{k})$ .

We define $H (j_{1}, \dots, j_{k})$ to be the maximum number of disks that can be moved from L to R using $W (j_{1}, \dots, j_{k})$ , e.g., $H (1) = 2$ .

A stack is any legal pile of disks, usually listed top-down, i.e., in order of increasing size. A disk move removes a disk from the top of the stack on one peg and places it on the top of the stack on another peg provided that it is smaller than the current top disk on that peg and the move does not produce a stack exceeding the height of that peg. The r-cone C_r is the stack of disks $(1, \dots, r)$ and, for $1 \leq r \leq s$ , the (r, s)-frustum $F_{s}^{r}$ is the stack (or set) $(r, r + 1, \dots, s)$ . We will find it convenient to extend these definitions such that C_r is the empty set when $r \leq 0$ , and $F_{s}^{r} = {t | t > 0 and r \leq t \leq s}$ is empty when r > s.

A configuration gives the contents of each peg. We denote a configuration C with the contents of L, R, and W being X, Y, and U, respectively, as $〈 X | U | Y 〉$ . When the contents of $W (j_{1}, j_{2})$ , say, are given in more detail as U₁, U₂, we write $〈 X | U_{1} : U_{2} | Y 〉$ . An empty set is denoted by “–”.

If a sequence σ of disk moves changes configuration C₁ to C₂, we write $C_{1} \overset{σ}{\Rightarrow} C_{2}$ or just $C_{1} \Rightarrow C_{2}$ when σ is unspecified. Since any sequence of disk moves is reversible, if $C_{1} \Rightarrow C_{2}$ then $C_{2} \Rightarrow C_{1}$ . So we may write $C_{1} \Leftrightarrow C_{2}$ when $C_{1} \Rightarrow C_{2}$ and we want to emphasise that this is an equivalence relation.

We make precise an obvious “monotonicity” property: if some sequence of disk moves is feasible, then so is the subsequence obtained by eliminating some of the disks.

Lemma 1

(Monotonicity Lemma). Let S be the subset of disks that remains after some eliminations. Let C₁ and C₂ be two configurations and $C'_{1} = C_{1} \cap S$ and $C'_{2} = C_{2} \cap S$ be the configurations corresponding to C₁ and C₂ but restricted to the disks in S. If $C_{1} \Rightarrow C_{2}$ then $C'_{1} \Rightarrow C'_{2}$ .

Proof.

For any sequence of moves σ, let $σ'$ be the sequence formed as follows: for any move $γ \in σ$ of disk d, if $d \in S$ then let $γ' = γ$ , and if $d \notin S$ then $γ'$ is the null move. Clearly, if $C_{1} \overset{σ}{\Rightarrow} C_{2}$ then $C'_{1} \overset{σ'}{\Rightarrow} C'_{2}$ . □

3 Single work peg

When there is just one work peg of bounded height, determining H(j) is fairly easy. The proof methods we introduce here will be used in later sections.

Theorem 2.

For all $j \geq 0, H (j) = j + 1$ .

Proof.

We first prove that $H (j) \geq j + 1$ . We show by induction on j and m that, for $m \leq j + 1$ , C_m can be transferred from L to R using a single work peg of height j. This holds trivially for m = 0, and also for j = 0 since one disk can be transferred directly from L to R. Suppose now that j > 0, $H (j - 1) \geq j$ , and $C_{m - 1}$ can be transferred from L to R. We want to show that if $m \leq j + 1$ then $〈 C_{m} | - | - 〉 \Rightarrow 〈 - | - | C_{m} 〉$ . The sequence we use below is not optimal in the number of moves, but is a better introduction to the more complicated transfers required later.

$〈 C_{m} | - | - 〉 \Rightarrow 〈 m | - | C_{m - 1} 〉$
Using the inductive hypothesis, move $C_{m - 1}$ from L to R leaving disk m on L.
$〈 m | - | C_{m - 1} 〉 \Rightarrow 〈 - | m | C_{m - 1} 〉$
Move disk m to W.
$〈 - | m | C_{m - 1} 〉 \Rightarrow 〈 C_{m - 1} | m | - 〉$
With space j – 1 remaining on W, by induction $C_{m - 1}$ can be moved back to L.
$〈 C_{m - 1} | m | - 〉 \Rightarrow 〈 - | - | C_{m} 〉$
Moving disk m to R, then $C_{m - 1}$ from L to R, finishes the transfer.

For the upper bound, we want to prove that if $〈 C_{m} | - | - 〉 \Rightarrow 〈 - | - | C_{m} 〉$ when W is a single peg of height j then $m \leq j + 1$ . Again we proceed by induction on j. The result holds for j = 0, since C₂ cannot be transferred from L to R without W. We assume as inductive hypothesis that $H (j - 1) \leq j$ , and consider a supposed transfer $〈 C_{m} | - | - 〉 \Rightarrow 〈 - | - | C_{m} 〉$ , where W is a peg of height j.

If disk m moves directly from L to R, then the other m – 1 disks must all be on W, so $m \leq j + 1$ as required. Supposing that m never moves directly between L and R, consider the configuration A immediately after disk m moves from L to W for the last time, so $A = 〈 - | m | C_{m - 1} 〉$ . Let B be the configuration immediately before the next time that m moves from W to R. So $B = 〈 C_{m - 1} | m | - 〉$ and m remains in W all the time between A and B. During this period $C_{m - 1}$ is transferred from R to L using only the remaining j – 1 positions on the work peg. This implies that $m - 1 \leq H (j - 1)$ , and $H (j - 1) \leq j$ by induction. Hence $m \leq j + 1$ , completing the proof. ■

4 Unit work pegs

Define $H U (k) = H (1, \dots, 1)$ where there are k work pegs each of height one. As seen above, $H U (1) = 2$ . To help the reader become familiar with the notation, we show the transfer of five disks in 25 moves using two unit work pegs. Here we merely use a superscript on the “ $\Rightarrow$ ” to give the number of disk moves involved, leaving the reader to determine the sequence in detail. A concatenation of stacks on one peg is separated by commas, e.g., $C_{2}, 4$ represents the cone {1, 2} on top of the disk 4. It should be noted that a cone on top of a suitably sized frustum will result in a larger cone, so $C_{r}, F_{s}^{r + 1} = C_{s}$ . $\begin{matrix} 〈 C_{5} | - : - | - 〉 \overset{5}{\Rightarrow} 〈 F_{5}^{4} | - : - | C_{3} 〉 \overset{3}{\Rightarrow} 〈 4 | - : 5 | C_{3} 〉 \overset{3}{\Rightarrow} 〈 C_{2}, 4 | - : 5 | 3 〉 \\ \overset{3}{\Rightarrow} 〈 C_{2}, 4 | - : - | 3, 5 〉 \overset{2}{\Rightarrow} 〈 4 | 1 : - | F_{3}^{2}, 5 〉 \overset{2}{\Rightarrow} 〈 2 | 1 : 4 | 3, 5 〉 \\ \overset{2}{\Rightarrow} 〈 C_{2} | 3 : 4 | 5 〉 \overset{2}{\Rightarrow} 〈 C_{2} | - : - | F_{5}^{3} 〉 \overset{3}{\Rightarrow} 〈 - | - : - | C_{5} 〉 \end{matrix}$

Lemma 4 shows that five is the maximum number of disks that can be transferred using two unit work pegs, but we do not know whether the number of moves used above can be reduced.

Since W contains only unit pegs, there is no point in disk moves between pegs in W and a configuration needs only define the set of at most k disks in W without specifying their positions. In this case of unit pegs, a frustum $F_{s}^{r}$ in W just refers to the set of disks ${r, r + 1, \dots, s}$ on $r - s + 1$ pegs without regard to their order.

The lower bound is provided by Lemma 3.

Lemma 3.

$H U (k) \geq k^{2} + 1$ for $k \geq 0$ .

Proof.

To prove the result, we show by induction on m and k that, for all $m \leq k^{2} + 1$ , C_m can be transferred from L to R using the set W of k unit work pegs. The result is true for $m \leq 1$ , for any $k \geq 0$ , since C₁ can be transferred without using W.

We assume the inductive hypothesis for $0 \leq k' < k$ and for all $m' < m$ , and consider the transfer of C_m from L to R using k unit work pegs where $1 < m \leq k^{2} + 1$ .

$〈 C_{m} | - | - 〉 \Rightarrow 〈 F_{m}^{m - k + 1} | - | C_{m - k} 〉$

Move $C_{m - k}$ from L to R. This is feasible, by induction on m since k > 0.

$〈 F_{m}^{m - k + 1} | - | C_{m - k} 〉 \Rightarrow 〈 - | F_{m}^{m - k + 1} | C_{m - k} 〉$

Move the (at most) k disks in the frustum $F_{m}^{m - k + 1}$ to the work pegs W.

$〈 - | F_{m}^{m - k + 1} | C_{m - k} 〉 \Rightarrow 〈 F_{m - 1}^{m - k + 1} | m | C_{m - k} 〉$

Move $F_{m - 1}^{m - k + 1}$ back to L, leaving disk m occupying one of the work pegs.

$〈 F_{m - 1}^{m - k + 1} | m | C_{m - k} 〉 \Rightarrow 〈 C_{m - 2 k + 1}, F_{m - 1}^{m - k + 1} | m | F_{m - k}^{m - 2 k + 2} 〉$

Move $C_{m - 2 k + 1}$ from the top of R back to L using the k – 1 empty work pegs, by induction on k, since $m - 2 k + 1 \leq k^{2} + 1 - 2 k + 1 = {(k - 1)}^{2} + 1$ .

$〈 C_{m - 2 k + 1}, F_{m - 1}^{m - k + 1} | m | F_{m - k}^{m - 2 k + 2} 〉 \Rightarrow 〈 C_{m - 2 k + 1}, F_{m - 1}^{m - k + 1} | F_{m - k}^{m - 2 k + 2}, m | - 〉$

Move the remaining k – 1 disks from R to the k – 1 empty work pegs.

$〈 C_{m - 2 k + 1}, F_{m - 1}^{m - k + 1} | F_{m - k}^{m - 2 k + 2}, m | - 〉 \Rightarrow 〈 C_{m - 2 k + 1}, F_{m - 1}^{m - k + 1} | - | F_{m - k}^{m - 2 k + 2}, m 〉$

Move disk m to R and place $F_{m - k}^{m - 2 k + 2}$ on top of this on R.

$〈 C_{m - 2 k + 1}, F_{m - 1}^{m - k + 1} | - | F_{m - k}^{m - 2 k + 2}, m 〉 \Rightarrow 〈 F_{m - 1}^{m - k + 1} | - | C_{m - 2 k + 1}, F_{m - k}^{m - 2 k + 2}, m 〉 = 〈 F_{m - 1}^{m - k + 1} | - | C_{m - k}, m 〉$

Move $C_{m - 2 k + 1}$ from the top of L to R, resulting in the cone $C_{m - k}$ . This is feasible using the k work pegs by induction on m.

$〈 F_{m - 1}^{m - k + 1} | - | C_{m - k}, m 〉 \Rightarrow 〈 C_{m - 1} | - | m 〉$

Move $C_{m - k}$ from the top of R back to L. This is feasible using the k work pegs by induction on m.

$〈 C_{m - 1} | - | m 〉 \Rightarrow 〈 - | - | C_{m} 〉$

Move $C_{m - 1}$ from L to R to complete the task. ■

For the upper bound:

Lemma 4.

$H U (k) \leq k^{2} + 1$ for $k \geq 0$ .

Proof.

The result is true for k = 0 since $H U (0) = 1$ . To prove the result by induction on k, we assume that $H U (k - 1) \leq {(k - 1)}^{2} + 1$ , where $k \geq 1$ , and prove that $H U (k) \leq k^{2} + 1$ .

Suppose the number of disks is m. We want to prove that if $〈 C_{m} | - | - 〉 \Rightarrow 〈 - | - | C_{m} 〉$ then $m \leq k^{2} + 1$ . If m can move directly from L to R, then $m \leq k + 1 \leq k^{2} + 1$ , as required.

Otherwise, as in the proof of Theorem 2, we focus on two critical stages in the sequence of moves. Let A be the configuration just after the last time that disk m moves from L to W, and B the configuration after which m moves from W to R the next time after A. Suppose $A = 〈 - | U_{1}, m | Y 〉$ and $B = 〈 X | U_{2}, m | - 〉$ . Let $Z = X \cap Y$ , so $| Z | = | X \cap Y | = | X | + | Y | - | X \cup Y | \geq 2 (m - k) - (m - 1) = m - 2 k + 1$ , since $| X | \geq m - k$ , $| Y | \geq m - k$ , and $| X \cup Y | \leq m - 1$ . The monotonicity lemma (Lemma 1) shows that the set of disks Z can be transferred from R to L using only k – 1 pegs in W. By the inductive hypothesis, $m - 2 k + 1 \leq H U (k - 1) \leq {(k - 1)}^{2} + 1$ , and so $m \leq k^{2} + 1$ , completing the proof. ■

Together, Lemmas 3 and 4 establish the precise value for $H U$ .

Theorem 5.

$H U (k) = k^{2} + 1$ for $k \geq 0$ .

We conclude this section with a simple consequence of this theorem.

Theorem 6.

$H (j_{1}, \dots, j_{k}) \leq {(\sum_{i = 1}^{k} j_{i})}^{2} + 1$ .

Proof.

It is easy to show that any work peg P of height a + b can be simulated by the pair of work pegs $P'$ and $P ″$ of heights a and b, respectively. If there are b or fewer disks on P, then they are stored on $P ″$ , otherwise the largest b disks are on $P ″$ and the rest on $P'$ .

Thence, $W (j_{1}, \dots, j_{k})$ can be simulated by $\sum_{i = 1}^{k} j_{i}$ unit pegs, and the result follows from Theorem 5. ■

5 Two work pegs

We show matching upper and lower bounds for H(i, j), the maximum number of disks that can be transferred from L to R using two work pegs P₁ and P₂ of heights i and j, respectively. A major difference from the case of unit pegs is that it might be advantageous to move the largest disk within W. This turns out not to be the case with just two pegs, but the possibility complicates the proof of the upper bound. We begin with the lower bound.

Lemma 7.

For all $i, j \geq 0, H (i, j) \geq 2 i j + i + j + 1$ .

Proof.

The proof is by induction on i and j. The result holds when either i = 0 or j = 0 since this corresponds to a single work peg and it is known that $H (i) = i + 1$ . To show the bound for H(i, j), we assume as inductive hypothesis that the result holds for all $(i', j')$ with $i' + j' < i + j$ . We may also assume that $i \geq 1, j \geq 1$ and consider the transfer of the cone C_m from L to R, where $m \leq 2 i j + i + j + 1$ . For this we again use induction, this time on m. If $m \leq 2 i j - i + j$ , then C_m can be transferred, since $m \leq H (i, j - 1)$ by induction on i + j. So we now assume that $2 i j - i + j < m \leq 2 i j + i + j + 1$ and that $H (i, j) \geq m - 1$ .

We show how C_m is transferred from L to R. Here we assume without loss of generality that $i \leq j$ . If j < i, then the same algorithm is used with the roles of P₁ and P₂ reversed.

$〈 C_{m} | - : - | - 〉 \Rightarrow 〈 F_{m}^{m - i} | - : - | C_{m - i - 1} 〉$

Transfer $C_{m - i - 1}$ from L to R. This is feasible by the inductive hypothesis on m.

$〈 F_{m}^{m - i} | - : - | C_{m - i - 1} 〉 \Rightarrow 〈 - | F_{m - 1}^{m - i} : m | C_{m - i - 1} 〉$

Move the frustum $F_{m - 1}^{m - i}$ to the work peg P₁, using work peg P₂, since $i \leq j + 1$ , and then move disk m to P₂.

$〈 - | F_{m - 1}^{m - i} : m | C_{m - i - 1} 〉 \Rightarrow 〈 F_{m - 1}^{m - i} | - : m | C_{m - i - 1} 〉$

Move $F_{m - 1}^{m - i}$ back to L, using the remaining capacity j – 1 of P₂ since $i \leq j$ .

$〈 F_{m - 1}^{m - i} | - : m | C_{m - i - 1} 〉 \Rightarrow 〈 C_{m - 2 i - 1}, F_{m - 1}^{m - i} | - : m | F_{m - i - 1}^{m - 2 i} 〉$

Move $C_{m - 2 i - 1}$ from the top of $C_{m - i - 1}$ on R back to L. This is feasible using the remaining capacity $(i, j - 1)$ of work pegs, since $m - 2 i - 1 \leq 2 i j + i + j + 1 - 2 i - 1 = 2 i (j - 1) + i + j \leq H (i, j - 1)$ , by the inductive hypothesis.

$〈 C_{m - 2 i - 1}, F_{m - 1}^{m - i} | - : m | F_{m - i - 1}^{m - 2 i} 〉 \Rightarrow 〈 C_{m - 2 i - 1}, F_{m - 1}^{m - i} | - : - | F_{m - i - 1}^{m - 2 i}, m 〉$

Disk m is inserted underneath a frustum of size i, just like a reversal of steps 2 and 3 above.

$〈 C_{m - 2 i - 1}, F_{m - 1}^{m - i} | - : - | F_{m - i - 1}^{m - 2 i}, m 〉 \Rightarrow 〈 C_{m - 1} | - : - | m 〉$

Move $C_{m - 2 i - 1}$ from the top of L to R and then move the resulting $C_{m - i - 1}$ from the top of R back to L. These operations are feasible using the now-empty work pegs, by induction on m.

$〈 C_{m - 1} | - : - | m 〉 \Rightarrow 〈 - | - : - | C_{m} 〉$

Move $C_{m - 1}$ from L to R to complete the task. ■

In the algorithm that provided the lower bound, the largest disk moved from L to W, then remained on the same peg in W and finally moved from W to R. We now have to deal with the possibility that the largest disk may move between pegs in W. In preparation, we prove two lemmas which establish useful properties of sequences that satisfy a requirement, such as moving some set of disks from one peg to another during some interval.

The first lemma shows, under some conditions, that if a disk d can move from W to R, then there is a sequence whose net effect is only to remove d from W and to insert it into the present stack on R. In the rest of this section, we often consider sequences of moves such that the largest disk m remains in W throughout. We write “ $\overset{\Rightarrow}{-}$ ” for the operation of such a sequence.

For a configuration $C = 〈 - | U_{1} : U_{2} | V 〉$ where m is in W, we say disk $d \in U_{1} \cup U_{2}$ can reach R if $C \overset{\Rightarrow}{-} 〈 T | U_{1}^{'} : U_{2}^{'} | V' 〉$ where $d \in V'$ . Furthermore, disk d in U₁ can shift to R if $C \overset{\Rightarrow}{-} 〈 - | U_{1} ∖ {d} : U_{2} | V \cup {d} 〉$ , i.e., the only difference made by the sequence is to remove disk d from its work peg and insert it into place on R. The definition for $d \in U_{2}$ is similar.

Lemma 8.

While m remains in W throughout, for any configuration with L empty, if d is the largest (other than m) of the set of disks in W that can reach R, then d can shift to R.

Before the proof, let us see an example to illustrate the lemma. Let $W = (3, 3)$ and $C = 〈 - | 3, 4, 5 : - - 7 | 1, 2, 6 〉$ . To help the reader, we use hyphens to represent the spare capacity on work pegs. Here, m = 7, and d = 5 is the disk of interest. Using the spare capacity in P₂, stacks of height three can be moved from one peg to another. So $〈 - | 3, 4, 5 : - - 7 | 1, 2, 6 〉 \overset{\Rightarrow}{-} 〈 1, 2, 6 | 3, 4, 5 : - - 7 | - 〉 \overset{\Rightarrow}{-} 〈 1, 2, 6 | - - - : - - 7 | 3, 4, 5 〉 .$

We see that d can indeed reach R. The lemma constructs a sequence of moves from C which can shift d to R. For example, $\begin{matrix} 〈 - | 3, 4, 5 : - - 7 | 1, 2, 6 〉 \overset{\Rightarrow}{-} 〈 1, 2 | 3, 4, 5 : - - 7 | 6 〉 \overset{\Rightarrow}{-} 〈 1, 2 | - - - : 3, 4, 7 | 5, 6 〉 \\ \overset{\Rightarrow}{-} 〈 1, 2 | - 3, 4 : - - 7 | 5, 6 〉 \overset{\Rightarrow}{-} 〈 - | - 3, 4 : - - 7 | 1, 2, 5, 6 〉 . \end{matrix}$

Proof.

Let $C = 〈 - | U_{1} : U_{2} | V 〉$ where $m \in W$ , and suppose that $d \in U_{1}$ is the largest disk (other than m) in W that can reach R. (There may be larger disks in W that cannot reach R.) Suppose that σ is a sequence of moves starting from C up to but not including the first move in which d reaches R. We assume that m remains in W throughout σ.

Let Z be the subset of disks in V that are larger than d. It may be that during σ some disks in Z move out of R. We show that such moves can be eliminated. Consider the sequence σ₁ that is σ but omitting all moves of disks in Z. By the monotonicity lemma, σ₁ is well-defined: all moves in σ₁ are possible because they are between configurations in which L and W hold subsets of their contents at the corresponding step of σ. Disks in Z remain on R and cannot obstruct any other disks or the move of d to R. Let σ₂ be the sequence consisting of σ₁ followed by the move of d to R, followed by the reversal of σ₁ also omitting any moves of d. By monotonicity again (disk d has been omitted), this reversal is valid, and σ₂ has the required property of returning all disks to their position in C except for d, which has been inserted in R. The case with d initially in U₂ is similar. ■

A simple, but useful, lemma finally prepares us for our upper bound result.

Lemma 9.

If $〈 C_{m} | - | - 〉 \Leftrightarrow 〈 - | - | C_{m} 〉$ , then $〈 U_{1} | - | V_{1} 〉 \Leftrightarrow 〈 U_{2} | - | V_{2} 〉$ for any U₁,V₁,U₂,V₂ with disjoint unions $U_{1} \dot{\cup} V_{1} = U_{2} \dot{\cup} V_{2} = C_{m}$ .

Proof.

By monotonicity, for any $r \leq m$ , cone C_r can be moved between L and R. The stacks U₁ and V₁ can be merged into C_m by forming $C_{1}, \dots, C_{m}$ successively. This increasing set of cones is made by shuttling partial cones between L and R, collecting the disks from U₁ and V₁ in increasing order. The same holds for U₂ and V₂. ■

Now we can prove our upper bound for two work pegs.

Lemma 10.

For all $i, j \geq 0, H (i, j) \leq 2 i j + i + j + 1$ .

Proof.

As in Lemma 7, the proof is by induction on i and j, and then on m. The result holds when either i = 0 or j = 0, since this corresponds to a single work peg and $H (i) = i + 1$ . To show the bound for H(i, j), we assume as inductive hypothesis that the result holds for all $(i', j')$ with $i' + j' < i + j$ . We may also assume that $i \geq 1$ and $j \geq 1$ .

For our inductive proof, let $m = H (i, j)$ . By the definition of H(i, j), $〈 C_{m} | - | - 〉 \overset{σ}{\Rightarrow} 〈 - | - | C_{m} 〉,$ for some sequence of moves σ. We focus on one particular segment $σ'$ of σ during which the largest disk m remains in W. Let $σ'$ begin at configuration $A = 〈 - | m : X_{1} | Y_{1} 〉$ , when m has just moved for the last time from L to W, onto P₁ say, and finish at $B = 〈 Y_{2} | m : X_{2} | - 〉$ or $〈 Y_{2} | X_{2} : m | - 〉$ , the next time that m is about to move from W to R.

Disk m is on L just before the start of $σ'$ and on R just after the end of $σ'$ , and by Lemma 7, $m = H (i, j) \geq 2 i j + i + j + 1 > i + j + 1$ since i > 0 and j > 0. Hence disk m cannot move directly from L to R, since the other m – 1 disks exceed the total capacity of W which is i + j. Therefore $σ'$ is well-defined.

Recall that $\overset{\Rightarrow}{-}$ is used for a sequence where the largest disk m remains in W throughout.

We first consider the case where disk m remains on P₁ throughout $σ'$ , so $A = 〈 - | m : X_{1} | Y_{1} 〉 \overset{σ^{'}}{\begin{matrix} \Rightarrow \\ - \end{matrix}} B = 〈 Y_{2} | m : X_{2} | - 〉 .$

Since each of X₁ and X₂ are on P₂ at some time, $| X_{1} \cup X_{2} | \leq | X_{1} | + | X_{2} | \leq 2 j$ . Let $Z = {1, \dots, m - 1} ∖ (X_{1} \cup X_{2})$ . Then, by the monotonicity lemma, we may ignore disks in $X_{1} \cup X_{2}$ to show that $〈 - | m : - | Z 〉 \overset{σ^{'}'}{\begin{matrix} \Rightarrow \\ - \end{matrix}} 〈 Z | m : - | - 〉,$

for some sequence $σ ″$ , and where $| Z | \geq m - 1 - 2 j$ . Since disk m remains on P₁ throughout $σ ″$ , we have $m - 1 - 2 j \leq H (i - 1, j)$ . Hence by induction, $\begin{matrix} H (i, j) = m \leq H (i - 1, j) + 1 + 2 j \leq (2 (i - 1) j + (i - 1) + j + 1) + 1 + 2 j \\ = 2 i j + i + j + 1, \end{matrix}$ as required for this case. The case where m moves to P₂ at time A, and remains there throughout $σ'$ , is similar.

Now we consider the case where disk m does move between work pegs during $σ'$ . The above argument fails because a set Z of size $H (i, j - 1)$ might be transferred if disk m can be moved to P₂, and $H (i, j - 1) > H (i - 1, j)$ if i < j. We used the fact that $| X_{1} | \leq j$ , while for the general case we want to have $| X_{1} | \leq \min (i, j)$ .

For disk m to move from P₁ to P₂, each disk on P₂ in configuration A has to be removed from W, either to L directly or else via R. For the latter, any such disk can reach R and so we need Lemma 8.

Suppose that $D = (d_{1}, \dots, d_{r})$ is the set of disks in decreasing order of size that are on P₂ and can reach R. By Lemma 8, d₁ can be shifted to R, resulting in configuration $C'$ . Now d₂ can still reach R starting from $C'$ (by first reversing the sequence shifting d₁ if necessary), and is now the largest disk on P₂ that can reach R. By Lemma 8, d₂ can be shifted to R, and so on, until all of D has been shifted. The result is a configuration $A' = 〈 - | m : X'_{1} | Y'_{1} 〉$ where no disk in $X'_{1}$ can reach R.

Suppose $A ″ = 〈 U_{1} | m : - | V_{1} 〉$ is the configuration before the first move of m from P₁ to P₂ during $σ'$ . Then $A' = 〈 - | m : X'_{1} | Y'_{1} 〉 \overset{\Leftrightarrow}{-} A \overset{\Rightarrow}{-} A ″ = 〈 U_{1} | m : - | V_{1} 〉,$ where $X'_{1} \subseteq U_{1}$ , since no disk in $X'_{1}$ can reach R.

By monotonicity, if we consider only disks in $X'_{1} \cup {m}$ and only moves between L and W and within W, we may ignore peg R completely and derive $〈 - | m : X'_{1} | \overset{\Rightarrow}{-} 〈 X'_{1} | m : - | .$

Therefore, $| X'_{1} | \leq H (i - 1) = i$ and so $| X'_{1} | \leq \min (i, j)$ .

By a symmetric argument, if $B = 〈 Y_{2} | m : X_{2} | - 〉$ then $B' = 〈 Y'_{2} | m : X'_{2} | - 〉 \overset{\Rightarrow}{-} B$ for some $X'_{2}, Y'_{2}$ where $| X'_{2} | \leq \min (i, j)$ , and similarly if m is on P₂ in B.

As in the cases where m never switches work pegs, we define $Z = {1, \dots, m - 1} ∖ (X'_{1} \cup X'_{2})$ , but now we have the stronger inequality $| Z | \geq m - 1 - 2 \min (i, j)$ . Again by the monotonicity lemma, we may ignore disks in $X_{1}^{'} \cup X_{2}^{'}$ , and so $〈 - | m : - | Z 〉 \overset{σ^{'}'}{\begin{matrix} \Rightarrow \\ - \end{matrix}} 〈 Z | m : - | - 〉,$ for some sequence $σ ″$ , and where $| Z | \geq m - 1 - 2 \min (i, j)$ .

What remains to be proved is that moving m between P₁ and P₂ cannot achieve the transfer of more than $\max (H (i - 1, j), H (i, j - 1))$ disks.

Let $r = 1 + \max (H (i - 1, j), H (i, j - 1))$ . It is sufficient to show that there can be no sequence τ such that $〈 - | m : - | C_{r} 〉 \overset{τ}{\begin{matrix} \Rightarrow \\ - \end{matrix}} 〈 C_{r} | m : - | - 〉 .$

It is not relevant here whether m is on P₁ or P₂ in these configurations. Any such sequence τ consists of segments where m remains on the same work peg, separated by moves of m between pegs. We focus on the positions of disk r at each such move of m. Clearly r is on L or R at these moves. Suppose r switches between L and R during one of these segments $τ'$ , so $F = 〈 U | m : - | V 〉 \overset{τ'}{\begin{matrix} \Rightarrow \\ - \end{matrix}} 〈 V' | m : - | U' 〉 = G,$ where $r \in U$ and $r \in U'$ , or similarly with $r \in V$ and $r \in V'$ .

Since $r - 1 = \max (H (i - 1, j), H (i, j - 1))$ , we may use the method in Lemma 9 to collect $U ∖ {r}$ and V into $C_{r - 1}$ on L, and $U' ∖ {r}$ and $V'$ into $C_{r - 1}$ on R. Hence $\begin{matrix} F \overset{\Leftrightarrow}{-} 〈 C_{r - 1}, r | m : - | - 〉 = 〈 C_{r} | m : - | - 〉 \\ \overset{\Leftrightarrow}{-} 〈 - | m : - | C_{r} 〉 = 〈 - | m : - | C_{r - 1}, r 〉 \overset{\Leftrightarrow}{-} G . \end{matrix}$

This gives a contradiction since at most $H (i - 1, j)$ disks can be transferred from L to R while m remains on P₁, and $r > H (i - 1, j)$ , and similarly where m is on P₂ and $r > H (i, j - 1)$ . So cone C_r cannot be transferred from L to R while m remains on P₁ nor while m remains on P₂ and, for any move of m between P₁ and P₂, r is either on L or on R.

Therefore $| Z | \leq \max (H (i - 1, j), H (i, j - 1))$ , and hence, by induction, $\begin{matrix} H (i, j) = m \leq | Z | + 2 \min (i, j) + 1 \\ \leq \max (H (i - 1, j), H (i, j - 1)) + 2 \min (i, j) + 1 \\ \leq \max (2 (i - 1) j, 2 i (j - 1)) + i + j + 2 \min (i, j) + 1 \\ = 2 i j + i + j + 1 + 2 \max (- i, - j) + 2 \min (i, j) \\ = 2 i j + i + j + 1. ■ \end{matrix}$

Together, Lemmas 7 and 10 establish

Theorem 11.

For all $i, j \geq 0, H (i, j) = 2 i j + i + j + 1$ .

6 More work pegs

When there are three or more work pegs, we have some partial results. Theorem 12 gives a simple formula similar to that in Theorem 11. Theorem 14 gives an improved lower bound in the form of a recursive formula, but there does not seem to be an equivalent explicit formula. Theorem 15 gives an exact result when work peg heights are sufficiently slowly growing. Finally, when we restrict our attention to the case of three work pegs, we have an improved algorithm in Theorem 17 which could even be optimal.

We begin with the easy lower bound which is an explicit polynomial formula proved in much the same way as Lemma 7.

Theorem 12.

For all $k \geq 0$ and nonnegative $j_{1}, \dots, j_{k}$ , $H (j_{1}, \dots, j_{k}) \geq 2 \sum_{u < v} j_{u} j_{v} + \sum_{w = 1}^{k} j_{w} + 1.$

Proof.

For any $k \geq 0$ , the proof by induction on $j_{1}, \dots, j_{k}$ is analogous to that in Lemma 7. The base of the induction holds when j_w = 0 for $1 \leq w \leq k$ , and a single disk can be transferred directly from L to R. Suppose without loss of generality that j_k is the height of a largest work peg. Let $m = 2 \sum_{u < v} j_{u} j_{v} + \sum_{w = 1}^{k} j_{w} + 1 and s = \sum_{w = 1}^{k - 1} j_{w} .$

For ease of notation we define the frusta, $F^{(i)}$ , where $(F^{(1)}, \dots, F^{(k - 1)}) = F_{m - 1}^{m - s}$ and $| F^{(i)} | = j_{i}$ , so $F^{(1)} = F_{m - s + j_{1} - 1}^{m - s}$ , etc.

Following a similar sequence to that used in the proof of Lemma 7, we show that C_m can be transferred from L to R using $W (j_{1}, \dots, j_{k})$ , establishing that $H (j_{1}, \dots, j_{k}) \geq m$ as required.

$〈 C_{m} | - : \dots : - | - 〉 \Rightarrow 〈 F_{m}^{m - s} | - : \dots : - | C_{m - s - 1} 〉$
The cone $C_{m - s - 1}$ is transferred from L to R.
$〈 F_{m}^{m - s} | - : \dots : - | C_{m - s - 1} 〉 \Rightarrow 〈 - | F^{(1)} : \dots : F^{(k - 1)} : m | C_{m - s - 1} 〉$
Using P_k, the s disks in $F_{m - 1}^{m - s}$ are transferred to fill $P_{1}, \dots, P_{k - 1}$ with $F^{(1)}, \dots, F^{(k - 1)}$ , respectively, and disk m is moved from L to P_k.
$〈 - | F^{(1)} : \dots : F^{(k - 1)} : m | C_{m - s - 1} 〉 \Rightarrow 〈 F_{m - 1}^{m - s} | - : \dots : - : m | C_{m - s - 1} 〉$
The remaining capacity $j_{k} - 1$ of P_k is sufficient to transfer the contents of all the other work pegs back to L.
$〈 F_{m - 1}^{m - s} | - : \dots : - : m | C_{m - s - 1} 〉 \Rightarrow 〈 C_{m - 2 s - 1}, F_{m - 1}^{m - s} | - : \dots : - : m | F_{m - s - 1}^{m - 2 s} 〉$
$C_{m - 2 s - 1}$ is transferred back from the top of R to L, since, as verified below, $m - 2 s - 1 \leq H (j_{1}, \dots, j_{k - 1}, j_{k} - 1)$ .
$〈 C_{m - 2 s - 1}, F_{m - 1}^{m - s} | - : \dots : - : m | F_{m - s - 1}^{m - 2 s} 〉 \Rightarrow 〈 C_{m - 2 s - 1}, F_{m - 1}^{m - s} | - : \dots : - | F_{m - s - 1}^{m - 2 s}, m 〉$
Disk m is inserted beneath the frustum of size s on R, just like a reversal of the procedure in steps 2 and 3 which extracted m from beneath $F_{m - 1}^{m - s}$ on L.
$〈 C_{m - 2 s - 1}, F_{m - 1}^{m - s} | - : \dots : - | F_{m - s - 1}^{m - 2 s}, m 〉 \Rightarrow 〈 - | - : \dots : - | C_{m} 〉$
The final stages are as in the proof of Lemma 7. The cone $C_{m - 2 s - 1}$ is transferred from L to R, producing $C_{m - s - 1}$ . This is then transferred back to L, producing $C_{m - 1}$ , which is finally moved to R, finishing the process.

Finally we check that $m - 2 s - 1 \leq H (j_{1}, \dots, j_{k - 1}, j_{k} - 1)$ : $\begin{matrix} m - 2 s - 1 = 2 \sum_{1 \leq u < v \leq k} j_{u} j_{v} + \sum_{w = 1}^{k} j_{w} + 1 - 2 s - 1 \\ = 2 \sum_{1 \leq u < v \leq k - 1} j_{u} j_{v} + 2 (j_{k} - 1) s + \sum_{w = 1}^{k - 1} j_{w} + (j_{k} - 1) + 1 \\ \leq H (j_{1}, \dots, j_{k - 1}, j_{k} - 1), \end{matrix}$ by induction. ■

The corollary gives a polynomial formula when the work pegs are of equal height.

Corollary 13.

If $j_{1} = \dots = j_{k} = j$ , then $H (j_{1}, \dots, j_{k}) \geq k (k - 1) j^{2} + k j + 1$ .

For two work pegs, we showed in the proof of Lemma 10 that no move within W of the largest disk could be useful. This is not the case when there are more than two pegs. Theorem 12 gives $H (1, 1, 2) \geq 15$ , but, as we will see in Theorem 14, this bound can be improved to 17. The reader may wish to explore this case now, starting with a transfer of C₁₃ from L to R.

Improved multipeg algorithm

We outline the transfer of m disks with work space $W (j_{1}, \dots, j_{k})$ . In the course of our recursive algorithms, we will encounter the work spaces W(0) and $W ()$ , where there is either a single work peg with zero height or no work peg at all. In these cases, just a single disk can be transferred, i.e., $H (0) = H () = 1$ .

The work pegs should be sorted in order of height, so we assume $j_{1} \leq \dots \leq j_{k}$ . In the first stage of the algorithm, an appropriately sized cone C_s is transferred from L to R recursively. $〈 C_{m} | - : \dots : - | - 〉 \Rightarrow 〈 F_{m}^{s + 1} | - : \dots : - | C_{s} 〉 .$

In the second stage, the longest work peg P_k is loaded with m_k disks by using all the other work pegs. We continue in this way loading $m_{k - 1}, \dots, m_{1}$ disks on each of $P_{k - 1}, \dots, P_{1}$ in turn, using all the currently empty work pegs. We will choose each $m_{i} \leq H (j_{1}, \dots, j_{i - 1})$ , and choose s so that $m - s = \sum_{i = 1}^{k} m_{i}$ . We define frusta $F^{(1)}, \dots, F^{(k)}$ as in Theorem 12, but now with $| F^{(i)} | = m_{i}$ , and $(F^{(k)}, \dots, F^{(1)}) = F_{m}^{s + 1}$ . The effect of this second stage is therefore $〈 F_{m}^{s + 1} | - : \dots : - | C_{s} 〉 \Rightarrow 〈 - | F^{(1)} : \dots : F^{(k)} | C_{s} 〉,$ and L is empty after these transfers. The transfer of disks from L to P₁ has no other work peg to help. Therefore $m_{1} = H () = 1$ and m is the only disk on P₁.

In the third stage of the algorithm, disk m remains in W but all other disks in W are returned to L, clearing pegs $P_{2}, P_{3}, \dots, P_{k}$ in succession. At most j₁ disks can be moved back from P₂ since there is only free space $j_{1} - 1$ on P₁. We choose therefore, $m_{2} = \min (j_{2}, j_{1})$ . From now on there is more than one work peg to assist the transfer of disks from P_i to L, but one of these must contain disk m. It might seem best for disk m to be on a longest peg $P_{i - 1}$ at this time, giving total work space $W (j_{1}, \dots, j_{i - 2}, j_{i - 1} - 1)$ for this transfer. This is true when only two work pegs are used, but we know, for example, that $H (1, 2, 3) > H (2, 2, 2)$ , so with work space $W (2, 2, 3)$ it would be better to put m on P₁. To account for this, it is convenient to define the function K.( ∗ ) $\begin{matrix} K (j_{1}, \dots, j_{k}) = \max (H (j_{1} - 1, j_{2}, \dots, j_{k}), H (j_{1}, j_{2} - 1, \dots, j_{k}), \dots, \\ H (j_{1}, j_{2}, \dots, j_{k - 1} - 1, j_{k}), H (j_{1}, j_{2}, \dots, j_{k - 1}, j_{k} - 1)) . \end{matrix}$ ( ∗ )

We use our multipeg algorithm recursively and so can move up to $K (j_{1}, \dots, j_{i - 1})$ disks back from P_i to L, since at this time disk m is the only disk in $W (j_{1}, \dots, j_{i - 1})$ and can be moved freely to any work peg. We choose $m_{i} = \min (j_{i}, K (j_{1}, \dots, j_{i - 1}))$ for $1 \leq i \leq k$ . Remember here that the parameters for K and H may need sorting again. After these transfers back to L from pegs $P_{2}, \dots, P_{k}$ , we have the configuration $〈 F_{m - 1}^{s + 1} | - : \dots : - : m | C_{s} 〉$ , where $s = m - \sum_{i = 1}^{k} m_{i}$ .

In the fourth stage, we transfer the top $K (j_{i}, \dots, j_{k - 1}, j_{k}) = m'$ disks from R to L reaching configuration $〈 C_{m'}, F_{m - 1}^{s + 1} | - : \dots : - : m | F_{s}^{m' + 1} 〉$ using the same method.

Note that the second and third stages taken together achieve the following, $〈 F_{m}^{s + 1} | - : \dots : - | C_{s} 〉 \Rightarrow 〈 F_{m - 1}^{s + 1} | - : \dots : - : m | C_{s} 〉,$ extracting disk m from under a stack of $m - s - 1$ disks on L. The fifth stage of the algorithm reverses such a sequence in order to insert disk m under the $F_{s}^{m' + 1}$ on R. This is possible if $s - m' \leq m - s - 1$ , i.e., $m - \sum_{i = 1}^{k} m_{i} - m' \leq \sum_{i = 1}^{k} m_{i} - 1$ . We choose $m = 2 \sum_{i = 1}^{k} m_{i} + m' - 1$ . Then $〈 C_{m'}, F_{m - 1}^{s + 1} | - : \dots : - : m | F_{s}^{m' + 1} 〉 \Rightarrow 〈 C_{m'}, F_{m - 1}^{s + 1} | - : \dots : - | F_{s}^{m' + 1}, m 〉 .$

The final stages of the algorithm are smaller versions of previous transfers. $\begin{matrix} 〈 C_{m'}, F_{m - 1}^{s + 1} | - : \dots : - | F_{s}^{m' + 1}, m 〉 \Rightarrow 〈 F_{m - 1}^{s + 1} | - : \dots : - | C_{s}, m 〉 \\ \Rightarrow 〈 C_{m - 1} | - : \dots : - | m 〉 \Rightarrow 〈 - | - : \dots : - | C_{m} 〉 . \end{matrix}$

The last transition is by induction on m.

This algorithm yields the following lower bound for multipeg Hanoi. Recall that if we choose $m = 2 \sum_{i = 1}^{k} m_{i} + m' - 1$ , then m disks can be transferred.

Theorem 14

. For $0 < j_{1} \leq j_{2} \leq \dots \leq j_{k}$ , and K as defined above in ( $*$ ), $H (j_{1}, \dots, j_{k}) \geq 2 \sum_{i = 1}^{k} m_{i} + K (j_{1}, \dots, j_{k}) - 1,$ where $m_{i} = \min (j_{i}, K (j_{1}, \dots, j_{i - 1})$ for $1 \leq i \leq k$ .

For a general set of work pegs an explicit exact formula seems unlikely, but we can give a special case for which this can be shown.

In Theorem 14 we define $m_{i} = \min (j_{i}, K (j_{1}, \dots, j_{i - 1}))$ for $1 \leq i \leq k$ . For $W (j_{1}, \dots, j_{k})$ we say that( ∗∗ ) $peg P_{i} is tight if m_{i} = j_{i} .$ ( ∗∗ )

For example, in $W (1, 1, 2)$ , P₃ is tight because $2 \leq H (0, 1) \leq K (1, 1)$ , P₂ is tight because $1 \leq H (0) = K (1)$ , and P₁ is tight because $1 \leq H () = 1$ .

For the case where all pegs are tight the recursion becomes $H (j_{1}, \dots, j_{k}) \geq 2 \sum_{i = 1}^{k} j_{i} + K (j_{1}, \dots, j_{k}) - 1.$

Recall that the function K is designed to optimize the position in W of the largest element during a major transfer from L to R. We will see that placing this on P_k achieves optimality when all pegs are tight, so $K (j_{1}, \dots, j_{k}) = H (j_{1}, \dots, j_{k} - 1)$ . It is then easy to check that all pegs are tight in $W (j_{1}, \dots, j_{k} - 1)$ , even after any necessary re-sorting.

Now we can verify the lower bound $H (j_{1}, \dots, j_{k}) \geq {(\sum_{i = 1}^{k} j_{i})}^{2} + 1$ , since $\begin{matrix} H (j_{1}, \dots, j_{k}) \geq 2 \sum_{i = 1}^{k} j_{i} + H (j_{1}, \dots, j_{k - 1}, j_{k} - 1) - 1 \\ \geq 2 \sum_{i = 1}^{k} j_{i} + {(- 1 + \sum_{i = 1}^{k} j_{i})}^{2} + 1 - 1 = {(\sum_{i = 1}^{k} j_{i})}^{2} + 1. \end{matrix}$

Theorem 6 provides the matching upper bound, giving the following result.

Theorem 15.

If all pegs are tight in $W (j_{1}, \dots, j_{k})$ , i.e., $j_{i} \leq K (j_{1}, \dots, j_{i - 1})$ for $1 \leq i \leq k$ , then $H (j_{1}, \dots, j_{k}) = {(\sum_{i = 1}^{k} j_{i})}^{2} + 1.$

The example $H (0!, 1!, \dots, 10!)$ , mentioned in our abstract, illustrates this theorem. Here, when $j_{i} = (i - 1)!$ , all pegs are tight. As in the example $W (1, 1, 2)$ above, P₁, P₂, P₃ of heights $0!, 1!, 2!$ , respectively, are tight. So P₄, of height $3! = 6$ , is also tight, as $H (0!, 1!, 2! - 1) = {(0! + 1! + 2! - 1)}^{2} + 1 = 10$ . (Since $2!$ for P₃ is tight, so is $2! - 1$ .) It is easily seen that this continues for subsequent work pegs.

There is more work to be done for the general case, as the following example shows.

Example 16.

For $W (1, 2, 3)$ , Theorem 14 gives $m_{1} = m_{2} = 1$ and $m_{3} = 3$ , and $H (1, 2, 3) \geq 10 + H (1, 2, 2) - 1 = 35$ . (We assume here that $H (1, 2, 2) = 26$ and will return to this later.) However,the following sequence shows that $H (1, 2, 3) \geq 37$ . $\begin{matrix} 〈 C_{37} | - : - - : - - - | - 〉 \Rightarrow 〈 F_{37}^{32} | - : - - : - - - | C_{31} 〉 \\ \overset{see below}{\Rightarrow} 〈 F_{36}^{32} | - : - - : - - 37 | C_{31} 〉 \Rightarrow 〈 C_{26}, F_{36}^{32} | - : - - : - - 37 | F_{31}^{27} 〉 \\ \overset{similarly}{\Rightarrow} 〈 C_{26}, F_{36}^{32} | - : - - : - - - | F_{31}^{27}, 37 〉 \Rightarrow 〈 F_{36}^{32} | - : - - : - - - | C_{31}, 37 〉 \\ \Rightarrow 〈 C_{36} | - : - - : - - - | 37 〉 \Rightarrow 〈 - | - : - - : - - - | C_{37} 〉 . \end{matrix}$

We have found some sequences of moves much easier to follow using a reduced notation. The sequence labeled “see below” moves the largest disk to P₃ from under a stack of five disks on L. Peg R is not involved. We denote the initial configuration before this sequence as $abcdef | - : - - : - - - | 〉$ , where, as in the proof of Lemma 10, peg R is ignored. The required sequence is then $\begin{matrix} 〈 abcdef | - : - - : - - - | \Rightarrow 〈 - | f : d e : abc | \Rightarrow 〈 d | f : - e : abc | \Rightarrow 〈 abd | f : - e : - - c | \\ \Rightarrow 〈 abd | - : c e : - - f | \Rightarrow 〈 d | - : c e : abf | \Rightarrow 〈 c | d : - e : abf | \\ \Rightarrow 〈 abc | d : - e : - - f | \Rightarrow 〈 abc | e : - d : - - f | \Rightarrow 〈 - | e : c d : abf | \\ \Rightarrow 〈 cde | - : - - : abf | \Rightarrow 〈 abcde | - : - - : - - f | . \end{matrix}$

The sequence labeled “similarly” moves disk 37 from P₃ to below $F_{31}^{27}$ on R. This is just like the reverse of the sequence that moves 37 from the bottom of $F_{37}^{32}$ to P₃, which we gave in detail above.

In Theorem 14, the biggest disk m moves from P₁ to P₂, then from P₂ and P₃ and so on, one by one, but in the above example m moves from P₁ to P₃ directly. Such a “jump” of the biggest disk also allows us to show that $H (1, 2, 2) \geq 26$ . We can generalize this jumping strategy in the form of an algorithm, but there is unlikely to be a succinct specification of m₁ through m_k based on that. We restrict ourselves therefore to the case of three work pegs and present an algorithm which may be optimal.

Jump algorithm

This algorithm follows a similar scheme to that in Theorem 14 but makes use of the tactic introduced in Example 16 where the largest disk may jump directly from P₁ to P₃. The heart of the algorithm lies in the choice of the values for m₂ and m₃.

The heights of pegs P₁,P₂,P₃ are j₁,j₂,j₃, where $j_{1} \leq j_{2} \leq j_{3}$ , and we describe the transfer of C_m from L to R. As in the algorithm for Theorem 14, in the first stage some suitable cone C_s is transferred recursively from L to R. In the second stage, frusta of sizes m₃, m₂ and m₁ are loaded successively from L onto pegs P₃, P₂, and P₁. We choose $m_{1} = 1$ with two alternatives plans for m₂ and m₃ $\begin{matrix} \begin{matrix} either & Plan (A), & m_{2} = j_{1}, & m_{3} = \min (j_{3}, H (j_{1}, j_{2} - 1) + j_{1}) \end{matrix} \\ \begin{matrix} or & Plan (B), & m_{2} = j_{1} + 1, & m_{3} = \min (j_{3}, H (j_{1} - 1, j_{2} - 1) + j_{1}) . \end{matrix} \end{matrix}$

We will use the plan that gives the larger value for $m_{2} + m_{3}$ , though note that Plan (B) is only feasible when $j_{2} > j_{1}$ . We can verify in this case that the two plans have equal performance if $j_{3} = 2 j_{1} j_{2} + j_{2}$ , while Plan (A) is better when j₃ is greater than this, and worse when less. The parameter s in the first stage is chosen so that $s + m_{1} + m_{2} + m_{3} = m$ , and therefore peg L is emptied in the second stage.

Stage 3 now begins to return all disks except for the largest to L. The sequences now become more intricate and we rely on the growing expertise of the reader to interpolate the details. We extend the alphabetic notation introduced in Example 16 to a more advanced level in which lower case letters denote single disks, while upper case letters denote frusta of given sizes.

Plan (A): with $m_{1} = 1$ , m₂ = j₁, $m_{3} = \min (j_{3}, H (j_{1}, j_{2} - 1) + j_{1})$ .

In the first stage, $〈 ABCDeFg | - : - : - | - 〉 \Rightarrow 〈 CDeFg | - : - : - | A B 〉,$ where $| A | = K (j_{1}, j_{2}, j_{3})$ and $B = m_{2} + m_{3}$ . Now AB remains on R and the second stage is to move g to W from below $C, D, e, F$ on L. Here, $| C | = \min (H (j_{1}, j_{2} - 1), j_{3} - j_{1}), | D | = j_{1} - 1$ , and $| F | = j_{1}$ . We see that $| CDe | = m_{3}$ as required.

As in Example 16, we will just show the contents of L and the work pegs. Note that in the second sequence F is transferred from P₂ to L using the space $j_{1} - 1$ on P₁, and in the third sequence the transfer of C from P₃ to L is possible since $| C | \leq H (j_{1}, j_{2} - 1)$ . $\begin{matrix} 〈 CDeFg | - : - : - | \Rightarrow 〈 - | g : F : CDe | \Rightarrow 〈 F | - : g : CDe | \Rightarrow 〈 C F | - : g : D e | \\ \Rightarrow 〈 C F | D e : - : g | \Rightarrow 〈 C F | - : e : D g | \Rightarrow 〈 F | - : e : CDg | \\ \Rightarrow 〈 e F | - : - : CDg | \Rightarrow 〈 CDeF | - : - : g | . \end{matrix}$

The third stage is to achieve $〈 CDeF | - : - : g | A B 〉 \Rightarrow 〈 ACDeF | - : - : g | B 〉 \Rightarrow 〈 ACDeF | - : - : - | B g 〉,$ i.e., to transfer A back to L, and then to insert g below B on R. Note that $| A | = K (j_{1}, j_{2}, j_{3})$ , so the transfer of A to L is possible with an appropriate positioning of disk g in W. Since $| B | = m_{2} + m_{3}$ , the insertion of g below B has the same pattern as the reversal of the second stage, extracting g from below CDeF. The final stages are just as in the algorithm for Theorem 14, so A moves to R, then AB moves back to L, and the rest is recursion.

Plan (B): with $m_{1} = 1, m_{2} = j_{1} + 1, m_{3} = \min (j_{3}, H (j_{1} - 1, j_{2} - 1) + j_{1})$ .

This alternative is only available when $j_{2} > j_{1}$ and proceeds in a similar way, but with a few more steps. $〈 ABCDeFghi | - : - : - | - 〉 \Rightarrow 〈 CDeFghi | - : - : - | A B 〉,$ where $| A | = K (j_{1}, j_{2}, j_{3}), B = m_{2} + m_{3}, | C | = \min (H (j_{1} - 1, j_{2} - 1), j_{3} - j_{1})$ , and $| D | = | F | = j_{1} - 1$ . The size of C is chosen to allow it to be transferred between L and P₃ when P₁ and P₂ each hold a single disk. Now $\begin{matrix} 〈 CDeFghi | - : - : - | \Rightarrow 〈 - | i : Fgh : CDe | \Rightarrow 〈 CFg | i : h : D e | \\ \Rightarrow 〈 CFg | - : Deh : i | \Rightarrow 〈 CFg | e : h : D i | \Rightarrow 〈 F g | e : h : CDi | \\ \Rightarrow 〈 g | e : F h : CDi | \Rightarrow 〈 - | g : eFh : CDi | \Rightarrow 〈 CeF | g : h : D i | \\ \Rightarrow 〈 CeF | - : h : Dgi | \Rightarrow 〈 CeF | h : g : D i | \Rightarrow 〈 e F | h : g : CDi | \\ \Rightarrow 〈 h | - : eFg : CDi | \Rightarrow 〈 CDeFgh | - : - : i | . \end{matrix}$

The last stages are the same as for Plan (A).

Theorem 17.

For $j_{1} \leq j_{2} \leq j_{3}$ , let $\begin{matrix} m_{A} = 1 + j_{1} + \min (j_{3}, H (j_{1}, j_{2} - 1) + j_{1}) and \\ m_{B} = 1 + \min (j_{2}, j_{1} + 1) + \min (j_{3}, H (j_{1} - 1, j_{2} - 1) + j_{1}) . \end{matrix}$

Then $H (j_{1}, j_{2}, j_{3}) \geq 2 \max (m_{A}, m_{B}) + K (j_{1}, j_{2}, j_{3}) - 1$ .

Proof.

In Plan (A), m₂ = j₁ and $m_{3} = \min (j_{3}, H (j_{1}, j_{2} - 1) + j_{1})$ , so $m_{1} + m_{2} + m_{3} = m_{A}$ , and C_m is transferred from L to R where $m = | ABCDeFg | = K (j_{1}, j_{2}, j_{3}) + (m_{A} - 1) + m_{A},$ as required.

In Plan (B), $m_{2} = j_{1} + 1 = \min (j_{2}, j_{1} + 1)$ and $m_{3} = \min (j_{3}, H (j_{1} - 1, j_{2} - 1) + j_{1})$ , so $m_{1} + m_{2} + m_{3} = m_{B}$ , and $m = | ABCDeFghi | = K (j_{1}, j_{2}, j_{3}) + (m_{B} - 1) + m_{B} .$

■

Theorem 17 yields the promised bound of $H (1, 2, 2) = 26$ , the example illustrated in . The upper bound is given by Theorem 6. Plan (B) uses $(m_{1}, m_{2}, m_{3}) = (1, 2, 2)$ for a matching lower bound of $10 + K (1, 2, 2) - 1 = 26$ , since $K (1, 2, 2) = H (1, 1, 2) = 17$ .

Fig. 1 A tower of 26 disks being transferred using work pegs of heights 2, 1, and 2.

Theorems 12, 14, and 17 give successive improvements to the lower bounds, but at the cost of more complicated algorithms. How much do these extra complications buy for us? As a rough indication, we show below a comparison of $H_{1} (j, j, j), H_{2} (j, j, j)$ , and $H_{3} (j, j, j)$ , for the results of Theorems 12, 14, and 17, respectively.

7 Conclusion

We have begun an investigation into the space complexity of the generalized Tower of Hanoi problem, where the number of work pegs provided to transfer a set of disks from the initial peg to the final peg is increased from the single peg used in the classic problem.

We pay little attention to the time complexity, the number of moves, in our algorithms. As may be expected from even our first example, transferring five disks using two unit pegs and 25 moves, the time complexity is large compared with that for unbounded pegs, where only 13 moves are needed. Lemma 7 shows that 25 disks can be transferred using two work pegs of height three. With unbounded pegs this requires precisely 577 moves [Citation5], but our algorithm takes over 2,000,000 moves, as estimated by a simple Mathematica program. It remains to be seen whether there are interesting trade-offs between time complexity and space complexity results for Hanoi problems.

We have exact results for $H (j_{1}, \dots, j_{k})$ when $k \leq 2$ , for $H U (k)$ , i.e., k pegs of height 1, and for the case where all the pegs are tight (defined in ( $* *$ ) in the previous section). Theorem 14 gives a fairly close bound for $H (j_{1}, \dots, j_{k})$ , but we know this is not exact. Indeed, Theorem 17 gives better lower bounds in the case of k = 3, and the algorithm given there could even be optimal.

There are some very small cases for the general problem where gaps remain. By ad hoc methods, we know exact values for all cases where the total work space is at most six. However, Theorem 17 shows only that $H (1, 3, 3) \geq 48$ , while our best upper bound so far invokes Theorem 6 to prove that $H (1, 3, 3) \leq H U (7) = 50$ . Surely this gap can be closed!

Table 1 Lower bounds for $H (j, j, j)$ from Theorems 12, 14, and 17.

Display Table

Acknowledgment

We thank the reviewers for their valuable suggestions. The research of Kazuo Iwama is supported by KAKENHI, Ministry of Education, Japan. The research of Mike Paterson is supported by the Centre for Discrete Mathematics and its Applications (DIMAP).

Additional information

Notes on contributors

Kazuo Iwama

KAZUO IWAMA received his B.E., M.E., and Ph.D. degrees from Kyoto University. After retirement from the School of Informatics at Kyoto, he was with the Research Institute for Mathematical Sciences and the Academic Center for Computing and Media Studies, Kyoto University, and is currently with NTHU in Taiwan. Most of this work was done when he was with the Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic, from February to June 2020.

Mike Paterson

MIKE PATERSON (Ph.D., FRS) took degrees in mathematics at Cambridge and rose to fame as the co-inventor of Sprouts with John Conway. He evolved from president of the Trinity Mathematical Society to president of the European Association for Theoretical Computer Science, and migrated from MIT to the University of Warwick, where he has been in the Computer Science department for 50 years.

References

Bousch, T. (2014). La quatrième tour de Hanoï. Bull. Belg. Math. Soc. Simon Stevin. 21(5): 895–912.
Google Scholar
Bousch, T., Hinz, A. M., Klavzar, S., Parisse, D., Petr, C., Stockmeyer, P. K. (2019). A note on the Frame–Stewart conjecture. Discrete Math. Algorithms Appl. 11(4): 1950049, 4 pp.
Web of Science ®Google Scholar
Chen, X., Shen, J. (2004). Explorations in 4-peg tower of Hanoi. Tech. Rep. TR–04–10. Ottawa, Canada: Carleton University.
Google Scholar
Chen, X., Shen, J. (2012). On the Frame–Stewart conjecture about the towers of Hanoi. SIAM J. Comput. 33(3): 584–589.
Google Scholar
Korf, R. E., Felner, A. (2007). Recent progress in heuristic search: A case study of the four-peg towers of Hanoi problem. In: Sangal, R., Mehta, H., Bagga, R. K., eds. Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI’07). San Francisco, CA: Morgan Kaufmann Publishers Inc., pp. 2324–2329.
Google Scholar
Lucas, É. (1893). Récréations Mathématiques, 2nd ed. Paris, France: Gauthier-Villars. Reprinted several times by Albert Blanchard, 2, 1893.
Google Scholar
Lunnon, W. F. (1986). Correspondence: The Reve’s puzzle. Computer J. 29(5): 478. DOI: https://doi.org/10.1093/comjnl/29.5.478.
Web of Science ®Google Scholar
Stewart, B. M. (1939). Problem 3918 (k-peg tower of Hanoi). Amer. Math. Monthly. 46(363): 216–217.
Google Scholar
Stewart, B. M., Frame, J. S. (1941). Solution to advanced problem 3819. Amer. Math. Monthly. 48(3): 216–219. DOI: https://doi.org/10.2307/2304268.
Google Scholar

Bounded Hanoi

Abstract

1 Introduction

2 Preliminaries

3 Single work peg

4 Unit work pegs

5 Two work pegs

6 More work pegs

Improved multipeg algorithm

Jump algorithm

7 Conclusion

Table 1 Lower bounds for $H (j, j, j)$ from Theorems 12, 14, and 17.

Acknowledgment

Notes on contributors

Kazuo Iwama

Mike Paterson

References

Information for

Open access

Opportunities

Help and information

Bounded Hanoi

Abstract

1 Introduction

2 Preliminaries

3 Single work peg

4 Unit work pegs

5 Two work pegs

6 More work pegs

Improved multipeg algorithm

Jump algorithm

7 Conclusion

Table 1 Lower bounds for H(j,j,j) from Theorems 12, 14, and 17.

Acknowledgment

Additional information

Notes on contributors

Kazuo Iwama

Mike Paterson

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

Table 1 Lower bounds for $H (j, j, j)$ from Theorems 12, 14, and 17.