Full article: A diluted version of the problem of the existence of the Hofstadter sequence

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

We investigate the conditions on an integer sequence $f (n)$ , $n \in N$ , with $f (1) = 0$ , such that the sequence $q (n)$ , computed recursively via $q (n) = q (n - q (n - 1)) + f (n)$ , with $q (1) = 1$ , exists. We prove that $f (n)$ is ‘slow’, that is, $f (n + 1) - f (n) \in {0, 1}$ , $n \geq 1$ , is a sufficient but not necessary condition for the existence of sequence q. Sequences q defined in this way typically display non-trivial dynamics: in particular, they are generally aperiodic with no obvious patterns. We discuss and illustrate this behavior with some examples.

Keywords:

2020 Mathematics Subject Classifications:

1. Motivation

In his 1979 book [Citation3], author D.R. Hofstadter mentions an integer sequence, $q_{h} (n)$ , defined, for n>2, by (1) $q_{h} (n) = q_{h} (n - q_{h} (n - 1)) + q_{h} (n - q_{h} (n - 2)) with q_{h} (1) = q_{h} (2) = 1.$ (1) In order for $q_{h} (n)$ to exist for all positive integers n, that is, for $n \in N$ , it must be true, again for all $n \in N$ , that $1 \leq q_{h} (n) \leq n$ , since only then does the right-hand side of (Equation1(1) $q_{h} (n) = q_{h} (n - q_{h} (n - 1)) + q_{h} (n - q_{h} (n - 2)) with q_{h} (1) = q_{h} (2) = 1.$ (1) ) refer to terms that are already known: $q_{h} (n)$ is undefined for $n \leq 0$ . Should it happen that $q_{h} (n) > n$ for some n, the sequence is said to die at n. Although direct computation shows that $q_{h} (n)$ exists for $1 \leq n \leq 3 \times 10^{10}$ [Citation5], the existence of $q_{h} (n)$ for all positive n is still an open question.

The sequence $q_{h} (n)$ is non-monotonic, aperiodic and its dynamical behavior appears to be complex. In [Citation9] for instance, small scale ‘chaotic’ behavior is described, with some order apparent at larger scales, and several statistical properties of the sequence are also investigated.

This complex behavior has inspired several authors to study variations of Hofstadter's original recursion. For instance, Tanny [Citation11] considers $T (n) = T (n - 1 - T (n - 1)) + T (n - 2 - T (n - 2)) with T (0) = T (1) = T (2) = 1,$ for n>2 [Citation6]. Another variant is considered in [Citation1], in which the authors investigate $V (n) = V (n - V (n - 1)) + V (n - V (n - 4)) with V (1) = V (2) = V (3) = V (4) = 1$ for n>4 [Citation7]. Both these variants give rise to sequences that are monotonic, unlike $q_{h} (n)$ , and hit every positive integer.

A different approach is followed in [Citation2], in which the recursion formula in (Equation1(1) $q_{h} (n) = q_{h} (n - q_{h} (n - 1)) + q_{h} (n - q_{h} (n - 2)) with q_{h} (1) = q_{h} (2) = 1.$ (1) ) is retained but different initial conditions are used. For certain initial conditions, ‘eventually quasi-polynomial’ solutions can be found. For a given integer m>0 and a given fixed set of polynomials $p_{i} (n)$ , $i = 0, \dots, m - 1$ , a quasi-polynomial function of n, $h (n)$ , say, is defined by $h (n) = p_{i} (n)$ , where $i = n mod m$ . If this holds only for n greater than some positive integer $n_{0}$ say, then $h (n)$ is said to be eventually quasi-polynomial. In [Citation2], families of eventually quasi-polynomial solutions to (2) $r (n) = r (n - r (n - 1)) + r (n - r (n - 2)),$ (2) with suitable initial conditions, are constructed. As an example of such a solution with m = 5 and where the $p_{i} (n)$ are of degree at most 1, we give $\begin{array}{llllll} i & 0 & 1 & 2 & 3 & 4 \\ p_{i} (n) & 2 & n - 4 & 5 & n - 5 & n - 6, \end{array}$

along with the initial conditions $r (3), \dots, r (12) = 1, 1, 3, 5, 1, 4, 7, 6, 4, 9$ . (Values of $r (1)$ and $r (2)$ are not needed to compute $r (n), n > 2$ .) For all $n > n_{0} = 12$ here, $r (n)$ satisfying (Equation2(2) $r (n) = r (n - r (n - 1)) + r (n - r (n - 2)),$ (2) ) is then given by the quasi-polynomial above.

In this paper, we also consider a variation on the original problem, one which consists of an infinite number of variants rather than just a single case. In particular, we discuss a ‘diluted’ version of the problem of the existence of $q_{h} (n)$ . In order to describe this version, we first need the notation $(a (n))_{n \in N} := a (1), a (2), \dots$ for an infinite sequence. We use this notation in formal contexts, such as in definitions and lemmas, but where it it is clear what is intended, we just write a to mean the whole sequence. Naturally, when the nth term is meant, we write $a (n)$ . We also need the following definitions:

Definition 1.1

Slow sequence

If integer sequence $(a (n))_{n \in N}$ obeys $a (n + 1) - a (n) \in {0, 1}$ for $n \in N$ , then the sequence is said to be ‘slow’.

Definition 1.2

Property $S_{0}$

If integer sequence $(a (n))_{n \in N}$ is slow and additionally $a (1) = 0$ , then it is said to have property $S_{0}$ .

We drop the quotation marks around the word ‘slow’ from now on.

The problem that we consider is the question of the existence of sequences q defined recursively by (3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) for $n \in N$ , n>1, and where $f (n)$ has property $S_{0}$ (so in particular $f (1) = 0$ ). All sequences in the paper start from index 1 and we apply the conditions $q (1) = 1$ and $f (1) = 0$ strictly throughout, even though $f (1)$ is never needed to compute the sequence q. Computations suggest that, under these conditions, the sequence q corresponding to any f with property $S_{0}$ exists for all $n \in N$ .

We think of sequences generated by (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ) as ‘diluted’ Hofstadter sequences because of the similarity of their definition to that of the original $q_{h}$ -sequence. In particular, in (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ), one ‘difficult’ – nested – term is retained, $q (n - q (n - 1))$ , but the second nested term is replaced by $f (n)$ , which is under our control.

Clearly the existence of the sequence q is also equivalent to the inequality $1 \leq q (n) \leq n$ holding for all $n \in N$ . If this is the case, then we simply say that q exists. The lower bound on $q (n)$ here is easy: by (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ), for any n for which q exists, $q (n)$ is the sum of a positive integer and a non-negative one, and so is also greater than or equal to unity.

The upper bound on q, however, is less obvious. In what follows, after some preliminaries, we give a proof of the following theorem.

Theorem 1.3

For all sequences $(f (n))_{n \in N}$ having property $S_{0}$ , the corresponding sequence $(q (n))_{n \in N}$ with $q (1) = 1$ and with $q (n)$ , for n>1, computed from (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ), exists for all $n \in N$ .

The proof of this theorem is the main result of the paper, but the recursion (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ) can also be viewed as a non-linear (in the light of the nested term), non-autonomous (because of $f (n)$ ) dynamical system, discrete in both time and space. Hence, having proved the existence of q for all f with property $S_{0}$ , we make some observations about the dynamics of sequences q, which are also interesting.

2. Preliminaries

The first term of all sequences in this paper has index 1. We start by giving the following definitions:

Two sequences $(a (n))_{n \in N}$ and $(b (n))_{n \in N}$ differ if there exists $n^{'} \in N$ such that $a (n^{'}) \neq b (n^{'})$ ; otherwise, they are equal.
A sequence $(q (n))_{n \in N}$ obeying (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ) exists if and only if $q (n)$ is defined for all $n \in N$ ; or, equivalently, if $1 \leq q (n) \leq n$ for all $n \in N$ .
A sequence $Q (f) := (q (n))_{n \in N}$ is the sequence corresponding to sequence $(f (n))_{n \in N}$ if $(q (n))_{n \in N}$ exists and is defined by (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ).
A sequence $F (q) := (f (n))_{n \in N}$ is the sequence corresponding to sequence $(q (n))_{n \in N}$ if $(q (n))_{n \in N}$ exists and $f (n)$ is defined by $f (1) = 0$ and $f (n) = q (n) - q (n - q (n - 1))$ for n>1.

Where an explicit expression for

f (n)

is available, for example

f (n) = ⌊ n / 2 ⌋

, we may write

Q (⌊ n / 2 ⌋)

Where we need to refer explicitly to the first few terms of a sequence, $a (n)$ say, we list the values starting from $a (1)$ – for example $a = (1, 2, 4, 4)$ means $a (1) = 1$ , $a (2) = 2$ , $a (3) = a (4) = 4$ .

Let us define $S$ as the set of all sequences having property $S_{0}$ : that is $S := {(f (n))_{n \in N} : f (1) = 0, f (i + 1) - f (i) \in {0, 1} for i \in N} .$ We will also need the set of finite sequences with property $S_{0}$ : $S_{m} := {{(f (n))}_{n = 1, \dots, m} : f (1) = 0, f (i + 1) - f (i) \in {0, 1} for i = 1, \dots, m - 1} .$ It is easily established that the cardinality of $S$ is the same as that of the real numbers. To do so, let $f \in S$ and consider the sequence of differences $(f (i + 1) - f (i))_{i \in N}$ , which will be a sequence consisting of the symbols 0 and 1. Interpreting this as a binary decimal, we see that each distinct $f \in S$ gives rise to a distinct real number $x \in [0, 1]$ .

One important subset of $S$ , with same cardinality, is $G := {(f (n))_{n \in N} = ⌊ αn ⌋ with α \in [0, 1)} .$

It is easy to see (i) that $G \subset S$ , and that in fact (ii) $G ⊊ S$ . Property (i) comes directly from the definition $⌊ x ⌋ := max {k \in Z : k \leq x}$ , from which we deduce that, for $α \in [0, 1]$ , $⌊ α (n + 1) ⌋ - ⌊ αn ⌋$ is either 0 or 1. For property (ii), consider the sequence $a (1), \dots, a (4) = 0, 0, 1, 2$ , which are the first four terms of a sequence in $S$ . Now assume that these terms are generated by $a (n) = ⌊ αn ⌋$ for some α. Then $a (2) = 0$ implies that $α \in [0, 1 / 2)$ , but $a (4) = 2$ implies that $α \in [1 / 2, 3 / 4)$ , leading to a contradiction. Hence, there are sequences in $S$ that are not in $G$ .

Furthermore, the condition that successive terms in a sequence $f (n)$ differ by at most unity turns out to be sufficient but not necessary for the corresponding $(q (n))_{n \in N}$ to exist. There are examples where f increases by more than unity, and where it is not monotonic, but for which $Q (f)$ nonetheless exists. Consider, for instance, (a) $f_{a} = (0, 0, 2, 2, 4, 4, \dots)$ , for which $q_{a} = Q (f_{a}) = (1, 1, 3, 3, 5, 5, \dots)$ and (b) $f_{b} = (0, 1, 0, 1, 0, 1, \dots)$ , for which $q_{b} = Q (f_{b}) = (1, 2, 1, 2, 1, 2, \dots)$ . Statements (a) and (b) are proved in

Lemma 2.1

If $f_{a} (n) = 2 ⌊ \frac{n - 1}{2} ⌋ = n - \frac{3 + (- 1)^{n}}{2}, then q_{a} (n) = n - \frac{1 + (- 1)^{n}}{2}$ and if $f_{b} (n) = \frac{1 + (- 1)^{n}}{2}, then q_{b} (n) = \frac{3 + (- 1)^{n}}{2}$ for all $n \in N .$

Proof.

These just boil down to computation.

For $q_{a}$ , we have $n - q_{a} (n - 1) = {\begin{cases} 1 & n is even \\ 2 & n is odd, \end{cases}$ so $q_{a} (n - q_{a} (n - 1)) = 1$ for all n. Hence, $q_{a} (n) - q_{a} (n - q_{a} (n - 1)) = q_{a} (n) - 1 = n - \frac{3 + (- 1)^{n}}{2} = f_{a} (n)$ , as claimed.

For $f_{b}$ , we have $n - q_{b} (n - 1) = {\begin{cases} n - 1 & n is even \\ n - 2 & n is odd . \end{cases}$ Hence, $n - q_{b} (n - 1)$ is odd, and so $q_{b} (n - q_{b} (n - 1)) = 1$ , both for all n. Therefore, $q_{b} (n) - q_{b} (n - q_{b} (n - 1)) = q_{b} (n) - 1 = \frac{1 + (- 1)^{n}}{2} = f_{b} (n)$ .

Remark 2.2

Example (b) is in fact a special case of $If m \in N and f (n) = (n - 1) mod m, then q (n) = f (n) + 1 = ((n - 1) mod m) + 1.$

On the other hand, it is easy to find examples of $f \notin S$ for which $Q (f)$ does not exist – for instance, $f = (0, 2, 2)$ , for which $q (1) = 1, q (2) = 3$ and so $q (3) = q (0) + f (3)$ which is undefined.

We also prove

Lemma 2.3

If f and $f^{'}$ are different sequences, and $Q (f)$ and $Q (f^{'})$ both exist, then $Q (f) \neq Q (f^{'})$ .

Proof.

Let $f (n) = f^{'} (n)$ for $1 \leq n \leq k$ but $f (k + 1) \neq f^{'} (k + 1)$ . Write $q = Q (f)$ , $q^{'} = Q (f^{'})$ . Now, $q (n) = q^{'} (n)$ for $1 \leq n \leq k$ and $q (k + 1) = q (k + 1 - q (k)) + f (k + 1)$ , $q^{'} (k + 1) = q^{'} (k + 1 - q^{'} (k)) + f^{'} (k + 1)$ . Also, $k + 1 - q (k) = k + 1 - q^{'} (k)$ , but $f (k + 1) \neq f^{'} (k + 1)$ . Hence $q (k + 1) \neq q^{'} (k + 1)$ and so the sequences q, $q^{'}$ differ.

We remark here that the recursion (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ) can be used in either direction. That is, given any sequence f for which $q = Q (f)$ exists, sequence q can obviously be computed, term-by-term, in order of increasing index, from (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ); and, given any sequence q obeying $1 \leq q (n) \leq n$ for all n, (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ) can be used to compute $f (n)$ for all n. A case where computing f given q gives an interesting result is

Lemma 2.4

Let sequence $q (n)$ obey (i) $1 \leq q (n) \leq n$ and (ii) $q (n + 1) - q (n) \in {0, 1}$ , both for $n \in N$ , and let $q (1) = 1$ . Define $f (n) = q (n) - q (n - q (n - 1))$ for $n \geq 2$ with $f (1) = 0$ . Then, for $n \geq 1$ , $f (n + 1) - f (n) \in {- 1, 0, 1}$ .

Proof.

Throughout the proof, we assume integer $n \geq 2$ . Assumption (i) tells us that q exists and (ii), that q is slow. By the definition of $f (n)$ , $f (n + 1) - f (n) = \underset{:= A (n)}{\underset{⏟}{q (n + 1) - q (n)}} - \underset{:= B (n)}{\underset{⏟}{[q (n + 1 - q (n)) - q (n - q (n - 1))]}} .$ Assumption (ii) implies immediately that $A (n) \in {0, 1}$ .

Now define $ℓ (n) := n - q (n - 1)$ : by (i), $1 \leq ℓ (n) \leq n - 1$ . Then $ℓ (n + 1) - ℓ (n) = 1 - (q (n) - q (n - 1)) \in {0, 1}$ , again by (ii). Hence, $B (n)$ is either $q (ℓ (n)) - q (ℓ (n)) = 0$ , or $q (ℓ (n) + 1) - q (ℓ (n)) \in {0, 1}$ . Therefore, $B (n) \in {0, 1}$ . Thus we have $A (n) - B (n) \in {- 1, 0, 1}$ and the lemma is proven.

Note that the converse of Lemma 2.4 is not true: if $f (n + 1) - f (n) \in {- 1, 0, 1}$ then q is not necessarily slow: a counterexample is given by $f_{b}$ in Lemma 2.1.

Finally, we point out that cases in which $f \in S$ gives rise to a monotonic sequence q appear to be rare – a few examples, each of which is also slow, are given in Section 4.

3. The main theorem

We now give a proof of Theorem 1.3. We need several lemmas, the first of which is

Lemma 3.1

The shift property

Let $(f (n))_{n \in N}$ , with $f (1) = 0$ , be any sequence of integers for which the corresponding sequence $(q (n))_{n \in N} = Q (f)$ , with $q (1) = 1$ , exists. Define $(f^{'} (n))_{n \in N} := {\begin{cases} 0 & n = 1 \\ f (n - 1) & n > 1 \end{cases} and (q^{'} (n))_{n \in N} := {\begin{cases} 1 & n = 1 \\ q (n - 1) & n > 1 \end{cases} .$ Then we have that $(A) q = Q (f)$ implies $q^{'} = Q (f^{'})$ and $(B) f = F (q)$ implies $f^{'} = F (q^{'})$ .

Proof.

A ⇒ B. We use strong induction and the fact that both q and $q^{'}$ obey (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ).

First, $q (1) = q^{'} (1) = 1$ by definition, and $q^{'} (2) = q^{'} (2 - q^{'} (1)) + f^{'} (2) = 1 + f (1) = 1$ . Hence, $q^{'} (2) = q (1)$ .

Next, assume that (4) $q^{'} (n) = q (n - 1) for n = 3, \dots, k with k > 3.$ (4) Now, by assumption (Equation4(4) $q^{'} (n) = q (n - 1) for n = 3, \dots, k with k > 3.$ (4) ), we have that $q^{'} (k) = q (k - 1)$ . For the inductive step $k \mapsto k + 1$ , we have that $\begin{aligned} q^{'} (k + 1) - q (k) & = q^{'} (k + 1 - q^{'} (k)) - q (k - q (k - 1)) \\ = q^{'} (k + 1 - q (k - 1)) - q (k - q (k - 1)) = q^{'} (m + 1) - q (m), \end{aligned}$ where $m = k - q (k - 1)$ . We have $2 \leq m + 1 \leq k$ , since q exists, and so, by (Equation4(4) $q^{'} (n) = q (n - 1) for n = 3, \dots, k with k > 3.$ (4) ), $q^{'} (m + 1) = q (m)$ and hence $q^{'} (k + 1) = q (k)$ .

B ⇒ A. This is straightforward. By definition, we have $f^{'} (1) = 0$ . Then, by B, we have $q^{'} (1) = 1$ and $q^{'} (2) = q (1) = 1$ . Hence, by (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ), $f^{'} (2) = q^{'} (2) - q^{'} (2 - q^{'} (1)) = 0$ . Then, letting $n \geq 3$ , we have $f^{'} (n) = q^{'} (n) - q^{'} (n - q^{'} (n - 1)) = q (n - 1) - q (n - 1 - q (n - 2)) = f (n - 1),$ as claimed. Here we have used B and (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ) for the second-last and last equalities respectively.

We will also need the following two special cases of $f \in S$ , each of which, unusually, leads to a slow sequence q:

Lemma 3.2

Let $f_{0} (n) = 0$ and $f_{1} (n) = n - 1$ , both for all $n \in N$ , and define sequences $q_{0} = Q (f_{0})$ , $q_{1} = Q (f_{1})$ . Then $q_{0} (n) = 1$ and $q_{1} (n) = n$ , also for all n.

Proof.

For $q_{0} (n)$ , (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ) gives $q_{0} (n) = q_{0} (n - q_{0} (n - 1))$ since $f_{0} (n) = 0$ . By induction, with base case $q_{0} (1) = 1$ , we make the assumption that for some k>1 we have $q_{0} (k) = 1$ . Then, by the recursion formula, we have $q_{0} (k + 1) = q_{0} (k + 1 - q_{0} (k)) = q_{0} (k) = 1$ , as claimed.

For $q_{1} (n)$ , (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ) gives $q_{1} (n) = q_{1} (n - q_{1} (n - 1)) + n - 1$ since $f_{1} (n) = n - 1$ . With $q_{1} (1) = 1$ , we assume that $q_{1} (k) = k$ for some k>1. Then $q_{1} (k + 1) = q_{1} (k + 1 - q_{1} (k)) + k = q_{1} (1) + k = k + 1$ and the proof is complete.

In the remainder of this section, f will always have property $S_{0}$ and we will assume that f is such that $Q (f)$ exists. We introduce the handy notation ${j : k}$ , with $k \geq j$ , to designate the set of successive integers ${j, j + 1, \dots, k}$ .

We now consider the question: for a particular n, given that $f (n) = i \in {0 : n - 1}$ , what are the possible values of $q (n)$ ? That is, we investigate the sets ${q (n) |_{f (n) = i}}$ , where f ranges over $S_{n}$ . The lower bound on each of these sets is given by

Lemma 3.3

If $Q (f)$ for $f \in S$ exists, then (i) for $n \in N$ and $i = 0, \dots, n - 1$ , $min ({q (n) |_{f (n) = i}}) = i + 1$ ; and (ii) a sequence f that gives rise to this minimum value is ${(f (k))}_{k = 1, \dots n} = (\underset{n - i zeroes}{\underset{⏟}{0, \dots, 0,}} 1, 2, \dots, i) .$

Proof.

For (i), if n = 1 then, by the initial conditions, $f (1) = 0$ and $q (1) = 1 = f (1) + 1$ ; and so the claimed lower bound holds. Therefore, fix n>1 and recall that $q (n) = q (n - q (n - 1)) + f (n) = q (n - q (n - 1)) + i$ . Then, provided that q exists, $1 \leq q (k) \leq k$ for $1 \leq k \leq n$ . In particular $1 \leq q (n - 1) \leq n - 1$ and so $1 \leq n - q (n - 1) \leq n - 1$ . Recall now that we are considering $f (n) = i$ . Hence, $q (n) = q (j) + i$ , where $1 \leq j \leq n - 1$ , so $1 \leq q (j) \leq n - 1$ . Therefore, $q (n) \geq i + 1$ , and this gives the required minimum.

For (ii), we use Lemmas 3.1 and 3.2. In order that $f (n) = i$ , we must have $i \leq n - 1$ , otherwise f cannot have property $S_{0}$ . If $f (n) = n - 1$ , then, by Lemma 3.2, $q (n) = n$ can be achieved by choosing $f = (0, 1, \dots, n - 1)$ . Otherwise, $f (n) = i < n - 1$ and in this case, the shift property of Lemma 3.1 applied $(n - i - 1)$ times gives $q (n) = i + 1$ .

We have easily found the minimum of the sets ${q (n) |_{f (n) = i}}$ , but the corresponding maximum is not so straightforward. There is, however, a way round the problem. We require the definition of two collections of finite sets of integers, $T$ and $U$ , both of which have the same triangular structure. Taking $T$ first, this is the collection of sets $T_{i, n}$ , where $n \in N$ and $i = 0, \dots, n - 1$ , which can be pictured as a triangular arrangement with n indexing rows and i indexing the position within row n. The definition of $T$ is

Definition 3.4

$T$ is the collection of sets $T_{i, n} := {q (n) : q = Q (f), f \in S such that f (n) = i},$ with $n \in N$ and $0 \leq i \leq n - 1$ .

By Lemma 3.2 we have (5) $T_{0, n} = {1} (a) and T_{n - 1, n} = {n} (b),$ (5) for $n \in N$ . For (a), if $f \in S$ and $f (n) = 0$ , then it must be that $f (i) = 0$ for $1 \leq i < n$ and so the first part of Lemma 3.2 applies. For (b), if $f \in S$ , the only way that $f (n) = n - 1$ is if $f (i) = i - 1$ for $1 \leq i < n$ , so the second part of Lemma 3.2 applies.

We have not succeeded in computing $T$ explicitly for all valid i and n. In principle, $T$ can be computed by finding all the values that $q (n)$ can take given that $f \in S$ and $f (n) = i$ . This would in turn require knowledge of all the values that $q (n - q (n - 1))$ can take (and then adding $i = f (n)$ to them). In practice, though, it seems that we have insufficient knowledge of the sequences $q = Q (f)$ as f ranges over $S$ .

However, we can explicitly construct a collection of sets, $U$ , and the sets in $U$ will turn out to contain the corresponding sets in $T$ . This is sufficient to prove Theorem 1.3. The collection $U$ consists of the sets (6) $U_{i, n} := {\begin{cases} {1} & i = 0 \\ {i + 1 : n} & i = 1, \dots, n - 1. \end{cases}$ (6) for $n \in N$ .

We now prove

Lemma 3.5

Let collections of sets $T$ and $U$ be as in Definition 3.4 and Equation (Equation6(6) $U_{i, n} := {\begin{cases} {1} & i = 0 \\ {i + 1 : n} & i = 1, \dots, n - 1. \end{cases}$ (6) ) respectively. Then $T_{i, n} \subset U_{i, n}$ for $n \in N$ and $i = 0, \dots, n - 1$ .

Proof.

First, note that for i = 0, (Equation5(5) $T_{0, n} = {1} (a) and T_{n - 1, n} = {n} (b),$ (5) )(a) applies and gives $T_{0, k} = {1} = U_{0, k}$ . Furthermore, for i= k, (Equation5(5) $T_{0, n} = {1} (a) and T_{n - 1, n} = {n} (b),$ (5) )(b) applies, giving $T_{k - 1, k} = {k} = U_{k - 1, k}$ . Both of these are true for all positive k.

The rest of the proof proceeds by strong induction, starting from $T_{0, 1} = {1} = U_{0, 1}$ . We assume that $T_{i, n} \subset U_{i, n}$ for all allowed values of i and $n = 1, \dots, k$ , which assumption we refer to as (H). We then study what happens when $k \mapsto k + 1$ . The lemma having been proved for i = 0 and i = k, it only remains to study the case $1 \leq i \leq k - 1$ .

Let us therefore consider $T_{i, k + 1}$ with $1 \leq i \leq k - 1$ . Equation (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ) now reads $q (k + 1) = q (k + 1 - q (k)) + i$ since, by definition, $f (k + 1) = i$ . We first need the set of possible values of $q (k) |_{f (k + 1) = i}$ , which, by (H), is $T_{i - 1, k} \cup T_{i, k} \subset U_{i - 1, k} \cup U_{i, k} = {i : k}$ . Only sets with indices $(i - 1, k)$ and $(i, k)$ are allowed because f has property $S_{0}$ : $f (k + 1) = i$ can only be true if $f (k) = i - 1$ or $f (k) = i$ . Thus, $k + 1 - q (k) \in {1 : k - i + 1}$ . Note that $2 \leq k - i + 1 \leq k$ , since $1 \leq i \leq k - 1$ , so $q (k + 1 - q (k))$ is defined only in terms of q with argument strictly less than k + 1. Hence, informally put, $q (k + 1 - q (k)) \in$ {union of rows 1 to k−i + 1 of $T}$ ; that is, using assumption (H) and (Equation6(6) $U_{i, n} := {\begin{cases} {1} & i = 0 \\ {i + 1 : n} & i = 1, \dots, n - 1. \end{cases}$ (6) ), (7) ${q (k + 1 - q (k)) |}_{f (k + 1) = i} \in ⋃_{n = 1}^{k - i + 1} ⋃_{j = 0}^{n - 1} T_{j, n} \subset ⋃_{n = 1}^{k - i + 1} ⋃_{j = 0}^{n - 1} U_{j, n} = {1 : k - i + 1} .$ (7) Therefore, for $1 \leq i \leq k - 1$ , we have $q (k + 1) \in {i + 1 : k + 1}$ and so $T_{i, k + 1} \subset {i + 1 : k + 1}$ . Hence, from (Equation6(6) $U_{i, n} := {\begin{cases} {1} & i = 0 \\ {i + 1 : n} & i = 1, \dots, n - 1. \end{cases}$ (6) ), we have $T_{i, k + 1} \subset U_{i, k + 1}$ and this completes the proof.

Finally, we are in a position to prove Theorem 1.3.

Proof of Theorem 1.3

An immediate consequence of Lemma 3.5 is that for $n \in N$ , $q (n) \in ⋃_{j = 0}^{n - 1} U_{j, n} = {1 : n}$ . Hence, $1 \leq q (n) \leq n$ for all $n \in N$ and Theorem 1.3 is proven.

We return to $T$ . In words, $T_{i, n}$ is the set of all values that $q (n)$ attains in practice, given that $f (n)$ , as f ranges over $S$ , is equal to i. Clearly, since f has property $S_{0}$ , $0 \leq i = f (n) \leq n - 1$ . It is important to note that $T$ is not equal to $U$ because approximations were made in the proof of Lemma 3.5 in order to compute the values that $q (n - q (n - 1))$ can assume in principle. In particular, the unions in (Equation7(7) ${q (k + 1 - q (k)) |}_{f (k + 1) = i} \in ⋃_{n = 1}^{k - i + 1} ⋃_{j = 0}^{n - 1} T_{j, n} \subset ⋃_{n = 1}^{k - i + 1} ⋃_{j = 0}^{n - 1} U_{j, n} = {1 : k - i + 1} .$ (7) ) are typically over more – and larger – sets $U_{i, n}$ than necessary. These approximations almost always overestimate, and never underestimate $U_{i, n}$ , giving a collection of sets $U$ such that those in $T$ are contained within the corresponding sets in $U$ : that is $T_{i, n} \subset U_{i, n}$ .

As stated earlier, $T$ can be visualized as a triangular array of sets – see Figure . This was computed by brute force. For instance, the last row corresponds to n = 8 and was computed by generating all $2^{7}$ sequences $f \in S_{8}$ , finding the sequences q corresponding to each and noting the different values of $q (8) |_{f (8) = i}$ , for $0 \leq i \leq 7$ . As an example, the fourth row of Figure implies that, for instance,

If $f (4) = 0$ , $q (4)$ can only be 1, that is, $T_{0, 4} = {1}$ . This is obvious since the sequence f, $f \in S_{4}$ , can only be $(0, 0, 0, 0)$ . See Lemma 3.2.
Now suppose that $f (4) = 1$ . In this case, the allowed sequences f are $(0, 0, 0, 1)$ , $(0, 0, 1, 1)$ and $(0, 1, 1, 1)$ . Using (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ) to compute $q (4)$ in each case, we find that the first two give $q (4) = 2$ and the third one, $q (4) = 3$ . Thus, $T_{1, 4} = {2, 3}$ (cf. $U_{1, 4} = {2, 3, 4}$ ).

Figure 1. The first eight rows of the triangular array of sets $T$ . Here, i and n index the sets horizontally and vertically respectively, with $i = 0, \dots, n - 1$ . The set of consecutive integers $k, \dots, l$ is written ${k : l}$ and as ${k}$ when l = k.

Figure 1. The first eight rows of the triangular array of sets T. Here, i and n index the sets horizontally and vertically respectively, with i=0,…,n−1. The set of consecutive integers k,…,l is written {k:l} and as {k} when l = k.

4. Examples of sequences q that are slow

In Lemma 3.2, we gave the examples $f (n) = 0$ and $f (n) = n - 1$ , both of which lead to sequences q that are slow. We are aware of three other examples, which we now present.

The first example is given in

Lemma 4.1

$If f (n) = ⌊ \frac{n + 2}{4} ⌋ then q (n) = ⌊ \frac{n + 2}{2} ⌋ .$

Proof.

Let $\tilde{q} (x) = \frac{x}{2} + \frac{3 + \cos πx}{4}$ for $x \in R$ . Then it can easily be verified that $\tilde{q} (n) = ⌊ \frac{n + 2}{2} ⌋$ for $n \in N$ . Furthermore, direct calculation gives $\tilde{q} (x) - \tilde{q} (x - \tilde{q} (x - 1)) = \frac{1}{8} (2 x + 1 + \cos πx) - \frac{1}{4} \sin (\frac{π (2 x + 1 + \cos πx)}{4}) := \tilde{f} (x) .$ Finally, it is straightforward to check that, for $n \in N$ , $\tilde{f} (n) = ⌊ \frac{n + 2}{4} ⌋$ and this proves the lemma.

Remark 4.2

We have proved more here, viz. that $\tilde{q} (x)$ , $\tilde{f} (x)$ as given above obey $\tilde{q} (x) = \tilde{q} (x - \tilde{q} (x - 1)) + \tilde{f} (x)$ for all $x \in R$ : a continuous solution to (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ). The same is true of $f_{0} (x) = 0$ , which implies that $q_{0} (x) = 1$ , and $f_{1} (x) = x - 1$ , which implies that $q_{1} (x) = x$ , both for $x \in R$ – an extension of Lemma 3.2.

For the second example, define (8) $δ (n) := {\begin{cases} 1, & n = 0 \\ 0, & otherwise, \end{cases}$ (8) for $n \in Z$ . Another instance of a slow sequence q is found when $f (n) = 1 - δ (n - 1)$ : in fact, $Q (1 - δ (n - 1)) = 1, 2, 2, 3, 3, 3, 4, \dots$ , where each positive integer k occurs k times. This sequence can also be written $For k \in N, q (n) = k for h (k) \leq n \leq h (k + 1) - 1, with h (k) := \frac{k^{2} - k + 2}{2} .$ We summarize these observations in the following lemma:

Lemma 4.3

Let $f (n) = 1 - δ (n - 1) = (0, 1, 1, \dots)$ . Then $q (n) = (1, 2, 2, 3, 3, 3, 4, \dots)$ , where each positive integer appears in turn, with integer k occurring k times in succession. That is, $q (n) = k for h (k) \leq n \leq h (k + 1) - 1,$ where $k \in N$ and $h (k) = \frac{k^{2} - k + 2}{2}$ .

This sequence [Citation8] is a special case of a generalized Golomb triangular recursion [Citation4]. A proof of Lemma 4.3 by S.W. Golomb can be found in the ‘Links’ section of [Citation8].

Remark 4.4

This result can also be written as $f (n) = 1 - δ (n - 1) \Rightarrow q (n) = ⌊ \frac{1}{2} + \sqrt{2 n - \frac{7}{4}} ⌋ =: w (n),$ as can be easily shown by considering $\sqrt{2 h (k) - 7 / 4} = k - 1 / 2$ for $k \in N$ . However, we lack a direct proof of Lemma 4.3 starting from this expression.

Remark 4.5

This result relates immediately to the fact that $T_{1, n} ⊊ U_{1, n}$ : we have explicitly constructed $T_{1, n} = {2 : w (n)}$ above, from which it is clear that $T_{1, n} ⊊ U_{1, n}$ for n>2. See the second column from the left in Figure .

The third example of a slow sequence q is especially interesting and involves γ, the reciprocal of the golden ratio:

Theorem 4.6

Let $γ := \frac{\sqrt{5} - 1}{2}$ , so that $γ^{2} + γ - 1 = 0$ . Then $q (n) = 1 + ⌊ γ (n - 1) ⌋$ obeys (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ) with $f (n) = ⌊ γ^{2} n ⌋$ .

Since $γ \approx 0.618 < 1$ , clearly $q (n)$ as defined here is slow, and, furthermore, $q (1) = 1$ .

In the course of the proof of the theorem, we need a few small results:

Footnote¹ $γ^{2} + γ = 1$ , which implies that $2 γ^{2} + 3 γ = 2 + γ$ , $γ^{- 1} = 1 + γ$ , $γ^{2} (γ + 2) = 1$ .
$x = ⌊ x ⌋ + {x}$ , $x \in R$ , where ${x}$ is the fractional part of x
$⌊ - x ⌋ = - ⌊ x ⌋ - 1$ , $x \notin Z$
$⌊ m + x ⌋ = m + ⌊ x ⌋$ for $m \in N$
${γ^{2} n} = 1 - {γn}$ .

Item (i) comes from the given definition of γ and (ii)–(iv) are from the definition of the floor function. For (v), start from (ii) with

x = γ^{2} n

, which gives

{γ^{2} n} = γ^{2} n - ⌊ γ^{2} n ⌋

. Now use (i) to replace

γ^{2}

with

1 - γ

, giving

{γ^{2} n} = n - γn - ⌊ n - γn ⌋ = - γn - ⌊ - γn ⌋

, where (iv) has been used. Then (iii) gives

{γ^{2} n} = - γn + ⌊ γn ⌋ + 1 = 1 - {γn}

, using (ii).

Proof of Theorem 4.6

We first show that the theorem is equivalent to identity (Equation9(9) $θ (n) := ⌊ γ + γ ⌊ γ^{2} (n - 1) ⌋ ⌋ + ⌊ γ^{2} n ⌋ + ⌊ γ^{2} (n + 1) ⌋ = n - 1 for n \in N .$ (9) ) below, then we prove the identity itself.

Starting from $q (n) = 1 + ⌊ γ (n - 1) ⌋$ , clearly $q (1) = 1$ . It is convenient to shift n by 1, giving $q (n + 1) = 1 + ⌊ γn ⌋$ , which we will show obeys (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ) for $n \in N$ . When this is substituted in $q (n + 1) - q (n + 1 - q (n))$ we have $q (n + 1) - q (n + 1 - q (n)) = ⌊ γn ⌋ - ⌊ γ (n - 1) - γ ⌊ γ (n - 1) ⌋ ⌋ .$ Now, using (i), (iii) and (iv), we find that $q (n + 1) - q (n + 1 - q (n)) = n - 1 - ⌊ γ^{2} n ⌋ - ⌊ γ + γ ⌊ γ^{2} (n - 1) ⌋ ⌋ .$ Hence, Theorem 4.6 holds if the right-hand side of the above expression is equal to $f (n + 1) = ⌊ γ^{2} (n + 1) ⌋$ for $n \in N$ , that is, if (9) $θ (n) := ⌊ γ + γ ⌊ γ^{2} (n - 1) ⌋ ⌋ + ⌊ γ^{2} n ⌋ + ⌊ γ^{2} (n + 1) ⌋ = n - 1 for n \in N .$ (9) To prove (Equation9(9) $θ (n) := ⌊ γ + γ ⌊ γ^{2} (n - 1) ⌋ ⌋ + ⌊ γ^{2} n ⌋ + ⌊ γ^{2} (n + 1) ⌋ = n - 1 for n \in N .$ (9) ), we consider separately three cases which are demarcated according to $Case A: {γ^{2} n} \in (0, γ^{2}); Case B: {γ^{2} n} \in (γ^{2}, γ); Case C: {γ^{2} n} \in (γ, 1) .$ For n>1, these intervals are all open because γ and $γ^{2} = (3 - \sqrt{5}) / 2$ are irrational. Hence, we first dispose of the case n = 1, where we find $θ (1) = ⌊ γ + 0 ⌋ + ⌊ γ^{2} ⌋ + ⌊ 2 γ^{2} ⌋ = 0 = n - 1,$ since $γ^{2} < 1 / 2$ .

We assume that $n \geq 2$ throughout the rest of this proof. Proving cases A–C for $n \geq 2$ amounts primarily to rearranging $θ (n)$ into a first-degree polynomial in n, plus the floor of a bounded function of n.

Case A. Note that ${γ^{2} n} \in (0, γ^{2})$ implies that $⌊ γ^{2} (n - 1) ⌋ = ⌊ γ^{2} (n) ⌋ - 1$ and $⌊ γ^{2} (n + 1) ⌋ = ⌊ γ^{2} (n) ⌋$ . Thus, in case A we have $θ (n) = ⌊ γ ⌊ γ^{2} n ⌋ ⌋ + 2 ⌊ γ^{2} n ⌋ =: θ_{A} (n) .$ Consider the first term in $θ_{A} (n)$ : $⌊ γ ⌊ γ^{2} n ⌋ ⌋ = ⌊ γ ⌊ (1 - γ) n ⌋ ⌋ = ⌊ γn - γ (⌊ γn ⌋ + 1) ⌋ = ⌊ γn - γ (γn - {γn} + 1) ⌋,$ where we have used (i), (iv), (iii) and then (ii). Replacing the first $γn$ with $(1 - γ^{2}) n$ , we get $⌊ γ ⌊ γ^{2} n ⌋ ⌋ = n + ⌊ - 2 γ^{2} n + γ {γn} - γ ⌋ = n - 1 - ⌊ 2 γ^{2} n - γ {γn} + γ ⌋,$ using (iii). We can now use (ii) and (v) to obtain $⌊ γ ⌊ γ^{2} n ⌋ ⌋ =$ $n - 1 - ⌊ 2 (⌊ γ^{2} n ⌋ + {γ^{2} n}) + γ (1 - {γn}) ⌋ = n - 1 - ⌊ 2 (⌊ γ^{2} n ⌋ + {γ^{2} n}) + γ {γ^{2} n} ⌋ .$ Hence, using (iv), $θ_{A} (n) = n - 1 - ⌊ 2 (⌊ γ^{2} n ⌋ + {γ^{2} n}) + γ {γ^{2} n} ⌋ + 2 ⌊ γ^{2} n ⌋ = n - 1 - ⌊ (γ + 2) {γ^{2} n} ⌋,$ and $θ_{A} (n)$ is now in the required form.

Since $0 < {γ^{2} n} < γ^{2}$ , we have, in case A, $0 < (γ + 2) {γ^{2} n} < γ^{2} (γ + 2) = 1$ , using (i), and so $⌊ (γ + 2) {γ^{2} n} ⌋ = 0$ . Therefore, $θ_{A} (n) = n - 1$ .

Case B. For case B, we have $γ^{2} < {γ^{2} n} < γ = 1 - γ^{2}$ and this implies that $⌊ γ^{2} (n - 1) ⌋ = ⌊ γ^{2} n ⌋ = ⌊ γ^{2} (n + 1) ⌋$ . Hence, in case B $θ (n) = ⌊ γ + γ ⌊ γ^{2} n ⌋ ⌋ + 2 ⌊ γ^{2} n ⌋ =: θ_{B} (n) .$ Since $2 ⌊ γ^{2} n ⌋$ is an integer, we immediately have $θ_{B} (n) = ⌊ γ + (γ + 2) ⌊ γ^{2} n ⌋ ⌋$ . Now, $(γ + 2) ⌊ γ^{2} n ⌋ = (γ + 2) (γ^{2} n - {γ^{2} n}) = n - (γ + 2) {γ^{2} n},$ by (ii) and (i). Hence, (10) $θ_{B} (n) = n + ⌊ γ - (γ + 2) {γ^{2} n} ⌋ = n - 1 - ⌊ (γ + 2) {γ^{2} n} - γ ⌋,$ (10) using (iii). Now, by the definition of case B, $γ^{2} (γ + 2) < (γ + 2) {γ^{2} n} < γ (γ + 2) or 1 < (γ + 2) {γ^{2} n} < γ + 1.$ Hence, $⌊ (γ + 2) {γ^{2} n} - γ ⌋ = 0$ and $θ_{B} (n) = n - 1$ .

Case C. Case C is similar to case B. We now have $γ < {γ^{2} n} < 1$ , giving $⌊ γ^{2} (n - 1) ⌋ = ⌊ γ^{2} n ⌋$ but $⌊ γ^{2} (n + 1) ⌋ = ⌊ γ^{2} n ⌋ + 1$ . Hence, in case C $θ (n) = ⌊ γ + γ ⌊ γ^{2} n ⌋ ⌋ + 2 ⌊ γ^{2} n ⌋ + 1 =: θ_{C} (n) .$ From the definition of $θ_{B} (n)$ , we see immediately that $θ_{C} (n) = θ_{B} (n) + 1$ and so, from (Equation10(10) $θ_{B} (n) = n + ⌊ γ - (γ + 2) {γ^{2} n} ⌋ = n - 1 - ⌊ (γ + 2) {γ^{2} n} - γ ⌋,$ (10) ), we have $θ_{C} (n) = n - ⌊ (γ + 2) {γ^{2} n} - γ ⌋$ . Hence, for case C, $γ (γ + 2) < (γ + 2) {γ^{2} n} < (γ + 2) or 1 < (γ + 2) {γ^{2} n} - γ < 2.$ Therefore, $⌊ (γ + 2) {γ^{2} n} - γ ⌋ = 1$ and so $θ_{C} (n) = n - 1$ , concluding the proof of the theorem.

5. Heuristic and experimental results

In this section we present heuristic plausibility arguments to give simple descriptions of the behavior of q for certain sequences f. As we shall see, computation suggests that the actual behavior of q follows our predictions closely. We consider three types of sequence f.

5.1. $f (n) = ⌊ αn ⌋$ , $α \in (0, 1)$

The following lemma summarizes what we can say about the behavior of $q (n)$ in this case.

Lemma 5.1

Let $f (n) = ⌊ αn ⌋$ with $α \in (0, 1)$ . Assume that $q (n) = an + ϵ (n)$ , where $a \in (0, 1)$ is a constant, and that $lim_{n \to \infty} ϵ (n) / n = 0$ . Then $a = \sqrt{α}$ .

Proof.

Substituting $q (n) = an + ϵ (n)$ in Equation (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ) gives $an + ϵ (n) = a (n - a (n - 1) - ϵ (n - 1)) + ϵ (n - a (n - 1) - ϵ (n - 1)) + αn - {αn},$ and solving for $ϵ (n)$ gives $ϵ (n) = - a^{2} (n - 1) - aϵ (n - 1) + ϵ (n - a (n - 1) - ϵ (n - 1)) + αn - {αn} .$ Dividing both sides by n and letting $n \to \infty$ gives $a^{2} = α$ and so (11) $q (n) = \sqrt{α} n + o (n) .$ (11)

This turns out to be a good approximation. For an example, see Figure , which shows $q (n) - n / \sqrt{2}$ when $f (n) = ⌊ n / 2 ⌋$ . For $n \leq 160000$ , we find from the data used to produce this figure that $| q (n) - n / \sqrt{2} | < 75.5$ .

Theorem 4.6 is a particular case of Equation (Equation11(11) $q (n) = \sqrt{α} n + o (n) .$ (11) ) with $α = γ^{2}$ .

5.2. $f (n) = a for n > n^{'}$

The case $f (n)$ tends to a constant $a \in N$ as $n \to \infty$ is interesting, and is a special case of a nested recursion studied in [Citation10]. We argue by proposing the ansatz $q (x) = \sqrt{2 ax} - b$ , $x \in R$ , and then deducing the $f (x)$ that this implies. Starting from the asymptotic expansion: $f (x) = q (x) - q (x - q (x - 1)) \sim a + \frac{\sqrt{2 a} (a - 2 b)}{4 \sqrt{x}} + \frac{a (a - 2 b - 2)}{4 x} + O (x^{- \frac{3}{2}}),$ we let $b = a / 2$ in order to eliminate the second term. This gives $f (x) \sim a - \frac{a}{2 x} + O (x^{- \frac{3}{2}}) .$ This short calculation suggests that, for $a, n \in N$ , (12) $f (n) = a - ⌊ δ_{1} (n) ⌋ ⟹ q (n) = \sqrt{2 an} - a / 2 + δ_{2} (n),$ (12) where $δ_{1} (1) = a$ and $δ_{i} (n) \to 0$ as $n \to \infty$ , i = 1, 2.

This appears to work surprisingly well – see Figure , which compares q when $f (n) = ⌊ 5 - 5 / \sqrt{n} ⌋)$ , so that $lim_{n \to \infty} f (n) = 4$ , with $s (n) := \sqrt{8 n} - 2$ for $n = 1, \dots, 10^{5}$ . Over this range, we find that $| q (n) - s (n) | < 2$ .

Figure 2. Plot of $q (n)$ , black dots, for $f (n) = ⌊ 5 - 5 / \sqrt{n} ⌋$ , so that $f (n) \to 4$ as $n \to \infty$ . According to the argument in Section 5.2, we expect that $q (n) \sim \sqrt{8 n} - 2 := s (n)$ , this curve being plotted as a continuous black line. The approximation appears to hold remarkably well, the two plots being almost exactly superimposed. Inset: enlargement of the region $75000 \leq n \leq 80000$ .

Figure 2. Plot of q(n), black dots, for f(n)=⌊5−5/n⌋, so that f(n)→4 as n→∞. According to the argument in Section 5.2, we expect that q(n)∼8n−2:=s(n), this curve being plotted as a continuous black line. The approximation appears to hold remarkably well, the two plots being almost exactly superimposed. Inset: enlargement of the region 75000≤n≤80000.

Figure 3. Plot of $q (n) - n / \sqrt{2}$ for $n = 1, \dots, 160000$ , with $f (n) = ⌊ \frac{n}{2} ⌋$ . Two pairs of self-similar regions $a_{1}, a_{2}$ and $b_{1}, b_{2}$ are shown – see text.

Figure 3. Plot of q(n)−n/2 for n=1,…,160000, with f(n)=⌊n2⌋. Two pairs of self-similar regions a1,a2 and b1,b2 are shown – see text.

Other examples were tried: $f (n) = ⌊ a - aexp (- bn) ⌋$ , $f (n) = ⌊ a - a / n^{b} ⌋$ and $f (n) = ⌊ an ⌋$ if $n < n_{0}$ and $⌊ a n_{0} ⌋$ otherwise. In all cases, a, b were chosen so as to ensure that $f (n)$ had property $S_{0}$ , and every time, (Equation12(12) $f (n) = a - ⌊ δ_{1} (n) ⌋ ⟹ q (n) = \sqrt{2 an} - a / 2 + δ_{2} (n),$ (12) ) gave an excellent approximation to $q (n)$ : in these cases at least, it is the asymptotic behavior of $f (n)$ that determines the asymptotic behavior of $q (n)$ .

5.3. $f (n) = ⌊ c_{1} n^{p_{1}} + c_{2} n^{p_{2}} + \dots ⌋$ with $p_{i} \in (0, 1)$

Arguing in reverse as in the previous case, we can obtain useful approximations to $q (n)$ when $f (n)$ is the sum of fractional powers of n, at least in certain special cases. We use the ansatz $q (x) = a x^{p} + b$ and calculate the asymptotic expansion for $\begin{aligned} f (x) & = q (x) - q (x - q (x - 1)) \sim ap [b x^{p - 1} + \frac{b^{2} (1 - p)}{2} x^{p - 2} + \dots \\ + a x^{2 p - 1} + a (b (1 - p) - p) x^{2 p - 2} + \dots \\ + \frac{a^{2} (1 - p)}{2} x^{3 p - 2} + \frac{a^{2} (1 - p) (b (2 - p) - 2 p)}{2} x^{3 p - 3} + \dots] . \end{aligned}$ Note that the powers of x here are p−i, $i \in N$ and ip−j, $i \geq 2$ , $j \geq i - 1$ . We cannot order the powers of x unless a value of p is specified. Letting, for example, a = 1, $b = 1 / 2$ and $p = 3 / 4$ , we have $f (x) = 3 x^{\frac{1}{2}} / 4 + 3 x^{\frac{1}{4}} / 32 + 5 / 128 + O (x^{- \frac{1}{4}})$ and $q (x) = x^{\frac{3}{4}} + 1 / 2$ . Then, with $f (n) = ⌊ 3 n^{\frac{1}{2}} / 4 + 3 n^{\frac{1}{4}} / 32 + 5 / 128 ⌋$ , which has property $S_{0}$ , we generate $q (n)$ via (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ) and compare this with our approximation $q^{'} (n) := n^{\frac{3}{4}} + 1 / 2$ . The agreement is good, with $- 21 \leq q (n) - q^{'} (n) \leq 3$ for $1 \leq n \leq N = 160000$ , where $q (N) = 7990$ .

5.4. Dynamics

We close this section with some figures illustrating examples of the dynamics of q for particular sequences f.

An interesting case is $q = Q (⌊ n / 2 ⌋)$ , shown in Figure . This shows the de-trended sequence $q (n) - n / \sqrt{2}$ – see (Equation11(11) $q (n) = \sqrt{α} n + o (n) .$ (11) ) – for $1 \leq n \leq 160000$ . At first sight, no obvious patterns appear in the plot, but in fact there are several regions of exact self-similarity, even in this limited range of n. Specifically, among others, we observe

$q (i + 69568) - q (i) = 49192 for i \in {9235 : 27465}$ (regions $a_{1}, a_{2}$ ); and
$q (i + 107616) - q (i) = 76096 for i \in {91 : 44577}$ (regions $b_{1}, b_{2}$ ).

Our final figure gives an intuitive idea of what happens if we make a small perturbation. It is tempting to describe the behavior of many of the sequences $Q (f)$ as ‘chaotic’. However, since (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ) is discrete in time and space, there is no sense in which the system can be infinitesimally perturbed; perhaps the best we can do is to compute sequences q, $q_{1}$ , where $q_{1}$ is computed identically to q – using (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ) in the usual way – except that we artificially adjust a single term, $q (n_{1})$ say, for some small integer $n_{1}$ . The result of such a computation, with $q = Q (⌊ n / 2 ⌋)$ , $q_{1} = Q (⌊ n / 2 ⌋ + δ (n - 16))$ , so that $q_{1} (16) = 1 + q (16)$ , is shown in Figure . This figure is a plot of the difference $q (n) - q_{1} (n)$ versus $\log_{2} (n)$ , the scaling bringing out the approximate periodicity visible on a logarithmic scale. We see regions where $q (n) - q_{1} (n) = 0$ exactly, interspersed with regions where this difference is far from zero: the memory of the perturbation seems to persist indefinitely.

Figure 4. Plot of the difference of $q (n) = Q (⌊ n / 2 ⌋)$ and $q_{1} (n) = Q (⌊ n / 2 ⌋ + δ (n - 16))$ . The idea is that $q_{1} (n)$ is a slightly perturbed version of $q (n)$ . The effect of the perturbation persists, at least until $n = 2^{19}$ .

6. Conclusions and further work

We have investigated the problem of the conditions on an integer sequence $f (n)$ , $n \in N$ , with $f (1) = 0$ , such that the sequence $q (n)$ , with $q (1) = 1$ , computed from $q (n) = q (n - q (n - 1)) + f (n)$ , exists. We think of the sequence q as a solution to this difference equation. We have proved that if $f (n + 1) - f (n) \in {0, 1}$ , $n \geq 1$ , then the solution q exists, the existence of q being exactly equivalent to the inequality $1 \leq q (n) \leq n$ holding for all $n \in N$ .

We have defined $S$ as the set of semi-infinite slow sequences (that is, sequences with differences between successive terms equal to 0 or 1)Footnote². In Lemma 2.1 examples were given of sequences $f \notin S$ but for which q nonetheless exists – the condition that f be slow is sufficient but not necessary – and this suggests that we define a second set of semi-infinite integer sequences, $F$ , which is the set of all sequences f with $f (1) = 0$ such that the corresponding sequence q exists. Clearly, $S ⊊ F$ .

It would be fruitful to investigate the structure of $F$ further. The question naturally arises as to whether the proof of Theorem 1.3 could be extended to include more sequences f. A better characterization of $F$ might be a useful step on the way to settling the question of the existence of the Hofstadter sequence, this problem being the original motivation for our work. In fact, the question would have been settled by Theorem 1.3, were the sequence $c (n) := q_{h} (n - q_{h} (n - 2))$ , with $q_{h}$ defined by (Equation1(1) $q_{h} (n) = q_{h} (n - q_{h} (n - 1)) + q_{h} (n - q_{h} (n - 2)) with q_{h} (1) = q_{h} (2) = 1.$ (1) ), to be slow. It is not – in fact $c = (1, 2, 2, 2, 3, 3, 3, 3, 3, 4, 5, 4, 5, \dots)$ .

Solutions arising from $f \in S$ typically appear to display non-trivial dynamics: in particular, they are generally aperiodic and display no obvious patterns. For some special sequences f, for instance $f (n) = ⌊ αn ⌋$ with $α \in [0, 1)$ , the average behavior appears to be well-defined, however, and we give a heuristic argument for this.

Less typical seem to be sequences $f \in S$ for which the corresponding solution is also monotonic, and we construct exact solutions in the cases of which we are aware.

Both from the point of view of the existence of solutions and from the study of their dynamics, the difference Equation (Equation3(3) $q (n) = q (n - q (n - 1)) + f (n) with q (1) = 1,$ (3) ) appears to be an interesting subject for further study, even in its own right.

Acknowledgments

The Authors would like to acknowledge helfpul discussions with Dr Peter Gallagher and the valuable comments of the anonymous referees of this journal.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Notes

1 (i), (ii) etc would be clearer labels than i, ii

2 It would be useful to add "and with first term equal to zero" here, i for consistency with the original definition of S.

References

B. Balamohan, A. Kuznetsov, and S. Tanny, On the behavior of a variant of Hofstadter's Q-sequence, J. Integer Seq. 10 (2007), Article 07.7.1.
Google Scholar
N. Fox, Quasipolynomial solutions to the Hofstadter Q-recurrence, Integers 16 (2016), paper A68, 6 pp.
Google Scholar
D.R. Hofstadter, Gödel, Escher, Bach: An Eternal Golden Braid, Basic Books, Springer, New York, 1979. ISBN 0-465-026850.
Google Scholar
A. Isgur, V. Kuznetsov, and S.M. Tanny, A combinatorial approach for solving certain nested recursions with non-slow solutions, J. Differ. Equ. Appl. 19(4) (2013), pp. 604–615. https://doi.org/10.1080/10236198.2012.662967.
Google Scholar
OEIS Foundation Inc., Entry A005185 in the On-Line Encyclopedia of Integer Sequences, 2024; Available at https://oeis.org/A005185 (in particular the reference to a computation by M. Eric Carr (2 July 2023)).
Google Scholar
OEIS Foundation Inc., Entry A006949 in the On-Line Encyclopedia of Integer Sequences, 2024; Available at https://oeis.org/A006949.
Google Scholar
OEIS Foundation Inc., Entry A063882 in the On-Line Encyclopedia of Integer Sequences, 2024; Available at https://oeis.org/A063882.
Google Scholar
OEIS Foundation Inc., Entry A002024 in the On-Line Encyclopedia of Integer Sequences, 2024; Available at https://oeis.org/A002024.
Google Scholar
K. Pinn, Order and chaos in Hofstadter's Q(n) sequence, Complexity 4(3) (1999), pp. 41–46.
Google Scholar
M. Sunohara and S.M. Tanny, On the solution space of the Golomb recursion, J. Differ. Equ. Appl. 24(8) (2018), pp. 1273–1294. https://doi.org/10.1080/10236198.2018.1471471.
Web of Science ®Google Scholar
S.M. Tanny, A well-behaved cousin of the Hofstadter sequence, Discrete Math. 105(1–3) (1992), pp. 227–239.
Web of Science ®Google Scholar

A diluted version of the problem of the existence of the Hofstadter sequence

Abstract

1. Motivation

Slow sequence

Property $S_{0}$

2. Preliminaries

3. The main theorem

The shift property

4. Examples of sequences q that are slow