Full article: Aspects of convergence of random walks on finite volume homogeneous spaces

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

We investigate three aspects of weak* convergence of the n-step distributions of random walks on finite volume homogeneous spaces $G / Γ$ of semisimple real Lie groups. First, we look into the obvious obstruction to the upgrade from Cesàro to non-averaged convergence: periodicity. We give examples where it occurs and conditions under which it does not. In a second part, we prove convergence towards Haar measure with exponential speed from almost every starting point. Finally, we establish a strong uniformity property for the Cesàro convergence towards Haar measure for uniquely ergodic random walks.

Keywords:

2010 Mathematics Subject Classifications:

1. Introduction

Let G be a real Lie group and Γ a lattice in G, that is, a discrete subgroup of G such that the homogeneous space $X = G / Γ$ admits a G-invariant Borel probability measure $m_{X}$ . This measure $m_{X}$ is unique and we refer to it as the (normalized) Haar measure on X. A good example to have in mind is $G = {SL}_{d} (R)$ and $Γ = {SL}_{d} (Z)$ .

The objects of study in this paper are random walks on X, given by probability measures µ on G: A step corresponds to randomly choosing a group element $g \in G$ according to µ and then moving from the current location $X ∋ x$ to gx. Starting at $x_{0} \in X$ , the distribution of the location after n steps is given by the convolution (1) $μ^{* n} * δ_{x_{0}},$ (1) which is the push-forward of the product measure $μ^{\otimes n} \otimes δ_{x_{0}}$ under the multiplication map $G^{n} \times X ∋ (g_{n}, \dots, g_{1}, x) \mapsto g_{n} \dots g_{1} x \in X$ .

The broader context in which the study of these random walks originated is that of subgroup actions on homogeneous spaces. After Ratner's treatment of the rigidity and asymptotic properties of unipotent actions in her celebrated series of articles [Citation21–24], a new approach was needed to understand the dynamics of non-unipotent actions. Passing from a deterministic to a probabilistic point of view turned out to be a particularly fruitful angle. Still, understanding the long-term behaviour of random walks on homogeneous spaces and the limiting behaviour of the n-step distributions (Equation1(1) $μ^{* n} * δ_{x_{0}},$ (1) ) is a notoriously difficult problem. Major contributions to this line of study were made e.g. by Eskin–Margulis in their work on non-divergence [Citation15], and by Benoist–Quint in their breakthrough series of articles [Citation4,Citation6–8]. We reproduce one of the main results of [Citation8] as motivating example. For the statement, recall that a probability measure ν on X is called homogeneous if there exists a closed subgroup H of G and a point $x \in X$ such that $supp (ν) = Hx$ is a closed orbit and ν is H-invariant.

Theorem 1.1

Benoist–Quint [Citation8]

Let µ be a compactly supported probability measure on G. Denote by $S$ and $G$ the closed subsemigroup and subgroup of G generated by $supp (μ)$ , respectively, and suppose that the Zariski closure of $Ad (G)$ in $Aut (g)$ is Zariski connected, semisimple, and has no compact factors. Then for every $x_{0} \in X$ there is a homogeneous probability measure $ν_{x_{0}}$ on X with $supp (ν_{x_{0}}) = \bar{S x_{0}} = \bar{G x_{0}}$ and such that (2) $\frac{1}{n} \sum_{k = 0}^{n - 1} μ^{* k} * δ_{x_{0}} ⟶ ν_{x_{0}}$ (2) as $n \to \infty$ in the weak* topology.

Here the weak* convergence (Equation2(2) $\frac{1}{n} \sum_{k = 0}^{n - 1} μ^{* k} * δ_{x_{0}} ⟶ ν_{x_{0}}$ (2) ) more explicitly means that for every compactly supported continuous function $f \in C_{c} (X)$ we have $\frac{1}{n} \sum_{k = 0}^{n - 1} \int_{X} f d (μ^{* k} * δ_{x_{0}}) = \frac{1}{n} \sum_{k = 0}^{n - 1} \int_{G^{k}} f (g_{k} \dots g_{1} x_{0}) d μ^{\otimes k} (g_{1}, \dots, g_{k}) ⟶ \int_{X} f d ν_{x_{0}}$ as $n \to \infty$ . Recently, it was shown by Bénard–de Saxcé [Citation3] that the compact support assumption on µ in Theorem 1.1 can be relaxed to a finite first moment assumption; see Remark 2.7. Another recent generalization of the theorem above in joint work of the author with Sert and Shi [Citation19] replaces the algebraic assumption on the support of µ by a certain expansion condition, which allows for cases in which µ is e.g. supported on a parabolic subgroup of a semisimple group.

Some questions left open by Theorem 1.1 are listed by Benoist–Quint at the end of their survey [Citation5]. A major one is the following.

Question 1.2

In the setting of Theorem 1.1, is it also true that (3) $μ^{* n} * δ_{x_{0}} ⟶ ν_{x_{0}}$ (3) as $n \to \infty$ ?

Answers are available only in special cases: Breuillard [Citation11] established (Equation3(3) $μ^{* n} * δ_{x_{0}} ⟶ ν_{x_{0}}$ (3) ) for certain measures µ supported on unipotent subgroups, Buenger [Citation12] proved it for some sparse solvable measures, and in previous work the author dealt with the case of spread out measures [Citation18]. Very recently, Bénard [Citation2] observed that (Equation3(3) $μ^{* n} * δ_{x_{0}} ⟶ ν_{x_{0}}$ (3) ) holds for aperiodic measures µ under the assumption that µ has two convolution powers which are not mutually singular.

The purpose of this article is to discuss three (largely independent) aspects of random walk convergence related to Theorem 1.1 and Question 1.2, mainly having in mind the case that G is a semisimple real Lie group. We are going to use the following terminology.

Definition 1.3

Let ν be a probability measure on X and $x_{0} \in X$ . We say that the random walk on X given by µ converges to ν on average (resp. converges to ν) from the starting point $x_{0}$ if $\frac{1}{n} \sum_{k = 0}^{n - 1} μ^{* k} * δ_{x_{0}} \to ν$ (resp. $μ^{* n} * δ_{x_{0}} \to ν$ ) as $n \to \infty$ in the weak* topology.

Convergence on average is also commonly referred to as Cesàro convergence. We use the two terms interchangeably.

The article is organized as follows.

In Section 2, we look into the obvious obstruction to the upgrade from Cesàro convergence to (non-averaged) convergence: periodicity. We show in Example 2.1 how (Equation3(3) $μ^{* n} * δ_{x_{0}} ⟶ ν_{x_{0}}$ (3) ) can fail when $x_{0}$ has finite orbit under $S$ . Using a product construction, we can also produce a counterexample in which the orbit closure $\bar{S x_{0}}$ has positive dimension (Example 2.2). In both cases, the periodic behaviour occurs at the level of the connected components of the orbit closure. As it turns out, this is no coincidence: If, in the setting of Theorem 1.1, the orbit closure $\bar{S x_{0}}$ is connected, there can be no periodicity (Theorem 2.5) and we can show that the Cesàro convergence (Equation2(2) $\frac{1}{n} \sum_{k = 0}^{n - 1} μ^{* k} * δ_{x_{0}} ⟶ ν_{x_{0}}$ (2) ) also holds along arithmetic progressions (Corollary 2.8).

In Section 3, we establish effective convergence of random walks to the normalized Haar measure $m_{X}$ for typical starting points $x_{0}$ : When $supp (μ)$ generates a Zariski dense subgroup of a semisimple real Lie group G without compact factors, for any fixed $L^{2}$ -function f on X the convergence $\int_{X} f d (μ^{* n} * δ_{x_{0}}) \overset{n \to \infty}{⟶} \int_{X} f d m_{X}$ not only holds but is in fact exponentially fast for $m_{X}$ -almost every $x_{0} \in X$ (Theorem 3.2, Proposition 3.4). The proof relies on an $L^{2}$ -spectral gap of the convolution operator $π (μ) : f \mapsto (x \mapsto \int_{G} f (gx) d μ (g))$ acting on measurable functions on X. Taking into account regularity of the function f, the above can be further strengthened to the statement that almost every $x \in X$ is exponentially generic (Definition 3.12): Up to a constant factor depending on derivatives of f, the exponential speed of convergence holds uniformly over all compactly supported smooth functions (Theorem 3.13). Key to this upgrade are the definition of suitable Sobolev norms and a functional analytic argument involving relative traces, first exploited in a dynamical context by Einsiedler–Margulis–Venkatesh [Citation13].

Finally, in Section 4 we prove that convergence on average to $m_{X}$ happens locally uniformly in $x_{0}$ in a strong way when the random walk is uniquely ergodic and admits a Lyapunov function (Theorem 4.13). For example, this is the case when G is a connected semisimple real algebraic group and $supp (μ)$ generates a non-discrete Zariski dense subgroup, and also in the setup of Simmons–Weiss [Citation27], which has connections to Diophantine approximation problems on fractals. To this end, we introduce the new concept of $(K_{n})_{n}$ -uniform recurrence (Definition 4.10), which refines recurrence properties of random walks previously studied in [Citation6,Citation15].

1.1. Standing assumptions & notation

As many of our arguments work in greater generality, in the remainder of the article we will relax the assumptions stated at the beginning of this introduction. The following setup shall be in place whenever nothing else is specified: G is a locally compact σ-compact metrizable group acting ergodically on a locally compact σ-compact metrizable space X endowed with a G-invariant probability measure $m_{X}$ ; and µ is a Borel probability measure on G.

2. Periodicity

In this section, we start with two simple counterexamples to (Equation3(3) $μ^{* n} * δ_{x_{0}} ⟶ ν_{x_{0}}$ (3) ), which illustrate ways in which a random walk may exhibit periodic behaviour (Section 2.1). Analysing these examples for their common feature, we are led to a simple condition ensuring aperiodicity, stated and proved in Section 2.2.

2.1. Examples

The first example with periodicity is on finite periodic orbits. In the following, for $d \geq 2$ we denote by $1_{d}$ the $d \times d$ -identity matrix.

Example 2.1

Consider the principal congruence lattice $Γ = Γ (2) = {g \in {SL}_{2} (Z) ∣ g \equiv 1_{2} mod 2}$ in $G = {SL}_{2} (R)$ . Being the kernel of the reduction homomorphism from ${SL}_{2} (Z)$ to ${SL}_{2} (Z / 2 Z)$ , we recognize $Γ (2)$ as a finite-index normal subgroup of ${SL}_{2} (Z)$ . In particular, $Γ (2)$ is a lattice in G. Let $μ = \frac{1}{2} (δ_{h_{1}} + δ_{h_{2}})$ with $h_{1} = (\begin{matrix} 1 & 1 \\ 0 & 1 \end{matrix}), h_{2} = (\begin{matrix} 1 & 0 \\ 1 & 1 \end{matrix}) .$ Then the closed subgroup $G$ generated by $supp (μ) = {h_{1}, h_{2}}$ is $G = {SL}_{2} (Z)$ , which is Zariski dense in G. The $G$ -orbit of $x_{0} = 1_{2} Γ \in G / Γ$ is $\begin{aligned} O & = {x_{0}, h_{1} x_{0}, h_{2} x_{0}, h_{2} h_{1} x_{0} = (\begin{matrix} 1 & 1 \\ 1 & 2 \end{matrix}) x_{0}, h_{1} h_{2} x_{0} = (\begin{matrix} 2 & 1 \\ 1 & 1 \end{matrix}) x_{0}, \\ h_{1} h_{2} h_{1} x_{0} = h_{2} h_{1} h_{2} x_{0} = (\begin{matrix} 2 & - 1 \\ 1 & 0 \end{matrix}) x_{0}}, \end{aligned}$ with transitions as shown in the following diagram:

Consequently, we see that the random walk with starting point $x_{0}$ alternates between the two sets $O_{1} = {x_{0}, h_{1} h_{2} x_{0}, h_{2} h_{1} x_{0}} and O_{2} = {h_{1} x_{0}, h_{2} x_{0}, h_{1} h_{2} h_{1} x_{0}} .$ The 2-step random walks on these sets constitute irreducible, aperiodic, finite state Markov chains, so that $\begin{aligned} μ^{* 2 n} * δ_{x_{0}} & ⟶ \frac{1}{3} \sum_{p \in O_{1}} δ_{p}, \\ μ^{* (2 n + 1)} * δ_{x_{0}} & ⟶ \frac{1}{3} \sum_{p \in O_{2}} δ_{p}, \end{aligned}$ as $n \to \infty$ in the weak* topology.

In the example above, the support of µ generates a Zariski dense subgroup of G and the lattice Γ in G is irreducible. (Recall that, loosely speaking, ‘irreducibility’ of Γ means that it does not arise from a product construction, cf. [Citation20, Definition 5.20]). By the work of Benoist–Quint [Citation8, Corollary 1.8], these properties force any orbit closure $\bar{S x_{0}}$ to be either finite or all of X. As soon as intermediate orbit closures are possible, however, one can also construct examples with periodic behaviour on non-discrete orbit closures.

Example 2.2

Let G, Γ, $X = G / Γ$ , $h_{1}, h_{2}$ , $x_{0}$ and $G$ be as in Example 2.1 and choose a diagonal matrix $a \in {SL}_{2} (R)$ such that the diagonal entries of $a^{2}$ are irrational. We are going to consider the random walk on the product space $X \times X = (G \times G) / (Γ \times Γ)$ given by the probability measure $μ = \frac{1}{4} \sum_{i = 1}^{4} δ_{g_{i}}$ on $G \times G$ with $\begin{aligned} g_{1} & = (h_{1}, a h_{1} a^{- 1}), g_{2} = (h_{1}, 1_{2}), \\ g_{3} & = (h_{2}, a h_{2} a^{- 1}), g_{4} = (h_{2}, 1_{2}) . \end{aligned}$ The (closed) subgroup generated by the support of this measure µ is given by $G \times a G a^{- 1} = {SL}_{2} (Z) \times a {SL}_{2} (Z) a^{- 1}$ . Indeed, the correct entry in the second copy of G can be arranged using a finite product of $g_{1}^{\pm 1}, g_{3}^{\pm 1}$ , and then the entry in the first copy can be corrected using $g_{2}^{\pm 1}, g_{4}^{\pm 1}$ . By Theorem 1.1 we thus know that for the starting point $(x_{0}, x_{0}) \in X \times X$ we have the weak* convergence $\frac{1}{n} \sum_{k = 0}^{n - 1} μ^{* k} * δ_{(x_{0}, x_{0})} ⟶ ν_{(x_{0}, x_{0})}$ as $n \to \infty$ , where $ν_{(x_{0}, x_{0})}$ is the homogeneous probability measure on the closure of the $G \times a G a^{- 1}$ -orbit of $(x_{0}, x_{0})$ . (Recall that it makes no difference for the closure whether one considers the orbit under the generated subgroup or subsemigroup.)

Let us identify this orbit closure. In the first copy of X, we recognize the finite orbit $O$ from Example 2.1. In the second copy, we see the action of irrational conjugates of $h_{1}, h_{2}$ . As the acting group has product structure, the orbit closure in question is the product of these two orbit closures in the components: $\bar{(G \times a G a^{- 1}) (x_{0}, x_{0})} = O \times \bar{a G a^{- 1} x_{0}} .$ Since the orbit $a G a^{- 1} x_{0}$ is infinite by our choice of the matrix a, it follows from [Citation8, Corollary 1.8] that $\bar{a G a^{- 1} x_{0}} = X$ , so that $\bar{(G \times a G a^{- 1}) (x_{0}, x_{0})} = O \times X and ν_{(x_{0}, x_{0})} = m_{O} \otimes m_{X}$ for the normalized counting measure $m_{O}$ on $O$ and the normalized Haar measure $m_{X}$ on X. However, in analogy to Example 2.1, the random walk is found to alternate between the sets $O_{1} \times X and O_{2} \times X,$ in the sense that $supp (μ^{* 2 n} * δ_{(x_{0}, x_{0})}) \subset O_{1} \times X$ and $supp (μ^{* (2 n + 1)} * δ_{(x_{0}, x_{0})}) \subset O_{2} \times X$ for all $n \in N$ . Hence, we conclude that the random walk starting from $(x_{0}, x_{0})$ does not converge to $ν_{(x_{0}, x_{0})}$ .

Remark 2.3

The same behaviour as in the previous example can be arranged inside a homogeneous space $X^{'} = G^{'} / Γ^{'}$ that is the quotient of a semisimple real Lie group $G^{'}$ by an irreducible lattice $Γ^{'}$ . Indeed, this is only a matter of choosing suitable embeddings $G \times G ↪ G^{'}$ and $X \times X ↪ X^{'}$ , where G and X are as in Example 2.2. Concretely, one can e.g. consider the $4 \times 4$ -congruence lattice $Γ^{'} = Γ (2) = {g \in {SL}_{4} (Z) ∣ g \equiv 1_{4} mod 2}$ in $G^{'} = {SL}_{4} (R)$ and the diagonal embeddings $\begin{aligned} G \times G & ↪ G^{'}, X \times X ↪ X^{'}, \\ (g, h) & \mapsto (\begin{matrix} g \\ h \end{matrix}), (g Γ, h Γ) \mapsto (\begin{matrix} g \\ h \end{matrix}) Γ^{'} . \end{aligned}$ We therefore see that Example 2.2, i.e. periodic behaviour on a non-discrete orbit closure, can be realized inside $X^{'} = G^{'} / Γ^{'}$ . Of course, after applying this embedding, the subgroup generated by the support of µ will no longer be Zariski dense in $G^{'}$ .

2.2. An aperiodicity criterion

Inspecting the examples above, one may notice that their common salient feature is that the orbit closure $\bar{S x_{0}}$ is disconnected. This naturally raises the question whether periodic behaviour can also occur when this orbit closure is connected. In what follows, we answer this question in the negative. We shall use the following formalization of periodicity.

Definition 2.4

Assume that the random walk on X given by µ converges on average to a probability measure ν on X from the starting point $x_{0} \in X$ . We say that this convergence is periodic if there exists an integer $d \geq 2$ and pairwise disjoint measurable subsets $D_{0}, \dots, D_{d - 1} \subset X$ with $ν (\partial D_{i}) = 0$ for $0 \leq i < d$ and such that $(μ^{* n} * δ_{x_{0}}) (D_{n mod d}) = 1$ for every $n \in N$ . Otherwise, we call the convergence aperiodic.

The requirement on the boundaries of the sets $D_{i}$ is needed to ensure that the cyclic behaviour is witnessed by the limit measure ν. Without a condition of this sort, one could try to artificially define $D_{i}$ as the set of all points in X that can be reached from $x_{0}$ precisely in $n \equiv i mod d$ steps. Indeed, this construction is possible for example when µ is finitely supported with the property that its support freely generates a discrete subsemigroup $S$ of G and the starting point $x_{0} \in X$ has a free $S$ -orbit. The latter is the case e.g. for $X = {SL}_{2} (R) / {SL}_{2} (Z)$ , $μ = \frac{1}{2} (δ_{h_{1}} + δ_{h_{2}})$ with $h_{1} = (\begin{matrix} 1 & 2 \\ 0 & 1 \end{matrix})$ and $h_{2} = (\begin{matrix} 1 & 0 \\ 2 & 1 \end{matrix})$ , and $x_{0} = a {SL}_{2} (Z)$ for a diagonal matrix $a \in {SL}_{2} (R)$ such that the diagonal entries of $a^{2}$ are irrational.

We are now ready to state the announced aperiodicity theorem.

Theorem 2.5

Retain the notation and assumptions from Theorem 1.1 and let $x_{0} \in X$ be such that the orbit closure $\bar{S x_{0}}$ is connected. Then the Cesàro convergence to $ν_{x_{0}}$ of the random walk on X given by µ starting from $x_{0}$ is aperiodic.

For the proof we need the following simple lemma.

Lemma 2.6

Let H be a Zariski connected real algebraic group and S a subset of H generating a Zariski dense subsemigroup. Then for every $d \in N$ , also the d-fold product set $S^{d} = {g_{d} \dots g_{1} ∣ g_{1}, \dots, g_{d} \in S}$ generates a Zariski dense subsemigroup of H. In particular, if $supp (μ)$ generates a Zariski dense subsemigroup for some probability measure µ on H, the same is true for $supp (μ^{* d})$ .

Proof.

Let $U \subset H$ be a non-empty Zariski open subset and consider the map $ϕ : H \to H, g \mapsto g^{d}$ . Since ϕ is Zariski continuous, $ϕ^{- 1} (U)$ is Zariski open. Moreover, this preimage is non-empty because U is dense in the Lie group topology and ϕ is a diffeomorphism near the identity. By the assumption that S generates a Zariski dense subsemigroup, we can thus find an element $g \in ϕ^{- 1} (U)$ that is the product of finitely many elements of S. It follows that $ϕ (g) = g^{d}$ lies in the intersection of U with the subsemigroup generated by $S^{d}$ .

The second claim involving µ immediately follows from the above together with the inclusion $supp (μ^{* d}) \supset supp (μ)^{d}$ .

Proof

Proof of Theorem 2.5

Suppose $d \in N$ is an integer such that there are pairwise disjoint $D_{0}, \dots, D_{d - 1} \subset X$ with $ν_{x_{0}} (\partial D_{i}) = 0$ for all $0 \leq i < d$ and such that $(μ^{* n} * δ_{x_{0}}) (D_{n mod d}) = 1$ for all $n \in N$ as in the definition of periodicity. We have to show that d = 1.

First note that from Theorem 1.1 and the properties of the sets $D_{i}$ it follows that (4) $ν_{x_{0}} (D_{0}) = lim_{n \to \infty} \frac{1}{n} \sum_{k = 0}^{n - 1} (μ^{* k} * δ_{x_{0}}) (D_{0}) = \frac{1}{d},$ (4) where the application of weak* convergence to the set $D_{0}$ is justified since it has negligible boundary with respect to the limit measure $ν_{x_{0}}$ . In view of Lemma 2.6, Theorem 1.1 also applies to the d-step random walk given by $μ^{* d}$ . Assuming for the moment that the limit measure for this d-step random walk starting from $x_{0}$ coincides with $ν_{x_{0}}$ , we deduce that (5) $ν_{x_{0}} (D_{0}) = lim_{n \to \infty} \frac{1}{n} \sum_{k = 0}^{n - 1} (μ^{* dk} * δ_{x_{0}}) (D_{0}) = 1.$ (5) Together, (Equation4(4) $ν_{x_{0}} (D_{0}) = lim_{n \to \infty} \frac{1}{n} \sum_{k = 0}^{n - 1} (μ^{* k} * δ_{x_{0}}) (D_{0}) = \frac{1}{d},$ (4) ) and (Equation5(5) $ν_{x_{0}} (D_{0}) = lim_{n \to \infty} \frac{1}{n} \sum_{k = 0}^{n - 1} (μ^{* dk} * δ_{x_{0}}) (D_{0}) = 1.$ (5) ) imply d = 1, the desired conclusion.

It thus remains to show that the d-step random walk starting from $x_{0}$ does indeed have the same limit measure as the 1-step random walk. Denoting by $S$ and $S_{d}$ the closed subsemigroups of G generated by $supp (μ)$ and $supp (μ^{* d})$ , respectively, this statement is equivalent to the equality $\bar{S x_{0}} = \bar{S_{d} x_{0}}$ of orbit closures. To prove this, let $g \in supp (μ)$ be arbitrary. We claim that $\bar{S x_{0}} = ⋃_{k = 0}^{d - 1} g^{- k} \bar{S_{d} x_{0}} .$ Indeed, since $\bar{S x_{0}}$ is homogeneous, it is invariant under the group generated by $S$ . As $\bar{S x_{0}}$ clearly contains $\bar{S_{d} x_{0}}$ , the inclusion ‘ $\supset$ ’ follows. For the reverse inclusion let $g_{n}, \dots, g_{1} \in supp (μ)$ for some $n \in N$ . Choose $0 \leq k < d$ such that $n + k \equiv 0 mod d$ . Then $g^{k} g_{n} \dots g_{1} x_{0} \in \bar{S_{d} x_{0}}$ and hence $g_{n} \dots g_{1} x_{0} \in g^{- k} \bar{S_{d} x_{0}}$ , giving the claim.

We already noted that Theorem 1.1 applies to $μ^{* d}$ . In particular, the orbit closure $\bar{S_{d} x_{0}}$ and its translates by $g^{- k}$ , $0 \leq k < d$ , are submanifolds of $\bar{S x_{0}}$ . Necessarily, all these translates have the same dimension, and since together they make up $\bar{S x_{0}}$ by the claim above, their shared dimension coincides with that of $\bar{S x_{0}}$ . This implies that $\bar{S_{d} x_{0}}$ is open in $\bar{S x_{0}}$ . However, it is also closed, so that the assumed connectedness of $\bar{S x_{0}}$ forces $\bar{S x_{0}} = \bar{S_{d} x_{0}}$ . This completes the proof.

Remark 2.7

It was recently shown by Bénard–de Saxcé [Citation3] that the compact support assumption on µ in Theorem 1.1 can be relaxed. Indeed, their [Citation3, Theorem C] establishes the same conclusion under the substantially weaker assumption that µ has a finite first moment, meaning that $\int_{G} \log ‖ Ad (g) ‖ d μ (g) < \infty .$ Relying on this stronger result, also our Theorem 2.5 above and Corollary 2.8 below are seen to hold under a finite first moment assumption on µ, instead of requiring compact support as in Theorem 1.1.

We end this section by recording a corollary of the proof above.

Corollary 2.8

Retain the notation and assumptions from Theorem 1.1 and suppose that $\bar{S x_{0}}$ is connected. Let $d \in N$ and denote by $S_{d}$ the closed subsemigroup of G generated by $supp (μ^{* d})$ . Then $\bar{S x_{0}} = \bar{S_{d} x_{0}}$ , and for the homogeneous probability measure $ν_{x_{0}}$ on this orbit closure we have for arbitrary $r \in N_{0}$ that (6) $\frac{1}{n} \sum_{k = 0}^{n - 1} μ^{* (dk + r)} * δ_{x_{0}} ⟶ ν_{x_{0}}$ (6) as $n \to \infty$ in the weak* topology.

Proof.

The statement about orbit closures was established as part of the proof of Theorem 2.5. From Theorem 1.1 we thus get the weak* convergence (7) $\frac{1}{n} \sum_{k = 0}^{n - 1} μ^{* dk} * δ_{x_{0}} \overset{n \to \infty}{⟶} ν_{x_{0}},$ (7) which is (Equation6(6) $\frac{1}{n} \sum_{k = 0}^{n - 1} μ^{* (dk + r)} * δ_{x_{0}} ⟶ ν_{x_{0}}$ (6) ) for r = 0. Given $f \in C_{c} (X)$ , the general case follows by applying (Equation7(7) $\frac{1}{n} \sum_{k = 0}^{n - 1} μ^{* dk} * δ_{x_{0}} \overset{n \to \infty}{⟶} ν_{x_{0}},$ (7) ) to the compactly supported continuous function $f_{r}$ defined by $f_{r} (x) := \int_{G} f (gx) d μ^{* r} (g) = \int_{G^{r}} f (g_{r} \dots g_{1} x) d μ^{\otimes r} (g_{1}, \dots, g_{r})$ for $x \in X$ .

This corollary sharpens the convergence statement in Theorem 1.1 in the case of a connected orbit closure: The Cesàro convergence to $ν_{x_{0}}$ holds along arbitrary arithmetic progressions. Although this does not provide an answer to Question 1.2, it at least allows the following conclusion to be drawn: If $(n_{i})_{i}$ is a sequence of indices such that $μ^{* n_{i}} * δ_{x_{0}}$ converges to a weak* limit different from $ν_{x_{0}}$ as $i \to \infty$ , then $(n_{i})_{i}$ cannot contain a density 1 subset of an infinite arithmetic progression.

3. Spectral gap

In this section, we will explain how a spectral gap of the convolution operator $π (μ)$ associated to a random walk entails the convergence of $μ^{* n} * δ_{x}$ towards $m_{X}$ for $m_{X}$ -a.e. $x \in X$ . In its simplest form, the involved argument works in great generality and also produces an exponential rate of convergence from almost every starting point when the test function f is fixed. This is done in Section 3.1. The following Sections 3.2–3.4 are dedicated to a substantial refinement of this spectral gap argument for random walks on homogeneous spaces of real Lie groups, making the exponentially fast convergence uniform over smooth test functions.

3.1. Generic points

Recall that $π (μ) : L^{\infty} (X, m_{X}) \to L^{\infty} (X, m_{X})$ is defined by $π (μ) f (x) := \int_{X} f d (μ * δ_{x}) = \int_{G} f (gx) d μ (g)$ for $f \in L^{\infty} (X, m_{X})$ and $x \in X$ , and that it extends to a continuous contraction on each $L^{p}$ -space (see [Citation9, Corollary 2.2]). We shall study its behaviour on $L^{2} (X, m_{X})$ . By ergodicity, the G-fixed functions are the constant functions, so we restrict our attention to their orthogonal complement $L_{0}^{2} (X, m_{X})$ of $L^{2}$ -functions with mean 0.

Definition 3.1

We say that µ has a spectral gap on X if the associated convolution operator $π (μ)$ restricted to $L_{0}^{2} (X, m_{X})$ has spectral radius strictly less than 1.

We are going to use the notation $ρ (T)$ to denote the spectral radius of an operator T. Then by the spectral radius formula, µ having a spectral gap on X can be reformulated as the requirement that $ρ (π (μ) |_{L_{0}^{2}}) = lim_{n \to \infty} \sqrt[n]{‖ π (μ) |_{L_{0}^{2}}^{n} ‖_{op}} < 1.$ Given the existence of a spectral gap, we obtain an almost everywhere convergence result in a quite general setup.

Theorem 3.2

Suppose that µ has a spectral gap on X. Then $m_{X}$ -a.e. $x \in X$ is generic for the random walk on X given by µ, meaning that $μ^{* n} * δ_{x} ⟶ m_{X}$ as $n \to \infty$ in the weak* topology. This convergence is exponentially fast in the sense that for every fixed $f \in L^{2} (X, m_{X})$ we have (8) $\underset{n \to \infty}{lim sup} {| \int_{X} f d (μ^{* n} * δ_{x}) - \int f d m_{X} |}^{1 / n} \leq ρ (π (μ) |_{L_{0}^{2}})$ (8) for $m_{X}$ -a.e. $x \in X$ .

Proof.

By separability of $C_{c} (X)$ , for the statement about weak* convergence it suffices to prove $m_{X}$ -a.s. convergence for one fixed function $f \in C_{c} (X)$ . Consequently, it is enough to prove the second assertion of the theorem. To this end, fix a function $f \in L^{2} (X, m_{X})$ and a rational number $ρ (π (μ) |_{L_{0}^{2}}) < α < 1$ , and consider the $L_{0}^{2}$ -function $f_{0} = f - \int f d m_{X}$ . Then in view of the spectral radius formula we have $‖ π (μ)^{n} f - \int f d m_{X} ‖_{L^{2}} = ‖ π (μ)^{n} f_{0} ‖_{L^{2}} \leq ‖ π (μ) |_{L_{0}^{2}}^{n} ‖_{op} ‖ f_{0} ‖_{L^{2}} \leq α^{n} ‖ f_{0} ‖_{L^{2}}$ for sufficiently large $n \in N$ .

Fix in addition a rational number $ε \in (0, 1)$ . By Chebyshev's inequality, the above implies that for large n we have $\begin{aligned} m_{X} ({x \in X | | π (μ)^{n} f (x) - \int f d m_{X} | \geq α^{n (1 - ε)} ‖ f_{0} ‖_{L^{2}}}) \\ \leq \frac{‖ π (μ)^{n} f - \int f d m_{X} ‖_{L^{2}}^{2}}{α^{2 n (1 - ε)} ‖ f_{0} ‖_{L^{2}}^{2}} \leq α^{2 εn} . \end{aligned}$ By Borel–Cantelli it follows that for all x in a full measure set $A_{α, ε}$ , the inequality $| π (μ)^{n} f (x) - \int f d m_{X} | \geq α^{n (1 - ε)} ‖ f_{0} ‖_{L^{2}}$ holds only for finitely many $n \in N$ . Since $π (μ)^{n} f (x) = \int f d (μ^{* n} * δ_{x})$ , we conclude that (Equation8(8) $\underset{n \to \infty}{lim sup} {| \int_{X} f d (μ^{* n} * δ_{x}) - \int f d m_{X} |}^{1 / n} \leq ρ (π (μ) |_{L_{0}^{2}})$ (8) ) holds for all x in a countable intersection of the sets $A_{α, ε}$ over rational numbers α approaching $ρ (π (μ) |_{L_{0}^{2}})$ and ε approaching 0 from above.

Remark 3.3

In the second conclusion of Theorem 3.2, how long it takes for the exponential rate of convergence to kick in depends on the point x. However, the measure of sets on which one has to wait for a long time can be controlled as follows: Given $ρ (π (μ) |_{L_{0}^{2}}) < α < 1$ , choose $N \in N$ such that $‖ π (μ) |_{L_{0}^{2}}^{n} ‖_{op} \leq α^{n}$ for all $n \geq N$ . Then if we additionally take $ε \in (0, 1)$ and denote $\begin{aligned} B_{α, ε, n, f} = {x \in X | | π (μ)^{n^{'}} f (x) - \int f d m_{X} | \geq α^{n^{'} (1 - ε)} ‖ f_{0} ‖_{L^{2}} for some n^{'} \geq n}, \end{aligned}$ the proof above gives the bound $m_{X} (B_{α, ε, n, f}) \leq \frac{α^{2 εn}}{1 - α^{2 ε}}$ for every $n \geq N$ . In particular, the measure of the set on which the exponential convergence does not start during the first n steps decays exponentially in n.

We now demonstrate that the previous result covers the case announced in Section 1.

Proposition 3.4

Let G be a connected semisimple real Lie group without compact factors and with finite centre, $Γ \subset G$ a lattice, and X the homogeneous space $G / Γ$ endowed with the Haar measure $m_{X}$ . Suppose that the closed subsemigroup $S$ generated by $supp (μ)$ has the property that $Ad (S)$ is Zariski dense in $Ad (G)$ . Then µ has a spectral gap on X.

Proof.

Consider the regular representation of G on $L_{0}^{2} (X, m_{X})$ . By Bekka [Citation1, Lemma 3] it doesn't weakly contain the trivial representation. From this, in view of [Citation25, Theorem C], the result follows if we can argue that the projection of µ to any simple factor of G is not supported on a closed amenable subgroup. However, since amenability passes to the Zariski closure (see e.g. [Citation28, Theorem 4.1.15]) the latter would imply that one of the simple factors of $Ad (G)$ is amenable, hence compact by a classical result of Furstenberg (see e.g. [Citation28, Proposition 4.1.8]).

3.2. Good height functions

Inspecting the proof of Theorem 3.2, one observes that every step is effective, with explicit bounds and good control over the measure of exceptional sets, except for the very first one: separability of the space $C_{c} (X)$ of compactly supported continuous functions. In the remainder of this section, we aim to also make effective this step, the goal being exponentially fast convergence $μ^{* n} * δ_{x} \to m_{X}$ from almost every starting point, uniformly over functions f on X. As merely continuous functions can behave arbitrarily badly (with respect to the convergence problem at hand), there is no hope of achieving this feat for all $f \in C_{c} (X)$ . We shall therefore restrict our attention to smooth functions of compact support, and take into account their regularity by considering not just their $L^{2}$ , but also certain Sobolev norms. Built into the definition of these norms will be what we call a good height function, the concept of which is introduced in this subsection.

Our setup is as follows: Let G be a real Lie group with Lie algebra $g$ . We endow $g$ with a scalar product, which we use to define a right-invariant metric $d^{G}$ on G. Given a lattice $Γ \subset G$ , this metric descends to a metric $d^{X}$ on $X = G / Γ$ such that the projection $G \to X$ is locally an isometry. Moreover, we fix an orthonormal basis of $g$ , using which we will identify $g$ with $R^{\dim g}$ . Here is the crucial definition.

Definition 3.5

We call a measurable function $ht : X \to (0, \infty)$ a good height function if there exists $0 < R \leq 1$ and a function $r : X \to (0, R]$ with the following properties:

The restriction of the exponential map $\exp : (- R, R)^{\dim g} \to G$ is a diffeomorphism onto its image and we have $\exp ((- r / 2, r / 2)^{\dim g}) \subset B_{r}^{G} (e)$ for all $r \leq R$ , where $B_{r}^{G} (e)$ denotes the open ball of radius r around the identity $e \in G$ with respect to the metric $d^{G}$ on G.
For all $x \in X$ , the projection $G \supset B_{r (x)}^{G} (e) \to X, g \mapsto gx$ is injective.
There exist constants $c, κ > 0$ such that $r (x) \geq cht (x)^{- κ}$ for all $x \in X$ .
There exists a constant $σ > 1$ such that $ht (x) \leq σht (gx)$ for all $x \in X$ and all $g \in B_{r (x)}^{G} (e)$ .

The definition suggests to think of a good height function as reciprocal of the injectivity radius. And indeed, this viewpoint allows their construction on any homogeneous space $X = G / Γ$ .

Proposition 3.6

Let G be a real Lie group and Γ a lattice in G. Then $X = G / Γ$ admits a good height function.

Proof.

Choose R>0 such that condition (i) of the definition is satisfied and set $r (x) = min {R, r_{inj} (x)}$ , where $r_{inj} (x)$ is the injectivity radius at $x \in X$ , i.e. the maximal radius such that (ii) holds at x. Define $ht (x) = r (x)^{- 1} .$ Then the only thing that needs to be verified is the validity of (iv). We claim that it holds with $σ = 2$ . This will follow if we can show that (9) $r_{inj} (gx) \leq 2 r_{inj} (x)$ (9) whenever $g \in B_{r (x)}^{G} (e)$ . To this end, let $r > r_{inj} (x)$ . Then by definition, there are distinct $g_{1}, g_{2} \in B_{r}^{G} (e)$ such that $g_{1} x = g_{2} x$ . As $g \in B_{r (x)}^{G} (e)$ , right-invariance of the metric implies $d^{G} (g_{i} g^{- 1}, e) = d^{G} (g_{i}, g) \leq d^{G} (g_{i}, e) + d^{G} (g, e) < r + r (x) < 2 r$ for i = 1, 2, and we also have $(g_{1} g^{- 1}) gx = (g_{2} g^{- 1}) gx$ . This shows that $r_{inj} (gx) \leq 2 r$ , and as $r > r_{inj} (x)$ was arbitrary, we see that (Equation9(9) $r_{inj} (gx) \leq 2 r_{inj} (x)$ (9) ) holds.

Often, however, one might want to work with different, naturally occurring height functions. The flexibility in our definition of a good height function accommodates this possibility.

In the examples below, we denote by $λ_{1} (Λ)$ the length of a shortest non-zero vector in a lattice $Λ \subset R^{d}$ .

Example 3.7

Let $G = {SL}_{d} (R)$ and $Γ = {SL}_{d} (Z)$ . Then $X = G / Γ$ can be identified with the space of lattices in $R^{d}$ with covolume 1 via $X ∋ g {SL}_{d} (Z) ⟷ g Z^{d} \subset R^{d} .$ Then the function $ht = λ_{1}^{- 1}$ , defined on X via the above identification, is a good height function. Indeed, one can first choose R>0 such that (i) is satisfied, and then set $r (x) = min {R, r_{inj} (x)}$ as in the proof of Proposition 3.6. Then (ii) is automatically satisfied, and (iv) is valid for a suitable choice of σ due to the inequality $λ_{1} (gx) \leq ‖ g ‖ λ_{1} (x)$ for $g \in G$ and $x \in X$ , where $‖ \cdot ‖$ denotes any matrix norm. To see that also (iii) holds, let $x = g Γ$ and suppose that hx = x for some $h \in G$ with $h \neq e$ . Then for all $γ \in {SL}_{d} (Z)$ , the matrix $(gγ)^{- 1} h (gγ)$ fixes the lattice $Z^{d}$ but is not the identity, so that $‖ gγ ‖^{κ_{1}} ‖ h - e ‖ \geq ‖ (gγ)^{- 1} (h - e) (gγ) ‖ = ‖ (gγ)^{- 1} h (gγ) - e ‖ \geq c_{1}$ for some constants $c_{1}, κ_{1} > 0$ . For a basis change $γ \in {SL}_{d} (Z)$ such that $gγ$ consists of a reduced basis of the lattice x we have $‖ gγ ‖ \leq c_{2} λ_{1} (x)^{- κ_{2}}$ for some $c_{2}, κ_{2} > 0$ (cf. e.g. [Citation26, Chapter III]). With this choice, the above inequality implies $‖ h - e ‖ \geq c λ_{1} (x)^{κ}$ for $c = c_{1} / c_{2}$ and $κ = κ_{1} κ_{2}$ . Since near the identity, the metric $d^{G}$ on G is Lipschitz-equivalent to the distance induced by $‖ \cdot ‖$ , this establishes (iii).

A similar construction is possible in a more general context.

Example 3.8

[Citation13]

Let $G = G (R)$ be the group of real points of a semisimple $Q$ -group $G$ and Γ an arithmetic lattice in G. Choose a rational $Ad (Γ)$ -stable lattice $g_{Z} \subset g$ . Then, using similar reasoning as in the previous example, the function $ht$ on $X = G / Γ$ defined by $ht (x) = λ_{1} (Ad (g) g_{Z})^{- 1}$ for $x = g Γ \in X$ is seen to be a good height function (cf. [Citation13, Section 3.6]).

3.3. Sobolev norms

Given a good height function $ht$ on X, the associated Sobolev norm of degree $ℓ \geq 0$ of a compactly supported smooth function $f \in C_{c}^{\infty} (X)$ is defined by $S_{ℓ} (f)^{2} = \sum_{\deg D \leq ℓ} ‖ ht (\cdot)^{ℓ} D f ‖_{L^{2}}^{2},$ where the sum runs over differential operators $D$ given by monomials of degree at most ℓ in elements of the fixed orthonormal basis of $g$ in the universal enveloping algebra.

In other words, the differential operators $D$ appearing above are $\partial_{v_{1}} \dots \partial_{v_{k}}$ for any k-tuple $(v_{1}, \dots, v_{k})$ of elements of the fixed basis of $g$ , $0 \leq k \leq ℓ$ , where $\partial_{v}$ for $v \in g$ is defined by $\partial_{v} f (x) = lim_{t \to 0} \frac{f (\exp (tv) x) - f (x)}{t}$ for $f \in C_{c}^{\infty} (X)$ and $x \in X$ .

Here are two immediate observations.

Lemma 3.9

Let $ht$ be a good height function on X and $S_{ℓ}$ the associated Sobolev norms.

The norms $S_{ℓ}$ are induced by inner products $⟨ \cdot, \cdot ⟩_{ℓ}$ on $C_{c}^{\infty} (X)$ .
Given $0 \leq ℓ_{0} \leq ℓ_{1}$ , there exists a constant $\tilde{c} > 0$ such that $S_{ℓ_{0}} \leq \tilde{c} S_{ℓ_{1}}$ .

Proof.

Part (i) is clear. Part (ii) is also immediate from the definition of the Sobolev norms, once we know that a good height function must be bounded away from 0. The latter, however, follows directly from property (iii) in the definition of a good height function, as the function r appearing there is assumed to be bounded.

The proof of our convergence result in Section 3.4 will depend on the following proposition.

Proposition 3.10

[Citation13]

For the Sobolev norms associated to a good height function on X, there exists a non-negative integer $ℓ_{0} \geq 0$ and a constant C>0 with the following properties:

(Sobolev embedding estimate [Citation13, (3.9)]) For every $f \in C_{c}^{\infty} (X)$ it holds that $‖ f ‖_{\infty} \leq C S_{ℓ_{0}} (f)$ .
(Finite relative traces [Citation13, (3.10)]) For all integers $ℓ \geq 0$ the relative trace $Tr (S_{ℓ}^{2} | S_{ℓ + ℓ_{0}}^{2})$ is finite, meaning that for any orthogonal basis $(e^{(k)})_{k}$ in the completion of $C_{c}^{\infty} (X)$ with respect to $S_{ℓ + ℓ_{0}}$ $\begin{aligned} Tr (S_{ℓ}^{2} | S_{ℓ + ℓ_{0}}^{2}) := \sum_{k} \frac{S_{ℓ} (e^{(k)})^{2}}{S_{ℓ + ℓ_{0}} (e^{(k)})^{2}} < \infty . \end{aligned}$

We refer to Bernstein–Reznikov [Citation10] for a systematic treatment of relative traces. In particular, it is proved in this reference that the above expression is independent of the choice of orthogonal basis.

The proofs in [Citation13] of the statements in the above proposition are given for the height function from Example 3.8. However, the only properties used are those in our definition of a good height function. In fact, the arguments only depend on validity of the second statement in [Citation13, Lemma 5.1], which holds in our context, as we demonstrate below.

Lemma 3.11

Let $ht$ be a good height function on X. Then there exists a non-negative integer $ℓ_{0} \geq 0$ and a constant C>0 such that for every non-negative integer $ℓ \geq 0$ and every differential operator $D$ given by a monomial of degree at most ℓ in elements of the fixed basis of $g$ we have $| ht (x)^{ℓ} D f (x) | \leq C S_{ℓ + ℓ_{0}} (f)$ for every $f \in C_{c}^{\infty} (X)$ and $x \in X$ .

Proof.

We inspect the function $F = D f$ in a chart around x given by the exponential map: We set $ε = r (x) / 2$ , where $r : X \to (0, R]$ is the function from the definition of a good height function, $d = \dim g$ , and consider $\tilde{F} : (- ε, ε)^{d} \to R, v \mapsto F (\exp (v) x) .$ Then by the first statement of [Citation13, Lemma 5.1], which is simply a Sobolev embedding estimate on $R^{d}$ , we know (10) $| F (x) | = | \tilde{F} (0) | \leq C_{1} 2^{d} r (x)^{- d} S_{d, ε} (\tilde{F}),$ (10) where $C_{1} > 0$ is a constant depending only on the dimension d of $g$ and $S_{d, ε}$ is the standard degree d Sobolev norm on the open subset $(- ε, ε)^{d}$ of $R^{d}$ , i.e. $S_{d, ε} (\tilde{F})^{2} = \sum_{| α | \leq d} ‖ \partial_{α} \tilde{F} ‖_{L^{2} ((- ε, ε)^{d})}^{2},$ where the sum is over all multi-indices $α$ of degree at most d and $\partial_{α} \tilde{F}$ is the corresponding standard partial derivative of $\tilde{F}$ . Using property (iii) in the definition of a good height function, (Equation10(10) $| F (x) | = | \tilde{F} (0) | \leq C_{1} 2^{d} r (x)^{- d} S_{d, ε} (\tilde{F}),$ (10) ) implies that (11) $| ht (x)^{ℓ} F (x) | \leq C_{2} ht (x)^{ℓ + ℓ_{0}} S_{d, ε} (\tilde{F}),$ (11) where $C_{2} > 0$ is another constant and we used that $ht$ is bounded away from 0 to replace $κd$ appearing in the exponent by $ℓ_{0} = max {⌈ κd ⌉, d}$ . Using properties (i) and (ii) in the definition of a good height function, we find $C_{3} > 0$ such that (12) $S_{d, ε} (\tilde{F}) \leq C_{3} \sqrt{\sum_{\deg D^{'} \leq d} ‖ D^{'} F |_{B_{r (x)}^{X} (x)} ‖_{L^{2}}^{2}} .$ (12) To see this, one needs to note two things: firstly, that by the chain rule the partial derivatives of $\tilde{F}$ at a point $v \in (- ε, ε)^{d}$ in the chart can be expressed as linear combinations of derivatives $D^{'} F$ appearing on the right-hand side in (Equation12(12) $S_{d, ε} (\tilde{F}) \leq C_{3} \sqrt{\sum_{\deg D^{'} \leq d} ‖ D^{'} F |_{B_{r (x)}^{X} (x)} ‖_{L^{2}}^{2}} .$ (12) ) evaluated at the corresponding point $x^{'} = \exp (v) x$ , with fixed coefficient functions depending only on finitely many derivatives of the exponential map on $(- ε, ε)^{d}$ ; and secondly, that the Haar measure $m_{X}$ is a smooth measure, meaning that it has a smooth and nowhere vanishing density w.r.t. Lebesgue measure in the chart.

Combining (Equation11(11) $| ht (x)^{ℓ} F (x) | \leq C_{2} ht (x)^{ℓ + ℓ_{0}} S_{d, ε} (\tilde{F}),$ (11) ), (Equation12(12) $S_{d, ε} (\tilde{F}) \leq C_{3} \sqrt{\sum_{\deg D^{'} \leq d} ‖ D^{'} F |_{B_{r (x)}^{X} (x)} ‖_{L^{2}}^{2}} .$ (12) ), condition (iv) in the definition of a good height function, and plugging back in the definition of F, we finally arrive at $| ht (x)^{ℓ} D f (x) | \leq C_{4} \sqrt{\sum_{\deg D^{'} \leq d} ‖ ht (\cdot)^{ℓ + ℓ_{0}} D^{'} D f |_{B_{r (x)}^{X} (x)} ‖_{L^{2}}^{2}} \leq C_{4} S_{ℓ + ℓ_{0}} (f),$ for yet another constant $C_{4} > 0$ , which is the one appearing in the lemma.

3.4. Exponentially generic points

Now we are ready to define the notion of effective genericity we wish to establish, and to prove the main convergence result of this section.

Until the end of this section, we fix a good height function $ht$ on X. Moreover, given a bounded measurable function f on X and $n \in N$ we will use the notation $D_{n} (f) (x) = π (μ)^{n} f (x) - \int f d m_{X}$ for $x \in X$ . We refer to $D_{n} (f)$ as the time n discrepancy for the function f.

Definition 3.12

We say that a point $x \in X$ is $(ℓ, β)$ -exponentially generic if $ℓ \geq 0$ is a non-negative integer and β a real number in $(0, 1)$ satisfying $\begin{aligned} \underset{n \to \infty}{lim sup} sup_{f \in C_{c}^{\infty} (X) ∖ {0}} {(\frac{| D_{n} (f) (x) |}{S_{ℓ} (f)})}^{1 / n} \leq β, \end{aligned}$ where $S_{ℓ}$ is the degree ℓ Sobolev norm associated to $ht$ .

With this terminology, we have the following result, which quantifies the dependence on the function f in the effective part of Theorem 3.2.

Theorem 3.13

Let G be a real Lie group, $Γ \subset G$ a lattice and $X = G / Γ$ endowed with the Haar measure $m_{X}$ . Suppose that µ has a spectral gap on X. Then there exists a non-negative integer $ℓ_{1} \geq 0$ such that $m_{X}$ -almost every point $x \in X$ is $(ℓ_{1}, ρ (π (μ) |_{L_{0}^{2}}))$ -exponentially generic.

Our argument uses ideas from the proof of [Citation13, Proposition 9.2]. Recall that $⟨ \cdot, \cdot ⟩_{ℓ}$ denotes the inner product associated to the Sobolev norm $S_{ℓ}$ .

Proof.

Set $ℓ_{1} = 2 ℓ_{0}$ with $ℓ_{0}$ from Proposition 3.10. We denote by $H$ the completion of $C_{c}^{\infty} (X)$ with respect to $S_{ℓ_{1}}$ .

The first step of the proof is to argue that $H$ admits an orthonormal basis $(e^{(k)})_{k}$ with respect to $S_{ℓ_{1}}$ that is also orthogonal with respect to $S_{ℓ_{0}}$ . To this end, let us endow $H$ with the scalar product $⟨ \cdot, \cdot ⟩_{ℓ_{1}}$ associated to $S_{ℓ_{1}}$ . This makes $H$ into a Hilbert space. As a consequence of Lemma 3.9(ii), $⟨ \cdot, \cdot ⟩_{ℓ_{0}}$ defines a bounded positive definite Hermitian form on $(H, ⟨ \cdot, \cdot ⟩_{ℓ_{1}})$ . Using Riesz representation it follows that there is a bounded positive self-adjoint operator T on $(H, ⟨ \cdot, \cdot ⟩_{ℓ_{1}})$ such that $⟨ v, w ⟩_{ℓ_{0}} = ⟨ Tv, w ⟩_{ℓ_{1}}$ for all $v, w \in H$ . Finiteness of the relative trace $Tr (S_{ℓ_{0}}^{2} | S_{ℓ_{1}}^{2})$ from Proposition 3.10(ii) then translates into the statement that T is a trace-class operator on $(H, ⟨ \cdot, \cdot ⟩_{ℓ_{1}})$ (cf. [Citation14, Proposition 6.44]); in particular, the operator T is compact (cf. [Citation14, Proposition 6.42]). By the spectral theorem, T is thus diagonalizable. Hence, an orthonormal basis $(e^{(k)})_{k}$ of $(H, ⟨ \cdot, \cdot ⟩_{ℓ_{1}})$ consisting of eigenvectors of T is a basis with the desired properties.

Next, fix rational numbers $ρ (π (μ) |_{L_{0}^{2}}) < α < 1$ and $ε \in (0, 1)$ . As in the proof of Theorem 3.2, using Chebyshev's inequality we find that for every $k \geq 0$ and large enough n we have (13) $\begin{aligned} m_{X} ({x \in X | | D_{n} (e^{(k)}) (x) | \geq α^{n (1 - ε)} S_{ℓ_{0}} (e^{(k)})}) \\ \leq \frac{‖ e_{0}^{(k)} ‖_{L^{2}}^{2}}{S_{ℓ_{0}} (e^{(k)})^{2}} α^{2 εn} \leq \frac{‖ e^{(k)} ‖_{L^{2}}^{2}}{S_{ℓ_{0}} (e^{(k)})^{2}} α^{2 εn}, \end{aligned}$ (13) where $e_{0}^{(k)} = e^{(k)} - \int e^{(k)} d m_{X}$ . Since the relative trace $Tr (S_{0}^{2} | S_{ℓ_{0}}^{2})$ is finite by Proposition 3.10, the terms on the right-hand side of (Equation13(13) $\begin{aligned} m_{X} ({x \in X | | D_{n} (e^{(k)}) (x) | \geq α^{n (1 - ε)} S_{ℓ_{0}} (e^{(k)})}) \\ \leq \frac{‖ e_{0}^{(k)} ‖_{L^{2}}^{2}}{S_{ℓ_{0}} (e^{(k)})^{2}} α^{2 εn} \leq \frac{‖ e^{(k)} ‖_{L^{2}}^{2}}{S_{ℓ_{0}} (e^{(k)})^{2}} α^{2 εn}, \end{aligned}$ (13) ) are summable over $k, n \geq 0$ . Borel–Cantelli thus implies that $\begin{aligned} \underset{k, n \geq 0}{lim sup} {x \in X | | D_{n} (e^{(k)}) (x) | \geq α^{n (1 - ε)} S_{ℓ_{0}} (e^{(k)})} \end{aligned}$ is a null set. Let $A_{α, ε}$ be the complement of this null set. We claim that any $x \in A_{α, ε}$ is $(ℓ_{1}, α^{1 - ε})$ -exponentially generic. Fix such a point x. Then we know that there are only finitely many pairs $(k, n)$ with $| D_{n} (e^{(k)}) (x) | \geq α^{n (1 - ε)} S_{ℓ_{0}} (e^{(k)})$ . Thus, there exists $n_{0}$ such that for $n \geq n_{0}$ the inequality $| D_{n} (e^{(k)}) (x) | < α^{n (1 - ε)} S_{ℓ_{0}} (e^{(k)})$ holds for all k. Now let $f \in C_{c}^{\infty} (X) ∖ {0}$ be arbitrary and write $f = \sum_{k} f_{k} e^{(k)}$ for the expansion of f in terms of the orthonormal basis $(e^{(k)})_{k}$ . Then, using the triangle inequality, we can estimate the time n discrepancy for f as follows: (14) $| D_{n} (f) (x) | \leq \sum_{k} | f_{k} | | D_{n} (e^{(k)}) (x) | .$ (14) The exchange of integral and summation involved in the above estimate is justified by part (i) of Proposition 3.10: It ensures that the functions $e^{(k)}$ are defined pointwise and the series expansion of f converges uniformly. Next, for $n \geq n_{0}$ an application of the Cauchy–Schwarz inequality implies that the right-hand side of (Equation14(14) $| D_{n} (f) (x) | \leq \sum_{k} | f_{k} | | D_{n} (e^{(k)}) (x) | .$ (14) ) is strictly less than (15) $α^{n (1 - ε)} {(\sum_{k} | f_{k} |^{2})}^{1 / 2} {(\sum_{k} S_{ℓ_{0}} (e^{(k)})^{2})}^{1 / 2} = α^{n (1 - ε)} S_{ℓ_{1}} (f) Tr (S_{ℓ_{0}}^{2} | S_{ℓ_{1}}^{2})^{1 / 2} .$ (15) Again by Proposition 3.10, the relative trace $Tr (S_{ℓ_{0}}^{2} | S_{ℓ_{1}}^{2})$ is finite. Hence, in view of our definition of exponential genericity and the fact that $n_{0}$ does not depend on f, combining (Equation14(14) $| D_{n} (f) (x) | \leq \sum_{k} | f_{k} | | D_{n} (e^{(k)}) (x) | .$ (14) ) and (Equation15(15) $α^{n (1 - ε)} {(\sum_{k} | f_{k} |^{2})}^{1 / 2} {(\sum_{k} S_{ℓ_{0}} (e^{(k)})^{2})}^{1 / 2} = α^{n (1 - ε)} S_{ℓ_{1}} (f) Tr (S_{ℓ_{0}}^{2} | S_{ℓ_{1}}^{2})^{1 / 2} .$ (15) ) establishes the claim. It follows that all x in a countable intersection of the sets $A_{α, ε}$ over rational numbers α approaching $ρ (π (μ) |_{L_{0}^{2}})$ and ε approaching 0 from above are $(ℓ_{1}, ρ (π (μ) |_{L_{0}^{2}}))$ -exponentially generic, giving the theorem.

Remark 3.14

In analogy to Remark 3.3, we can control the measure of the set of points where exponentially generic behaviour is not observed for a given number of steps: If we define $\begin{aligned} B_{α, ε, n} = {x \in X | | D_{n^{'}} (f) (x) | \geq α^{n^{'} (1 - ε)} S_{ℓ_{1}} (f) & Tr (S_{ℓ_{0}}^{2} | S_{ℓ_{1}}^{2})^{1 / 2} \\ for some n^{'} \geq n, f \in C_{c}^{\infty} (X)} \end{aligned}$ for $ρ (π (μ) |_{L_{0}^{2}}) < α < 1$ , $ε \in (0, 1)$ and $n \in N$ , and $N \in N$ is chosen such that $‖ π (μ) |_{L_{0}^{2}}^{n} ‖_{op} \leq α^{n}$ for all $n \geq N$ , then for every $n \geq N$ it holds that $m_{X} (B_{α, ε, n}) \leq Tr (S_{0}^{2} | S_{ℓ_{0}}^{2}) \frac{α^{2 εn}}{1 - α^{2 ε}} .$ Indeed, we have $B_{α, ε, n} \subset ⋃_{n^{'} \geq n, k \geq 0} {x \in X | | D_{n^{'}} (e^{(k)}) (x) | \geq α^{n^{'} (1 - ε)} S_{ℓ_{0}} (e^{(k)})}$ , as the proof of Theorem 3.13 demonstrates. Thus, again, the measure of the set of ‘bad points’, on which exponential genericity takes more than n steps to manifest, is itself exponentially small in n.

4. Uniform Cesàro convergence

In this last section, we explore the situation where the only possible limit in Theorem 1.1 is the normalized Haar measure $m_{X}$ . In this setting, by analogy with the case of unique ergodicity in classical ergodic theory, it is reasonable to expect the Cesàro convergence (Equation2(2) $\frac{1}{n} \sum_{k = 0}^{n - 1} μ^{* k} * δ_{x_{0}} ⟶ ν_{x_{0}}$ (2) ) to hold (locally) uniformly in the starting point $x_{0}$ . We shall prove in Section 4.1 below that this indeed holds true. In Section 4.2, we conclude the article by showing that in many naturally occurring situations something even stronger than locally uniform can be achieved.

Before continuing with the pertinent definitions, let us recall that even though the setup of Theorem 1.1 is our motivation and useful to have in mind, formally we are working with the assumptions stated at the end of Section 1: $(X, m_{X})$ is merely required to be a space with a G-action for which $m_{X}$ is invariant and ergodic.

Definition 4.1

A probability measure ν on X is called µ-stationary if $μ * ν = ν$ . The random walk on X induced by µ is called uniquely ergodic if $m_{X}$ is the unique µ-stationary probability measure on X.

In particular, for a random walk to be uniquely ergodic, there must be no finite $G$ -orbits in X, where $G$ denotes the closed subgroup of G generated by µ. In the case that $X = G / Γ$ for a lattice Γ in G, this happens if and only if $G$ is not virtually contained in a conjugate of Γ. (Recall that a subgroup H of G is said to be virtually contained in a subgroup L of G if $H \cap L$ has finite index in H.) In fact, in many cases of interest, finite orbits are the only obstruction to unique ergodicity: For example, this is true when G is a connected semisimple Lie group without compact factors, Γ is an irreducible lattice, $X = G / Γ$ , and $Ad (S)$ is Zariski dense in $Ad (G)$ (see [Citation8, Corollary 1.8]); and also in the setting of [Citation27], a special case of which is reproduced below as Example 4.8.

4.1. Locally uniform convergence

The notion of unique ergodicity introduced above coincides with the classical property of unique ergodicity of the Markov operator $π (μ)$ . When the space X is compact, this is enough to guarantee that the Cesàro convergence $\frac{1}{n} \sum_{k = 0}^{n - 1} μ^{* k} * δ_{x} \to m_{X}$ as $n \to \infty$ is uniform in x (see e.g. [Citation16, Section 5.1]). Without compactness, we also need to assume a form of recurrence.

Definition 4.2

We say that the random walk on X given by µ is locally uniformly recurrent if for every compact subset $K \subset X$ and $ε > 0$ there exists $n_{0} \in N$ and a compact subset $M \subset X$ with $μ^{* n} * δ_{x} (M) \geq 1 - ε$ for all $n \geq n_{0}$ and $x \in K$ . It is called locally uniformly recurrent on average if the above holds with the Cesàro averages $\frac{1}{n} \sum_{k = 0}^{n - 1} μ^{* k} * δ_{x}$ in place of $μ^{* n} * δ_{x}$ .

It is a simple exercise to check that locally uniform recurrence implies locally uniform recurrence on average. In concrete examples, recurrence properties such as these are typically established by constructing a Lyapunov function; see Section 4.2 below.

The following well-known fact explains why these properties are referred to as ‘non-escape of mass’.

Lemma 4.3

Let the sequence ${x_{n}}_{n}$ of points in X be relatively compact and suppose that the random walk on X is locally uniformly recurrent (resp. on average). Then every weak* limit of the sequence $(μ^{* n} * δ_{x_{n}})_{n}$ (resp. $(\frac{1}{n} \sum_{k = 0}^{n - 1} μ^{* k} * δ_{x_{n}})_{n}$ ) is a probability measure. □

The proof is immediate and left to the reader.

We are now ready to state and prove our first result on locally uniform Cesàro convergence.

Theorem 4.4

Suppose that the random walk on X induced by µ is uniquely ergodic and locally uniformly recurrent on average. Then for every $f \in C_{c} (X)$ , every compact $K \subset X$ , and every $ε > 0$ , there exists $n_{0} \in N$ such that for every $n \geq n_{0}$ and $x \in K$ we have $| \frac{1}{n} \sum_{k = 0}^{n - 1} \int_{X} f d (μ^{* k} * δ_{x}) - \int_{X} f d m_{X} | < ε .$ Equivalently, considering the space of probability measures on X as endowed with the weak* topology, the sequence of functions $X ∋ x \mapsto \frac{1}{n} \sum_{k = 0}^{n - 1} μ^{* k} * δ_{x}$ converges to $m_{X}$ uniformly on compact subsets of X as $n \to \infty$ .

Proof.

The equivalence of the two formulations is due to the definition of neighbourhoods in the weak* topology by finitely many test functions in $C_{c} (X)$ .

To prove the statement for individual functions, we proceed by contradiction. If the conclusion is false, then for some $f \in C_{c} (X)$ , $K \subset X$ compact and $ε > 0$ there exist indices $n (j) \to \infty$ and $x_{j} \in K$ with (16) $| \frac{1}{n (j)} \sum_{k = 0}^{n (j) - 1} \int_{X} f d (μ^{* k} * δ_{x_{j}}) - \int_{X} f d m_{X} | \geq ε$ (16) for all $j \in N$ . Let ν be a weak* limit point of the sequence ${(\frac{1}{n (j)} \sum_{k = 0}^{n (j) - 1} μ^{* k} * δ_{x_{j}})}_{j} .$ Then ν is µ-stationary, and a probability measure because of our recurrence assumption and the fact that all $x_{j}$ lie in the fixed compact set K Lemma 4.3. But by unique ergodicity this forces $ν = m_{X}$ , contradicting (Equation16(16) $| \frac{1}{n (j)} \sum_{k = 0}^{n (j) - 1} \int_{X} f d (μ^{* k} * δ_{x_{j}}) - \int_{X} f d m_{X} | \geq ε$ (16) ).

4.2. Lyapunov functions & stronger uniformity

Loosely speaking, (Foster–)Lyapunov functions are functions enjoying certain contraction properties with respect to the random walk, to the effect that (on average) its dynamics are directed towards the ‘centre’ of the space, where the function takes values below some threshold. They were introduced into the study of random walks on homogeneous spaces by Eskin–Margulis [Citation15], whose ideas were further developed by Benoist–Quint [Citation6].

Definition 4.5

A measurable function $V : X \to [0, \infty]$ is called a Lyapunov function for the random walk on X induced by µ if

it is proper, in the sense that the sublevel sets $V^{- 1} ([0, L])$ are relatively compact for $L \in [0, \infty)$ , and
there exist constants $α < 1$ , $β \geq 0$ such that $π (μ) V \leq αV + β$ , where $π (μ)$ is the convolution operator associated to µ introduced in Section 3.

The inequality in the second condition above is referred to as the contraction property of V.

Allowing Lyapunov functions to take the value ∞ is conceptually important for the proofs of results such as Theorem 1.1, in order to show that the random walk does not accumulate near a lower-dimensional homogeneous subspace. Also, affording the possibility of non-continuous Lyapunov functions is crucial in recent constructions given in the literature [Citation6,Citation19]. For the purposes of the discussion in this section, however, it is no big restriction to have in mind the case of a continuous Lyapunov function which is finite on all of X.

Remark 4.6

Let us collect some immediate observations about Lyapunov functions.

If V is a Lyapunov function, then so are cV and V + c for any constant c>0. In particular, one may impose an arbitrary lower bound on V, so that it is no restriction to assume that a Lyapunov function takes values $\geq 1$ , say.
Given a Lyapunov function $V^{'} : X \to [0, \infty]$ for the $n_{0}$ -step random walk (induced by the convolution power $μ^{* n_{0}}$ ), one can construct a Lyapunov function V for the random walk given by µ itself by setting $V = \sum_{k = 0}^{n_{0} - 1} α^{\frac{n_{0} - 1 - k}{n_{0}}} π (μ)^{k} V^{'} .$
By enlarging α and using properness, the contraction property in the definition of a Lyapunov function V may be replaced by $π (μ) V \leq αV + β 1_{K}$ for some compact $K \subset X$ , where $1_{K}$ denotes the indicator function of K (cf. [Citation17, Lemma 15.2.8]).

Two examples in which a Lyapunov function exists are the following.

Example 4.7

[Citation15]

Identify $X = {SL}_{2} (R) / {SL}_{2} (Z)$ with the space of unimodular lattices in $R^{2}$ as in Example 3.7 and recall that we denote by $λ_{1} (x)$ the length of a shortest non-zero vector in $x \in X$ . Then for every compactly supported probability measure µ on G whose support generates a Zariski dense subgroup there exist $ε, δ > 0$ such that $V^{'} = 1 + ε λ_{1}^{- δ}$ is a finite continuous Lyapunov function for the $n_{0}$ -step random walk on X induced by $μ^{* n_{0}}$ for some $n_{0} \in N$ . This construction can be generalized to higher dimensions by taking into account the higher successive minima $λ_{2}, \dots, λ_{d}$ of lattices in $R^{d}$ . A more advanced construction also ensures existence of Lyapunov functions for Zariski dense probability measures with finite exponential moments when $G = G (R)$ is the group of real points of a Zariski connected semisimple algebraic group $G$ defined over $R$ such that G has no compact factors.

Example 4.8

[Citation27]

Let $G = {SL}_{d + 1} (R)$ , $Γ = {SL}_{d + 1} (Z)$ and $X = G / Γ$ . For $0 \leq i \leq m$ let $c_{i} > 1$ be positive real numbers, $y_{i} \in R^{d}$ vectors such that $y_{0} = 0$ and $y_{1}, \dots, y_{m}$ span $R^{d}$ , $O_{i} \in {SO}_{d} (R)$ and set $g_{i} = (\begin{matrix} c_{i} O_{i} & y_{i} \\ 0 & c_{i}^{- d} \end{matrix}) \in G .$ Then for any choice of $p_{0}, \dots, p_{m} > 0$ with $\sum_{i = 0}^{m} p_{i} = 1$ , the measure $μ = \sum_{i = 0}^{m} p_{i} δ_{g_{i}}$ defines a uniquely ergodic random walk on X admitting a finite continuous Lyapunov function.

It is well known that existence of a Lyapunov function implies recurrence properties of the random walk.

Lemma 4.9

[Citation15, Lemma 3.1]

Suppose the random walk on X given by µ admits a finite continuous Lyapunov function V. Then this random walk is locally uniformly recurrent.

The intuitive reason for this behaviour is simple: The contraction property means that after a step of the random walk, the value of the Lyapunov function V on average gets smaller by a constant factor, at least when starting outside some compact set K (cf. Remark 4.6(iii) above), which one can think of as the ‘centre’ of the space. The set K can be chosen as (closure of) a sublevel set of V. By the contraction property, the number of steps required to reach it is uniform over starting points x in any given sublevel set of V, or in any given compact subset of X in the case that V is finite and continuous. This suggests that we might even let the starting points diverge, as long as this divergence is outcompeted by the geometric rate of contraction of V. We are led to the following notion of recurrence.

Definition 4.10

Let $(K_{n})_{n}$ be a sequence of subsets of X. We say that the random walk on X given by µ is $(K_{n})_{n}$ -uniformly recurrent if for every $ε > 0$ there exists $n_{0} \in N$ and a compact subset $M \subset X$ with $μ^{* n} * δ_{x} (M) \geq 1 - ε$ for all $n \geq n_{0}$ and $x \in K_{n}$ . It is called $(K_{n})_{n}$ -uniformly recurrent on average if the above holds with the Cesàro averages $\frac{1}{n} \sum_{k = 0}^{n - 1} μ^{* k} * δ_{x}$ in place of $μ^{* n} * δ_{x}$ .

Remark 4.11

We point out that contrary to the locally uniform situation, for the two versions of this property (with/without average) it is generally not clear whether one implies the other.

We are now going to establish such recurrence properties for certain families $(K_{n})_{n}$ of sublevel sets of Lyapunov functions, which can be chosen to be increasing and to exhaust the part of X where the Lyapunov function is finite. Recall that the Lyapunov exponent of a function $φ : N \to [1, \infty)$ is the exponential growth rate $λ (φ) = \underset{n \to \infty}{lim sup} \frac{1}{n} \log φ (n) .$ If $λ (φ) = 0$ , we say that φ has sub-exponential growth.

Proposition 4.12

Let $φ : N \to [1, \infty)$ be a function. Suppose that the random walk on X induced by µ admits a Lyapunov function V with contraction factor $α < 1$ and set $K_{n} = V^{- 1} ([0, φ (n)])$ .

If φ has Lyapunov exponent $λ (φ) < \log (α^{- 1})$ , then the random walk on X given by µ is $(K_{n})_{n}$ -uniformly recurrent. The number $n_{0}$ in the definition can be chosen independently of ε.
If φ has sub-exponential growth, then the random walk on X given by µ is $(K_{n})_{n}$ -uniformly recurrent on average.

The proof is a refinement of the methods in [Citation6,Citation15].

Proof.

Let $α, β$ be the constants from the contraction property of V and define $B = \frac{β}{1 - α}$ . We are going to use the same set M for both parts of the proposition, namely $M = \bar{V^{- 1} ([0, 2 B / ε])}$ , which is compact since V is proper. Then for $n \in N$ and $x \in K_{n}$ we find, by repeatedly using the contraction property of V, $μ^{* n} * δ_{x} (M^{c}) \leq \frac{ε}{2 B} π (μ)^{n} V (x) \leq \frac{ε}{2 B} (α^{n} V (x) + B) \leq \frac{ε}{2 B} α^{n} φ (n) + \frac{ε}{2} .$ When the exponential growth rate of φ is less than $\log (α^{- 1})$ , for some $n_{0} \in N$ we have $α^{n} φ (n) \leq B$ for all $n \geq n_{0}$ . This proves (i).

In order to prove (ii) we use a similar estimate, but have to ensure that the values $μ^{* k} * δ_{x} (M^{c})$ are small for a sufficiently large proportion of $0 \leq k < n$ . For $x \in K_{n}$ we find, as above, (17) $μ^{* k} * δ_{x} (M^{c}) \leq \frac{ε}{2 B} α^{k} φ (n) + \frac{ε}{2} .$ (17) Using straightforward manipulations, we further see $α^{k} φ (n) \leq B / 2 ⟺ \frac{k}{n} \geq \log (α^{- 1})^{- 1} (\frac{1}{n} \log φ (n) - \frac{1}{n} \log (B / 2)),$ the right-hand side of which tends to 0 as $n \to \infty$ by sub-exponential growth of φ. Hence, with $k (n) = ⌊ εn / 4 ⌋$ , we may choose $n_{0}$ large enough to ensure the above inequality holds for all $k \geq k (n)$ for $n \geq n_{0}$ . For such n we conclude, using (Equation17(17) $μ^{* k} * δ_{x} (M^{c}) \leq \frac{ε}{2 B} α^{k} φ (n) + \frac{ε}{2} .$ (17) ), $\begin{aligned} \frac{1}{n} \sum_{k = 0}^{n - 1} μ^{* k} * δ_{x} (M^{c}) & = \frac{1}{n} \sum_{k = 0}^{k (n) - 1} μ^{* k} * δ_{x} (M^{c}) + \frac{1}{n} \sum_{k = k (n)}^{n - 1} μ^{* k} * δ_{x} (M^{c}) \\ \leq \frac{k (n)}{n} + \frac{3 ε}{4} \leq ε, \end{aligned}$ which ends the proof of (ii).

Theorem 4.4 can now be strengthened in the following way.

Theorem 4.13

In addition to the assumptions of Theorem 4.4, suppose that the random walk on X induced by µ admits a Lyapunov function V. Let $φ : N \to [1, \infty)$ have sub-exponential growth. Then for every $f \in C_{c} (X)$ we have $lim_{n \to \infty} sup_{V (x) \leq φ (n)} | \frac{1}{n} \sum_{k = 0}^{n - 1} \int_{X} f d (μ^{* k} * δ_{x}) - \int_{X} f d m_{X} | = 0.$

Proof.

Using $(K_{n})_{n}$ -uniform recurrence on average for $K_{n} = V^{- 1} ([0, φ (n)])$ from Proposition 4.12(ii), the proof of Theorem 4.4 goes through with the obvious modifications.

Acknowledgments

The author would like to express his gratitude to Andreas Wieser for valuable comments on preliminary versions of the article, and to Manfred Einsiedler for explaining how relative traces can be used to make separability effective. Thanks also go to HIM Bonn and the organizers of the trimester program ‘Dynamics: Topology and Numbers’, in the course of which parts of this manuscript were completed, for hospitality and providing an excellent working environment. Finally, the author is grateful to the anonymous referee for pointing out a simple way to establish a better speed of convergence in Theorems 3.2 and 3.13.

Disclosure statement

No potential conflict of interest was reported by the author(s).

References

Bekka M.B., On uniqueness of invariant means, Proc. Amer. Math. Soc. 126 (1998), pp. 507–514.
Web of Science ®Google Scholar
Bénard T., Equidistribution of mass for random processes on finite-volume spaces, Israel J. Math.255 (2023), pp. 417–422.
Web of Science ®Google Scholar
Bénard T. and de Saxcé N., Random walks with bounded first moment on finite-volume spaces, Geom. Funct. Anal. 32 (2022), pp. 687–724.
Web of Science ®Google Scholar
Benoist Y. and Quint J.-F., Mesures stationnaires et fermés invariants des espaces homogènes, Ann. Math. (2) 174 (2011), pp. 1111–1162.
Google Scholar
Benoist Y. and Quint J.-F., Introduction to random walks on homogeneous spaces, Jpn. J. Math. 7 (2012), pp. 135–166.
Web of Science ®Google Scholar
Benoist Y. and Quint J.-F., Random walks on finite volume homogeneous spaces, Invent. Math. 187 (2012), pp. 37–59.
Web of Science ®Google Scholar
Benoist Y. and Quint J.-F., Stationary measures and invariant subsets of homogeneous spaces (II), J. Amer. Math. Soc. 26 (2013), pp. 659–734.
Web of Science ®Google Scholar
Benoist Y. and Quint J.-F., Stationary measures and invariant subsets of homogeneous spaces (III), Ann. Math. (2) 178 (2013), pp. 1017–1059.
Web of Science ®Google Scholar
Benoist Y. and Quint J.-F., Random Walks on Reductive Groups, Springer, Cham, 2016.
Google Scholar
Bernstein J. and Reznikov A., Sobolev norms of automorphic functionals, Int. Math. Res. Not. 2002 (2002), pp. 2155–2174.
Web of Science ®Google Scholar
Breuillard E. F., Equidistribution of random walks on nilpotent Lie groups and homogeneous spaces, PhD thesis, Yale University, 2004.
Google Scholar
Davis Buenger C., Quantitative non-divergence, effective mixing, and random walks on homogeneous spaces, PhD thesis, The Ohio State University, 2016.
Google Scholar
Einsiedler M., Margulis G., and Venkatesh A., Effective equidistribution for closed orbits of semisimple groups on homogeneous spaces, Invent. Math. 177 (2009), pp. 137–212.
Web of Science ®Google Scholar
Einsiedler M. and Ward T., Functional Analysis, Spectral Theory, and Applications, Springer, Cham, 2017.
Google Scholar
Eskin A. and Margulis G., Recurrence properties of random walks on finite volume homogeneous manifolds, in Random walks and geometry, Vadim A. Kaimanovich, ed., De Gruyter, Berlin, 2004, pp. 431–444. Proceedings of a Workshop at the Erwin Schrödinger Institute, Vienna, June 18–July 13, 2001. Corrected version: https://www.math.uchicago.edu/eskin/return.ps.
Google Scholar
Krengel U., Ergodic Theorems, de Gruyter, Berlin, 1985.
Google Scholar
Meyn S. and Tweedie R. L., Markov Chains and Stochastic Stability, 2nd ed., Cambridge University Press, Cambridge, 2009.
Google Scholar
Prohaska R., Spread out random walks on homogeneous spaces, Ergodic Theory Dynam. Syst. 41 (2021), pp. 3439–3473.
Web of Science ®Google Scholar
Prohaska R., Sert C., and Shi R., Expanding measures: Random walks and rigidity on homogeneous spaces, Forum Math. Sigma 11(e59) (2023), pp. 1–61.
Google Scholar
Raghunathan M.S., Discrete Subgroups of Lie Groups, Springer, Berlin, 1972.
Google Scholar
Ratner M., On measure rigidity of unipotent subgroups of semisimple groups, Acta Math. 165 (1990), pp. 229–309.
Web of Science ®Google Scholar
Ratner M., Strict measure rigidity for unipotent subgroups of solvable groups, Invent. Math. 101 (1990), pp. 449–482.
Web of Science ®Google Scholar
Ratner M., On Raghunathan's measure conjecture, Ann. Math. (2) 134 (1991), pp. 545–607.
Web of Science ®Google Scholar
Ratner M., Raghunathan's topological conjecture and distributions of unipotent flows, Duke Math. J. 63 (1991), pp. 235–280.
Web of Science ®Google Scholar
Shalom Y., Explicit Kazhdan constants for representations of semisimple and arithmetic groups, Ann. Inst. Fourier (Grenoble) 50 (2000), pp. 833–863.
Web of Science ®Google Scholar
Siegel C. L., Lectures on the Geometry of Numbers, Springer, Berlin, 1989.
Google Scholar
Simmons D. and Weiss B., Random walks on homogeneous spaces and diophantine approximation on fractals, Invent. Math. 216 (2019), pp. 337–394.
Web of Science ®Google Scholar
Zimmer R. J., Ergodic Theory and Semisimple Groups, Birkhäuser, Boston, 1984.
Google Scholar

Aspects of convergence of random walks on finite volume homogeneous spaces

Abstract

1. Introduction

Benoist–Quint [Citation8]

1.1. Standing assumptions & notation

2. Periodicity

2.1. Examples

2.2. An aperiodicity criterion

Proof of Theorem 2.5

3. Spectral gap

3.1. Generic points

3.2. Good height functions

[Citation13]

3.3. Sobolev norms

[Citation13]

3.4. Exponentially generic points

4. Uniform Cesàro convergence

4.1. Locally uniform convergence

4.2. Lyapunov functions & stronger uniformity

[Citation15]

[Citation27]

[Citation15, Lemma 3.1]

Acknowledgments

Disclosure statement

References

Information for

Open access

Opportunities

Help and information

Aspects of convergence of random walks on finite volume homogeneous spaces

Abstract

1. Introduction

Benoist–Quint [Citation8]

1.1. Standing assumptions & notation

2. Periodicity

2.1. Examples

2.2. An aperiodicity criterion

Proof of Theorem 2.5

3. Spectral gap

3.1. Generic points

3.2. Good height functions

[Citation13]

3.3. Sobolev norms

[Citation13]

3.4. Exponentially generic points

4. Uniform Cesàro convergence

4.1. Locally uniform convergence

4.2. Lyapunov functions & stronger uniformity

[Citation15]

[Citation27]

[Citation15, Lemma 3.1]

Acknowledgments

Disclosure statement

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date