Full article: On the word problem for weakly compressible monoids

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

We study the language-theoretic properties of the word problem, in the sense of Duncan & Gilman, of weakly compressible monoids, as defined by Adian & Oganesian. We show that if C is a reversal-closed super- $AFL$ , as defined by Greibach, then M has word problem in C if and only if its compressed left monoid L(M) has word problem in C. As a special case, we may take C to be the class of context-free or indexed languages. As a corollary, we find many new classes of monoids with decidable rational subset membership problem. Finally, we show that it is decidable whether a one-relation monoid containing a non-trivial idempotent has context-free word problem. This answers a generalization of a question first asked by Zhang in 1992.

Keywords:

2020 Mathematics Subject Classification:

1 Introduction

The word problem for finitely presented monoids was introduced in 1914 by Thue [Citation39]. Though this problem would eventually turn out to be undecidable in general, via the remarkable work by Markov [Citation31] and Post [Citation38], there are many tractable cases. Particularly the word problem for one-relation monoids has garnered a great deal of attention. Though it remains a tantalizing open problem whether this problem is decidable, much can be said (see especially the recent survey [Citation35]). Thue himself studied this problem, and solved the problem for some cases when the single defining relation is of the form w = 1. Such monoids are known as special one-relation monoids. Adian [Citation1, Citation2] studied special monoids in-depth, and proved that the word problem is decidable for all special one-relation monoids, via a reduction to the word problem for one-relator groups; this latter problem is decidable by Magnus’ famous theorem [Citation26]. Adian’s student Makanin [Citation28, Citation29, Citation33] later extended this to show that the word problem for any special k-relation monoid reduces to the word problem for some k-relator group. The author has recently proved similar language-theoretic reductions for special monoids [Citation36].

The work by Adian on special monoids would later be extended by Adian and his student Oganesian in the joint work [Citation3]. They defined a form of “compression,” which is called weak compression by the author in [Citation35]. A monoid M with defining relations u_i = v_i (for $1 \leq i \leq k$ ) is called weakly compressible if there is some non-empty word α such that every u_i and v_i begins and ends with α. Associated to any weakly compressible monoid is a “compressed” left monoid L(M), in which the sum of the lengths of the defining relations are shorter than in the original monoid. The main idea is that the word problem for M can be reduced to the word problem for L(M). In particular, by combining this with Adian’s result, this solves the word problem in the class of subspecial one-relation monoids, being the class of monoids which can be compressed to a special one-relation monoid. Subspecial monoids were studied already by Lallement [Citation24], and are (essentially) the one-relation monoids containing a non-trivial idempotent.

Weak compression of subspecial monoids has received some further attention in the past few decades. Zhang [Citation41] extended the above results on the word problem to solve the conjugacy problem in certain subspecial monoids. Kobayashi [Citation22] used compression to prove that all one-relation monoids satisfy the topological finiteness condition $FDT$ , for which the subspecial case was the only outstanding case. Building on this, Gray & Steinberg showed that all one-relation monoids satisfy the homological finiteness property ${FP}_{\infty}$ , at the same time studying the algebraic properties of subspecial monoids [Citation14].

In another direction, a highly successful approach to studying the word problem for groups came through the methods of formal language theory. Such methods were introduced to group theory by An i¯s i¯mov [Citation5], and focus on the language-theoretic properties of the set of all words representing the identity element, calling this set the word problem of the group (its properties are generally independent of the finite generating set chosen). An i¯s i¯mov showed that the word problem of a group is a regular language if and only if the group is finite, and that the class of groups with context-free word problem is closed under free products. A little over ten years later, Muller & Schupp (supplemented by [Citation12]) gave a remarkable algebraic charactersiation, generally referred to as the Muller–Schupp theorem: a finitely generated group has context-free word problem if and only if it is virtually free [Citation32].

The language of words representing the identity element in a monoid is generally not very enlightening (though for special monoids it is [Citation36], as in the case of groups). To remedy this, Duncan & Gilman introduced a new language which encodes equality in a monoid [Citation11]. This set, which is also termed the word problem for the monoid, has many of the attractive properties of the analogous set for groups, such as invariance under generating set chosen. Furthermore, it is easy to show that a monoid has regular word problem if and only if it is finite, mimicking An i¯s i¯mov’s result. The word problem also generally behaves well under taking free products [Citation9, Citation37]. This notion of the word problem has also been studied in e.g. [Citation17, Citation18].

This paper seeks to study the language-theoretic properties of Adian & Oganesian’s weak compression. We will paint a fairly complete picture. The main theorem of this article (Theorem 1.1) will involve classes of languages C which are super- $AFL$ s, as introduced by Greibach [Citation15]. Such classes generalize the context-free languages and the indexed languages; roughly speaking, a full $AFL$ is a super- $AFL$ if it is closed under “recursion” (see Section 2.2 for details). We will prove the following main theorem:

Theorem 1.1.

Let M be a weakly compressible monoid, and let L(M) be its left monoid. Let C be a reversal-closed super- $AFL$ . Then M has word problem in C if and only if L(M) has word problem in C.

An overview of the article is as follows. In Section 2 we give some notation, useful concepts, and define two language-theoretic operations from [Citation37]. In Section 3, we define weak compression, amalgamating several authors’ definitions and notations. In Section 4, we prove Theorem 1.1. Finally, in Section 5 we give some corollaries of Theorem 1.1, particularly focused on context-free monoids. In particular, we will obtain many monoids for which the rational subset membership problem is decidable (Corollary 5.1); and that it is decidable whether a one-relation monoid containing a non-trivial idempotent has context-free word problem (Theorem 5.4).

This article is a condensed form of Chapter 4 of the author’s PhD thesis [Citation34].

2 Notation and auxiliary results

We assume the reader is familiar with the fundamentals of formal language theory. In particular, a full $AFL$ (abstract family of languages) is a class of languages closed under homomorphism, inverse homomorphism, intersection with regular languages, union, concatenation, and the Kleene star. For some background on this, and other topics in formal language theory, we refer the reader to standard books on the subject [Citation6, Citation16, Citation19]. The paper also assumes familiarity with the basics of the theory of semigroup, monoid, and group presentations, which will be written as $Sgp 〈 A | R 〉, Mon 〈 A | R 〉$ , and $Gp 〈 A | R 〉$ , respectively. For further background see e.g. [Citation2, Citation10, Citation25, Citation27]. We also refer the reader to [Citation36, Citation37].

2.1 Monoids, words, rewriting

Let A be a finite alphabet, and let $A^{*}$ denote the free monoid on A, with identity element denoted ε or 1, depending on the context. Let A⁺ denote the free semigroup on A, i.e. $A^{+} = A^{*} - {ε}$ . For $u, v \in A^{*}$ , by $u \equiv v$ we mean that u and v are the same word. For $w \in A^{*}$ , we let $| w |$ denote the length of w, i.e. the number of letters in w. We have $| ε | = 0$ . If $w \equiv a_{1} a_{2} \dots a_{n}$ for $a_{i} \in A$ , then we let $w^{rev}$ denote the reverse of w, i.e. the word $a_{n} a_{n - 1} \dots a_{1}$ . For a language L, we define $L^{rev}$ as the collection of all words $w^{rev}$ where $w \in L$ . We say that a class of languages C is reversal-closed if for every $L \in C$ , we have $L^{rev} \in C$ . We say that the word $w \in A^{*}$ is self-overlap free if it is empty, or else if it is non-empty and none of the proper non-empty prefixes of w is also a proper non-empty suffix of w. Thus xyxyy is self-overlap free, but xyxyx is not. If the words $u, v \in A^{*}$ are equal in the monoid $M = Mon 〈 A | R 〉$ , then we denote this $u =_{M} v$ . Finally, when we say that a monoid M is generated by a finite set A, we mean that there exists a surjective homomorphism $π : A^{*} \to M$ . In this case, $u =_{M} v$ will be used synonymously with $π (u) = π (v)$ .

We give some notation for rewriting systems. For an in-depth treatment and further explanations of the terminology, see e.g. [Citation8, Citation21]. A rewriting system $R$ on A is a subset of $A^{*} \times A^{*}$ . An element of $R$ is called a rule. The system $R$ induces several relations on $A^{*}$ . We will write $u {\overset{}{\to}}_{R} v$ if there exist $x, y \in A^{*}$ and a rule $(l, r) \in R$ such that $u \equiv x l y$ and $v \equiv xry$ . We let ${\overset{}{\to}}_{R}^{*}$ denote the reflexive and transitive closure of ${\overset{}{\to}}_{R}$ . We denote by ${\overset{*}{\leftrightarrow}}_{R}$ the symmetric, reflexive, and transitive closure of ${\overset{}{\to}}_{R}$ . The relation ${\overset{*}{\leftrightarrow}}_{R}$ defines the least congruence on $A^{*}$ containing $R$ . For $X \subseteq A^{*}$ , we let ${〈 X 〉}_{R}$ denote the set of ancestors of X, i.e. ${〈 X 〉}_{R} = {w \in A^{*} | \exists x \in X such that w {\overset{}{\to}}_{R}^{*} x}$ . The monoid $Mon 〈 A | R 〉$ is identified with the quotient $A^{*} / {\overset{*}{\leftrightarrow}}_{R}$ . For a rewriting system $T \subseteq A^{*} \times A^{*}$ and a monoid $M = Mon 〈 A | R 〉$ , we say that $T$ is M-invariant if for every rule $(u, v) \in T$ , we have $u =_{M} v$ . That is, $T$ is M-invariant if and only if ${\overset{*}{\leftrightarrow}}_{T} \subseteq {\overset{*}{\leftrightarrow}}_{R}$ .

A rewriting system $R \subseteq A^{*} \times A^{*}$ is said to be monadic if $(u, v) \in R$ implies $| u | \geq | v |$ and $v \in A \cup {ε}$ . We say that $R$ is special if $(u, v) \in R$ implies $v \equiv ε$ . Every special system is monadic. Let C be a class of languages. A monadic rewriting system $R$ is said to be C if for every $a \in A \cup {ε}$ , the language ${u | (u, a) \in R}$ is in C. Thus, we may speak of e.g. monadic C-rewriting systems or monadic context-free rewriting systems.

Let M be a monoid with a finite generating set A. One way of encoding the structure of M as a language is by using the identity problem of M (with respect to A), defined as(2.1) ${IP}_{A}^{M} = {w \in A^{*} | w =_{M} 1} .$ (2.1)

When M is a group, this is the standard definition of the language-theoretic word problem, appearing e.g. in Muller & Schupp [Citation32] and Anisimov [Citation5]. However, for general monoids, (2.1) is generally not very useful. Instead, the central definition of this article will be the following. The (monoid) word problem of M (with respect to A) is defined as the language(2.2) ${WP}_{A}^{M} : = {u # v^{rev} | u, v \in A^{*}, u =_{M} v},$ (2.2) where $#$ is some fixed symbol not in A. For a class of languages C, we say that M has C-word problem if ${WP}_{A}^{M}$ is in C. If C is closed under inverse homomorphism, then M having C-word problem does not depend on the finite generating set A chosen for M (see e.g. [Citation17]). The above definition (2.2) is due to Duncan and Gilman [Citation11, p. 522]. We will throughout this article use $A_{#}$ as a short-hand for the set $A \cup {#}$ .

2.2 Super- $AFL$ s

The theorems in this paper involve a special type of classes of languages, called super- $AFL$ s. These were introduced by Greibach [Citation15], using nested iterated substitutions. In [Citation37], the author gave an equivalent characterization of super- $AFL$ s, which we shall use here. We begin with a simple definition. Recall that for a language L and a rewriting system $R$ , the language ${〈 L 〉}_{R}$ denotes the language of ancestors of L under $R$ , i.e. the set of words which can be rewritten to some word in L.

Definition 1.

Let C be a class of languages. Let $R \subseteq A^{*} \times A^{*}$ be a rewriting system. Then we say that $R$ is C-ancestry preserving if for every $L \subseteq A^{*}$ with $L \in C$ , we have ${〈 L 〉}_{R} \in C$ . If every monadic C-rewriting system is C-ancestry preserving, then we say that C has the monadic ancestor property.

Example 1.

If $R \subseteq A^{*} \times A^{*}$ is a monadic context-free rewriting system, and $L \subseteq A^{*}$ is a context-free language, then ${〈 L 〉}_{R}$ is a context-free language [Citation7, Theorem 2.2]. That is, if we let CF denote the class of context-free languages, then every monadic CF-rewriting system is CF-ancestry preserving; in other words, the class of context-free languages has the monadic ancestor property.

Definition 2.

Let C be a full $AFL$ . Then C is said to be a super- $AFL$ if C has the monadic ancestor property.

For example, the class CF of context-free languages forms a super- $AFL$ by [Citation23, Theorem 1.2], and the class IND of indexed languages is also a super- $AFL$ [Citation4, Citation13]. Both are closed under reversal. On the other hand, neither the classes REG nor DCF of regular, resp. deterministic context-free, languages are super- $AFL$ s. Indeed, if C is a super- $AFL$ , then $CF \subseteq C$ , by [Citation15, Theorem 2.2]. For more examples and generalizations, we refer the reader to the so-called hyper- $AFL$ s defined by Engelfriet [Citation13], all of which are super- $AFL$ s.

We remark that Greibach [Citation15] originally defines super- $AFL$ s via “nested iterated substitutions”, which behave similarly to taking ancestors under monadic rewriting systems. The author [Citation37, Proposition 2.2] has proved that the above definition, using rewriting systems, is equivalent to Greibach’s, and so we will use her results about super- $AFL$ s without restriction.

2.3 Alternating products and bipartisan ancestry

With general definitions taken care of, we will now turn to give some useful auxiliary language-theoretic results.

We first define an operation (the alternating product) on certain languages, which mimics the operation of the free product of semigroups. This operation appears in [Citation37]. Fix an alphabet A and let $#$ be a symbol not in A. Let $L \subseteq A^{*} # A^{*}$ be any language. We say that L is concatenation-closed (with respect to $#$ ) if(2.3) $u_{1} # v_{1} \in L and u_{2} # v_{2} \in L \Rightarrow u_{1} u_{2} # v_{2} v_{1} \in L,$ (2.3) where $u_{1}, v_{1}, u_{2}, v_{2} \in A^{*}$ . The word problem of any finitely generated monoid is always a concatenation-closed language.

Let $X : N \to {1, 2}$ be a parametrisation such that either $X (2 j) = 1$ and $X (2 j + 1) = 2$ , or else $X (2 j) = 2$ and $X (2 j + 1) = 1$ , for all $j \in N$ . A parametrisation X of this form will be called standard. Given two concatenation-closed languages in $A^{*} # A^{*}$ , we now present an operation for combining them. Let $L_{1}, L_{2} \subseteq A^{*} # A^{*}$ be concatenation-closed. Then the alternating product $L_{1} ⋆ L_{2}$ of L₁ and L₂ is defined as the language consisting of all words of the form:(2.4) $u_{1} u_{2} \dots u_{k} # v_{k} \dots v_{2} v_{1},$ (2.4) where for all $1 \leq i \leq k$ we have $u_{i} # v_{i} \in L_{X (i)}$ , where X is a standard parametrisation. In particular, $# \in L_{1} ⋆ L_{2}$ if and only if $# \in L_{1}$ or $# \in L_{2}$ . Alternating products are modeled on the (semigroup) free product, as the following example shows.

Example 2.

Let $L_{1} = {a^{n} # a^{n} | n \geq 0}$ and $L_{2} = {b^{n} # b^{n} | n \geq 0}$ . Then, of course, $L_{1} = {WP}_{{a}}^{{a}^{*}}$ and $L_{2} = {WP}_{{b}}^{{b}^{*}}$ . Now both languages L₁ and L₂ are concatenation-closed, and it is easy to see that we have $L_{1} ⋆ L_{2} = {a^{n_{1}} b^{n_{2}} \dots a^{n_{k}} # a^{n_{k}} \dots b^{n_{2}} a^{n_{1}} | k \geq 0; n_{1}, n_{2}, \dots, n_{k - 1} \geq 1; n_{k} \geq 0} .$

Thus $L_{1} ⋆ L_{2} = {w # w^{rev} | w \in {a, b}^{*}} = {WP}_{{a, b}}^{{a, b}^{*}}$ .

Using the monadic ancestor property, one can prove the following useful statement.

Proposition 2.1.

[Citation37, Proposition 2.3] Let C be a super- $AFL$ . Let $L_{1}, L_{2} \subseteq A^{*} # A^{*}$ be concatenation-closed languages. Then $L_{1}, L_{2} \in C \Rightarrow L_{1} ⋆ L_{2} \in C$ .

We will require another operation with useful preservation properties, also introduced in [Citation37]. Let $R_{1}, R_{2} \subseteq A^{*} \times A^{*}$ be two rewriting systems. Let $L \subseteq A^{*} # A^{*}$ . Then we define the bipartisan $(R_{1}, R_{2})$ -ancestor of L as the language(2.5) $^{R_{1}} L^{R_{2}} = {w_{1} # w_{2} | \exists u_{1} # u_{2} \in L such that w_{i} \in {〈 u_{i} 〉}_{R_{i}} for i = 1, 2} .$ (2.5)

Bipartisan ancestors can, informally speaking, manipulate the left and the right side (of $#$ in words in L) using $R_{1}$ resp. $R_{2}$ , and these manipulations can be independent of one another. We give an example of this independence.

Example 3.

Let $A = {a, b}$ , and let $L = {a # a}$ . Let $R_{1}$ be the rewriting system with the rules $(b^{n}, a)$ for all $n \geq 1$ . Then $^{R_{1}} L^{R_{1}} = {b^{n_{1}} # b^{n_{2}} | n_{1}, n_{2} \geq 1} \cup {b^{n} # a | n \geq 1} \cup {a # b^{n} | n \geq 1} \cup {a # a} .$

In particular, we have $b^{n_{1}} # b^{n_{2}} \in^{R_{1}} L^{R_{1}}$ even if $n_{1} \neq n_{2}$ .

Proposition 2.2.

[Citation37, Proposition 2.5] Let C be a class of languages closed under rational transductions. Let $L \subseteq A^{*} # A^{*}$ , and let $R_{1}, R_{2} \subseteq A^{*} \times A^{*}$ be rewriting systems. If $R_{1}, R_{2}$ are C-ancestry preserving, then $L \in C \Rightarrow^{R_{1}} L^{R_{2}} \in C$ .

Bipartisan ancestors are useful in describing the language theory of monoid (and group) free products. See [Citation37] for further details. Alternating products and bipartisan ancestors are the language-theoretic tools we shall require for this article. We now turn to the main subject of the article.

3 Weak compression

We present weak compression as it is defined in the survey [Citation35, Section 3.1] by the author. This is an amalgamation of other approaches [Citation3, Citation14, Citation22, Citation24, Citation41].

Let A be an alphabet. We say that a pair (u, v) of words is sealed by $w \in A^{+}$ if $u, v \in w A^{*} \cap A^{*} w$ . If a pair is sealed by some word, then it is clearly sealed by a unique self-overlap free word α. For example, (xyxpxyx, xyxqxyx) is sealed by xyx, but also by the self-overlap free word x.

Let M be the monoid defined by the presentation(3.1) $Mon 〈 A | u_{i} = v_{i} (1 \leq i \leq p) 〉 .$ (3.1)

Definition 3.

We say that the monoid defined by (3.1) is weakly compressible (with respect to α) if there is some self-overlap free word $α \in A^{+}$ such that for all $1 \leq i \leq p$ , the pair (u_i , v_i ) is sealed by α.

If M is not weakly compressible, then we say that M is incompressible. Any special monoid is incompressible by default (in particular groups are incompressible).

Let M be a weakly compressible monoid defined by (3.1). We will assume $| A | > 1$ , for otherwise M is a finite cyclic monoid, and all is trivial. A word $w \in A^{*}$ is called a left α-conjugator if $w \in α A^{*}$ . Let $Σ_{*} (α)$ be the set of left α-conjugators which contain exactly one occurrence of α, i.e. $Σ_{*} (α) = α (A^{*} - A^{*} α A^{*})$ . For ease of notation, we write $Σ_{*}$ instead of $Σ_{*} (α)$ . Clearly, $Σ_{*}$ is a countably infinite suffix code (as $| A | > 1$ ), and $Σ_{*}^{+}$ is the set of all left α-conjugators. Enumerate the words of $Σ_{*}$ as ${w_{1}, w_{2}, \dots}$ and fix a set $Γ_{*} (α) = Γ_{*}$ of symbols ${γ_{w_{1}}, γ_{w_{2}}, \dots}$ in bijective correspondence with $Σ_{*}$ via the map $φ : w_{i} \mapsto γ_{w_{i}}$ . As $Σ_{*}$ is a suffix code, we can extend $φ$ to an isomorphism of the free monoids $Σ_{*}^{*}$ and $Γ_{*}^{*}$ .

Every defining relation u_i = v_i in (3.1) is sealed by α, and as α is self-overlap free, we have $u_{i}, v_{i} \in Σ_{*}^{*} α$ . We factor u_i , v_i uniquely over the suffix code $Σ_{*}$ , yielding(3.2) $u_{i} \equiv u_{i, 1} u_{i, 2} \dots u_{i, m_{i}} α, and v_{i} \equiv v_{i, 1} v_{i, 2} \dots v_{i, n_{i}} α .$ (3.2)

Any word $u_{i, j}$ or $v_{i, k}$ appearing in the factorizations (3.2) for some $1 \leq i \leq p$ is called a left piece of M. The set of all left pieces of M is denoted $Σ (α)$ , which will be shortened to Σ. As M is finitely presented, Σ is a finite set of words. We let $Γ = Γ (α)$ be the set $φ (Σ)$ of symbols from $Γ_{*}$ .

From the factorization (3.2), we define a new presentation(3.3) $Mon 〈 Γ_{*} | φ (u_{i, 1} u_{i, 2} \dots u_{i, m_{i}}) = φ (v_{i, 1} v_{i, 2} \dots v_{i, n_{i}}) (1 \leq i \leq p) 〉 .$ (3.3)

Definition 4.

The monoid defined by the presentation (3.3) is called the extended left monoid associated to the monoid M (defined by (3.1)), and is denoted $L_{*} (M)$ . The submonoid of $L_{*} (M)$ generated by Γ is the left monoid associated to M, and is denoted L(M).

It is obvious from (3.3) that $L_{*} (M) = F * L (M)$ , where * denotes the monoid free product, and $F$ is a free monoid of countably infinite rank, freely generated by $Γ_{*} - Γ$ . We emphasize that it is always decidable to compute the presentation (3.3) starting from the weakly compressible (3.1), as it only requires computing with regular languages. Of course, L(M) is finitely presented, with the same defining relations as in (3.3). Furthermore, the sum of the lengths of the defining relations in L(M) is strictly shorter than the corresponding sum for M. In particular, setting $L^{1} (M) = L (M)$ and $L^{i} (M) = L (L^{i - 1} (M))$ for i > 1, there exists some $n \geq 1$ such that $L^{n} (M)$ is incompressible.Footnote¹

We give an example. One can easily verify that if(3.4) $M_{1} = Mon 〈 x, y | xyyxxxyxxyyxxxy = x y 〉,$ (3.4) then the defining relation of M₁ is sealed by xy. Factorizing both sides of the defining relation, we find $Σ = {xyx, xyyxx}$ and hence $Γ = {γ_{xyx}, γ_{xyyxx}}$ . Thus(3.5) $L (M_{1}) = Mon 〈 γ_{xyx}, γ_{xyyxx} | γ_{xyyxx} γ_{xyx} γ_{xyyxx} = 1 〉 .$ (3.5)

This (special) monoid is incompressible. It is isomorphic to $Mon 〈 a, b | aba = 1 〉$ , which is isomorphic to the infinite cyclic group $Z$ by removing the redundant generator b.

We return to the general case, and present the normal form results from [Citation3, Citation24]. First, note that it is obvious that if two words $u, v \in A^{*}$ are equal in M, then u contains an occurrence of the self-overlap free word α if and only if v does; and furthermore, if neither u nor v contain α, then $u =_{M} v$ if and only if $u \equiv v$ . Second, given any $u \in A^{*} α A^{*}$ , there exist unique $u', u ″ \in A^{*} - A^{*} α A^{*}$ and $u^{†} \in α A^{*} \cap A^{*} α$ such that $u \equiv u' u^{†} u ″$ . We call such a factorization the canonical form of u, and $u^{†}$ is called the α-part of u. Write $u^{†} \equiv u_{l}^{†} α$ , where $u_{l}^{†} \in Σ_{*}^{*}$ . The word $φ (u_{l}^{†})$ is called the γ-part of u. The following theorem is fundamental to weakly compressible monoids.

Theorem 3.1

(Adian & Oganesian, Lallement). Let M be a weakly compressible monoid defined by (3.1). Let $u, v \in A^{*} α A^{*}$ have canonical forms $u' u^{†} u ″$ and $v' v^{†} v ″$ , respectively. Let $γ_{u}, γ_{v}$ be the γ-parts of u and v, respectively. Then $u =_{M} v$ if and only if (1) $u' \equiv v'$ and $u ″ \equiv v ″$ ; and (2) $γ_{u} = γ_{v}$ in $L_{*} (M)$ . Furthermore, (2) is equivalent to (2’) $u^{†} =_{M} v^{†}$ .

Remark 1.

The above result appears as [Citation24, Lemma 3.2] and [Citation3, Theorem 3.2] only in the case of one defining relation. However, the proofs of these results use no properties of one-relation monoids, and can, mutatis mutandis, also be used to prove the above result.

In particular, the word problem for M is decidable if and only if the word problem for L(M) is decidable. Furthermore, one can without much difficulty show that the left (right) divisibility problem for M reduces to the word and left (right) divisibility problems in L(M). In particular this solves the word and divisibility problems in the monoid M₁ defined by (3.4).

4 Proof of Theorem 1.1

Let M be a weakly compressible (with respect to some self-overlap free word α) monoid defined by (3.1), and let $φ, Σ_{*}, Γ_{*}, Σ$ , and Γ be as in Section 3. We will now prove that the language-theoretic properties of the monoid M defined by (3.1) can be reduced to the properties of the left monoid L(M). Let C be a reversal-closed super- $AFL$ , which will remain fixed throughout this section.

To simplify some technical notation we shall sometimes consider Σ, rather than Γ, as a finite generating set for the monoid L(M), as there exists a surjective homomorphism $π_{Σ}$ from $Σ^{*}$ onto L(M) given by the composition $π_{Σ} = π_{Γ} ° φ$ , where $π_{Γ} : Γ^{*} \to L (M)$ is the natural surjective homomorphism. However, it is important to notice that if $u, v \in Σ^{*}$ , then generally $π_{Σ} (u) = π_{Σ} (v)$ is very distinct from $u =_{M} v$ , unlike what Theorem 3.1 may seem to suggest. Instead, using the canonical forms we find that we have(4.1) $π_{Σ} (u) = π_{Σ} (v) \Leftrightarrow u α =_{M} v α .$ (4.1)

By the first remark preceding Theorem 3.1, it follows that the language ${WP}_{A}^{M}$ is a union $WP {[α]}_{A}^{M} \cup W_{α}^{-}$ of the two disjoint languages(4.2) $WP {[α]}_{A}^{M} = {w_{1} # w_{2}^{rev} | w_{1}, w_{2} \in A^{*} α A^{*} such that w_{1} =_{M} w_{2}},$ (4.2) (4.3) $W_{α}^{-} = {w # w^{rev} | w \in A^{*} - A^{*} α A^{*}} .$ (4.3)

The language $W_{α}^{-}$ defined by (4.3) is the intersection of the context-free language ${WP}_{A}^{A^{*}}$ with the regular language $(A^{*} - A^{*} α A^{*}) # {(A^{*} - A^{*} α A^{*})}^{rev}$ . In particular, $W_{α}^{-}$ is a context-free language. Any super- $AFL$ contains the class of context-free languages [Citation15, Theorem 2.2]. It follows that as C is a super- $AFL$ , we have that $WP {[α]}_{A}^{M} \in C$ implies ${WP}_{A}^{M} \in C$ , as C is closed under union.

Having reduced the language-theoretic properties of ${WP}_{A}^{M}$ to those of $WP {[α]}_{A}^{M}$ , we perform another reduction, which will yield Lemma 4.1. Let(4.4) $WP {[α ⊓ α]}_{A}^{M} = {w_{1} # w_{2}^{rev} | w_{1}, w_{2} \in α A^{*} \cap A^{*} α such that w_{1} =_{M} w_{2}} .$ (4.4)

Of course, we have the strict inclusions $WP {[α ⊓ α]}_{A}^{M} \subset WP {[α]}_{A}^{M} \subset {WP}_{A}^{M} .$

Using this new language, we define the rewriting system(4.5) $R_{α} = {(w_{1} # w_{2}^{rev} \to #) | w_{1} # w_{2}^{rev} \in WP {[α ⊓ α]}_{A}^{M} \cup {WP}_{A}^{A^{*}}} .$ (4.5)

This is a monadic rewriting system. As C is a super- $AFL$ , if we now have $WP {[α ⊓ α]}_{A}^{M} \in C$ , then it follows that $R_{α}$ is a C-rewriting system.

Lemma 4.1.

If $WP {[α ⊓ α]}_{A}^{M} \in C$ , then ${WP}_{A}^{M} \in C$ .

Proof.

Suppose $WP {[α ⊓ α]}_{A}^{M} \in C$ . By the remarks following (4.2) and (4.3), it suffices to show that $WP {[α]}_{A}^{M} \in C$ . We claim that(4.6) $WP {[α]}_{A}^{M} = {〈 {WP}_{A}^{A^{*}} 〉}_{R_{α}} \cap (A^{*} α A^{*} # A^{*} α^{rev} A^{*}) .$ (4.6)

This suffices to establish that $WP {[α]}_{A}^{M} \in C$ , as C is closed under intersection with regular languages; ${WP}_{A}^{A^{*}}$ is a context-free language and hence in C; and C has the monadic ancestor property.

Note first that for every rule $(s, t) \in R_{α}$ we have $s, t \in {WP}_{A}^{M}$ . Hence, if $w \in {〈 {WP}_{A}^{A^{*}} 〉}_{R_{α}}$ , then it is clear by induction on the number of $R_{α}$ -rewritings necessary to transform w into an element of ${WP}_{A}^{A^{*}}$ that $w \in {WP}_{A}^{M}$ . Hence the inclusion $\supseteq$ in (4.6) is proved.

Second, if $w \in WP {[α]}_{A}^{M}$ , then $w \equiv u # v^{rev}$ for some $u, v \in A^{*} α A^{*}$ with $u =_{M} v$ . Let $u' u^{†} u ″$ and $v' v^{†} v ″$ be the canonical forms of u and v, respectively. By Theorem 3.1, we have $u' \equiv v', u ″ \equiv v ″$ , and $u^{†} =_{M} v^{†}$ . Hence $u' # {(v')}^{rev}, u ″ # {(v ″)}^{rev} \in {WP}_{A}^{A^{*}} .$

Furthermore, $u^{†} # {(v^{†})}^{rev} \in WP {[α ⊓ α]}_{A}^{M}$ . Thus from (4.5) we find $(s, #) \in R_{α} for all s \in {u' # {(v')}^{rev}, u ″ # {(v ″)}^{rev}, u^{†} # {(v^{†})}^{rev}} .$

It follows that $w \equiv u # v^{rev} \equiv u' u^{†} u ″ # {(v ″)}^{rev} {(v^{†})}^{rev} {(v')}^{rev} {\overset{}{\to}}_{R_{α}}^{*} # .$

Thus $w \in {〈 # 〉}_{R_{α}} \subseteq {〈 {WP}_{A}^{A^{*}} 〉}_{R_{α}}$ . As w was arbitrary, and as both u and v contain α, we have proved the inclusion $\subseteq$ in (4.6). This establishes the desired equality (4.6). □

Thus, to prove (the hard direction of) the main theorem, it suffices to show that the properties of $WP {[α ⊓ α]}_{A}^{M}$ reduce to the properties of L(M). This is non-trivial; a reduction to the properties of $L_{*} (M)$ is suggested by Theorem 3.1, but $L_{*} (M)$ is not a finitely generated monoid, being a free product of L(M) by an infinite rank free monoid $F$ . However, the word problem of $F$ is, informally speaking, essentially the context-free language $L_{F} = {WP}_{A}^{A^{*}} \cap (Σ_{*} - Σ) # {(Σ_{*} - Σ)}^{rev}$ . We will show that $WP {[α ⊓ α]}_{A}^{M}$ is, up to technical details, describable as an alternating product of the word problem of L(M) by this language $L_{F}$ , together with an appropriate amount of ancestry.Footnote² By the preservation results for alternating products and ancestry, this yields the proof strategy for reducing $WP {[α ⊓ α]}_{A}^{M}$ to L(M).

Lemma 4.2.

If L(M) has word problem in C, then $WP {[α ⊓ α]}_{A}^{M} \in C$ .

Before giving the proof of this key lemma, we need some setup and auxiliary lemmas. We introduce some notation for convenience. Let $Σ_{1} = Σ$ , and let $Σ_{2} = Σ_{*} - Σ$ . Obviously $Σ_{1} \cap Σ_{2} = \emptyset$ . Let $Γ_{i} = φ (Σ_{i})$ for i = 1, 2. As Σ_i is a suffix code, it follows that $φ (Σ_{i}^{*}) = φ {(Σ_{i})}^{*} = Γ_{i}^{*}$ , for i = 1, 2. For further ease of notation, we write $M_{1} = {〈 Γ_{1} 〉}_{L_{*} (M)} = L (M)$ , and $M_{2} = {〈 Γ_{2} 〉}_{L_{*} (M)} = F$ . Then $L_{*} (M) = M_{1} * M_{2}$ , where * denotes the monoid free product. In this new notation, it follows directly from Theorem 3.1 that if $u, v \in Σ_{1}^{*}$ , then(4.7) $u α =_{M} v α \Leftrightarrow φ (u) =_{M_{1}} φ (v),$ (4.7) as in this case $φ (u), φ (v) \in Γ_{1}^{*}$ , and all defining relations of the presentation (3.3) of $L_{*} (M)$ are pairs of words over the alphabet $Γ_{1}$ . Analogously, if $u, v \in Σ_{2}^{*}$ , then(4.8) $u α =_{M} v α \Leftrightarrow u \equiv v .$ (4.8)

We define two rewriting systems on $Σ_{1}^{*}$ resp. ${(Σ_{1}^{rev})}^{*}$ . Let(4.9) $I_{α} = {(w α, α) | w \in Σ_{1}^{+} : w α =_{M} α},$ (4.9) (4.10) $I_{α}^{r} = {({(w α)}^{rev}, α^{rev}) | w \in Σ_{1}^{+} : w α =_{M} α} .$ (4.10)

Let now $u \in {(Σ_{1} \cup Σ_{2})}^{*}$ be any word, factorized uniquely as $u_{0} u_{1} \dots u_{n}$ with $u_{i} \in Σ_{X (i)}^{+}$ for all $0 \leq i < n$ and $u_{n} \in Σ_{X (n)}^{*}$ , where X is a standard parametrisation. We say that (this factorization of) u is reduced if $u \equiv α$ or $u \equiv ε$ ; or if $u_{i} α \neq_{M} α$ for all $0 \leq i \leq n$ . Obviously, any irreducible descendant of u modulo $I_{α}$ is reduced, and any reduced word is clearly irreducible modulo $I_{α}$ . Furthermore, as $I_{α}$ is M-invariant, if $u {\overset{}{\to}}_{I_{α}}^{*} u'$ , then $u =_{M} u'$ , so we conclude that every word $u \in {(Σ_{1} \cup Σ_{2})}^{*}$ is equal in M to some reduced word $u'$ (though this is generally not unique). Given any reduced $u'$ , we can uniquely factorize it as a reduced factorization $u_{0}^{'} u_{1}^{'} \dots u_{k}^{'}$ , where $u_{i}^{'} \in Σ_{Y (i)}^{*}$ for $0 \leq i \leq k$ , where Y is a standard parametrisation. This factorization $u_{0}^{'} u_{1}^{'} \dots u_{k}^{'}$ of $u'$ is called the normal form of the reduced word $u'$ .

Lemma 4.3.

Let $u, v \in {(Σ_{1} \cup Σ_{2})}^{*}$ . Let $u'$ , resp. $v'$ , be any reduced forms of u, resp. v, with normal forms $u' \equiv u_{0}^{'} u_{1}^{'} \dots u_{m}^{'}$ and $v' \equiv v_{0}^{'} v_{1}^{'} \dots v_{n}^{'}$ respectively. Then $u α =_{M} v α$ if and only if (1) n = m, and (2) $u_{i}^{'}, v_{i}^{'} \in Σ_{X (i)}^{*}$ and $u_{i}^{'} α =_{M} v_{i}^{'} α$ for all $0 \leq i \leq n$ , for some standard parametrisation X.

Proof.

The “if” direction follows by induction on n, and the simple observation that if $u_{i}, v_{i}, u_{i + 1}, v_{i + 1} \in Σ^{+}$ , then as $u_{i + 1}$ and $v_{i + 1}$ begin with α we have $(u_{i} α =_{M} v_{i} α and u_{i + 1} α =_{M} v_{i + 1} α) \Rightarrow u_{i} u_{i + 1} α =_{M} v_{i} v_{i + 1} α .$

For the “only if” direction, by Theorem 3.1, it follows that $u α =_{M} v α$ if and only if $φ (u') =_{M_{1} * M_{2}} φ (v')$ . Now $φ (u') \equiv φ (u_{0}^{'}) φ (u_{1}^{'}) \dots φ (u'_{m})$ . As $φ (u_{i}^{'}) =_{M_{1} * M_{2}} 1$ if and only $u_{i}^{'} α =_{M} α$ , it follows from the fact that $u'$ is reduced that $φ (u')$ is reduced with respect to the monoid free product $M_{1} * M_{2}$ , i.e. no non-empty subword of $φ (u')$ equals 1 in $M_{1} * M_{2}$ . The analogous statement is true for $φ (v')$ . Hence, as $φ (u') =_{M_{1} * M_{2}} φ (v')$ , it follows from the usual normal form lemma for monoid free products (see [Citation20, Section 8.2] or [Citation37, Section 1]) that (1) n = m; and that (2) $φ (u_{i}^{'}), φ (v_{i}^{'}) \in Γ_{X (i)}^{*}$ and $φ (u_{i}^{'}) =_{M_{X (i)}} φ (v_{i}^{'})$ for $0 \leq i \leq n$ , where X is some standard parametrisation. As $u_{i}^{'} α =_{M} v_{i}^{'} α$ if and only $φ (u_{i}^{'}) =_{M_{X (i)}} φ (v_{i}^{'})$ , the result follows. □

The rewriting systems $I_{α}$ and $I_{α}^{r}$ defined in (4.9) and (4.10) are close to being monadic. We show that, via appropriate rational transductions, we can extend the monadic ancestor property to also apply to these rewriting systems, using the self-overlap free property of α in a non-trivial way.

Lemma 4.4.

If L(M) has word problem in C, then $I_{α}$ and $I_{α}^{r}$ are C-ancestry preserving.

Proof.

Let $L \subseteq {(Σ_{1} \cup Σ_{2})}^{*}$ be an arbitrary language with $L \in C$ . To prove that $I_{α}$ is C-ancestry preserving, it suffices to show that ${〈 L 〉}_{I_{α}} \in C$ . Let $I_{α}^{+}$ be the set of left-hand sides of rules in $I_{α}$ . We claim $I_{α}^{+} \in C$ . Indeed, $w \in Σ_{1}^{*}$ is such that $w α =_{M} α$ if and only if $φ (w) =_{L (M)} 1$ by (4.7). Recalling the definition of ${IP}_{A}^{M}$ from (2.1), it follows that(4.11) $I_{α}^{+} = φ^{- 1} ({IP}_{Γ_{1}}^{M_{1}}) α - {α} = φ^{- 1} ({IP}_{Γ_{1}}^{M_{1}}) α \cap Σ_{1}^{+} α .$ (4.11)

As L(M) has word problem in C, it follows that ${IP}_{Γ_{1}}^{M_{1}} \in C$ , by an easy transduction (cf. [Citation37, Lemma 3.5]). As C is a full $AFL$ and $φ$ is a homomorphism, it hence follows from (4.11) that $I_{α}^{+} \in C$ .

Let now $⋄$ be a new symbol. Let $A_{⋄} = A \cup {⋄}$ , and define the homomorphism $σ_{⋄} : A_{⋄}^{*} \to A^{*}$ by $a \mapsto a$ for all $a \in A$ , and $⋄ \mapsto α$ . Define a new rewriting system $I_{α}^{⋄} = {(W \to ⋄) | W \in σ_{⋄}^{- 1} (I_{α}^{+}) \cap (A_{⋄}^{*} - A_{⋄}^{*} α A_{⋄}^{*})} .$

Clearly $I_{α}^{⋄}$ is a monadic rewriting system. The language of left-hand sides of $⋄$ in $I_{α}^{⋄}$ is the intersection $σ_{⋄}^{- 1} (I_{α}^{+}) \cap (A_{⋄}^{*} - A_{⋄}^{*} α A_{⋄}^{*})$ , and as $I_{α}^{+} \in C$ it follows from the closure of C under rational transduction that $I_{α}^{⋄}$ is a C-rewriting system.

Now, as α is self-overlap free, $Σ_{1}$ is a suffix code, so for any word $u \in A^{*} α A^{*}$ we can uniquely factor u as $u_{0} α u_{1} α \dots α u_{k}$ , where $u_{i} \in A^{*} - A^{*} α A^{*}$ for $0 \leq i \leq k$ . Hence there is exactly one word $u_{⋄} \in A_{⋄}^{*}$ with the properties that (1) $u_{⋄}$ does not contain α; and (2) $σ_{⋄} (u_{⋄}) = u$ . The uniqueness of this word depends on the fact that α is self-overlap free.Footnote³ Of course, for any $u \in A^{*} - A^{*} α A^{*}$ , there is also a unique such $u_{⋄}$ , namely $u_{⋄} \equiv u$ . Let $L_{⋄}$ be the language ${w_{⋄} | w \in L}$ . Then $σ_{⋄} (L_{⋄}) = L$ . Furthermore, by the above uniqueness argument, we have(4.12) $L_{⋄} = σ_{⋄}^{- 1} (L) \cap (A_{⋄}^{*} - A_{⋄}^{*} α A_{⋄}^{*}) .$ (4.12)

Hence, from $L \in C$ we conclude $L_{⋄} \in C$ .

We now claim that ${〈 L 〉}_{I_{α}} = σ_{⋄} ({〈 L_{⋄} 〉}_{I_{α}^{⋄}})$ . The right-hand side is the image under the homomorphism $σ_{⋄}$ of the ancestor of $L_{⋄}$ under the monadic C-rewriting system $I_{α}^{⋄}$ . As C is a super- $AFL$ , the right-hand side is in C, and thus we would conclude that $I_{α}$ is C-ancestry preserving. The desired equality is easy to prove. Indeed, it follows directly from the fact that if $w \in A^{*}$ and $u \in L$ , then $w {\overset{}{\to}}_{I_{α}}^{*} u$ if and only if $w_{⋄} {\overset{}{\to}}_{I_{α}^{⋄}}^{*} u_{⋄}$ . This latter fact is proved by an easy induction on the number of rules applied, and we omit this proof. We conclude that $I_{α}$ is C-ancestry preserving.

The case of $I_{α}^{r}$ is symmetric, bearing in mind that (1) C is reversal-closed; (2) $α^{rev}$ is self-overlap free if and only if α is self-overlap free, and (3) $Σ_{1}^{rev}$ is a prefix code, rather than a suffix code (factorization over $Σ_{1}^{rev}$ is still unique). We omit the details. □

One more step is needed before proving Lemma 4.2. Let $P_{2} = {w # w^{rev} | w \in Σ_{2}^{*}}$ , where the $P$ abbreviates “palindrome.” By (4.8), we have(4.13) ${WP}_{A}^{M} \cap (Σ_{2}^{*} # {(Σ_{2}^{rev})}^{*}) = P_{2} .$ (4.13)

Clearly $P_{2}$ is a context-free language, as $Σ_{2}$ is regular, so $P_{2} \in C$ . Let $τ_{#} : A_{#}^{*} \to A_{#}^{*}$ be defined by $τ_{#} (a) = a$ for all $a \in A$ , and $τ_{#} (#) = α # α^{rev}$ . We define the (rather complicated-looking) language(4.14) $L_{α} =^{I_{α}} {(τ_{#} ({WP}_{Σ_{1}}^{L (M)} ⋆ P_{2}))}^{I_{α}^{r}} .$ (4.14)

We shall see in the below proof of Lemma 4.2 that $L_{α} = WP {[α ⊓ α]}_{A}^{M}$ . We first prove that $L_{α}$ is a language encoding the language-theoretic properties of L(M).

Lemma 4.5.

If L(M) has word problem in C, then $L_{α} \in C$ .

Proof.

If L(M) has word problem in C, we have ${WP}_{Σ_{1}}^{L (M)} \in C$ (cf. also the remark preceding (4.1)). This is a concatenation-closed language, as it is a word problem. Furthermore, $P_{2}$ is a context-free language and is easily checked to be concatenation-closed. Hence by Proposition 2.1, the alternating product ${WP}_{Σ_{1}}^{L (M)} ⋆ P_{2}$ is in C, as is its image under the homomorphism $τ_{#}$ . By Lemma 4.4, $I_{α}$ and $I_{α}^{r}$ are C-ancestry preserving, so by Proposition 2.2, we have $L_{α} \in C$ . □

Proof of Lemma 4.2.

It suffices, by Lemma 4.5, to prove that $WP {[α ⊓ α]}_{A}^{M} = L_{α}$ . We prove the inclusions one at a time.

$(\subseteq)$ . Let $w \in WP {[α ⊓ α]}_{A}^{M}$ be arbitrary. Then we can write $w \equiv u α # {(v α)}^{rev}$ with $u, v \in {(Σ_{1} \cup Σ_{2})}^{*}$ and $u α =_{M} v α$ . Let $u', v'$ be reduced forms of u resp. v such that $u {\overset{}{\to}}_{I_{α}}^{*} u'$ and $v {\overset{}{\to}}_{I_{α}}^{*} v'$ . As $I_{α}$ is M-invariant, we have $u =_{M} u'$ and $v =_{M} v'$ . Then by Lemma 4.3 we can write $u' \equiv u_{0}^{'} u_{1}^{'} \dots u_{m}^{'} and v' \equiv v_{0}^{'} v_{1}^{'} \dots v_{n}^{'},$ with n = m, $u_{i}^{'}, v_{i}^{'} \in Σ_{X (i)}^{*}$ and $u_{i}^{'} α =_{M} v_{i}^{'} α$ for all $0 \leq i \leq n = m$ , for some standard parametrisation X. Let X be such a parametrisation. Let(4.15) $w' \equiv u_{1}^{'} u_{2}^{'} \dots u_{n}^{'} # {(v_{n}^{'})}^{rev} \dots {(v_{2}^{'})}^{rev} {(v_{1}^{'})}^{rev} .$ (4.15)

Now for every $0 \leq i \leq n$ , we have $u_{i}^{'} α =_{M} v_{i}^{'} α$ if and only if $φ (u_{i}^{'}) =_{M_{X (i)}} φ (v_{i}^{'})$ . When X(i) = 1, then by (4.1) and (4.7) it follows that $π_{Σ_{1}} (u_{i}^{'}) = π_{Σ_{1}} (v_{i}^{'})$ , so $u_{i}^{'} # {(v_{i}^{'})}^{rev} \in {WP}_{Σ_{1}}^{L (M)}$ . When X(i) = 2, then by (4.8) we have $u_{i}^{'} \equiv v_{i}^{'}$ , so $u_{i}^{'} # {(v_{i}^{'})}^{rev} \in P_{2}$ . It follows that the right-hand side of (4.15) is an element of the alternating product of ${WP}_{Σ_{1}}^{L (M)}$ by $P_{2}$ , and hence so too is $w'$ . Let $Q = {WP}_{Σ_{1}}^{L (M)} ⋆ P_{2}$ , i.e. we just proved $w' \in Q$ . Let(4.16) $w ″ \equiv u_{1}^{'} u_{2}^{'} \dots u_{n}^{'} α # α^{rev} {(v_{n}^{'})}^{rev} \dots {(v_{2}^{'})}^{rev} {(v_{1}^{'})}^{rev} \equiv u' # {(v')}^{rev} .$ (4.16)

Then $w ″ \equiv τ_{#} (w')$ . We have $u {\overset{}{\to}}_{I_{α}}^{*} u'$ , so of course $u α {\overset{}{\to}}_{I_{α}}^{*} u' α$ . As $v {\overset{}{\to}}_{I_{α}}^{*} v'$ , we have $v^{rev} {\overset{}{\to}}_{I_{α}^{r}}^{*} {(v')}^{rev}$ and so also $α^{rev} v^{rev} {\overset{}{\to}}_{I_{α}^{r}}^{*} α^{rev} {(v')}^{rev}$ . Hence, by the definition of $(I_{α}, I_{α}^{r})$ -ancestors, we have(4.17) $u α # α^{rev} v^{rev} \in^{I_{α}} {(τ_{#} (Q))}^{I_{α}^{r}} .$ (4.17)

But $u α # α^{rev} v^{rev}$ is just w; and the right-hand side of (4.17) is $L_{α}$ , so we are done.

$(\supseteq)$ This argument is similar to the direction $(\subseteq)$ , and differs only in that it uses the M-invariance of $I_{α}$ . The proof is therefore only sketched here (but is detailed in the author’s Ph.D. thesis [Citation34, Lemma 4.2.13]).

Consider any word $u # v \in L_{α}$ . Two things must be proved; first, that (1) $u, v^{rev} \in α A^{*} \cap A^{*} α$ ; and second, that (2) $u =_{M} v^{rev}$ . For (1), it suffices to note that any word in $τ_{#} ({WP}_{Σ_{1}}^{L (M)} ⋆ P_{2})$ is of the form $U # V$ with $U, V^{rev} \in α A^{*} \cap A^{*} α$ . As $I_{α}$ is M-invariant, any ancestor $U'$ of $U \in α A^{*} \cap A^{*} α$ will be equal to U in M; hence, in particular, we will also have $U' \in α A^{*} \cap A^{*} α$ by Theorem 3.1. The analogous argument for V implies that (1) holds.

For (2), since $u # v \in L_{α}$ , there is some $u' # v' \in {WP}_{Σ_{1}}^{L (M)} ⋆ P_{Σ_{2}}$ such that $\begin{matrix} u {\overset{}{\to}}_{I_{α}}^{*} u' α, \\ v {\overset{}{\to}}_{I_{α}^{r}}^{*} v' α, \end{matrix}$ as $τ_{#} (u' # v') \equiv (u' α) # (α^{rev} v')$ . By the M-invariance of $I_{α}$ , we have $u =_{M} u' α$ , and analogously also $v =_{M} v' α$ . Thus to show that $u # v$ satisfies (2), it suffices to show $u' α =_{M} {(v')}^{rev} α$ . But that this is the case can be easily seen by writing out $u' # v'$ as an element of the alternating product ${WP}_{Σ_{1}}^{L (M)} ⋆ P_{Σ_{2}}$ . □

By combining Lemmas 4.1 and 4.2, we conclude that if L(M) has word problem in C, then M has word problem in C. This is the difficult direction of the proof of Theorem 1.1. It remains to prove the converse:

Lemma 4.6.

If M has word problem in C, then L(M) has word problem in C.

Proof.

Indeed, as before let the homomorphism $τ_{#} : A_{#}^{*} \to A_{#}^{*}$ be defined by $τ_{#} (a) = a$ for all $a \in A$ , and $τ_{#} (#) = α # α^{rev}$ . Then it is easy to check, using (4.1), that(4.18) $τ_{#} ({WP}_{Σ_{1}}^{L (M)}) = {WP}_{A}^{M} \cap (Σ_{1}^{*} α # α^{rev} {(Σ_{1}^{rev})}^{*}) .$ (4.18)

Now $τ_{#}$ is an injective homomorphism, as $A \cup {α # α^{rev}}$ is a code, implying that any word in the image of $τ_{#}$ can be decoded by finding all the occurrences of $τ_{#}$ , which must be surrounded by non-overlapping occurrences of α, and the remaining letters are then decoded as themselves. In particular, $τ_{#}^{- 1} ° τ_{#} (L) = L$ for any language L. Hence, applying $τ_{#}^{- 1}$ to both sides in (4.18), we find that ${WP}_{Σ_{1}}^{L (M)}$ is a rational transduction of ${WP}_{A}^{M}$ . As C is closed under rational transductions, it follows that L(M) has word problem in C. □

This completes the proof of Theorem 1.1.

5 Corollaries of Theorem 1.1

In this section, we present some corollaries of Theorem 1.1. In particular, we will show that it is decidable whether a one-relation monoid containing a non-trivial idempotent has context-free word problem (Theorem 5.4). This answers a generalisation of a question first asked by Zhang in 1992, which was answered affirmatively by the author in [Citation36]. We begin by applying the result to the rational subset membership problem for monoids.

5.1 Rational subset membership problem

Throughout this section we will fix a weakly compressible monoid M, defined by the presentation (3.1), which is compressible with respect to α. As the class of context-free (resp. indexed) languages is a super- $AFL$ closed under reversal, it of course follows from Theorem 1.1 that: M has context-free (resp. indexed) word problem if and only if L(M) does.

One of the direct consequences for a monoid having context-free word problem is decidability of its rational subset membership problem. Recall that for a finitely generated monoid M, generated by a finite set A and with associated surjective homomorphism $π : A^{*} \to M$ , this decision problem asks: given a regular language $R \subseteq A^{*}$ and a word $w \in A^{*}$ , can one decide whether $π (w) \in π (R)$ ? The rational subset membership problem clearly specializes to the submonoid membership problem, the divisibility problems, and the word problem for M. It is easy to check that any context-free monoid has decidable rational subset membership problem; indeed, if $π (w) \in π (R)$ if and only if w is equal to some word in R, i.e. if and only if $w \in {WP}_{A}^{M} / # R^{rev}$ , where $/$ denotes the right quotient. As $# R^{rev}$ is a regular language, and ${WP}_{A}^{M}$ is a context-free language, the quotient is a context-free language; and membership in context-free languages is well-known to be (uniformly) decidable (cf. also [Citation36, Theorem 3.5]). We conclude:

Corollary 5.1.

Let M be a weakly compressible monoid. If L(M) has context-free word problem, then the rational subset membership problem for M is decidable.

Example 4.

Let $n, k, β_{1}, \dots, β_{n}$ be natural numbers such that n > 1, $1 \leq k \leq n$ , and $β_{i} \geq 1$ for all $1 \leq i \leq n$ . Let $Π = Π (n, k, {β_{i}}_{i = 1}^{n})$ be the monoid defined by $Mon 〈 a_{1}, a_{2}, \dots, a_{n} | a_{1}^{β_{1}} a_{2}^{β_{2}} \dots a_{n}^{β_{n}} = a_{k} 〉 .$

Then as the rewriting system with the single rule $(a_{1}^{β_{1}} a_{2}^{β_{2}} \dots a_{n}^{β_{n}}, a_{k})$ is complete, monadic, context-free, and defines Π, it follows that Π has context-free word problem [Citation7, Corollary 3.8]. Hence, by Theorem 1.1 any monoid which compresses to Π has context-free word problem. Let now A be a new alphabet, let $α \in A^{*}$ be self-overlap free, and let $w_{i} \in α (A^{*} - A^{*} α A^{*})$ for $1 \leq i \leq n$ be pairwise distinct words. Let τ be the homomorphism defined by mapping $a_{i} \mapsto w_{i}$ for all $1 \leq i \leq n$ . Let $Π' = Π' (n, k, {β_{i}}_{i = 1}^{n}, {w_{i}}_{i = 1}^{n})$ be the monoid defined by(5.1) $Π' = Mon 〈 A | τ (a_{1}^{β_{1}} a_{2}^{β_{2}} \dots a_{n}^{β_{n}}) α = τ (a_{k}) α 〉 .$ (5.1)

Then one readily sees that $L (Π') = Π$ . Hence $Π'$ has context-free word problem, for any choice of $n, k, β_{i}$ and w_i ( $1 \leq i \leq n$ ). As a concrete example, if $A = {x, y}$ and $α = x y$ , then the monoid $Π' = Π' (3, 2, {2, 3, 4}, {x y, xyx, xyy})$ defined by(5.2) $Mon 〈 x, y | {(x y)}^{2} {(xyx)}^{3} {(xyy)}^{4} x y = xyxxy 〉$ (5.2) has context-free word problem, as it compresses to the monoid $Π (3, 2, {2, 3, 4})$ defined by $Mon 〈 a_{1}, a_{2}, a_{3} | a_{1}^{2} a_{2}^{3} a_{3}^{4} = a_{2} 〉 .$

Verifying directly that (5.2) has context-free word problem appears to be somewhat tedious.

We conclude that the monoid (5.2) has decidable rational subset membership problem, and more generally so too does any monoid of the form (5.1).

5.2 Subspecial one-relation monoids

We turn to the case when M in (3.1) is defined by a single relation u = v. Assume without loss of generality that $| u | \geq | v |$ . We say that M is subspecial if $u \in v A^{*} \cap A^{*} v$ . Any special monoid (i.e. when the defining relation is u = 1) is obviously subspecial. An element $e \in M$ is called an idempotent if $e^{2} = e$ . Of course, the identity element 1 is always a (trivial) idempotent. Associated to any idempotent e is a maximal subgroup, being the set of elements which are invertible with respect to this idempotent (alternatively, the group $H$ -class of e). Lallement [Citation24] proved that a one-relation monoid M contains a non-trivial idempotent if and only if (1) M is special, and not right cancellative; (2) $| u | > | v | > 0$ and M is subspecial. By using Adian’s overlap algorithm (see [Citation36]), it is decidable whether a one-relation special monoid is right cancellative. Hence it is decidable whether a one-relation monoid M contains a non-trivial idempotent.

As M is subspecial, it is clearly weakly compressible. Furthermore, it is not hard to see that L(M) is also subspecial (see e.g. [Citation22, Lemma 5.4]). Thus to any subspecial monoid M we can associate a special monoid $L_{s} (M)$ , obtained by iterating weak compression until we arrive at a special monoid. Hence M has context-free word problem if and only if $L_{s} (M)$ does, by Theorem 1.1 (of course, the same is also true for any reversal-closed super- $AFL$ ). Using the structural results about maximal subgroups of subspecial monoids by Gray & Steinberg [Citation14], we can prove the following extension of the Muller–Schupp theorem:

Theorem 5.2.

Let M be a subspecial (one-relation) monoid. Then M has context-free word problem if and only if all of its maximal subgroups are virtually free.

Proof.

First, assume M is special. Then, by a result due to Malheiro [Citation30, Theorem 4.6], all maximal subgroups of M are isomorphic to the group of units U(M) of M. The main theorem of [Citation36] states that M has context-free word problem if and only if U(M) is virtually free. The result thus follows in this case.

Suppose that M is subspecial, but not special. By [Citation14, Lemma 5.2], the maximal subgroups of M are all isomorphic to the group of units of $L_{s} (M)$ , with the exception of the group of units of M, which is trivial (and hence virtually free). Let G be one of the non-trivial maximal subgroups of M, so that $G ≅ U (L_{s} (M))$ . Now $L_{s} (M)$ is a finitely generated one-relation special monoid. Hence, by classical results (cf. [Citation36]), G is a finitely generated one-relator group.

Assume now that M has context-free word problem. Then, as G is a finitely generated subsemigroup of M, it follows from [Citation17, Proposition 8(b)] that G has context-free word problem. Hence, by the Muller–Schupp theorem, G is virtually free, as desired. For the converse, assume G is virtually free. By the easy direction of the Muller–Schupp theorem, G has context-free word problem. Then, by the main theorem of [Citation36], $L_{s} (M)$ has context-free word problem. Hence, by the main theorem of the present paper, M has context-free word problem. □

This gives a complete algebraic characterization of the subspecial monoids with context-free word problem, extending the Muller–Schupp theorem to this class.

Corollary 5.3.

Let M be a subspecial one-relation monoid such that all of its maximal subgroups are virtually free. Then M has decidable rational subset membership problem.

For example, one can check that the subspecial monoid(5.3) $M_{3} = Mon 〈 x, y | xyxyyxyx = x 〉$ (5.3) compresses (in a single step) to $L_{s} (M_{3}) ≅ Mon 〈 a, b, c | abca = 1 〉$ . Now it follows from [Citation2] that the group of units of $L_{s} (M_{3})$ is isomorphic to $Gp 〈 p, q | pqp = 1 〉$ , which is isomorphic to $Z$ by removing the redundant generator q. Hence the non-trivial maximal subgroups of M₃ are all infinite cyclic, and in particular virtually free. Hence the monoid M₃ defined by (5.3) has context-free word problem by Theorem 5.2, and it has decidable rational subset membership problem by Corollary 5.3.

We end with a final corollary of Theorem 5.2, the statement of which does not use compression or subspeciality.

Theorem 5.4.

There is an algorithm which takes as input a one-relation monoid $M = Mon 〈 A | u = v 〉$ containing a non-trivial idempotent, and decides whether M has context-free word problem.

Proof.

Suppose we are given a one-relation monoid presentation $Mon 〈 A | u = v 〉$ for M. Assume without loss of generality that $| u | \geq | v |$ . Then as M contains a non-trivial idempotent, either M is special, or else M is subspecial with $| u | > | v | \geq 0$ . In the first case, the defining relation is u = 1, and by [Citation36, Theorem B] we can hence decide (uniformly in u) whether this special one-relation monoid has context-free word problem.

In the latter case, we repeatedly compress the relation u = v until we find(5.4) $L_{s} (M) = Mon 〈 B | w = 1 〉 .$ (5.4)

By Theorem 1.1, M has context-free word problem if and only if the special one-relation monoid (5.4) has context-free word problem; this latter problem is (uniformly) decidable by another application of [Citation36, Theorem B]. □

In 1992, Zhang [Citation40, Problem 3] asked if it is decidable whether a special one-relation monoid has context-free word problem. This was recently answered affirmatively by the author [Citation36, Theorem B]. The above Theorem 5.4 thus extends this answer from special to subspecial one-relation monoids.

Acknowledgments

The research in this article was carried out while the author was a Ph.D. student at the University of East Anglia. The author wishes to thank his supervisor Dr Robert D. Gray for many useful comments and discussions, and the anonymous referee for very thorough and careful comments, all of which significantly improved the article.

Additional information

Funding

The author gratefully acknowledges funding from the Dame Kathleen Ollerenshaw Trust, which is supporting his current research at the University of Manchester.

Notes

1 The compression defined by Lallement [24] transforms M into $L^{n} (M)$ in a single step, whereas that by Adian & Oganesian [3] or indeed Kobayashi [22] corresponds to transforming M into L(M).

2 The alternating products and ancestry defined and used by the author in [37] were, in fact, first developed by the author to deal with precisely the problem of describing the word problem of $L_{*} (M)$ .

3 For example, if we take the word $α \equiv xyx$ , which has self-overlaps, then $σ_{⋄} (x y ⋄) = σ_{⋄} (⋄ y x) = xyxyx$ .

References

Adian, S. I. (1960). The problem of identity in associative systems of a special form. Soviet Math. Dokl. 1:1360–1363.
Google Scholar
Adian, S. I. (1966). Defining relations and algorithmic problems for groups and semigroups. Proceedings of the Steklov Institute of Mathematics, No. 85.
Google Scholar
Adian, S. I., Oganesian, G. U. (1978). On problems of equality and divisibility in semigroups with a single defining relation. Math. USSR, Izv. 12:207–212. [Izv. Akad. Nauk SSSR Ser. Mat. 42(2)].
Web of Science ®Google Scholar
Aho, A. V. (1968). Indexed grammars—an extension of context-free grammars. J. Assoc. Comput. Mach. 15: 647–671. DOI: 10.1145/321479.321488.
Web of Science ®Google Scholar
An i¯s i¯mov, A. V. (1971). The group languages. Kibernetika (Kiev) 4:18–24.
Google Scholar
Berstel, J., Perrin, D. (1985). Theory of Codes, volume 117 of Pure and Applied Mathematics. Orlando, FL: Academic Press, Inc.
Google Scholar
Book, R. V., Jantzen, M., Wrathall, C. (1982). Monadic Thue systems. Theoret. Comput. Sci. 19(3):231–251. DOI: 10.1016/0304-3975(82)90036-6.
Web of Science ®Google Scholar
Book, R. V., Otto, F. (1993). String-Rewriting Systems. Texts and Monographs in Computer Science. New York: Springer-Verlag.
Google Scholar
Brough, T., Cain, A. J., Pfeiffer, M. (2019). Context-free word problem semigroups. In: Hofman, P., Skrzypczak, M., eds. Developments in Language Theory, volume 11647 of Lecture Notes in Computer Science. Cham: Springer, pp. 292–305.
Google Scholar
Campbell, C. M., Robertson, E. F., Ruškuc, N., Thomas, R. M. (1995). Semigroup and group presentations. Bull. London Math. Soc. 27(1):46–50. DOI: 10.1112/blms/27.1.46.
Google Scholar
Duncan, A., Gilman, R. H. (2004). Word hyperbolic semigroups. Math. Proc. Cambridge Philos. Soc. 136(3): 513–524. DOI: 10.1017/S0305004103007497.
Google Scholar
Dunwoody, M. J. (1985). The accessibility of finitely presented groups. Invent. Math. 81(3):449–457. DOI: 10.1007/BF01388581.
Web of Science ®Google Scholar
Engelfriet, J. (1985). Hierarchies of hyper-AFLs. J. Comput. System Sci. 30(1):86–115. DOI: 10.1016/0022-0000(85)90006-6.
Web of Science ®Google Scholar
Gray, R. D., Steinberg, B. (2022). A Lyndon’s identity theorem for one-relator monoids. Selecta Math. (N.S.) 28(3):Paper No. 59, 53.
Web of Science ®Google Scholar
Greibach, S. A. (1970). Full AFLs and nested iterated substitution. Inf. Control 16:7–35.
Google Scholar
Harrison, M. A. (1978). Introduction to Formal Language Theory. Reading, MA: Addison-Wesley Publishing Co.
Google Scholar
Hoffmann, M., Holt, D. F., Owens, M. D., Thomas, R. M. (2012). Semigroups with a context-free word problem. In: Yen, H.-C., Ibarra, O. H., eds. Developments in Language Theory, volume 7410 of Lecture Notes in Computer Science. Heidelberg: Springer, pp. 97–108.
Google Scholar
Holt, D. F., Owens, M. D., Thomas, R. M. (2008). Groups and semigroups with a one-counter word problem. J. Aust. Math. Soc. 85(2):197–209. DOI: 10.1017/S1446788708000864.
Web of Science ®Google Scholar
Hopcroft, J. E., Ullman, J. D. (1979). Introduction to Automata Theory, Languages, and Computation. Addison-Wesley Series in Computer Science. Reading, MA: Addison-Wesley Publishing Co.
Google Scholar
Howie, J. M. (1995). Fundamentals of Semigroup Theory, volume 12 of London Mathematical Society Monographs. New Series. New York: The Clarendon Press, Oxford University Press, Oxford Science Publications.
Google Scholar
Jantzen, M. (1988). Confluent String Rewriting, volume 14 of EATCS Monographs on Theoretical Computer Science. Berlin: Springer-Verlag.
Google Scholar
Kobayashi, Y. (2000). Finite homotopy bases of one-relator monoids. J. Algebra 229(2):547–569. DOI: 10.1006/jabr.1999.8251.
Web of Science ®Google Scholar
Král, J. (1970). A modification of a substitution theorem and some necessary and sufficient conditions for sets to be context-free. Math. Systems Theory 4:129–139. DOI: 10.1007/BF01691097.
Google Scholar
Lallement, G. (1974). On monoids presented by a single relation. J. Algebra 32:370–388. DOI: 10.1016/0021-8693(74)90146-X.
Web of Science ®Google Scholar
Lyndon, R. C., Schupp, P. E. (1977). Combinatorial Group Theory. Ergebnisse der Mathematik und ihrer Grenzgebiete, Band 89. Berlin-New York: Springer-Verlag.
Google Scholar
Magnus, W. (1932). Das Identitätsproblem für Gruppen mit einer definierenden Relation. Math. Ann. 106(1): 295–307. DOI: 10.1007/BF01455888.
Google Scholar
Magnus, W., Karrass, A., Solitar, D. (1966). Combinatorial Group Theory: Presentations of Groups in Terms of Generators and Relations. New York-London-Sydney: Interscience Publishers [John Wiley & Sons, Inc.].
Google Scholar
Makanin, G. S. (1966). On the identity problem for finitely presented groups and semigroups. PhD thesis, Steklov Mathematical Institute, Moscow.
Google Scholar
Makanin, G. S. (1966). On the identity problem in finitely defined semigroups. Dokl. Akad. Nauk SSSR 171: 285–287.
Google Scholar
Malheiro, A. (2005). Complete rewriting systems for codified submonoids. Int. J. Algebra Comput. 15(2):207–216. DOI: 10.1142/S0218196705002220.
Web of Science ®Google Scholar
Markov, A. A. (1947). On certain insoluble problems concerning matrices. Doklady Akad. Nauk SSSR (N. S.) 57: 539–542.
Google Scholar
Muller, D. E., Schupp, P. E. (1983). Groups, the theory of ends, and context-free languages. J. Comput. System Sci. 26(3):295–310. DOI: 10.1016/0022-0000(83)90003-X.
Web of Science ®Google Scholar
Nyberg-Brodda, C.-F. (2021). A translation of G. S. Makanin’s 1966 Ph.D. thesis “On the Identity Problem for Finitely Presented Groups and Semigroups”. Available online at arXiv:2102.00745.
Google Scholar
Nyberg-Brodda, C.-F. (2021). The word problem and combinatorial methods for groups and semigroups. PhD thesis, University of East Anglia, UK.
Google Scholar
Nyberg-Brodda, C.-F. (2021). The word problem for one-relation monoids: a survey. Semigroup Forum 103(2): 297–355. DOI: 10.1007/s00233-021-10216-8.
Web of Science ®Google Scholar
Nyberg-Brodda, C.-F. (2022). On the word problem for special monoids. Semigroup Forum 105(1):295–327. DOI: 10.1007/s00233-022-10286-2.
Web of Science ®Google Scholar
Nyberg-Brodda, C.-F. (2023). On the word problem for free products of semigroups and monoids. J. Algebra 622:721–741. DOI: 10.1016/j.jalgebra.2023.02.007.
Web of Science ®Google Scholar
Post, E. L. (1947). Recursive unsolvability of a problem of Thue. J. Symbolic Logic 12:1–11. DOI: 10.2307/2267170.
Google Scholar
Thue, A. (1914). Problem über Veränderungen von Zeichenreihen nach gegebenen Regeln. Christiana Videnskaps-Selskabs Skrifter, I. Math. naturv. Klasse, 10.
Google Scholar
Zhang, L. (1992). Congruential languages specified by special string-rewriting systems. In: Ito, M., ed. Words, Languages and Combinatorics (Kyoto, 1990). River Edge, NJ: World Scientific Publishing, pp. 551–563.
Google Scholar
Zhang, L. (1992). On the conjugacy problem for one-relator monoids with elements of finite order. Int. J. Algebra Comput. 2(2):209–220. DOI: 10.1142/S021819679200013X.
Google Scholar

On the word problem for weakly compressible monoids

Abstract

1 Introduction

2 Notation and auxiliary results

2.1 Monoids, words, rewriting

2.2 Super- $AFL$ s

2.3 Alternating products and bipartisan ancestry

3 Weak compression

4 Proof of Theorem 1.1

5 Corollaries of Theorem 1.1

5.1 Rational subset membership problem

5.2 Subspecial one-relation monoids

Acknowledgments

References

Information for

Open access

Opportunities

Help and information

On the word problem for weakly compressible monoids

Abstract

1 Introduction

2 Notation and auxiliary results

2.1 Monoids, words, rewriting

2.2 Super-AFL s

2.3 Alternating products and bipartisan ancestry

3 Weak compression

4 Proof of Theorem 1.1

5 Corollaries of Theorem 1.1

5.1 Rational subset membership problem

5.2 Subspecial one-relation monoids

Acknowledgments

Additional information

Funding

Notes

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

2.2 Super- $AFL$ s