Full article: Algebraic tunings

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

We propose an approach to tuning systems in which octave doubling ratio is replaced by a suitable algebraic unit τ, and note frequencies are proportional to a subset of the ring $Z [τ]$ . Then it is possible for many difference tones between notes in the tuning to also appear in the tuning. After outlining more general principles, we consider in detail some natural examples based on the golden ratio $ϕ = (1 + \sqrt{5}) / 2$ , limited by norm or by the number of digits in the greedy β-expansion. We discuss additive and multiplicative properties, implementation and composition using these tunings. The Online Supplement contains MIDI and websynths files to implement the tuning $S_{β}^{5} (ϕ)$ (based on β-expansions to $ϕ^{- 5}$ ) on websynths.com and a composition Three Places.

Keywords:

2020 Mathematics Subject Classification:

1. Introduction

Conventional modern Western music is based on a 12 tone equal temperament (12TET) tuning in which consecutive pitches differ by a ratio $2^{1 / 12} \approx 1.059$ , so that 12 of these smallest intervals (semitones) give a ratio of 2 (conventional octave). This choice allows music to be transposed exactly into any key. Rational frequency ratios other than powers of 2 are also available, albeit only approximately, for example 7 semitones gives $2^{7 / 12} \approx 1.498 \approx \frac{3}{2}$ . For music using only a few keys, including much composed prior to 1700, tuning in other temperaments may be preferred, in which these intervals are exact or closer to rational approximations; see CitationLindley (2001). This is particularly an issue for keyboard music; performers of many other instruments (including voice) are able to a greater or lesser extent adjust the pitch during performance.

One motivation for temperaments using exactly rational frequency ratios is that of difference tones (CitationGreated 2001). Due to nonlinearities in the ear, these tones may be perceived when intervals or chords are played. Their frequency is the difference, or another simple linear combination, of the original frequencies, the latter termed “combination tones.” If the frequencies in the scale are exact rational ratios, difference tones will then often correspond to other frequencies in the scale. A recent discussion of difference tones and their use in electronic music composition may be found in CitationChechile (2020).

There are also many tuning systems quite different from 12-TET, either from many non-Western musical traditions though it should be noted that 12TET originated in China (CitationCho 2010) or of more recent origin, in theory, instruments and composition. These are often termed “microtonal” though the intervals may not be smaller than in 12TET. Intervals are still typically measured in cents. As standard in music, the interval corresponding to frequency ratio $f_{2} / f_{1}$ is defined to be (1) $1200 \log_{2} (\frac{f_{2}}{f_{1}}) cents$ (1) so that a 100 cent interval is a 12TET semitone.

Perception of octave equivalence for a frequency ratio of 2 is not universal, for example the Tsimané people of Bolivia do not appear to have this (or any other) octave equivalence (CitationJacoby et al. 2019). This suggests that the human ear may become accustomed to other interval equivalence.

The ninth century treatises Musica enchiriadis and Scolica enchiriadis (CitationErickson 2001) use “dasian” notation which has equivalence at a perfect fifth (frequency ratio 3/2), and a 3-limit tuning system, that is, frequency ratios that are multiples of powers of 2 and 3. This fits naturally with monophonic chant, and with parallel organum using an interval of a perfect fifth, but not with parallel organum with a fourth (ratio 4/3) or with one or more parts doubled at an octave (ratio 2), also common in this period as described in these treatises.

A notable recent example of a tuning system with a different octave is the Bohlen-Pierce (BP) scale (CitationMathews et al. 1988), where consecutive pitches differ by a ratio $3^{1 / 13}$ . This is periodic with a frequency ratio of 3 (“tritave”) and has intervals approximating ratios involving 3, 5 and 7. This makes it well suited for instruments with strong odd harmonics, such as the clarinet, and BP-tuned instruments are available commercially. Interestingly, one motivation for this scale was that of difference tones (CitationMathews et al. 1988).

The aim of this work is to use the above ideas for generating tuning systems, namely difference tones and non-standard octaves, in the context of algebraic number theory. Namely, we observe that if the octave periodicity is an algebraic unit τ, and frequencies are proportional to elements of the corresponding ring $Z [τ]$ , then difference tones also lie in the ring. The simplest case is the golden ratio $τ = ϕ = (1 + \sqrt{5}) / 2 \approx 1.618$ . The frequencies involve two integers, being proportional to $aϕ + b$ where $a, b \in Z$ , and in this sense have the same complexity as the rational numbers. This will be our main example, but we also motivate and develop this approach in more generality.

First, we discuss some relevant approaches in the existing literature. CitationO'Connell (1993) noted that $2^{25} = 33554432$ is close to $ϕ^{36} \approx 33385282$ , so that dividing each semitone in thirds (that is, 36TET) yields an interval (25/3 semitones, or close to 833 cents) very close to the golden ratio, motivated as below by sum and difference tones. A very similar scale is obtained by splitting a golden octave (frequency ratio of ϕ) into 25 equally spaced intervals. He then described compositions based on pentachords noting the factorization of 25. Frequency ratios of 2, $\sqrt{5}$ and 3 were highlighted. In the present work the main ratio is ϕ, although the above ratios also appear.

Some related history and scales can be found in CitationSmethurst (2016). Of note is the Bohlen 833 scale, which does not appear otherwise published except on websites such as CitationBohlen (2012). This scale consists of a fundamental and its 2, 3 and 4 harmonics and subharmonics, repeated every golden octave. Unlike the equally tempered approach of O'Connell, this scale now has closest intervals of three different sizes. It also has several difference tones in the scale, though the distinct sums and differences are not much less than the generic value of $n (n + 1) / 2$ for n notes; see Figure below.

Figure 1. Arithmetic sequences in the golden scales, with notes at $x = aϕ + b$ for integers $(a, b)$ . The sequences are straight line segments in the $(a, b)$ plane, but curved in these coordinates. See section 5.

Figure 2. Sum-product phenomenon for the $S_{N}$ scale (upper), $S_{β}$ scale (middle) and Bohlen 833 scale (lower). The number of distinct sums, positive differences, and products are shown in blue, orange and green, respectively, plotted against the number of notes used. See section 7.

In this paper we continue further in this direction, constructing scales with many difference tones equal to each other and/or contained in the scale. This has the effect that almost all intervals (ie multiplicative ratios) are different.

In section 2, we begin by generalizing the difference tone rationale, and discuss the algebraic number systems and choice of the octave periodicity ratio τ. Section 3 contains relevant properties of ϕ and similar algebraic numbers. Section 4 defines the scales we propose, based on ϕ. Section 5 considers their additive properties (ie difference tones) and section 6 their multiplicative properties (ie intervals). Section 7 gives a relation to an interesting open mathematical problem, the sum-product conjecture. Section 8 gives considerations in implementing and experimenting with these scales, and section 9 some ideas for composition, and discussion of a first composition, Three Places.

2. Octave equivalence using algebraic units

Define a scale $\hat{S} \subset R_{> 0}$ as a nonempty set of positive real numbers representing frequencies, measured in Hz. One element $f \in \hat{S}$ denotes the “fundamental” frequency. We consider $S = {x : fx \in \hat{S}}$ for now; this is equivalent to setting f = 1. In section 4, we will choose another convenient value. We also assume that S is locally finite away from zero.

We require that S be log-periodic, that is, the set of logarithms of elements of S has period $\ln τ$ for some real number $τ > 1$ . In other words, $\forall x \in S$ , $τx \in S$ and $τ^{- 1} x \in S$ ; the pitch class of x is the set $τ^{n} x$ for $n \in Z$ . In most traditional scales, $τ = 2$ , corresponding to the octave, but here it will differ. We choose τ to be the smallest such value, since S is also invariant under multiplication and division by integer powers of τ. Though S is locally finite away from zero by assumption, the log-periodicity implies that S accumulates at zero.

We now introduce some standard definitions in algebraic number theory. We denote $Z [τ, τ^{- 1}]$ to be the minimal set containing ${1, τ, τ^{- 1}}$ and closed under addition, subtraction and multiplication. This consists of any integer linear combination of arbitrary positive and negative powers of τ, and so is in general not a finitely generated space over $Z$ . However, if τ is an algebraic integer of degree d i.e. has minimal polynomial of degree d with integer coefficients and leading coefficient 1, then all powers $\geq d$ in the sum can be written in terms of powers $0 \dots d - 1$ . If in addition, τ is an algebraic unit, i.e. its minimal polynomial also has constant term $\pm 1$ , then all negative powers in the sum can be written in terms of the powers $0 \dots d - 1$ . In this case $τ^{- 1}$ is in $Z [τ]$ and we can use the latter notation rather than $Z [τ, τ^{- 1}]$ . The space now has dimension d over $Z$ and a good candidate to construct scales with (in some reasonable sense) many difference tones in the scale.

One more standard definition needed here is that of the (field) norm of an element of $Z [τ]$ . Multiplication by an element $x \in Z [τ]$ corresponds to a d-dimensional linear operator over $Z$ . The norm $N (x)$ is the determinant of this operator, and hence is multiplicative, ie $N (xy) = N (x) N (y)$ . Strictly speaking, the norm is defined relative to the original ring $Z$ and extension $Z [τ]$ but these are clear in the context and omitted from the notation.

Later we will need the norm in $Z [ϕ]$ , where $ϕ = (1 + \sqrt{5}) / 2$ is the golden ratio. The minimal polynomial $x^{2} - x - 1$ has highest and lowest terms with coefficient $\pm 1$ , so ϕ is an algebraic unit. If we write $x \in Z [ϕ]$ as $x = aϕ + b$ , then $ϕ^{2} = ϕ + 1$ implies that $xϕ = (a + b) ϕ + a$ and the determinant of the multiplication operator is (2) $N (aϕ + b) = | \begin{array}{cc} a + b & a \\ a & b \end{array} | = b^{2} + ab - a^{2}$ (2) In conventional just intonation scales, the octave equivalence is $τ = 2$ (not an algebraic unit). Frequencies are of the form $p_{1}^{k_{1}} p_{2}^{k_{2}} \dots / 2^{q}$ where the numerator is typically given as a product of small non-negative powers of primes (for example 2, 3, 5 for 5-limit tuning). Combining the numerator into a single integer p, we can reduce the number of required integer parameters to two, that is, frequencies are dyadic rational numbers of the form $p / 2^{q} \in Z [1 / 2]$ . The dasian scale discussed in the introduction has $τ = 3 / 2$ (not an algebraic integer). Frequencies are of the 3-limit form $2^{k_{1}} 3^{k_{2}} = p / 6^{q} \in Z [1 / 6]$ , where now $k_{1}$ and $k_{2}$ are integers satisfying $5 \leq k_{1} + k_{2} \leq 8$ . Representing either in terms of p and q or $k_{1}$ and $k_{2}$ there are two integer parameters.

The same number of parameters is required for τ a degree 2 (i.e. quadratic) algebraic unit. These are solutions to $x^{2} - ax \pm 1 = 0$ for integer a, thus those greater than unity are of the form $(a + \sqrt{a^{2} \mp 4}) / 2$ , and easily enumerated. Those less than 3 are the golden ratio $ϕ \approx 1.618$ , the silver ratio $s = 1 + \sqrt{2} \approx 2.414$ , and also $ϕ^{2} \approx 2.618$ . The degree 3 (i.e. cubic) units in contrast are dense unless there is an additional condition such as the Pisot property discussed in the next section.

3. The golden ratio, properties and generalizations

In this paper we will focus on the golden ratio ϕ as the ratio for octave equivalence. This section describes some other properties of ϕ, that are relevant in selecting algebraic units on which to base alternative scales, and in selecting elements of $Z [τ]$ to include in the scale.

Algebraic numbers give the asymptotic growth of solutions of linear recurrences with coefficients from their minimal polynomial. For ϕ, this is the Fibonacci sequence defined by $F_{1} = F_{2} = 1$ , $F_{n} = F_{n - 1} + F_{n - 2}$ . It is easy to show (for example by induction) that (3) $\begin{aligned} ϕ^{n} - (- ϕ)^{- n} & = F_{n} \sqrt{5} \end{aligned}$ (3) (4) $\begin{aligned} ϕ^{n} + (- ϕ)^{- n} & = F_{n + 1} + F_{n - 1} \end{aligned}$ (4) Equation (Equation3(3) $\begin{aligned} ϕ^{n} - (- ϕ)^{- n} & = F_{n} \sqrt{5} \end{aligned}$ (3) ) may be used to calculate $F_{n}$ . Equation (Equation4(4) $\begin{aligned} ϕ^{n} + (- ϕ)^{- n} & = F_{n + 1} + F_{n - 1} \end{aligned}$ (4) ) shows that for large n, $ϕ^{n}$ is close to an integer; this is due to the following property.

A Pisot number (CitationBertin et al. 2012) is a real algebraic integer greater than 1 with all Galois conjugates (i.e. roots of the minimal polynomial) of complex magnitude less than 1. The distance between the nth power of a Pisot number and the nearest integer converges to zero as $n \to \infty$ . A positive power of a Pisot number is Pisot. The golden ratio is Pisot; this can be confirmed from the roots of the minimal polynomial. In general there are criteria using inequalities for the coefficients of the minimal polynomial; see Theorem 2.2 of CitationAkiyama and Gjini (2005). The quadratic and cubic Pisot units less than 3 are tabulated in Table . This table also includes information about the β-expansion (see below) and common names for the golden ratio and some of the others.

Table 1. Pisot units of degree 2 and 3, of magnitude less than 3, and not powers of smaller Pisot units.

Display Table

A β-expansion (CitationCharlier, Cisternino, and Dajani 2021) is an expression for a real number in powers of an arbitrary real $β > 1$ , (5) $x = \sum_{j = - \infty}^{j_{max}} c_{j} β^{j}$ (5) where $c_{j} \in Z \cap [0, β)$ . In general this is non-unique: there are many possible ${c_{j}}$ that satisfy equation (Equation5(5) $x = \sum_{j = - \infty}^{j_{max}} c_{j} β^{j}$ (5) ). The greedy β-expansion is obtained by starting from the largest possible $j_{\max}$ and choosing the largest possible $c_{j}$ for each j decreasing towards $- \infty$ . The β-expansion of 1, denoted $d_{β} (1)$ is the greedy β-expansion starting from j = −1. For Pisot β it is known to be finite or repeating; see Table . The greedy condition can be written that no consecutive set of $c_{j}$ is lexicographically greater or equal than $d_{β} (1)$ . For $τ = ϕ$ we have $d_{β} (1) = 0.11$ and leads to the simple condition $c_{j} c_{j + 1} = 0$ , that is, that the finite sequences of $c_{j}$ in the greedy β-expansion are exactly those without consecutive 1s (also, similar to the infinite trailing sequence of 9s that does not occur in decimal expansions, the β-expansion excludes an infinite trailing sequence of $\bar{01}$ ). Here, we use β-expansions as an approach to deciding what elements of $Z [β]$ to include in the scale.

Finally, we mention the Diophantine properties of the golden ratio and related algebraic units (see for example CitationRockett and Szusz 1992), in other words, how close are intervals to rational frequency ratios appearing in conventional (just intonation) music? It turns out that the golden ratio is, in a precise sense, maximally irrational, that is, badly approximable by rationals. Hurwitz's theorem states that for any irrational τ, the equation (6) $| τ - \frac{p}{q} | < \frac{K}{q^{2}}$ (6) has infinitely many integer solutions for $(p, q)$ if $K \geq \frac{1}{\sqrt{5}}$ . If $τ = \frac{a + bϕ}{c + dϕ}$ for integers a, b, c, d with $| ad - bc | = 1$ then this bound is sharp. If $τ = \frac{a + bs}{c + ds}$ with $| ad - bc | = 1$ with $s = 1 + \sqrt{2}$ (the silver ratio) then the bound $K \geq \frac{1}{\sqrt{8}}$ is sharp. All other irrationals are better approximated by rationals, in that the bound may be replaced by $K \geq \frac{5}{\sqrt{221}} = \frac{1}{\sqrt{8.84}}$ . Each quadratic irrational has its own bound; these are exactly the numbers with eventually periodic continued fraction expansions. There is a quadratic irrational ratio in 12TET, namely six semitones gives a frequency ratio $2^{6 / 12} = \sqrt{2}$ .

However, for algebraic numbers of higher degree, the continued fraction expansions appear to have similar properties to generic real numbers but less is known; it is conjectured that K may be made arbitrarily small in equation (Equation6(6) $| τ - \frac{p}{q} | < \frac{K}{q^{2}}$ (6) ) and it is known that replacing $q^{2}$ by $q^{2 + ϵ}$ leads to only finitely many solutions (CitationRoth 1955). In summary, cubic units have more irregular approximation properties than quadratic units. For musical purposes, the ear can distinguish only the first few approximations, for example the plastic number p is 11 cents from 4/3. The next approximant has an error of less than 1 cent, but at 49/37 is not a simple ratio. Other cubic units have approximations with larger denominators, for example the supergolden number is 1 cent from 22/15 and the tribonacci number 6 cents from 11/6. For comparison, the cubic irrationals in 12TET are four and eight semitones, with frequency ratios $2^{1 / 3}$ and $2^{2 / 3}$ ; these are about 14 cents from 5/4 and 8/5 respectively, and less than 1 cent from 63/50 and 100/63 respectively.

4. Defining scales

Since scales consist of isolated frequencies, we must keep only a finite number of values per τ-octave. There are at least two natural choices based on the definitions we have considered so far. We can use the norm as a bound, that is, define $S_{N}^{B} (τ) = {x > 0 | | N (x) | \leq B}$ where the unit τ appears implicitly in the norm N, and the fact it is a unit implies the log-periodicity of the set. Another approach is to note that the set of x>0 with $N (x) > 0$ forms a cone, and if convex (and it is for ϕ) is closed under both addition and multiplication and hence suitable for defining a scale $S_{N}^{B +} (τ)$ , replacing the inequality in the equation above by $0 < N (x) \leq B$ ; we will not consider this further here, except to note that $S_{N}^{36 +} (ϕ)$ has the same number of notes as $S_{β}^{5} (ϕ)$ and so fits on a MIDI keyboard (see equations (Equation7(7) $S_{β}^{B} (τ) = {x > 0 | \exists c_{j} greedy, so that x = \sum_{j_{max} - B}^{j_{max}} c_{j} τ^{j}}$ (7) ), (Equation8(8) $1200 \frac{\log ϕ^{3}}{\log 2} \approx 2499.27$ (8) ) below).

Alternatively, we can use the greedy β-expansion and define (7) $S_{β}^{B} (τ) = {x > 0 | \exists c_{j} greedy, so that x = \sum_{j_{max} - B}^{j_{max}} c_{j} τ^{j}}$ (7) which is again log-periodic by definition. It is possible (but perhaps less natural) to remove the greedy condition, and impose only $c_{j} < τ$ . We will not consider this further here, except to note that $S_{β}^{5} (ϕ)_{greedy} = S_{β}^{4} (ϕ)_{non-greedy}$ .

We now focus on the golden ratio, and consider two scales in $Z [ϕ]$ satisfying the previously defined properties. The scale $S_{β}^{5} (ϕ)$ consists of notes with greedy β-expansion of $\leq 6$ digits, leading to 8 notes per ϕ-octave, and will be denoted $S_{β}$ for brevity. The scale $S_{N}^{20} (ϕ)$ contains notes with norms of magnitude $\leq 20$ , leading to 10 notes per golden octave, which turns out to be those of $S_{β}$ together with two others (of 7 digit beta representation). This will be denoted $S_{N}$ for brevity.

As noted in the previous section, the greedy β-expansion base ϕ is characterized by sequences with no consecutive 1s. The norm is given in equation (Equation2(2) $N (aϕ + b) = | \begin{array}{cc} a + b & a \\ a & b \end{array} | = b^{2} + ab - a^{2}$ (2) ) above. We have $N (ϕ) = - 1$ , so that multiplying or dividing by ϕ, corresponding to going up or down by a golden octave, leads to a change of sign in the norm. Using these definitions, the notes of the scales $S_{N}$ and $S_{β}$ defined above are those presented in Table .

Table 2. Notes of the golden scale $S_{N}$ .

Display Table

The naming of the pitch classes uses ${α, β, γ, δ, ϵ}$ for numbers represented by at most 5 digits in the β-expansion. A ♭ or ♯ corresponds to taking or adding $ϕ^{- 6}$ to these, respectively. For more general $S_{β}^{B} (τ)$ or $S_{N}^{B} (τ)$ , it is natural to give notes in scales with smaller B separate letters, and accidentals for the rest, but it seems difficult to make from this a precise and workable standard nomenclature.

The $S_{β}$ scale of eight pitch classes can be deployed by retuning a MIDI keyboard, since three golden octaves (24 notes) are close to the same number of semitones, more precisely (8) $1200 \frac{\log ϕ^{3}}{\log 2} \approx 2499.27$ (8) cents, that is, just under 25 semitones. The $S_{N}$ scale has too many pitch classes for a MIDI keyboard but the advantage that it contains more arithmetic progressions, chords of fixed difference tone frequency.

For a suitable choice of fundamental frequency f, there are sequences of the form $f ϕ^{n}$ which for large n are all close to integers following from the Pisot property of ϕ above. We choose f to be $f = ϕ^{10} / \sqrt{5} \approx 55.0036$ Hz, which due to equation (Equation3(3) $\begin{aligned} ϕ^{n} - (- ϕ)^{- n} & = F_{n} \sqrt{5} \end{aligned}$ (3) ) is close to a Fibonacci number. The Fibonacci frequency 55 Hz is A1 on the usual 12TET scale, being three octaves below concert pitch ( $440 Hz = 55 \times 2^{3} Hz$ ). Here we use Greek letters for notes on the golden scale, and denote the fundamental frequency $α 1$ . The frequencies for both scales are given in Table and a mapping (of $S_{β}$ ) to a MIDI keyboard is given in Table ; see also the Online Supplement files described at the end of this paper. Note that we have shifted this correspondence by two semitones, so $α 1$ corresponds to B1, in order to reduce the amount of tuning required in higher octaves, and also to balance the most extreme accidentals (C♭ and E♯).

Table 3. Frequencies in the golden scales.

Display Table

Table 4. Keyboard mapping of $S_{β}$ , over the full MIDI range.

Display Table

5. Additive properties – arithmetic sequences

Arithmetic sequences are one of the main motivations for considering scales based on number fields, since they have common difference tones, which may also be in the scale. Looking at the lattice representations of the notes in Table it is clear that there are some arithmetic sequences, for example ${α, γ, ϵ ♭}$ . However, it is not obvious how to identify all such sequences, given the repetition of the scale in different octaves. If all can be identified for a given fixed scale, it is still not clear how to choose a finite set of pitch classes to maximise (in some precise sense) the number of such sequences.

Whilst we cannot give a definitive answer to these questions here, we can point to a useful approach for identifying arithmetic sequences. In Figure we plot the notes of the scale with log magnitude $\ln (aϕ + b)$ on the x-axis and norm $b^{2} + ab - a^{2}$ on the y-axis. The plot is invariant under transposition by a ϕ-octave, which corresponds to translation by $\ln ϕ \approx 0.4812$ to the right and reflection across the horizontal axis.

Arithmetic sequences are straight lines in $(a, b)$ space, which become suitably curved in these coordinates. We included the most relevant curved segments in Figure ; clearly they are in fact dense. The additional two notes in $S_{N}$ , $α ♯$ and $γ ♯$ , are thus helpful in providing the long arithmetic sequence ${ϵ 1, δ ♭ 2, α ♯ 3, γ ♯ 3, ϵ 3, β ♭ 4, γ 4, δ 4, ϵ 4}$ and in extending two of the others.

6. Multiplicative properties – intervals

Given the emphasis on additive properties, that is, many difference tones are equal, it is not surprising that with regard to multiplicative properties, the reverse is true, that is, almost all ratios (corresponding to musical intervals) are different. The intervals, up to three golden octaves, are given in Table .

Table 5. Intervals in the golden scales, in cents.

Display Table

Table illustrates this, in that since the norm is multiplicative, the ratios are seen to differ for almost all combinations. Even where two norms have magnitude 11 or 19, the ratios are not algebraic integers, since the only units in $Z [ϕ]$ are powers of ϕ.

Table also illustrates some special intervals using the “exact” column. There are rational intervals with ratios 2 ( $α 1$ - $γ 2$ , $γ 1$ - $ϵ 2$ , $δ 1$ - $α ♯ 3$ ), 3 ( $α 1$ - $β 3$ ) and related combinations (4/3, 3/2, 4), as well as maximally irrational intervals ϕ (all notes up a golden octave) and $\sqrt{5}$ ( $α 1$ - $δ 2$ , $γ 1$ - $α ♯ 3$ ). Combining ϕ with the other intervals also yields combinations that occur more than once, for example a golden third of ratio $2 / ϕ = \sqrt{5} - 1 \approx 1.236$ , is found for $α 1$ - $γ 1$ , $γ 1$ - $ϵ 1$ and $δ 1$ - $α ♯ 2$ . This interval lies between that of minor and major thirds, for example the just minor third $6 / 5 = 1.2$ and 12TET minor third $2^{1 / 4} \approx 1.189$ and just major third $5 / 4 = 1.25$ and 12TET major third $2^{1 / 3} \approx 1.260$ . Similarly for other intervals such as a golden fourth $ϕ^{2} / 2 \approx 1.309$ which is a little less than just perfect fourth $4 / 3 \approx 1.333$ and 12TET perfect fourth $2^{5 / 12} \approx 1.335$ and corresponds to $γ 1$ - $α 2$ , $ϵ 1$ - $γ 2$ and $α ♯ 1$ - $δ 1$ . The only geometric sequences are $αa$ - $γb$ - $ϵc$ and $ξa$ - $ξb$ - $ξc$ where $(a, b, c)$ are integers in arithmetic progression (including all equal) and ξ is any pitch class.

Equation (Equation3(3) $\begin{aligned} ϕ^{n} - (- ϕ)^{- n} & = F_{n} \sqrt{5} \end{aligned}$ (3) ) gives some large “approximately integer” intervals. For $δ 1$ - $αn$ we have a frequency ratio $ϕ^{n} / \sqrt{5}$ , and for large n the $ϕ^{- n}$ can be neglected leading to a Fibonacci interval. For example n = 6: (9) $\frac{ϕ^{6}}{\sqrt{5}} \approx 8.0249$ (9) which is about 4 cents above three conventional octaves.

7. The sum-product conjecture

The observation referred to above, that there are few differences suggests that there are many intervals, is close to a well known problem in the mathematical literature. The sum-product phenomenon asserts that for a finite set A in a suitable ambient space (here $R$ ), at least one of the sum set A + A and the product set AA should be large. One form of this is the Erd $ó$ s-Szemerédi sum-product conjecture which states that for all large $| A |$ we have (10) $max {| A + A |, | AA |} \geq | A |^{2 - o (1)}$ (10) The best rigorous bound has an exponent of 1.335 (CitationRudnev and Stevens 2022).

Our scales have relatively few sums but many products, whilst the closest relative, the Bohlen 833 scale (see the introduction), has fewer products. This is depicted in Figure . Note that our scales have fewer sums and differences as expected, with a transition to quadratically increasing behaviour at around five golden octaves, the precision of the β-expansion. For seven octaves of $S_{N}$ , we have $| A | = 70$ , $| A + A | = 516$ , $| AA | = 678$ , so both $| A + A |$ and $| AA |$ are less than $| A |^{1.54}$ . This suggests that a growing sequence of sets along these lines (for example, all elements of a ring with bounds on magnitude and norm) may be a possible example to test the sum-product conjecture.

8. Implementing the scales

It is possible to synthesize sounds of arbitrary waveform and frequency electronically, though many software packages assume octave equivalence for powers of 2, even where notes within each octave may be tuned individually. One option for either computer or MIDI keyboard is websynths.org; Tables , provide the relevant mappings; see the Online Supplement files.

It may also be possible to retune stringed instruments. Guitars with retuned strings and moved frets are commonly used in microtonal music (CitationNielsen 2003). Similarly for bowed stringed instruments or the piano (using the above keyboard mapping), though we have not yet tried and cannot vouch for the safety. Similarly, it is possible using instruments allowing arbitrary pitch including trombone and voice.

Musical instruments apart from synthesized sine waves generate harmonics other than the fundamental frequency. For those with uniform strings or columns of air, these are at close to integer multiples of the fundamental, which is one of the main explanations for rational frequency ratios in music. These however do not correspond to notes on the golden ratio scales except where the fundamental is α or δ.

It is possible to design strings to have specified overtones by using variable thickness. Unfortunately it does not seem practical in this way to generate all the powers of ϕ, which would thus lie in the scale for all the notes; see CitationSethares and Hobby (2018). As a simpler case, we have considered a string of length L and mass $m_{S} = μ_{S} L$ with a small weight at a point xL along it of mass $m_{2}$ using the formalism in the above paper; see equations (Equation1(1) $1200 \log_{2} (\frac{f_{2}}{f_{1}}) cents$ (1) )–(Equation4(4) $\begin{aligned} ϕ^{n} + (- ϕ)^{- n} & = F_{n + 1} + F_{n - 1} \end{aligned}$ (4) ) there. More specifically, we have a string of three segments with linear mass densities $μ_{j}$ and lengths $l_{j}$ where $l_{1} = xL$ , $l_{3} = (1 - x) L$ , $μ_{3} = μ_{1} = μ_{S}$ and $μ_{2} = m_{2} / l_{2}$ . Taking the limit $l_{2} \to 0$ with $m_{2}$ fixed, the mode equation with appropriate boundary conditions (equation (Equation4(4) $\begin{aligned} ϕ^{n} + (- ϕ)^{- n} & = F_{n + 1} + F_{n - 1} \end{aligned}$ (4) ) in the above paper) reduces to (11) $\frac{\sin \tilde{ω}}{\tilde{ω}} = \frac{m_{2}}{m_{S}} \sin (\tilde{ω} x) \sin (\tilde{ω} (1 - x))$ (11) where $\tilde{ω} = ω \sqrt{m_{S} / T}$ is a dimensionless frequency and T is the string tension. Optimising to find x = 0.116934 and $m_{2} / m_{S} = 0.519991$ , the frequencies of vibration relative to the fundamental become ${1, 1.618, 2.618, 3.812, 5.037, 6.271, \dots}$ . In other words, it is possible with only a single additional mass to make the harmonic series of a string start with frequency ratios $1 : ϕ : ϕ^{2}$ .

Alternatively, geometric sequences of vibration modes may be created in fractal structures; see CitationStrichartz (1999). Note that irrational harmonics lead to aperiodic wave-forms; for harmonics of sufficiently large amplitude, they may have interesting non-differentiable or fractal properties, along the lines of Weierstrass's original construction of a nowhere-differentiable function (CitationKaplan, Mallet-Paret, and Yorke 1984).

9. Composition

Some ideas and principles can be found in the previous literature on (especially) tunings with a golden ratio octave (refer to the introduction). The main characteristics of the tunings considered here are the arithmetic progressions and many different intervals.

The arithmetic progressions have common difference tones (refer to Figure ), and can be used for scales and chords. The long progression containing $β ♭$ and $ϵ ♭$ is relatively close to the harmonic series (the frequencies are in ratios $n - ϕ^{- 4}$ for integer n).

The large variety of intervals can also be used to great effect. For example, the dissonant interval $ϵ ♭ 1 - β 3$ of just greater than an octave at 1232 cents can resolve to the perfect octave $α 1 - γ 2$ , where the $γ 2$ is a perfect fifth below the $β 3$ . The golden third and sixth intervals are paradoxical in that thirds and sixths are normally considered consonant, but these are as irrational as possible, in the sense given in section 3.

Ideally, composition using the golden tuning should be on its terms, rather than imitating well known 12TET approaches. The 12TET tuning is homogeneous and cyclic: Each note is equivalent, and keys are arranged in the well known circle of fifths. In contrast, the golden tunings are based on a linear backbone of α notes, decorated by the other notes, each with its own place and character, at increasing distance from the backbone. Distance here can refer to the number of digits needed in the β-expansion, resulting in the ordering α, δ, γ, ${β, ϵ}$ , ${β ♭, δ ♭, ϵ ♭}$ , ${α ♯, β ♯, γ ♯, δ ♯, ϵ ♯}$ , …. Or, it can refer to the norm, which gives the similar ordering α, γ, δ, β, ${β ♭, ϵ ♭}$ , ϵ, ${γ ♯, δ ♭}$ , $α ♯$ , ….

The parallel organum common in ninth century chant mentioned in the introduction also works here, with the perfect fourth or fifth replaced by the golden ratio octave. We could alternatively (or as well) use two golden ratio octaves, which is 34 cents below a 12TET perfect eleventh.

The composition Three Places (in the Supplemental Online material) is written using the keyboard mapping of $S_{β}$ shown in Table . The intention was to approach the tuning system entirely intuitively, allowing the ear to guide the process and pick out interesting chords and pitch combinations. Given the lack of octave periodicity, the piece is formed from three localized musical ideas and their interactions. Each musical idea is described as a ‘place’, and whilst the notes contained in each idea are relatively consonant, the transition from one idea to the next can be abrupt and surprising. With music composed using unfamiliar tuning systems, a ‘settling in’ period of adjustment can be helpful to the listener, provided in Three Places by the oscillating tones at the opening of the piece.

10. Conclusion

Golden ratio scales have led us to wide vistas of mathematics, including many aspects of algebraic number theory, the sum-product conjecture, and non-differentiable curves. There are many open mathematical questions, for example, if we constrain the number of notes per octave, are these scales optimal from the point of view of the number of arithmetic sequences or of sum or difference tones in the scale? From a musical point of view there remains the challenging task of further developing relevant principles of melody and harmony. The same approach, defining scales by bounding the β-expansion or norm, can be applied to other algebraic units, such as those in Table .

Supplemental material

Supplemental Material

Download (8.3 MB)

Acknowledgments

The authors acknowledge helpful discussions with Dan Fretwell, Jake Langham and Misha Rudnev.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Supplemental data

Supplemental data for this article can be accessed online at http://dx.doi.org/10.1080/17459737.2023.2234126.

Additional information

Funding

The authors are grateful for the support of the University of Bristol School of Mathematics composer-in-residence project, in conjunction with CREATE-REACT.

References

Akiyama, Shigeki, and Nertila Gjini. 2005. “Connectedness of Number Theoretic Tilings.” Discrete Mathematics and Theoretical Computer Science 7 (1): 269–312. https://doi.org/10.46298/dmtcs.353
Web of Science ®Google Scholar
Bell, Jason P., and Kevin G. Hare. 2005. “A Classification of (Some) Pisot-Cyclotomic Numbers.” Journal of Number Theory 115 (2): 215–229. https://doi.org/10.1016/j.jnt.2004.11.009.
Web of Science ®Google Scholar
Bertin, Marie J., Annette Decomps-Guilloux, Marthe Grandet-Hugot, Martine Pathiaux-Delefosse, and Jean Schreiber. 1992. Pisot and Salem Numbers. Basel; Birkhäuser.
Google Scholar
Bohlen, Heinz. 2012. “An 833 Cents Scale.” http://www.huygens-fokker.org/bpsite/833cent.html.
Google Scholar
Charlier, Émilie, Célia Cisternino, and Karma Dajani. 2021. “Dynamical Behavior of Alternate Base Expansions.” Ergodic Theory and Dynamical SystemsPublished online. https://doi.org/10.1017/etds.2021.161.
Web of Science ®Google Scholar
Chechile, Alexander. 2020. “Practical Applications of Difference Tones in Electronic Music Composition and Synthesis.” PhD thesis, Stanford University. https://www.proquest.com/dissertations-theses/practical-applications-difference-tones/docview/2457330846/se-2?accountid=9730.
Google Scholar
Cho, Gene J. 2010. “The Significance of the Discovery of the Musical Equal Temperament in the Cultural History.” Journal of Xinghai Conservatory of Music 2: 1–4. http://caod.oriprobe.com/articles/24155792/The_Significance_of_the_Discovery_of_the_Musical_E.htm
Google Scholar
Dettmann, C. P., and N. E. Frankel. 1993. “Structure Factor of Deterministic Fractals with Rotations.” Fractals 1 (2): 253–261. https://doi.org/10.1142/S0218348X93000265.
Web of Science ®Google Scholar
Erickson, Raymond. 2001. “Musica enchiriadis, Scolica enchiriadis.” In Grove Music Online. https://www.oxfordmusiconline.com/grovemusic/display/10.1093/gmo/9781561592630.001.0001/omo-9781561592630-e-0000019405.
Google Scholar
Greated, Clive. 2001. “Combination Tone.” In Grove Music Online. https://www.oxfordmusiconline.com/grovemusic/view/10.1093/gmo/9781561592630.001.0001/omo-9781561592630-e-0000006170.
Google Scholar
Jacoby, Nori, Eduardo A. Undurraga, Malinda J. McPherson, Joaquín Valdés, Tomás Ossandón, and Josh H. McDermott. 2019. “Universal and Non-Universal Features of Musical Pitch Perception Revealed by Singing.” Current Biology 29 (19): 3229–3243. https://doi.org/10.1016/j.cub.2019.08.020.
PubMed Web of Science ®Google Scholar
Kaplan, James L., John Mallet-Paret, and James A. Yorke. 1984. “The Lyapunov Dimension of a Nowhere Differentiable Attracting Torus.” Ergodic Theory and Dynamical Systems 4 (2): 261–281. https://doi.org/10.1017/S0143385700002431.
Google Scholar
Lindley, Mark. 2001. “Temperaments.” In Grove Music Online. https://www.oxfordmusiconline.com/grovemusic/view/10.1093/gmo/9781561592630.001.0001/omo-9781561592630-e-0000027643.
Google Scholar
Mathews, Max V., John R. Pierce, Alyson Reeves, and Linda A. Roberts. 1988. “Theoretical and Experimental Explorations of the Bohlen-Pierce Scale.” The Journal of the Acoustical Society of America 84 (4): 1214–1222. https://doi.org/10.1121/1.396622.
Web of Science ®Google Scholar
Nielsen, Michael. 2003. “Microtonal Systems and Guitar Composition.” Master's thesis, Technological University Dublin.
Google Scholar
O'Connell, Walter. 1993. “The Tonality of the Golden Section.” Xenharmonikôn 15:3–18. https://xh.xentonic.org/tables-of-contents.html; https://anaphoria.com/oconnell.pdf
Google Scholar
Rockett, Andrew M., and Peter Szusz. 1992. Continued Fractions. Singapore: World Scientific Publishing Company.
Google Scholar
Roth, Klaus Friedrich. 1955. “Rational Approximations to Algebraic Numbers.” Mathematika 2 (1): 1–20. https://doi.org/10.1112/mtk.v2.1.
Google Scholar
Rudnev, Misha, and Sophie Stevens. 2022. “An Update on the Sum-Product Problem.” Mathematical Proceedings of the Cambridge Philosophical Society, 173 (2): 411–430. https://doi.org/10.1017/S0305004121000633.
Google Scholar
Sethares, William A., and Kevin Hobby. 2018. “Designing Inharmonic Strings.” Journal of Mathematics and Music 12 (2): 107–122. https://doi.org/10.1080/17459737.2018.1491649.
Web of Science ®Google Scholar
Smethurst, Reilly. 2016. “Two Non-Octave Tunings by Heinz Bohlen: A Practical Proposal.” In Proceedings of Bridges, Jyväskylä, Finland. 519–522.
Google Scholar
Strichartz, Robert S. 1999. “Analysis on Fractals.” Notices AMS 46 (10): 1199–1208. https://www.ams.org/notices/
Google Scholar

Algebraic tunings

Abstract

1. Introduction

2. Octave equivalence using algebraic units

3. The golden ratio, properties and generalizations

Table 1. Pisot units of degree 2 and 3, of magnitude less than 3, and not powers of smaller Pisot units.

4. Defining scales

Table 2. Notes of the golden scale $S_{N}$ .

Table 3. Frequencies in the golden scales.

Table 4. Keyboard mapping of $S_{β}$ , over the full MIDI range.

5. Additive properties – arithmetic sequences

6. Multiplicative properties – intervals

Table 5. Intervals in the golden scales, in cents.

7. The sum-product conjecture

8. Implementing the scales

9. Composition

10. Conclusion

Supplemental Material

Acknowledgments

Disclosure statement

Supplemental data

References

Information for

Open access

Opportunities

Help and information

Algebraic tunings

Abstract

1. Introduction

2. Octave equivalence using algebraic units

3. The golden ratio, properties and generalizations

Table 1. Pisot units of degree 2 and 3, of magnitude less than 3, and not powers of smaller Pisot units.

4. Defining scales

Table 2. Notes of the golden scale SN.

Table 3. Frequencies in the golden scales.

Table 4. Keyboard mapping of Sβ, over the full MIDI range.

5. Additive properties – arithmetic sequences

6. Multiplicative properties – intervals

Table 5. Intervals in the golden scales, in cents.

7. The sum-product conjecture

8. Implementing the scales

9. Composition

10. Conclusion

Supplemental Material

Acknowledgments

Disclosure statement

Supplemental data

Additional information

Funding

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

Table 2. Notes of the golden scale $S_{N}$ .

Table 4. Keyboard mapping of $S_{β}$ , over the full MIDI range.