Publication Cover
Journal of Mathematics and Music
Mathematical and Computational Approaches to Music Theory, Analysis, Composition and Performance
Volume 18, 2024 - Issue 2
1,163
Views
0
CrossRef citations to date
0
Altmetric
Articles

Mathematical foundations of complex tonality

& ORCID Icon
Pages 173-202 | Received 13 Aug 2022, Accepted 19 Jun 2023, Published online: 17 Jul 2023

Abstract

Equal temperament, in which semitones are tuned in the irrational ratio of 21/12:1, is best seen as a serviceable compromise, sacrificing purity for flexibility. Just intonation, in which intervals are given by products of powers of 2, 3, and 5, is more natural, but of limited flexibility. We propose a new scheme in which ratios of Gaussian integers form the basis of an abstract tonal system. The tritone, so problematic in just temperament, given ambiguously by the ratios 4532, 6445, 3625, 2518, none satisfactory, is in our scheme represented by the complex ratio 1+i:1. The major and minor whole tones, given by intervals of 98 and 109, can each be factorized into products of complex semitones, giving us a major complex semitone 34(1+i) and a minor complex semitone 13(3+i). The perfect third, given by the interval 54, factorizes into the product of a complex whole tone 12(1+2i) and its complex conjugate. Augmented with these supplementary tones, the resulting scheme of complex intervals based on products of low powers of Gaussian primes leads to the construction of a complete system of major and minor scales in all keys.

1. Introduction

For centuries, musicians and mathematicians have struggled with the challenges of temperament and tonality. The problem is that there is a paradox at the heart of Western music. The paradox is that no tuning system exists for keyboard instruments such that all the harmonious intervals are perfectly tuned. Here we propose a radical solution based on the notion of complex tonality. We begin with an overview of the just system of intonation, which originated in antiquity and was much studied in our era during the 16th, 17th and 18th centuries, reaching a high point in the work of CitationEuler (1739Citation1774). We can define the just system as the totality of the various ways of representing tones by the vibrational frequencies of strings of various lengths, each such frequency being given by products of powers of the primes 2, 3, and 5 times some fixed frequency, which we call the tonal centre. Over time, many different ways of choosing the twelve notes of the chromatic scale in the just system have been proposed. Our goal is to construct a complex analogue of the just system, with broad applications, based on the ratios of certain Gaussian integers, in such a way that some of the insufficiencies of the just system can be resolved.

Among the various diatonic scales that have been considered in the just system, the so-called intense diatonic scale of Claudius Ptolemeus is especially notable (CitationSolomon 2000). Ptolemy's scale, expressed in units of time such that the frequency of middle C is unity, takes the form (1) C=1,D=98,E=54,F=43,G=32,A=53,B=158,C=2.(1) One can then ask how the Ptolemaic scale can be embedded in a chromatic scale. There are many possibilities, but one good example is the following, due to Johannes Kepler, which is known as Kepler's Monochord No. 2 (CitationBarbour 1953), here transposed down a fifth: (2) C=1,D=1615,D=98,E=65,E=54,F=43,G=4532,G=32,A=85,A=53,B=169,B=158,C=2.(2) In Section 2 we look at the just system in some detail and we sketch the arguments leading to Ptolemy's diatonic scale and to Kepler's chromatic scale. Among the defects of the just system, clearly visible in Kepler's scale, are the multiplicities of primes required for the tritone interval from C to F. Also troubling, when one looks further, are the multiple tunings for the notes D and B, and the unpleasant “wolf” tunings of certain intervals.

As a first step forward, we propose an alternative approach to building up scales in the just system by use of an analogy with chemistry. In Section 3, Atoms, molecules and particles, we generate the just scale by the aggregation of a small number of whole tone and semitone intervals, which we call atoms. We show that one recovers the seven modal scales (in their modern form) by permutations of the aggregation sequence of the given whole tones and semitones. This leads to a consistent assignment of all the tones of the chromatic scale with the exception of F. The equal temperament system, which evolved as a practical response to the difficulties of the just system, can also be cast into an “atomic” framework by use of the chemical analogy, but one loses the mathematically pleasant representation of harmonious intervals as products of powers of primes. One is thus motivated to delve more deeply back into the just system, with a view to generalizing it in some way.

In Section 4, Chromatic tone systems, we turn to the idea of constructing the just chromatic scale by the aggregation of three types of semitones. In Theorem 4.1, we use the aggregative approach to give a complete description of the various possible three-semitone chromatic extensions of Ptolemy's scale. More precisely, we ask for all systems of three rational semitones with the property that the chromatic scales assembled by aggregating such tones hit all seven values of the Ptolemaic scale. We prove the existence of exactly three distinct families of solutions, each in thirty-two variations, giving ninety-six scales in all. There is an overall integrity to the three families thus arising, since the semitones in the various systems exchange syntonic commas in forming whole tones, in analogy with the behaviour of electrons in atomic physics. We present some applications of the resulting classification scheme to several of the known historical just schemes catalogued by CitationBarbour (1953). Nonetheless, one is forced to admit that the three triples of rational semitones underlying this scheme contain high multiplicities of primes, hence contributing to the well-known perceived difficulties in the sonorities of the just system.

Following our description of the various just extensions of Ptolemy's scale using three rational semitones, and taking note of its shortcomings, but at the same time taking advantage of its methods, we introduce the idea of complex tonality, in which tones take on complex values given by ratios of Gaussian integers.

The complex interval systems thereby arising can be regarded as the basis for a new abstract form of music, but can also be used as a new way of analyzing the harmonic and melodic structures of the established corpus of Western music. Our construction proposes that the tritone interval from C to F can be given the value 1+i, a single power of a single complex prime, which can thus be seen as the most consonant interval in the system.

In Section 5, Complex diatonic scales, we pursue an analogy with our exposition of the just system in terms of atoms and molecules, and exhibit a set of complex whole tones and semitones, seven in total, including the X, Y, and Z tones of the real just system, all involving small multiplicities of complex primes, sufficient to enable us to write down aggregation sequences for the complex analogues of the twelve major diatonic scales.

In Section 6, Well-tempered complexity, we turn to the mathematical foundations required for further exploration of the complex system. We summarize various properties of the Gaussian integers and in Definition 6.1 we introduce the notion of a complex n-limit tone, one that is a product of powers of Gaussian integers that are prime factors of rational integers of magnitude no more than n. This is a new idea that generalizes the existing theory of real n-limit tones very naturally. Specializing to the case of complex 5-limit tones, we work out the general form of a complex 5-limit phase factor in Proposition 6.1 and we point out that the set of complex 5-limit intervals modulo units and complex 5-limit phase factors forms a Generalized Interval System in the sense of CitationLewin (1987).

In Section 7, Complex chromatic scales, we show that a complex chromatic scale incorporating all the notes of the twelve given complex diatonic scales can be constructed by aggregation of three complex semitones. This construction is carried out in such a way that the chromatic scale extends the harmonious values of Ptolemy's diatonic scale, leading to our main result, Theorem 7.1, which gives a full description of the totality of complex systems that (a) embrace the harmonious Ptolemaic values, (b) admit three complex semitones as generators, and (c) observe a certain symmetry condition that ensures that the first six tones aggregate to the value 1+i for F. We establish the existence and uniqueness of three such complex systems, the first of which is of particular interest on account of its full use of the factorization of the number five and the abundance of intervals that it exhibits involving low multiplicities of primes.

In Section 8, Complex Pythagorean systems, we consider the possibility of an analogous construction for two-semitone complex 3-limit scales by fixing the values of the Pythagorean pentatonic scale and asking that the system take the value 1+i at F. The existence and uniqueness of such a scheme is shown in Theorem 8.1. In Section 9, Back to reality, we return to the study of real tuning systems and ask for the construction of “shadow” systems based on the magnitudes of the intervals arising in the complex schemes. This program can be carried out both for complex 3-semitone systems and for complex 2-semitone systems. The results are summarized in our Theorems 9.1 and 9.2.

In Section 10, Applications, conjectures, and conclusions, we wrap up and speculate on some applications to the theory of harmony.

In Tables , , and  we lay out the values of the tones of the chromatic scales for the systems that we have considered, alongside the associated aggregation sequences, showing the variations that arise in some cases.

In the Appendix we present a list of intervals occurring frequently in our analysis of complex tone systems. Our brief list can be compared to the more complete catalogues of real tones provided by CitationEllis (1864), CitationHelmholtz (1885), CitationLonguet-Higgins (1987), CitationHaluška (2004), and others, with which there is of course some overlap.

Table 1. Structures of the three complex tone systems based on three semitones. Each system occurs in a number of variants, one example of which is presented for each system. For each of the three complex systems, the first column gives the aggregation sequence and the second column gives the values taken by the notes of the scale.

Table 2. Four variants of the Complex System II scale based on three semitones. For each variant, the first column gives the aggregation sequence and the second column gives the values of the notes.

Table 3. Two variants of the Complex System III chromatic scale based on three semitones. For each scale, the first column gives the aggregation sequence and the second column gives the values of the notes of the scale.

Table 4. Chromatic scales generated by two semitones for (a) the complex Pythagorean scale, (b) the associated real shadow scales, and (c) a version of the traditional real Pythagorean scale. For each scale the first column gives the aggregation sequence and the second column gives the values of the notes of the scale. The variants that arise in each case are indicated.

2. Just intonation

The question of which musical intervals are harmonious and which are dissonant is at least, in part, a matter of physiology, convention and culture – though it is undeniable that mathematical elements enter (albeit unconsciously) into musical aesthetics in an essential way, even at the level of tonality (CitationEuler 1739; CitationHelmholtz 1885; CitationLonguet-Higgins 1987; CitationPenrose 2000). The tradition in Western music is that the unison, the octave, the fifth, the fourth, the major third, the minor third, the major sixth and the minor sixth are harmonious. But the whole tone, the semitone, the major seventh, the minor seventh and the tritone are all to varying degrees dissonant.

It has been known since ancient times that there is a relation between harmonious intervals and the ratios of the frequencies at which strings vibrate. Suppose we fix a length of string so that it will vibrate at about 261.62 Hertz, so-called middle C (or C4) on the piano. If we halve the length of string then the frequency of vibration is doubled, and we get C5, the note which is one octave above middle C. If we take one third of the length of string then the frequency is trebled, and we get G5, the G that lies a fifth above C5.

One fourth of the length of the string vibrates at four times the original frequency, and one fifth vibrates at five times the original. A string vibrating at a frequency four times C4 sounds at C6, two octaves above C4, and a string vibrating at five times C4 sounds at E6. It follows that the frequency of G4 will be half that of G5. If we use units of time such that the frequency of C4 is unity, then G5=3 and we deduce that G4=32. Likewise since E6=5 we deduce that the E just above middle C sounds at E4=54. We conclude that the interval of a fifth is given by the ratio 32, the major third is the ratio 54, and the minor third is the ratio of 32 to 54, namely 65. Hence, starting at C = 1 we get E=65, E=54, G=32 and C=2. Similar arguments show that F=43, A=85 and A=53.

Thus, we see that the harmonious intervals are products of powers of the prime numbers 2, 3 and 5. The minor third is 65, the major third is 54, the fourth is 43, the fifth is 32, the minor sixth is 85, the major sixth is 53, and the octave is 2. The logical deduction of such ratios is the basis of the just system of intonation, which CitationBarbour (1953) defines as “a system of tuning based on the octave (2:1), the pure fifth (3:2), and the pure major third (5:4).”

The situation with dissonant intervals is a little trickier. The tones that we have derived so far come about as a consequence of regarding C as the tonal centre. But suppose we think of G as a new tonal centre. This is like taking a new string two-thirds the length of the original and starting the analysis over. In this way we get a value for D, which is a fifth above G. In particular, since G4=32, we multiply 32 times 32 to obtain D5=94 and thus D4=98. Since D4 is a whole tone above C4, we are then tempted to assign the value 98 to the interval of a whole tone. Likewise, since B is a major third above G we multiply 32 times 54 to obtain B=158. And since B is a fourth above F, and F is a fourth above C, we can set B to be equal to 43 times 43 and hence B=169. Finally, since F=43 is a semitone above E=54, and C=2 is a semitone above B=158, we conclude that the interval of a semitone can be given the ratio 1615, and hence C=1615. Then we can take F to be given by a combination of a third and a whole tone above C. This gives the product of 54 and 98, namely F=4532.

So, to sum up, we have assigned rational values to all the notes of the chromatic scale given by products of powers of the prime numbers 2, 3, and 5. The resulting system of tuning is a good example of what we call “just intonation” and it has many attractive features.

In particular, the diatonic scale takes the form (Equation1). This scale has been discussed by Gioseffo Zarlino (1517–1590) and others from the time of the Renaissance onward, though its roots lay in the work of Didymus, Ptolemy, Porphyry and other scholars of the early Roman Empire (CitationBarker 1984Citation1994), and it is now commonly referred to as Ptolemy's intense diatonic. For the associated chromatic scale in its entirety, according to the logical development presented thus far, we obtain (Equation2). CitationBarbour (1953) refers to this version of the chromatic scale as Alexander Malcolm's Monochord, and points out that it is equivalent to Johannes Kepler's Monochord No. 2 when the latter is transposed down by a perfect fifth. We can use the term “just intonation” to refer broadly to all of the various tuning systems constructed out of intervals involving products of powers of the primes 2, 3 and 5. The just diatonic scale (Equation1) is fairly rigid as a mathematical object in the sense that there is relatively limited scope in modifying its entries. The just chromatic scale (Equation2) is more flexible and there are many other choices that can be made for its entries, as we shall see in Section 4.

A good account of the virtues of the just system can be found in the works of CitationLonguet-Higgins (1987). It would appear that in practice the just system, whether it be in the narrow sense of the Ptolemaic and Keplerian scales mentioned above, or more broadly in the sense of the totality of all rational 5-limit dodecatonic scale systems, has served primarily as an idée fixe, in relation to which a vast range of ingenious temperaments have been constructed over the last few centuries, rather than as a viable solution to the problem of keyboard tuning in itself. An exception to this principle may be that of Leonhard Euler, who (if he is to be judged on the basis of what he has said in his writings) was a staunch believer in just intonation, even if he seems to wobble on the question of whether the prime factors should be restricted to 2, 3, and 5, or extended to include 7 as well and perhaps higher primes. Be that as it may, there are many subtle albeit well-known issues that arise in connection with the 5-limit just system, and it is in the numerous attempts that have been made at resolving such issues that new ideas have been originated both in musical analysis and in practice.

To get a sense of some of these issues, let us return to the idea of shifting the tonal centre. Multiplication by 32 leads us successively from C4 to G4 to D5 to A5 and then on to E6. We obtain a value of 278 for A5 and hence 8116 for E6. But that implies E4=8164, whereas we had already deduced E4=8064, that is to say, 54. The multiplicative discrepancy κ=8180 between the two values of E4 is called the syntonic comma (or comma of Didymus). Likewise, we had set A=53(=8048), whereas now we get A=2716(=8148), again with a discrepancy of 8180. The just system is evidently unstable under such shifts in the tonal centre.

Going forward, let us write P/Q for the ratio of the tones P and Q. This ratio represents the interval from the tone Q to the tone P (often written Q – P). For example, the interval from E=54 to G=32 is the ratio G/E=32÷54=65, a minor third.

If we recall that our value for D was obtained by applying the cycle of fifths to G, we have to ask whether D=98 is consistent with A=53. In fact, one finds that the interval A/D is a bit flat for a fifth if D=98. We obtain (3) A/D=53÷98=4027=κ132.(3) Since κ>1, we see that the interval A/D obtained in this way is slightly smaller than 32, i.e. somewhat flat. To sensitive ears it sounds out of tune, particularly if played on the harpsichord, and hence has come to be known as an example of a “wolf” interval. But if we set D=109, the resulting interval (4) A/D=53÷109=32(4) is a perfect fifth. Indeed, one can check that 98÷109=κ. Thus, it seems inevitable that we must augment the just system with a second version of the note D taking the value 109.

Similarly, the value B=169 that we tentatively assigned earlier is consistent with the idea that B should be a fourth above F. On the other hand, B can also be regarded as a minor third above G, or a fifth above E and these both give B=95. Thus, we need to assign two different values to B, namely 169 and 95, and we can check that 95÷169=κ. Indeed, the situation of B precisely parallels that of D.

Likewise, one can show that four different values for F are required, corresponding to the various ways that the interval F/C can be formed out of simpler intervals. These are (5) F=4532,F=6445,F=2518,F=3625.(5) Note that 4532÷2518=κ and that 4532 and 6445 are complements in the sense that their product is the octave. The multiplicity of the intervals representing F is troubling, and all are rather complicated when they are factorized into products of powers of primes. We call this “the problem of F”. Is there a way around the problem of F? This question motivates us both to revisit the foundations of the just system and to introduce the idea of complex tonality.

3. Atoms, molecules and particles

To enable a way forward, leading to the complex systems that we introduce later in the article, it will be helpful first if we look at the Ptolemaic system in terms of so-called “atoms”. The idea is that we introduce a collection of simple tones (called atoms) such that each tone in the diatonic scale can be regarded as a multiplicative aggregation of atoms into molecules. The notes of the just system can be built up in this way step by step. We introduce three atoms, given by (6) X=98,Y=109,Z=1615.(6) Then one sees that (7) XY=54,XYZ=43,X2YZ=32,X2Y2Z=53,X3Y2Z=158,X3Y2Z2=2,(7) and hence that for the C-major scale in the just system we can write (8) C=1,D=X,E=XY,F=XYZ,G=X2YZ,A=X2Y2Z,B=X3Y2Z,C=X3Y2Z2.(8) Each note in the scale is represented by a “molecule” with a particular configuration of atoms, and as the scale progresses more atoms are aggregated. Our atoms consist of the two “whole tones”, viz., 98 and 109, alongside the semitone 1615. We can use a shorthand to indicate the order in which the atoms are added in. Thus, for the diatonic scale starting at C = 1 we have the aggregation sequence 1 : X, Y, Z, X, Y, X, Z. The note before the colon denotes the initial tone, which is followed by a sequence of atoms. The initial unity here ensures that the scale begins at our chosen value for C, but in principle one can choose a different initial value. The algebraic identity X3Y2Z2=2 that closes the octave implies some flexibility in the order in which the atoms can be aggregated. The seven ecclesiastical modes arise in this way, for example, corresponding to permutations of the original aggregation sequence:

  1. Ionian mode: C,D,E,F,G,A,B,C 1:X,Y,Z,X,Y,X,Z

  2. Dorian mode: C,D,E,F,G,A,B,C 1:X,Z,Y,X,Y,Z,X

  3. Phrygian mode: C,D,E,F,G,A,B,C 1:Z,X,Y,X,Z,Y,X

  4. Lydian mode: C,D,E,F,G,A,B,C 1:X,Y,X,Z,Y,X,Z

  5. Mixolydian mode: C,D,E,F,G,A,B,C 1:X,Y,Z,X,Y,Z,X

  6. Aeolian mode: C,D,E,F,G,A,B,C 1:X,Z,Y,X,Z,Y,X

  7. Locrian mode: C,D,E,F,G,A,B,C 1:Z,X,Y,Z,X,Y,X.

One can think of the note representing the underlying key (in this case, C) as being given in the form of a structureless monad, represented by the tone unity. But set in a larger context, this note itself can be given structure. It may be helpful to introduce a chemical analogy. The note C corresponds to the completed shell corresponding to an inert gas, such as argon. The creation of a scale, in this analogy, is represented by the addition of electrons, one by one, until the next shell is complete, corresponding to the note C.

It is interesting to note that Isaac Newton seems to have been in favour of the just system but preferred a version of the scheme based on the Dorian mode on account of the symmetric arrangement of the atoms arising in that case (CitationBibby 2003). It may be that Newton was attracted to the fact that the aggregation sequence forms a palindrome in this mode. In any case it would be interesting to pursue the theory of modes arising in the present context from the point of view of the algebraic combinatorics of words, in the spirit of CitationDominguez, Clampitt, and Noll (2009), CitationŽabka (2011), CitationClampitt (2017), CitationClampitt and Noll (2018), CitationClampitt (2019), and others.

Inspection of the modal scales leads to another method for arriving at the accidentals of the chromatic scale. The resulting values for D, E, A, and B are consistent with the values we had tentatively recorded at (Equation2). We obtain B=XZYXYZ=169 for the Dorian mode, B=ZXYXZY=169 for the Phrygian, B=XYZXYZ=169 for the Mixolydian, B=XZYXZY=169 for the Aeolian, and B=ZXYZXY=169 for the Locrian.

Likewise, we obtain E=XZ=65 for the Dorian and Aeolian modes, E=ZX=65 for the Phrygian and Locrian modes, A=ZXYXZ=85 for the Phrygian, A=XZYXZ=85 for the Aeolian, A=ZXYZX=85 for the Locrian, and D=Z=1615 for the Locrian. But we get F=XYX=4532 in the Lydian mode and G=ZXYZ=6445 in the Locrian mode. So the problem of F rears its head, since the modes give two different values for it.

The modal scales as we have presented them are all rooted in the same tonality, that of C. If we change the tonal centre, instabilities emerge in the form of new tones. Suppose, for example, we wish to make a selection of notes from the just chromatic scale to represent the notes of the G-major scale. As with the C-major diatonic scale, we can ask that the G-major scale should be constructed via the accumulation of atoms. This can be achieved as follows. We begin with the molecule for G4, which we know to be of the form X2YZ. Then we add on successively the atoms Y, X, Z, X, Y, X, Z. The resulting tones correspond to existing notes on the just keyboard.

At the next step, problems begin to emerge. If we try to construct a D-major scale, then we can start at D=98 and add successive atoms by applying the sequence Y,X,Z,Y,X,X,Z. We obtain F=4532, which we have agreed is acceptable, but for C5 we obtain 13564, which is not one of the pre-existing tones.

When we go on to look at the remaining sharp keys we obtain three more tones, given by G=2516, D=7532, and A=225128, contradicting the values previously assigned. The flat keys fare better, but in the key of D major we obtain G=6445, which is different from the F=4532 we agreed on earlier. And in the key of G major, we obtain C=4825, which is different from the previously proposed tuning at B=158. Apparently, if we try to represent the major scales by successive aggregation of the atoms proposed to construct the just diatonic scale, then we obtain new values for notes to which we have already tentatively assigned values.

The generally accepted way around these difficulties in practice is, of course, the system of equal temperament, in which we introduce a irrational semitone (9) ψ=212,(9) which functions as the single “particle” out of which all the tones of the chromatic scale can be assembled by aggregation. Thus for the equally tempered chromatic scale we obtain (10) C=1,C=ψ,D=ψ2,D=ψ3,E=ψ4,F=ψ5,F=ψ6,G=ψ7,G=ψ8,A=ψ9,A=ψ10,B=ψ11,C=ψ12.(10) and we have the aggregation sequence (11) 1:ψ,ψ2,ψ3,ψ4,ψ5,ψ6,ψ7,ψ8,ψ9,ψ10,ψ11,ψ12.(11) Note that F=2, and that C=2. Under equal temperament none of the harmonious intervals are perfect. Nevertheless, overall, the system produces satisfactory results for music written to be played in that system. This includes, for example, a good deal of the repertoire of music written for the piano. Equal temperament thus offers a tentative resolution to the paradox of tuning by spreading the dissonance equally across the keyboard. CitationScholes (1955) goes further in his Oxford Companion to Music at page 1018, and says:

The whole development of musical composition in the nineteenth century was based on the acceptance of the twelve equal keys into which the composer could pass without let or hindrance. Wagner's chromatic harmonies, for example, necessarily imply an orchestra with its instruments tuned “equally”.

Opponents of equal temperament, of which there are many, claim that it flattens out the music, leaving it homogeneous and characterless. Worse still, although the fifths are reasonably accurate under equal temperament, the thirds are noticeably wide. The disadvantages have been succinctly summarized by CitationKlopp (1974):

Obviously, all keys are available and the possibilities of enharmonization are unlimited. However, these freedoms are dearly paid for: there is not one single pure interval; the thirds are especially poor, giving the triad an insecure and restless sound; there is no differentiation between the keys; melodic tensions are reduced; the temperament is difficult to set.

And according to CitationTovey (1929):

No true harmonic ideas are based on equal temperament.

Euler (Citation1739, Citation1774) was dead against equal temperament for essentially the same reasons and expressed the opinion that the majority of musicians had rejected it. But in this he seems to have been mistaken, and it is clear that he did not anticipate the great wave of nineteenth and twentieth century composers who would embrace the advantages that transpositions into any key would offer. Nonetheless, one can take the view that from the Age of Enlightenment onward the ever-increasing demand for modulations into keys far removed from the tonic pushed the principles of Western tonal music as far as they could be taken – and that despite the fact that music predicated on equal temperament has produced numerous striking, important, and completely convincing compositions, there remains a lingering sense of incompleteness in the centuries-old mathematical project of developing a system of music based entirely on products of powers of primes, in the spirit of Euler's explorations. This suggests that we should return to the idea of a tonal system based on the rationals and consider in more precise terms the conditions that we would like it to satisfy, with an eye on the possibility of novel generalizations.

4. Chromatic tone systems

Let us enquire, therefore, whether it is possible to create a version of the just chromatic scale by the successive aggregation of “particles” (semitones of various magnitudes) in such a way that the result includes all of the notes of the just diatonic scale as a subscale. Although not, perhaps, obvious, the answer is affirmative, as the following example (CitationOrbach 1999), based on Kepler's scale, clearly illustrates. As particles we take (12) p=1615,q=135128,r=2524.(12) An aggregation sequence that leads to the just chromatic scale (Equation2) is then given as follows: (13) 1:p,q,p,r,p,q,p,p,r,p,q,p.(13) That this sequence reproduces the atomic sequence 1 : X, Y, Z, X, Y, X, Z for the just diatonic scale with X=98, Y=109 and Z=1615 can easily be verified if one observes that X = pq, Y = pr and Z = p. But is this the only chromatic scale that can be constructed from three rational constituents by successive aggregation in such a way that it includes the just diatonic scale as a subscale?

The answer, perhaps surprisingly, turns out to be negative: in fact, there are ninety-six different versions of the chromatic scale that interleave the just diatonic scale. The situation is summarized in the following.

Theorem 4.1

Let S denote a set of three rational semitones with the property that there exists a chromatic scale of twelve notes constructed by aggregation from elements of S interleaving the just diatonic scale C = 1, D=98, E=54, F=43, G=32, A=53, B=158, C=2. Three and only three such sets of semitones exist, given by

  1. Just System I: S={1615,135128,2524},

  2. Just System II: S={135128,1615,256243},

  3. Just System III: S={2524,2725,1615}.

Each such system gives rise to thirty-two distinct chromatic scales.

Proof.

To construct a twelve-tone chromatic scale that interleaves the just diatonic scale, we need to assemble the atoms X=98, Y=109, Z=1615 from a set of three particles {p,q,r} in such a way that X and Y each contain two particles, and Z contains a single particle. Clearly, X and Y share a particle in common, say, p. Then without loss of generality we can set X = pq and Y = pr, and Z will take one of the values p, q, r. Thus, pq=98 and pr=109, and either p=1615 (System 1), or q=1615 (System 2), or r=1615 (System 3). Solving for p, q, r, we obtain the three systems stated in the theorem. In any one of the systems, the order of aggregation for the constituents of X and the constituents of Y can be interchanged for each X and each Y. Since there are three Xs and two Ys, that gives us five possible interchanges, and hence thirty-two different variations for each system.

Just System I is the historic scheme of semitones set out in (Equation12). Within any of the three systems, we can present the constituents of X either in the order p, q or the order q, p. Likewise the constituents of Y can be presented in the order p, r or the order r, p. Thus for each of the five “flat” notes of the chromatic scale, we have two (and only two) choices.

For example, in System I we obtain either 1615 or 135128 for D, either 65(=981615) or 7564(=982524) for E, either 4532(=43135128) or 6445(=431615) for G, either 85(=321615) or 2516(=322524) for A, and either 169(=531615) or 225128(=53135128) for B. Thus we have thirty-two different versions of System I, depending on which choices we make for the five accidentals.

If we take the first option in each accidental, for which the tones are the simplest, then we recover the aggregation sequence (Equation13), which under System I gives rise to (Equation2), which appears in CitationBarbour (1953) as the historic tuning system known as Malcolm's Monochord. Barbour gives the tones in cents, so one has to translate these approximate cent values to the corresponding exact rational numbers to obtain our chromatic scale (Equation2). By (Equation12) and (Equation13), the aggregation sequence for Malcolm's Monochord is given more explicitly by (14) 1:1615,135128,1615,2524,1615,135128,1615,1615,2524,1615,135128,1615,(14) starting at C. Now let's consider Kepler's Monochord No. 2, which is given by the scale (15) C=1,D=135128,D=98,E=65,E=54,F=43,G=4532,G=32,A=85,A=2716,B=95,B=158,C=2.(15) This scale as such does not belong to Just System I, since A=2716 rather than 53. But if we transpose Kepler's tuning down by a fifth, along the whole keyboard, thus assigning F3=23, F3=23135128=4564, G3=2398=34, and so forth, then we recover the chromatic scale (Equation2), as we mentioned earlier. Hence Kepler's system is isomorphic to that of Malcolm, and admits the same aggregation sequence of semitones once a suitable starting point has been established following a transposition.

It may be helpful if we set out the structure of the three systems in a little more detail, showing the breakdown of composite atoms X and Y into their particle constituents:

  1. Just System I:  X=1615135128,  Y=16152524,

  2. Just System II:  X=1351281615,  Y=135128256243,

  3. Just System III:  X=25242725,  Y=25241615.

Note that the intervals arising here are well-established in the theory of tone systems, under a variety of names; see, e.g., CitationHelmholtz (1885), CitationLonguet-Higgins (1987), CitationHaluška (2004). We have met the major whole tone 98, the minor whole tone 109, and the (minor) diatonic semitone 1615. Now we meet the minor chromatic semitone 2524, the major chromatic semitone 135128, the major diatonic semitone 2725, and the minor Pythagorean semitone 256243.

We observe that various relations involving multiplication by the syntonic comma κ=8180 hold between the semitone intervals determined by Theorem 4.1. In particular, we have (16) 256243κ1615κ2725,2524κ135128.(16) What this means is that a set of discrete transformations can be constructed taking the three systems from one to another. Let us write (17) θ1=256243,θ0=1615,θ1=2725,ϕ0=2524,ϕ1=135128.(17) Then one can regard the three θ semitones as being different versions of a common underlying entity, differing by factors of the syntonic comma, and similarly for the two ϕ semitones. One can think of θ0 as being a kind of “neutral” semitone, whereas θ1 has a “charge” of 1 and θ1 has a charge of 1. Similarly, ϕ0 is neutral and ϕ1 has charge one.

Then the relations between the different systems amount to transferring units of charge from one semitone to another in such a away that the total charges of X and Y remain unchanged. Thus X is a charged whole tone, Y is a neutral whole tone, Z is a neutral semitone, and the three just tuning systems can be set out in the following scheme:

  1. Just System I:  X=θ0ϕ1,  Y=θ0ϕ0,  Z=θ0,

  2. Just System II:  X=θ0ϕ1,  Y=θ1ϕ1,  Z=θ0,

  3. Just System III:  X=θ1ϕ0,  Y=θ0ϕ0,  Z=θ0.

Thus in moving from System I to System II, we transfer a unit of charge from θ0 to ϕ0 in the Y atom; moving from System I to System III we transfer a unit of charge from ϕ1 to θ0 in the X atom; whereas in moving from System II to System III we transfer a unit of charge from ϕ1 to θ0 in the X atom and we transfer a unit from ϕ1 to θ1 in the Y atom. We see that the three systems are closely related and that there is order to the arrangement.

Remark 4.1

The existence of a five-semitone chromatic scale agreeing with Ptolemy's intense diatonic scale on the notes of that scale can be demonstrated by use of the various semitones appearing in (Equation17). This can be achieved by setting (18) X=θ1ϕ0,Y=θ1ϕ1,Z=θ0(18) and using the chromatic aggregation sequence (19) 1:θ1,ϕ0,θ1,ϕ1,θ0,θ1,ϕ0,θ1,ϕ1,θ1,ϕ0,θ0.(19)

An example of a historical scale that (modulo transposition) belongs to Just System II can be found in the work of CitationRamis de Parejo (1482). Ramis's Monochord is given by CitationBarbour (1953) both in cents and in the Eitz notation, i.e. in Pythagorean tunings that may be adjusted up or down by one or more syntonic commas. Expressed in rationals, Ramis's tuning starting at C takes the form (20) C=1,D=135128,D=109,E=3227,E=54,F=43,G=4532,G=32,A=12881,A=53,B=169,B=158,C=2.(20) This is clearly not within the scope of Theorem 4.1, since D=109. Nonetheless, the associated aggregation sequence, which is of the form (21) 1:135128,256243,1615,135128,1615,135128,1615,256243,135128,1615,135128,1615,(21) exhibits the particle structure of System II, which suggests, as we saw in the case of Kepler's scale, that a transposition might bring it into line with Theorem 4.1. Indeed, this turns out to be the case, for if we transpose the scale of Ramis de Pareja down by a fourth, multiplying each note by 34 all along the keyboard, then starting at C, we obtain (22) C=1,D=135128,D=98,E=3227,E=54,F=43,G=4532,G=32,A=405256,A=53,B=169,B=158,C=2,(22) which evidently satisfies the conditions of Theorem 4.1; and for the associated aggregation sequence starting at C = 1, which is consistent with Just System II, we have (23) 1:135128,1615,256243,135128,1615,135128,1615,135128,256243,1615,135128,1615.(23) Historical examples of scales belonging to Just System III that can be found (without need of transposition) in Barbour's list include Fogliano's Monochord No. 2, and Rousseau's Monochord (CitationFogliano 1529; CitationRousseau 1768). But there are other historical just tunings in Barbour's list that are of interest, some involving as many as four semitones, such as Euler's Monochord, that would also be worthy of study using the methods we have proposed.

Summing up, we have seen that in the just system the notes of various well-established scales can be assembled from a small number of semitone particles, and we have been able to present a more or less complete account of the tuning schemes that can be constructed on the premise that the structure of the Ptolemaic scale is preserved and that the number of semitones is three.

Nevertheless, we are left with the uncomfortable fact that the resulting systems of semitones involve products of rather high powers of prime numbers. Is it possible to improve on the situation in some way, by introducing a new scheme for music that allows one to bring the multiplicities of the primes down, so we are only left with low powers? This is the question we seek to answer in our investigation of complex intonation.

5. Complex diatonic scales

We shall give an interesting example of such a scheme. It goes outside of the usual conventions of music by allowing tones to be represented by complex numbers. We begin with the observation that all the intervals under consideration in the just system are products of powers of the numbers 2, 3 and 5. These numbers are primes in the conventional sense, but 2 and 5 can be factorized over the complex numbers. That is to say, we have (24) 2=(1+i)(1i)and5=(1+2i)(12i).(24) In particular, since the note F lies midway between C and C, this suggests we should set (25) F=1+i.(25) It may not be obvious a priori what kind of acoustic event will trigger the sensation of a complex tone, but since the numbers involved are simple and striking, we are encouraged to explore further, adding further complex tones.

First, we shall pursue the matter informally, as we did in our initial analysis of the just system, following which we presented Theorem 4.1, which gave a precise account of the system in its entirety. Thus here we shall construct an example of a complex scheme by use of clever albeit essentially inductive reasoning. Then once we have by this line of argument shown the existence of a viable complex scheme we proceed to formalize the theory via the presentation of Theorem 7.1.

Let us write ξ for 1+i. Since an octave is composed of two tritones, this suggests that we should identify both 1+i and 1i as representations of the tritone. As we remark later, our suggestion that 1+i and 1i should be regarded as representing the same tone can be understood as a reflection of the fact that they differ by a “unit”, in this case an overall factor of i. As basic intervals we retain the diatonic whole tones X=98 and Y=109, and the semitone Z=1615, and supplement these with a complex semitone α defined by (26) α=34ξ(26) and a complex whole tone δ defined by (27) δ=45ξ.(27) The argument is that since 43α=ξ we can say that ξ is a fourth above α. Similarly, we have 54δ=ξ, so we can say that ξ is a third above δ. Note also that ξξ¯=2, αα¯=X, δ=αZ and αδ¯=XZ. These various relations justify our interpretation of α as a type of semitone and δ as a type of whole tone.

In our complex scheme we keep the just values for E=54, F=43, G=32, and A=53. Then in addition to F=ξ we can set C=34ξ, D=56ξ, and A=54ξ, since these notes are respectively a fourth below F, a minor third below F, and a third above F. We also need values for D, G and B. If we set D=98 as one does in just tuning, then although it is a fourth below G, it is a “wolf” fifth below A, since A/D=53÷98=4027. But if we set D=109, then D would lie a fifth below A, but would be a “wolf” fourth below G, since 32÷109=2720. Clearly, neither of the choices D=98 and D=109 are satisfactory.

The resolution of this dilemma brings in the other complex factorization that we have at our disposal, namely 5=(1+2i)(12i). Then, in addition to the complex semitone α=34ξ, which has the property that αα¯=98, we obtain another complex semitone β, defined by (28) β=13(3+i),(28) with the property that ββ¯=109. Then we can set G=12(3i), a semitone above G. We observe that both of the diatonic whole tones factorize, which justifies the interpretation of β as another species of semitone. The product of these complex semitones gives a complex whole tone which we define by setting αβ=γ, and one can check that (29) γ=12(1+2i).(29) Note that γγ¯=54, so γ lies midway between C and E in the sense that ξ lies midway in the range between the notes C and C. Thus, we set D=γ, banishing the wolves by moving D off into the complex.

We observe that the intervals G/D=35(12i) and A/D=23(12i) are products of low powers of Gaussian primes and hence both can be regarded as consonant in our scheme. These intervals can be contrasted with the wolves 4027 and 2720 that arise in the just scheme.

Another unsatisfactory aspect of the just scheme is the value of B=158. This gives us B/F=4532, the same harsh interval that we eliminated by setting F=ξ. That suggests that instead of B=158 we should set B=43ξ. Then we obtain the simple ratio B/F=ξ, which by virtue of the fact that it involves only a single prime can be regarded as consonant.

Summing up, with the various adjustments in place, we conclude that the C-major scale in our complex scheme takes the form (30) C=1,D=γ,E=54,F=43,G=32,A=53,B=43ξ,C=2,(30) with the aggregation sequence 1:γ,γ¯,Z,X,Y,δ,α¯. Now let us see what happens when we change keys, noting how the new choice for F fits into the scheme. Recall that in the just system a change of key from C to G worked out, but at the next level with the key of D one encounters an obstacle. The G-major scale in the complex scheme is (31) G=34,A=56,B=23ξ,C=1,D=γ,E=54,F=ξ,G=32,(31) starting at G3. For the corresponding sequence of atoms we have 1:Y,δ,α¯,γ,γ¯,δ,α¯. In particular, one notices that δ=45ξ lifts the E=54 up to F=ξ. Then one sees that α¯=34ξ¯ lifts the F up to G=32, by use of the relation ξξ¯=2.

In the key of D major, there is an added level of complexity on account of the two sharps and the fact that D itself is a complex tone: (32) D=γ,E=54,F=ξ,G=32,A=53,B=43ξ,C=32ξ,D=2γ.(32) For the corresponding sequence of atoms we obtain 1:γ¯,δ,α¯,Y,δ,X,β. The treatment of the F is like that of the G-major scale. For the C, we see that X=98 lifts B=43ξ to C=32ξ, and then β lifts C to D=2γ, by use of the identity αβ=γ. Essentially the same line of reasoning can be applied to the keys of A major, E major and B major.

The flat keys show a slightly different pattern. For example, in the case of F major, starting at F4, we have (33) F=43,G=32,A=53,B=54ξ,C=2,D=2γ,E=52,F=83,(33) with the aggregation sequence 1:X,Y,α,δ¯,γ,γ¯,Z. Thus the semitone α=34ξ lifts A=53 to B=54ξ. Then δ¯=45ξ¯ lifts B to C=2. Similar constructions apply to the keys of B, E, A, D and G.

Let us take stock of the complex whole tones and semitones at our disposal. We have the three semitones (34) α=34(1+i),β=13(3+i),Z=1615,(34) along with the four whole tones (35) X=98,Y=109,γ=12(1+2i),δ=45(1+i).(35) These intervals can be used to construct complex analogues of the major and minor scales in all twelve keys. The resulting scheme of intervals for the twelve major scales is as follows:

  1. C major  1ξ:γ,γ¯,Z,X,Y,δ,α¯F major  43ξ:X,Y,α,δ¯,γ,γ¯,Z

  2. G major  32ξ:Y,δ,α¯,γ,γ¯,δ,α¯B major  58ξ:δ¯,γ,β¯,δ¯,X,Y,α

  3. D major  γ32:γ¯,δ,α¯,Y,δ,X,βE major  56ξ:δ¯,X,β¯,γ,δ¯,γ,β¯

  4. A major  56ξ:δ,X,β,γ¯,δ,γ¯,βA major  34β¯:γ,δ¯,α,Y,δ¯,X,β¯

  5. E major  54ξ:δ,γ¯,β,δ,X,Y,α¯D major  34ξ:Y,δ¯,α,γ¯,γ,δ¯,α

  6. B major  23ξ:X,Y,α¯,δ,γ¯,γ,ZG major  ξ43:γ¯,γ,Z,X,Y,δ¯,α.

Here, the initial note is given for each major scale, followed by the aggregation sequence that generates that scale. The notes C, E, F, G and A take the Ptolemaic values and for F we have ξ=1+i. Each minor scale is the relative minor of an associated major scale. For example, for C minor we have the sequence 1:γ,β¯,δ¯,X,β¯,γ,δ¯ obtained by permuting the E-major sequence. In this way we obtain a system of major and minor scales in all keys.

6. Well-tempered complexity

Motivated by the constructions of the previous section, we are now in a position to present the complex scheme in its entirety. We begin by recalling a few results concerning so-called Gaussian integers. We write Z for the ring of ordinary real integers, which we call rational integers. We write Z[i] for the ring of Gaussian integers, by which we mean complex numbers of the form a+bi, where a,bZ and i=1. Gaussian integers can be added, subtracted and multiplied to yield other Gaussian integers. In what follows we often refer to Gaussian integers simply as “integers”. The complex conjugate of an integer c=a+ib is denoted c¯=abi, and we write |c|=a2+b2 for the magnitude of c.

By a unit we mean an integer that divides any integer. There are four units: ±1,±i, and for any integer n we refer to the four numbers ±n,±in as its associates. By a prime we mean any integer p, not 0, and not a unit, that is divisible only by its associates ±p,±ip and the units ±1,±i. Every integer n can be expressed in the form n=Πj=1rpj, where the pj are primes. The fundamental theorem of arithmetic for Gaussian integers asserts that this expression is unique, apart from the order of the primes, the presence of units, and ambiguities between associated primes (CitationHardy and Wright 1938).

By a real prime, we mean a Gaussian prime, in the sense just defined, that is real; the conventional primes are all rational primes. For example, 3 and 7 are real primes, whereas 2=(1+i)(1i) and 5=(1+2i)(12i) are not; but 2, 3, 5 and 7 are rational primes. A rational prime p>0 is a real prime only if it is of the form p = 4n + 3 for nN.

A Gaussian integer n=a+bi, a,bZ, is said to be even if a, b are both even or both odd in the usual sense, and otherwise n is odd. For real n this definition of even and odd coincides with the usual one. Then one can show that an integer is even iff it is divisible by 1+i. This generalizes the idea that a rational integer is even iff it is divisible by two.

As usual, we write Q for rational numbers, by which we mean numbers of the form a/b for a,bZ and b0.

The aggregate of numbers of the form r+is, with r,sQ, constitute the so-called Gaussian rationals Q[i].

Given the fundamental theorem of arithmetic for Gaussian integers, one can show that any Gaussian rational has a unique factorization of the form z=Πj=1r(pj)aj, where the primes are distinct and the aj are non-vanishing integers. The uniqueness is qualified up to permutation, the presence of units, and ambiguities between associated primes.

It will have been observed that the tones (or intervals) that we have considered so far, apart from the equal temperament semitones, are Gaussian rationals and thus inherit from Q[i] the multiplicative group structure that is characteristic of the concatenation of intervals. In practice, we are concerned with Gaussian rationals that can be built up from a small basis of Gaussian primes.

Definition 6.1

We say that a tone t is complex five-limit and write tL(5,C) if it takes the form (36) t=(1+i)a(1i)b3c(1+2i)d(12i)e(36) for some collection of integers a,b,c,d,eZ.

It should be plain that a real 5-limit tone tL(5,R), by which we mean a tone of the form t=2p3q5r, for some choice of p,q,rZ, is necessarily a complex 5-limit tone; for clearly, if t is to be real in (Equation36), we must have a = b and d = e.

More generally, we write tL(n,C) for nN if t can be expressed as a product of powers of prime factors of rational integers that are of magnitude less than or equal to n. Thus, for example, tL(3,C) consists of intervals of the form (37) t=(1+i)a(1i)b3c.(37) It should be evident that L(n,C) has for each nN the structure of a free Abelian group and that L(m,C) is a subgroup of L(n,C) for mn.

This leads us to comment on the significance of units in our tonal language. The discussion above suggests that if tL(5,C) represents a complex tone and ϵ is a unit, then ϵt represents the same tone as t. Thus, t represents the same musical value as its associates. In particular, ξ, ξ, iξ, and iξ all represent the tone F. But iξ=ξ¯, so ξ and ξ¯ both represent F.

One might wonder then if there is some redundancy in the expression for (Equation36), since 1+i and 1i are associates, as we have just learned, and hence represent the same prime factor up to a unit. There is indeed some redundancy, but we can use (38) (1+i)a(1i)b=(i)b(1+i)a+b(38) to put (Equation37) in the form (39) t=ϵ(1+i)a3b,(39) and to put (Equation36) in the form (40) t=ϵ(1+i)a3b(1+2i)c(12i)d,(40) where ϵ is a unit, since, depending on whether b = 0, 1, 2, 3 mod 4 in (Equation38) we find that (i)b=1,i,1,i. Then, relabelling the exponents, we obtain (Equation39) and (Equation40). Depending on the context, we can use one or the other of the equivalent expressions (Equation36) and (Equation40) as a canonical representation for a given tone in the complex 5-limit case.

The latter arises more naturally in prime number theory; but the former arises more directly in music theory. In fact, we can take matters further, and apply a similar line of reasoning to the complex prime factors of the number five. As we have pointed out, since 12(1+2i)×12(12i)=54, we can regard each of the complex numbers γ=12(1+2i) and γ¯=12(12i) as representing a type of whole tone interval. But their quotient is given by the complex phase factor (41) ω=12i1+2i=(3+4i5).(41) We see that ωω¯=1 and γ¯=ωγ. Thus, if we assume that γ and γ¯ represent the same tone, one can ask what the situation is if higher powers of ω are brought into play.

Proposition 6.1

A complex tone ζ will be said to be a phase factor if ζζ¯=1. A phase factor is complex 5-limit if and only if for some unit ϵ and some kZ it is of the form (42) ζ=ϵ(3+4i5)k.(42)

Proof.

If ζ is a complex 5-limit tone then it can be put into the form (Equation36). Then for it to be a phase factor we require b = −a, c = 0 and e = −d. This implies that (43) ζ=(1i1+i)j(12i1+2i)k=(i)j(1)k(3+4i5)k(43) for some j,kZ, which shows that any complex 5-limit phase factor is of the form (Equation42). Conversely, any tone of the form (Equation42) is indeed a complex 5-limit phase factor.

Remark 6.1

Thus, by virtue of (Equation36) and (Equation43) it follows that any complex 5-limit tone admits a unique factorization of the form (44) t=(1+i)a3b(1+2i)c,(44) modulo units and complex 5-limit phase factors.

Remark 6.2

The space L(5,C), modulo such factors, forms a Generalized Interval System (GIS) in the sense of CitationLewin (1987). We have a triple (S,I,int), where S=L(5,C) modulo units and complex 5-limit phases is regarded as a space of complex tones, I is another copy of L(5,C), modulo such factors, now regarded as a group of intervals, and int is the map that takes the complex tones s,tS to the complex interval t/sI, mod units and complex 5-limit phase factors.

7. Complex chromatic scales

Remarkably, we can construct a single complex chromatic scale that can be used for the formation of all of the major scales given in the previous section, along with their modal variations. This scale is as follows: (45) C=1,C=34(1+i)D=12(1+2i),D=56(1+i),E=54,F=43,F=1+i,G=32,G=12(3i),A=53,A=54(1+i),B=43(1+i),C=2.(45) Moreover, this complex chromatic scale can be constructed by aggregation in a very simple way. Only three semitones are required and the aggregation sequence starting at C = 1 is (46) 1:α,β,β¯,α¯,Z,α,α¯,β¯,β,α,Z,α¯,(46) where α=34(1+i), β=13(3+i), Z=1615. Note that the first six intervals in the sequence are complex conjugate to the second six. Indeed, one can check that (47) αββ¯α¯=1+i,α¯β¯βαZα¯=1i.(47) Thus, we have demonstrated the existence of a twelve-tone chromatic scale in the Gaussian rationals, constructed by aggregation from a set of three complex semitones α,β,ZQ[i] such that (48) C=1,E=54,F=43,F=1+i,G=32,A=53,C=2,(48) and one can check that the chromatic scale (Equation45) is consistent with the scheme of major and minor scales introduced in Section 5. We are then led to ask if any other complex tuning systems exist based on three semitones. The answer is yes, and surprisingly we find that exactly three such systems exist.

Theorem 7.1

Let S denote a set of three complex 5-limit semitones satisfying (a) there exists a chromatic scale of twelve notes constructed by aggregation from the elements of S and their complex conjugates such that C = 1, E=54, F=43, F=1+i, G=32, A=53, C=2, and (b) the aggregation is conjugate symmetric, in the sense that the first six elements of the chromatic series are conjugates of the second six elements, in the same order. Then three and only three such sets exist, given by

  1. Complex System I: S={34(1+i),13(3+i)ζ,1615}, ζ=ϵ(3+4i5)k, ϵ{1,1,i,i}, kZ,

  2. Complex System II: S={34(1+i),2027(1i),1615},

  3. Complex System III: S={34(1+i),2524,1615}.

Proof.

Let the sequence of aggregation take the form (49) 1:X1,X2,X3,X4,X5,X6,X7,X8,X9,X10,X11,X12,(49) where each of the Xs belong to S. Conjugate symmetry implies the sequence takes the form (50) 1:X1,X2,X3,X4,X5,X6,X¯1,X¯2,X¯3,X¯4,X¯5,X¯6.(50) Then we impose the following constraints to fix the designated notes: (51) X1X2X3X4=54,X1X2X3X4X5=43,X1X2X3X4X5X6=1+i,X1X2X3X4X5X6X¯1=32,X1X2X3X4X5X6X¯1X¯2X¯3=53.(51) These relations give X1=α, X2X3=109, X4=α¯, X5=Z, and X6=α, where α=34(1+i) and Z=1615. Thus, S must contain α and Z alongside another particle. This remaining particle is determined by the constraint X2X3=109, and there are three possibilities:

Complex System I. X2 and X3 are both new particles in this case, and hence complex conjugates, since there can be no more than three particles altogether. Thus we set X2=β and X3=β¯ for some new semitone β. The relation ββ¯=109 then implies (52) β=13(3+i)ζ,(52) for some phase factor ζ. Imposing the condition that β should be complex 5-limit and using Proposition 6.1, one finds the solution for β is by (53) β=13(3+i)ϵ(3+4i5)k,(53) where ϵ is a unit and k is an integer. Thus for System I we obtain (54) S={α,β,Z}.(54) This is identical to the example we considered earlier in this section, only now with the inclusion of a phase factor parametrized by ϵ and k. If tones correspond to equivalence classes of elements of L(5,C) modulo units and phase factors, then S is uniquely determined.

Complex System II. In this case one of the semitones X2 and X3 is taken to be α or α¯. Let us say X2=α¯. Then X2X3=109 implies X3=2027(1+i). But if we express X3 as a multiple of α, then we obtain X3=κ1α, and we see that the syntonic comma makes an appearance. Thus for System II we have (55) S={α,α,Z},(55) where α=κ1α. Note that the two complex semitones are proportional in this case, with a real factor of proportionality. The relation between α and α is analogous to that between X and Y in the just system, where Y=κ1X. Note that instead of X2=α¯, we could have taken X2=α, or X3=α, or X3=α¯. Thus, System II occurs in four variants. But if tones are equivalence classes of elements of L(5,C) modulo units and complex 5-limit phase factors, then there are only two variants, since α and α¯ are equivalent modulo units.

Complex System III. Here one of the semitones X2 and X3 is taken to be 1615. Let us say X3=1615. Then 1615X2=109 which gives X2=2524. Thus for System III we have (56) S={α,ρ,Z},(56) where ρ=2524. The interval ρ is well known in the theory of tuning and is variously referred to as the minor chromatic semitone or the small half-tone. Again, we could have taken X2=1615 and X3=ρ, so we see that System III occurs in two variants.

We have already presented the complex chromatic scale and the associated aggregation sequence for System I at (Equation45) and (Equation46). These values are recorded in Table  along with the details of the other two systems of complex tonalities with three semitones. Under System II, a calculation shows that the aggregation sequence starting at C = 1 is (57) 1:α,α¯,α,α¯,Z,α,α¯,α,α¯,α,Z,α¯(57) and the chromatic scale is given by (58) C=1,C=34(1+i),D=98,D=56(1+i),E=54,F=43,F=1+i,G=32,G=98(1+i),A=53,A=54(1+i),B=43(1+i),C=2.(58) This scale is characterized by the fact that the notes C, D, E, F, G, A, C take on just values, whereas the remaining notes are given by multiples of ξ. In the variant shown, we have taken X2=α¯,X3=α, which leads to D=98. In the variant where X2=α¯,X3=α, we obtain D=109. In both cases there is a wolf interval, either between D and A or between D and G. See Table  for further details of the variants of System II. Turning now to System III, we see that under its first variant the aggregation sequence takes the form (59) 1:α,Z,ρ,α¯,Z,α,α¯,Z,ρ,α,Z,α¯(59) and for the chromatic scale we obtain (60) C=1,C=34(1+i),D=45(1+i),D=56(1+i),E=54,F=43,F=1+i,G=32,G=85,A=53,A=54(1+i),B=43(1+i),C=2.(60) Note that we have the simple ratio A/D=ρξ. We also have G=85, which is the just value of this note. In the second variant of System III, we have D=2532ξ and G=2516, the latter being the so-called augmented fifth, which can be thought of as a compounding of two intervals of a third. In both variants of System III one has G/D=ξ¯.

8. Complex Pythagorean systems

It is not unreasonable to ask whether one can construct complex chromatic systems based on two semitones, rather than three. Whereas the point of departure for the three-semitone complex system was the just scale, the case of two complex semitones begins with consideration of the ancient Pythagorean system, the elements of which we briefly recall.

We use the term “Pythagorean” to refer broadly to the entire class of tuning systems for which intervals are products of powers of the numbers two and three (CitationHaluška 1999; CitationThirring 1990). If we begin at C = 1, then an ascending sequence of tones for C,G,D,A, can be constructed using the cycle of fifths by multiplying successive terms by 32 and then normalizing with a power of two to bring the pitch back into the original octave. We obtain (61) C=1,G=32,D=98,A=2716,E=8164,B=243128,F=729512,,(61) and so on up the cycle of fifths, but lowering the tones where necessary by one or more octaves to ensure that they all fall within the same octave. A descending sequence can be similarly constructed by multiplying successive terms by 23, and normalizing, to give (62) C=1,F=43,B=169,E=3227,A=12881,D=256243,G=1024729,,(62) and so on down the cycle of fifths. The resulting F is different from the G, but if we choose one or the other of them then we merge the two series to obtain a Pythagorean version of the chromatic scale, given (in this case, with the G) by (63) C=1,D=256243,D=98,E=3227,E=8164,F=43,G=1024729,G=32,A=12881,A=2716,B=169,B=243128,C=2.(63) It is straightforward to verify that this scale is generated by aggregation from a pair of semitones, given by (64) σ=256243,τ=21872048,(64) corresponding respectively to the D of the descending series and the C of the ascending series (one step beyond the F that we have shown). The aggregation sequence is (65) 1:σ,τ,σ,τ,σ,σ,τ,σ,τ,σ,τ,σ.(65) The fact that the diesis σ appears seven times and apotome τ appears five times is reflected in the identity σ7τ5=2, which is easily checked by use of the relations σ=2835 and τ=21137. See Table . The origins of the Pythagorean system are shrouded in mystery, but it should be evident that both the pentatonic scale, (66) C=1,D=98,F=43,G=32,A=2716,C=2,(66) and the heptatonic scale (67) C=1,D=98,E=8164,F=43,G=32,A=2716,B=243128,C=2(67) can be generated aggregatively within this system by pairs of certain atoms, where each atom has a certain particle composition.

In the case of the pentatonic scale, we have the whole tone X=98 given by στ and the minor third W=3227 given by σ2τ. The associated aggregation sequence starting at C = 1 is 1 : X, W, X, X, W. But in the case of the heptatonic scale, the system is generated by the whole tone X and the semitone σ=256243, with the aggregation sequence (68) 1:X,X,σ,X,X,X,σ.(68) One observes that if we sharpen the notes E, A and B of Ptolemy's intense diatonic scale (Equation1) by applying the syntonic comma, then we obtain the Pythagorean heptatonic scale. In particular, we have (69) 54κ8164,53κ2716,158κ243128.(69) Once we have selected a twelve-tone chromatic Pythagorean scale by taking a strip of twelve successive notes from the infinite sequence rising above and descending below C, we can for convenience re-write our chromatic scale in such a way that the black keys are represented by notes with sharps, and we have (70) C=1,C=256243,D=98,D=3227,E=8164,F=43,F=1024729,G=32,G=12881,A=2716,A=169,B=243128,C=2.(70) This expression for the chromatic scale is equivalent to that of (Equation63), but is rather more well suited for a comparison to the scales we have developed based on three semitones. As in the case of three semitones, one sees that the F is composed of high powers of primes.

Thus we are prompted to search for a complex Pythagorean system involving lower powers of primes. Following the method of Theorem 7.1, our approach will be (i) to take as given the real Pythagorean tones of the pentatonic scale, (ii) to supplement these with F=1+i, and (iii) to impose conjugate symmetry. We obtain the following:

Theorem 8.1

Let S denote a system of two complex semitones satisfying (a) there exists a chromatic scale of twelve notes constructed by aggregation from elements of S and their complex conjugates such that C = 1, D=98, F=43, F=1+i, G=32, A=2716, C=2 and (b) the aggregation is conjugate symmetric, in the sense that the first six elements of the chromatic series are conjugates of the second six, in the same order. Then S={34(1+i),256243}.

Proof.

Let the aggregation sequence take the form (Equation49). When we impose conjugate symmetry the sequence reduces to (Equation50), as in the case of three semitones. Then we impose the following constraints to fix the notes of the pentatonic scale together with F=1+i: (71) X1X2=98,X1X2X3X4X5=43,X1X2X3X4X5X6=1+i,X1X2X3X4X5X6X¯1=32,X1X2X3X4X5X6X¯1X¯2X¯3=2716.(71) These relations can be solved to deduce that X1=α, X2=α¯, X3=α, X6=α, and X4X5=6481ξ. Taking into account that we need to have two semitones in total, we find that the system is uniquely determined to be S={α,σ}, where α=34(1+i) and σ=256243.

There are two variants: (i) X4=α¯, X5=256243, and (ii) X4=256243, X5=α¯. The resulting chromatic scale thus takes two different possible forms, depending on the choice of variant. This affects the notes E and A. For variant (i) we obtain (72) C=1,C=34(1+i)D=98,D=2732(1+i),E=8164,F=43,F=1+i,G=32,G=98(1+i),A=2716,A=8164(1+i),B=43(1+i),C=2,(72) with the aggregation sequence (73) 1:α,α¯,α,α¯,σ,α,α¯,α,α¯,α,σ,α¯.(73) Then for variant (ii) we obtain (74) C=1,C=34(1+i)D=98,D=2732(1+i),E=89(1+i),F=43,F=1+i,G=32,G=98(1+i),A=2716,A=169,B=43(1+i),C=2,(74) with the aggregation sequence (75) 1:α,α¯,α,σ,α¯,α,α¯,α,α¯,σ,α,α¯.(75)

Remark 8.1

It is interesting to note that it was not necessary to assume that the semitones are complex 3-limit in the formulation of Theorem 8.1. Rather, this feature emerges as a consequence of the weaker conditions that we have imposed, in contrast with Theorem 7.1, where it was assumed that the semitones are complex 5-limit.

9. Back to reality

One can ask whether the complex tone systems that we have considered give rise to real “shadow” systems, where the magnitudes of the complex numbers under consideration might play a role. It would thus seem natural to look for an analogue of Theorem 7.1 in which the role of ξ is replaced by |ξ|=2. This takes us out of the rational framework, but is nevertheless of interest since the results run in parallel with the complex system in a nice way. Since one has F=2 in equal temperament, as well as in historic tuning systems (for example, that of CitationGrammateus 1518) where 2 arises as the geometric mean between F=43 and G=32 (CitationBarbour 1953), it makes sense to pursue the matter. We obtain the following:

Theorem 9.1

Let S denote a system of three real semitones satisfying (a) there exists a chromatic scale of twelve notes constructed by aggregation from elements of S such that C = 1, E=54, F=43, F=2, G=32, A=53, C=2, and (b) the aggregation is symmetric, in the sense that the first six elements of the chromatic series equal the second six elements, in the same order. Then there are three such systems, given by

  1. Shadow System I: S={342,1310,1615},

  2. Shadow System II: S={342,20272,1615},

  3. Shadow System III: S={342,2524,1615}.

Proof.

The aggregation sequence is evidently of the form (76) 1:X1,X2,X3,X4,X5,X6,X1,X2,X3,X4,X5,X6,(76) with real entries, and the constraints are given by (77) X1X2X3X4=54,X1X2X3X4X5=43,X1X2X3X4X5X6=2,X1X2X3X4X5X6X1=32,X1X2X3X4X5X6X1X2X3=53.(77) It follows that X1=X4=X6=θ, X5=1615, and X2X3=109, where θ=342. The need for three semitones in total implies that either (i) X2X3=η2, or (ii) X2X3=θη, or else (iii) X2X3=1615η, where η is another semitone. Then by solution of the constraint X2X3=109 we are led to the three systems in the claimed result.

As in Theorem 7.1, the first and third semitones are the same in all three systems, and it is the second semitone that differs. Note that Systems II and III each have two variants, depending on the order of the semitones X2 and X3.

An analogous result can be obtained for the real shadow of the complex Pythagorean system, where we look at the construction of a real two-semitone system matching the Pythagorean pentatonic scale and satisfying F=2.

Theorem 9.2

Let S denote a system of two real semitones satisfying (a) there exists a chromatic scale of twelve notes constructed by aggregation from elements of S such that C = 1, D=98, F=43, F=2, G=32, A=2716, C=2 and (b) the aggregation is symmetric in the sense that the first six elements of the chromatic series equal the second six elements in the same order. Then there are two such systems, given by

  1. Pythagorean Shadow System I: S={342,256243},

  2. Pythagorean Shadow System II: S={342,8924}.

Proof.

The aggregation sequence takes the form (Equation76), with real entries, and the constraints (78) X1X2=98,X1X2X3X4X5=43,X1X2X3X4X5X6=2,X1X2X3X4X5X6X1=32,X1X2X3X4X5X6X1X2X3=2716.(78) It follows that X1=X2=X3=X6=342 and X4X5=64812. That there should be two semitones implies that either X4X5=θσ, where θ=342 and σ=256243 is the second semitone, or X4X5=ν2, where ν=8924 is the second semitone, and this gives the systems stated in the theorem. Note that for System I there are two variants: (a) X4=θ, X5=σ, (b) X4=σ, X5=θ.

Remark 9.1

Pythagorean Shadow System I is based on the magnitudes of the complex Pythagorean system presented in Theorem 8.1, but Pythagorean Shadow System II arises independently as a second solution of the conditions of Theorem 9.2.

10. Applications, conjectures, and conclusions

Complex tonality opens up the possibility of genuinely new types of musical structures and perhaps also helps us to understand existing structures. This becomes evident in the various harmonic relations holding between scales and between chords. In our Complex System I, all the major scales are different from one another. Thus if we play a simple melody in one key and then transpose it to a different key, the new melody is usually not quite the same as the original. In other words, the various major keys are not transpositionally equivalent. The same is true of the minor keys. In the key of C major, the tonic and subdominant triads are equivalent (they have the same interval structure) but the tonic and the dominant triads are not, on account of the complex values for B and D. Thus for C major and F major we have the triads [1,54,32] and [43,53,2], but for G major we have the triad [32,43ξ,85ξ].

The hallmark of the dominant in this case is that the interval B/G is given by 89ξ and that the interval D/G is given by 1615ξ, whereas the interval D/B is given by 65, as in the just system. This closeness of relationship between the F-major and the C-major triads may contribute to the perceived gentleness of the plagal resolution from F major to C major in contrast to the more robust authentic resolution from G major to C major.

A well-known problem in the theory of harmony is the question of why major and minor chords should be so different in character from one another to the ears. After all, they are composed of the same intervals, 54 and 65, merely stacked in the opposite order. There is no obvious acoustic, physiological or psychological reason why they should produce such a different effect on the listener. One might argue along the lines of cultural conditioning, and to be sure, such elements may come in; but surely that cannot be the whole story. In Complex System I, the C-major chord takes the familiar form [1,54,32], whereas by (Equation45) the parallel C-minor chord is [1,56ξ,32], which can be contrasted to the just Aeolian [1,65,32]. But for the relative minor in the complex system, we have [56,1,54], in which the intervals are the same as those of the relative minor in the just system. Thus, in the complex system there is a clear distinction, in the key of C major, between the structure of the parallel minor (with its complex minor third interval) and the relative minor (with its real minor third interval), whereas in the just system these two minor chords have the same structure.

Such distinctions may help us better understand, for example, the startling contrasts in the relations between C major and C minor, on the one hand, and between C major and A minor, on the other hand, in the two movements of Beethoven's Opus 111 Piano Sonata. That the first relation is like that of heaven and hell, and the second like that of acceptance and resignation. One recalls Kretschmar's lecture in Thomas Mann's Doctor Faustus.

Our remarks cannot do justice to the profundity of Beethoven's composition; nonetheless, the relations between these two different means of forming a minor chord from a major chord, and the nature of the mental states induced by these chords, are so important that any insight into the way the mind takes on board these distinctions may be of some interest.

Indeed, if one looks at the Opus 111 Sonata from the point of view of its embedding in the complex system, more of its structure becomes apparent. Structural relations between major and minor chords are evident more generally if one views the neo-Riemannian theories (CitationCohn 1998Citation2012) from this angle. For instance, the system of hexatonic relations beginning at C major involves a progression from the real to the complex and back under Complex System I. The well-known hexatonic cycle (79) C majorC minorA majorA minorE majorE minorC major,(79) obtained by successive parallel and leading-tone shifts, is given in the Keplerian system by (80) [1,54,32][1,65,32][1,65,85][1516,65,85][1516,54,85][1516,54,32][1,54,32].(80) But in Complex System I this sequence of transformations takes the following form: (81) [1,54,32][1,56(1+i),32][1,56(1+i),12(3i)][23(1+i),56(1+i),12(3i)][23(1+i),54,12(3i)][23(1+i),54,32][1,54,32].(81) One sees in particular that the starting chord C major has no complex elements, whereas the associated polar chord A minor is constructed entirely from complex elements. Again, more structure is revealed.

Let us consider another example, once more involving the complex A-minor chord. This example concerns the role of the tritone in so-called vagrant chords (CitationSchoenberg 1978). Ordinarily, the tritone B/F is taken to be a dissonant interval, but there are settings in which it has the opposite effect. Such a setting occurs notably, to varying degrees, in the context of the minor sixth chord and its permutations. The most well known of these harmonious chords involving the tritone is the A-minor sixth, which after permutations we recognize as the Tristan chord, which in just intonation takes the form [23,1516,65,85].

The remarkable aspect of this chord is that in the presence of the two higher notes the tritone seems to lose its dissonance in such a way that the chord as an entièreté acquires that uncanny sense of beauty tinged with tragedy that permeates the whole of the opera Tristan und Isolde. We can only speculate (among many others – see, e.g., CitationCohn 2012; CitationSchoenberg 1978) how this comes about, but it may be that in isolation the tritone is perceived by the ear as the dissonant interval 4532, with its multiplicity of prime powers; but when nudged by the presence of the other elements of the chord the mind flips (as with optical illusions) to a perception of this interval in its representation by the most basic of the Gaussian primes.

It is interesting in this context to recall that Lewin (1993) in a notable passage considers the harmonic intuition associated with the stimulus of the interval 4532. In essence he argues (quite persuasively) that the intuition of the interval F/C takes the F to be the mediant of the dominant of the dominant, and thus interprets the F as a leading tone:

Observe that the number () 4532 arises here not as the product of 25 and 32 and 5, which is the most natural mathematical factorization. Rather, 4532 arises as the product of the four factors 2, 54, 34 and 34, reflecting its “natural” way of measuring an intuited chain of intuitions in the given situation.

These arguments appear in the context of a wider discussion of the just system, in which musical intervals take the form 2a3b5c for a,b,cZ, in connection with which Lewin asserts the following:

It is not immediately clear what intuitions of “distance” or “motion” we are measuring by these intervals. Personally, I am convinced that our intuitions are highly conditioned by cultural factors. In particular, I do not think that the acoustics of harmonically vibrating bodies provide in themselves an adequate basis for grounding those intuitions.

Thus it may be that complex intervals map to certain intuitions of chains of intuitions, just as do the intervals of the just system in Lewin's analysis – that an acoustic stimulus may indeed trigger this intuition, in a setting where cultural factors play a significant role. Pursuing the idea further one could then conjecture that the setting of the tritone in the environment of the two higher notes of the Tristan chord is sufficient as an acoustic trigger for the interval to map to the complex interval ξ rather, than, say, a leading tone represented by the mediant of the dominant of a dominant – the latter would suggest a resolution into C major, which is clearly not what one observes, despite the many tactics that Wagner does employ in his score to achieve what one might call tentative resolutions.

In short, we propose that when the tritone is heard in isolation it is registered by the ear as harsh, that when it is heard in the context of, say, an eighteenth century contrapuntal development, it triggers a leading-tone interpretation along the lines of Lewin's gestalt; but that in the setting of the minor sixth chord the mind identifies this interval as the complex prime 1+i, which it treats as pleasurable, luxurious, even exotic – thus perhaps justifying the term diabolus in musica which has been applied to this most unstable of intervals.

According to this view, acoustic phenomena can trigger certain intuitions already established in the mind – either innately or through cultural conditioning – which can then be identified with certain mathematical objects. These mathematical objects are constructed from sets of complex 5-limit intervals, which we recognize as melodies, harmonies, rhythms, and so forth. The complex interval system itself can be viewed as an abstraction, and hence the precise form taken by this map from acoustic phenomena to the interval system will inevitably vary somewhat from person to person, from piece to piece, from culture to culture, and over time, even if the existence of such a map can be regarded as a universal.

Under the Complex System I, the Tristan chord takes the form [23,23ξ,56ξ,12(3i)]. Let us put this array under the microscope and deconstruct it into its constituent intervals, of which there are six in all, comparing these to the corresponding intervals arising in the just version of the chord. For the interval B/F we obtain ξ in the complex case, instead of the Keplerian tritone 4532, as we have already noted. We take ξ to be consonant since it is a low power of a prime. For the interval E/B we obtain a perfect third 54 in the complex system, as opposed to the fraction 65÷1516=3225 that arises in the just system. For the dissonant interval E/F in the just system we obtain 95. The corresponding interval in the complex system takes the form 54ξ, which involves fewer primes (if we do not count powers of 2) and hence can be viewed as no more dissonant that 95. The interval A/E is given by 43 in the just system. In the complex system, we use the prime factorization 3i=ξ(12i) to get A/E=35(12i), more complicated than the just expression, but involving low powers of primes, and hence consonant. We can parse this interval as A/E=6512(12i), showing it can be interpreted as the product of a minor third and a complex whole tone. This may explain why the fourth between the E and the A is softened, and seems darker in the context of the Tristan chord than is does on its own. The interval A/F is given by 125 in the just system, whereas it takes the form 34ξ(12i) in the complex system, again with low powers of primes.

Finally, we observe that the sixth between B and A is given by fraction 85÷1516=12875 in the just system, whereas it takes the form A/B=34(12i) in the complex system, involving only a small number of primes. All the intervals within the Tristan chord arising in the complex system are harmonious (or are at least as harmonious as one might wish), whereas in the just system three of the intervals have large powers of primes and are unacceptable as candidates for consonance. In equal temperament all the intervals are out of tune, though it is difficult to say they are approximately in tune, since one cannot say what form “exact” tuning might take. The equally tempered version of the chord sounds acceptable – after all, this is what we hear when we attend the opera or play passages from it on the piano – much as it would have in Wagner's day. It follows that the conventionally performed version of the chord in the setting of the score is sufficient to act as a stimulus for the mind to receive it as a signal for the relevant tones of the complex system, along the lines we have discussed. We remark, incidentally, that all of the tritone intervals in Complex System I take the value ξ, modulo units and 5-limit phase factors. This is consistent with the observation that even under Wagner's many modulations, the Tristan chord does not lose its magic.

In conclusion, it may be appropriate to make a few brief remarks concerning our research methodology, even if these remarks may be irrelevant to the question of how one might further progress the work. It is not unusual in science for new ideas to arrive in a flash, after lengthy consideration of a body of earlier material, in reaction to which the earlier material can be reorganized and better understood. It may even then look to the outsider as if these developments of the earlier material somehow led to the new ideas, though in reality the sequence of events was opposite. In the present work, our realization that setting F=1+i led rather naturally to the construction of complex scales with interesting properties prompted fresh investigations into the foundations of the just system, which in turn led to Theorem 4.1. It is not surprising therefore that our Theorem 4.1 does not quite have the character of an incremental development based on established ideas in music analysis. This is because its methods are fresh and offer new insights, as is evidenced in our analysis of the historical just scales catalogued by CitationBarbour (1953). With Theorem 4.1 at hand, we were then in a position to investigate the complex system more systematically, leading to Theorem 7.1. This required bringing in more mathematics, in the form of various results from the theory of Gaussian integers. The connections to word theory, generalized musical intervals, and the theory of scales came later, and although these latter areas have influenced the presentation of our work, they were not involved in its initial formulation.

It would be disingenuous to suggest that the results we have reported here came about simply as the product of a methodical research program aimed at improving on the just system, somehow eventually leading to the introduction of complex tonality. Nonetheless, going forward, it would make sense to pursue the approach that we have introduced more systematically in the investigation of both real and complex n-limit tone systems. That the prime decompositions 2=(1+i)(1i) and 5=(1+2i)(12i) lead to such a wealth of musical structure may come as a surprise. But we can take the view that musical cognition entails the formation of a map from a class of acoustic phenomena to a suitably established tone system, the latter being essentially a mental construct. And if the tuning of our ears cannot stretch beyond the confines of 5-limit tones, whether real or complex, then our minds can. In Nietzsche's words, “Neue Ohren für neue Musik.” Hence, from the perspective of the mathematician, the complex n-limit tone systems may well constitute a natural framework within which music of the future can be composed.

Acknowledgements

We are grateful for comments from participants at RealTime Seminars in the Department of Computing at Goldsmiths University of London, June 2022 and May 2023, and the IMA Conference on Mathematics in Music at the Royal College of Music, London, July 2022, where drafts of this work were presented. We thank J. Armstrong, T. Blackwell, D. M. Blasius, T. Crawford, J. Forth, G. Di Graziano, D. P. Hewett, K. Rietsch and two anonymous referees for helpful comments and suggestions.

Disclosure statement

No potential conflict of interest was reported by the authors.

References

  • Barbour, J. M. 1953. Tuning and Temperament: A Historical Survey. 2nd ed. East Lansing: Michigan State College Press.
  • Barker, A. 1984, 1989. Greek Musical Writings. Vols. I, II. Cambridge: CUP.
  • Barker, A. 1994. “Greek Musicologists in the Roman Empire.” Apeiron 27 (4): 53–74. https://doi.org/10.1515/APEIRON.1994.27.4.53.
  • Bibby, N. 2003. “Tuning and Temperament: Closing the Spiral.” In Music and Mathematics, edited by J. Fauvel, R. Flood, and R. Wilson, Oxford: OUP.
  • Clampitt, D. 2017. “Lexicographic Orderings of Modes and Morphisms.” In The Musical-Mathematical Mind: Patterns and Transformations, edited by G. Pareyón, S. Pina-Romero, O. A. Agustín-Aquino, and E. Lluis-Puebla, 89–97. Cham: Springer.
  • Clampitt, D. 2019. “An Overview of Scale Theory Via Word Theory.” Brazilian Journal of Music and Mathematics 3 (2): 1–17.
  • Clampitt, D., and T. Noll. 2018. “Naming and Ordering the Modes, in Light of Combinatorics on Words.” Journal of Mathematics and Music 12 (3): 134–153. https://doi.org/10.1080/17459737.2018.1519729.
  • Cohn, R. 1998. “Introduction to Neo–Riemannian Theory: A Survey and a Historical Perspective.” Journal of Music Theory 42 (2): 167–180. https://doi.org/10.2307/843871.
  • Cohn, R. 2012. Audacious Euphony: Chromaticism and the Consonant Triad's Second Nature. Oxford Studies in Music Theory. Oxford: OUP.
  • Dominguez, M., D. Clampitt, and T. Noll. 2009. “WF Scales, ME Sets, and Christoffel Words.” In Mathematics and Computation in Music, edited by T. Klouche and T. Noll, 477–488. Berlin: Springer.
  • Ellis, A. J. 1864. “On the Conditions, Extent, and Realization of a Perfect Musical Scale on Instruments with Fixed Tones.” Proceedings of the Royal Society 13 (60): 93–108.
  • Euler, L. 1739. “Tentamen Novae Theoriae Musicae Ex Certissismis Harmoniae Principiis Dilucide Expositae. Petropoli, Ex Typographia Academiae Scientiarum.” Opera Omnia: Series 3, 1:197–427. Eneström E33
  • Euler, L. 1774. “De Harmoniae Veris Principiis Per Speculum Musicum Repraesentatis.” Novi Commentarii Academiae Scientiarum Petropolitanae 18:330–353. Opera Omnia: Series 3, 1: 568–586. Eneström E457
  • Fogliano, Lodovico. 1529. Musica Theorica. Venice.
  • Grammateus, Henricus. 1518. “Arithmetica Applicirt Oder Gezogen Auff die edel Kunst Musica.” In Ayn New Kunstlich Beuch. Nürnberg.
  • Haluška, J. 1999. “Searching the Frontier of the Pythagorean System.” Tatra Mountains Mathematical Publications 16:273–282.
  • Haluška, J. 2004. The Mathematical Theory of Tone Systems. Boca Raton, Florida: CRC, Taylor & Francis.
  • Hardy, G. H., and E. M. Wright. 1938. An Introduction to the Theory of Numbers. Oxford: Clarendon Press.
  • Helmholtz, H. 1885. On the Sensation of Tone as a Physiological Basis for the Theory of Music. Translated by A. J. Ellis. London: Longmans, Green and Co.
  • Klopp, G. C. 1974. Harpsichord Tuning. Translated by Glenn Wilson. Garderen, Holland: Werkplaats Voor Clavecimbelbouw.
  • Lewin, D. 1987. Generalized Musical Intervals and Transformations. New Haven: Yale University Press.
  • Longuet-Higgins, H. C. 1987. Mental Processes: Studies in Cognitive Science. Cambridge, Massachusetts: MIT Press.
  • Orbach, J. 1999. Sound and Music. Lanham, Maryland: University Press of America.
  • Penrose, R. 2000. “The Heritage of Pythagoras: Nineteen to the Dozen.” In Musicology and Sister Disciplines, edited by D. Greer. Oxford: OUP.
  • Ramis de Parejo, Bartolomeus. 1482. Musica Practica. Bologna.
  • Rousseau, Jean-Jacques. 1768. Dictionnaire de Musique. Paris: Chez la Veuve Duchesne.
  • Schoenberg, A. 1978. Theory of Harmony. Translated by Roy E. Carter. Berkeley: University of California Press.
  • Scholes, P. A. 1955. Oxford Companion to Music. 9th ed. Oxford: OUP.
  • Solomon, J. 2000. Ptolemy Harmonics: Translation and Commentary. Leiden: Brill.
  • Thirring, W. 1990. “Über Vollkommene Tonsysteme.” Annalen der Physik 502 (2–3): 245–250. https://doi.org/10.1002/(ISSN)1521-3889.
  • Tovey, D. 1929. “Harmony.” In Encyclopedia Brittanica. 14th ed. J. L. Garvin (editor). London: Encyclopedia Brittanica.
  • Žabka, M. 2011. “Introduction to Scale Theory over Words in Two Dimensions.” In Mathematics and Computation in Music, edited by C. Agon, M. Andreatta, G. Assayag, E. Amiot, J. Bresson, and J. Mandereau. Lecture Notes in Computer Science (LNCS 6726). Berlin: Springer.

Appendix

List of frequently occurring intervals