Abstract
Equal temperament, in which semitones are tuned in the irrational ratio of , is best seen as a serviceable compromise, sacrificing purity for flexibility. Just intonation, in which intervals are given by products of powers of 2, 3, and 5, is more natural, but of limited flexibility. We propose a new scheme in which ratios of Gaussian integers form the basis of an abstract tonal system. The tritone, so problematic in just temperament, given ambiguously by the ratios , , , , none satisfactory, is in our scheme represented by the complex ratio . The major and minor whole tones, given by intervals of and , can each be factorized into products of complex semitones, giving us a major complex semitone and a minor complex semitone . The perfect third, given by the interval , factorizes into the product of a complex whole tone and its complex conjugate. Augmented with these supplementary tones, the resulting scheme of complex intervals based on products of low powers of Gaussian primes leads to the construction of a complete system of major and minor scales in all keys.
1. Introduction
For centuries, musicians and mathematicians have struggled with the challenges of temperament and tonality. The problem is that there is a paradox at the heart of Western music. The paradox is that no tuning system exists for keyboard instruments such that all the harmonious intervals are perfectly tuned. Here we propose a radical solution based on the notion of complex tonality. We begin with an overview of the just system of intonation, which originated in antiquity and was much studied in our era during the 16th, 17th and 18th centuries, reaching a high point in the work of CitationEuler (1739, Citation1774). We can define the just system as the totality of the various ways of representing tones by the vibrational frequencies of strings of various lengths, each such frequency being given by products of powers of the primes 2, 3, and 5 times some fixed frequency, which we call the tonal centre. Over time, many different ways of choosing the twelve notes of the chromatic scale in the just system have been proposed. Our goal is to construct a complex analogue of the just system, with broad applications, based on the ratios of certain Gaussian integers, in such a way that some of the insufficiencies of the just system can be resolved.
Among the various diatonic scales that have been considered in the just system, the so-called intense diatonic scale of Claudius Ptolemeus is especially notable (CitationSolomon 2000). Ptolemy's scale, expressed in units of time such that the frequency of middle C is unity, takes the form (1) (1) One can then ask how the Ptolemaic scale can be embedded in a chromatic scale. There are many possibilities, but one good example is the following, due to Johannes Kepler, which is known as Kepler's Monochord No. 2 (CitationBarbour 1953), here transposed down a fifth: (2) (2) In Section 2 we look at the just system in some detail and we sketch the arguments leading to Ptolemy's diatonic scale and to Kepler's chromatic scale. Among the defects of the just system, clearly visible in Kepler's scale, are the multiplicities of primes required for the tritone interval from C to . Also troubling, when one looks further, are the multiple tunings for the notes D and , and the unpleasant “wolf” tunings of certain intervals.
As a first step forward, we propose an alternative approach to building up scales in the just system by use of an analogy with chemistry. In Section 3, Atoms, molecules and particles, we generate the just scale by the aggregation of a small number of whole tone and semitone intervals, which we call atoms. We show that one recovers the seven modal scales (in their modern form) by permutations of the aggregation sequence of the given whole tones and semitones. This leads to a consistent assignment of all the tones of the chromatic scale with the exception of . The equal temperament system, which evolved as a practical response to the difficulties of the just system, can also be cast into an “atomic” framework by use of the chemical analogy, but one loses the mathematically pleasant representation of harmonious intervals as products of powers of primes. One is thus motivated to delve more deeply back into the just system, with a view to generalizing it in some way.
In Section 4, Chromatic tone systems, we turn to the idea of constructing the just chromatic scale by the aggregation of three types of semitones. In Theorem 4.1, we use the aggregative approach to give a complete description of the various possible three-semitone chromatic extensions of Ptolemy's scale. More precisely, we ask for all systems of three rational semitones with the property that the chromatic scales assembled by aggregating such tones hit all seven values of the Ptolemaic scale. We prove the existence of exactly three distinct families of solutions, each in thirty-two variations, giving ninety-six scales in all. There is an overall integrity to the three families thus arising, since the semitones in the various systems exchange syntonic commas in forming whole tones, in analogy with the behaviour of electrons in atomic physics. We present some applications of the resulting classification scheme to several of the known historical just schemes catalogued by CitationBarbour (1953). Nonetheless, one is forced to admit that the three triples of rational semitones underlying this scheme contain high multiplicities of primes, hence contributing to the well-known perceived difficulties in the sonorities of the just system.
Following our description of the various just extensions of Ptolemy's scale using three rational semitones, and taking note of its shortcomings, but at the same time taking advantage of its methods, we introduce the idea of complex tonality, in which tones take on complex values given by ratios of Gaussian integers.
The complex interval systems thereby arising can be regarded as the basis for a new abstract form of music, but can also be used as a new way of analyzing the harmonic and melodic structures of the established corpus of Western music. Our construction proposes that the tritone interval from C to can be given the value , a single power of a single complex prime, which can thus be seen as the most consonant interval in the system.
In Section 5, Complex diatonic scales, we pursue an analogy with our exposition of the just system in terms of atoms and molecules, and exhibit a set of complex whole tones and semitones, seven in total, including the X, Y, and Z tones of the real just system, all involving small multiplicities of complex primes, sufficient to enable us to write down aggregation sequences for the complex analogues of the twelve major diatonic scales.
In Section 6, Well-tempered complexity, we turn to the mathematical foundations required for further exploration of the complex system. We summarize various properties of the Gaussian integers and in Definition 6.1 we introduce the notion of a complex n-limit tone, one that is a product of powers of Gaussian integers that are prime factors of rational integers of magnitude no more than n. This is a new idea that generalizes the existing theory of real n-limit tones very naturally. Specializing to the case of complex 5-limit tones, we work out the general form of a complex 5-limit phase factor in Proposition 6.1 and we point out that the set of complex 5-limit intervals modulo units and complex 5-limit phase factors forms a Generalized Interval System in the sense of CitationLewin (1987).
In Section 7, Complex chromatic scales, we show that a complex chromatic scale incorporating all the notes of the twelve given complex diatonic scales can be constructed by aggregation of three complex semitones. This construction is carried out in such a way that the chromatic scale extends the harmonious values of Ptolemy's diatonic scale, leading to our main result, Theorem 7.1, which gives a full description of the totality of complex systems that (a) embrace the harmonious Ptolemaic values, (b) admit three complex semitones as generators, and (c) observe a certain symmetry condition that ensures that the first six tones aggregate to the value for . We establish the existence and uniqueness of three such complex systems, the first of which is of particular interest on account of its full use of the factorization of the number five and the abundance of intervals that it exhibits involving low multiplicities of primes.
In Section 8, Complex Pythagorean systems, we consider the possibility of an analogous construction for two-semitone complex 3-limit scales by fixing the values of the Pythagorean pentatonic scale and asking that the system take the value at . The existence and uniqueness of such a scheme is shown in Theorem 8.1. In Section 9, Back to reality, we return to the study of real tuning systems and ask for the construction of “shadow” systems based on the magnitudes of the intervals arising in the complex schemes. This program can be carried out both for complex 3-semitone systems and for complex 2-semitone systems. The results are summarized in our Theorems 9.1 and 9.2.
In Section 10, Applications, conjectures, and conclusions, we wrap up and speculate on some applications to the theory of harmony.
In Tables , , , and we lay out the values of the tones of the chromatic scales for the systems that we have considered, alongside the associated aggregation sequences, showing the variations that arise in some cases.
In the Appendix we present a list of intervals occurring frequently in our analysis of complex tone systems. Our brief list can be compared to the more complete catalogues of real tones provided by CitationEllis (1864), CitationHelmholtz (1885), CitationLonguet-Higgins (1987), CitationHaluška (2004), and others, with which there is of course some overlap.
2. Just intonation
The question of which musical intervals are harmonious and which are dissonant is at least, in part, a matter of physiology, convention and culture – though it is undeniable that mathematical elements enter (albeit unconsciously) into musical aesthetics in an essential way, even at the level of tonality (CitationEuler 1739; CitationHelmholtz 1885; CitationLonguet-Higgins 1987; CitationPenrose 2000). The tradition in Western music is that the unison, the octave, the fifth, the fourth, the major third, the minor third, the major sixth and the minor sixth are harmonious. But the whole tone, the semitone, the major seventh, the minor seventh and the tritone are all to varying degrees dissonant.
It has been known since ancient times that there is a relation between harmonious intervals and the ratios of the frequencies at which strings vibrate. Suppose we fix a length of string so that it will vibrate at about 261.62 Hertz, so-called middle C (or ) on the piano. If we halve the length of string then the frequency of vibration is doubled, and we get , the note which is one octave above middle C. If we take one third of the length of string then the frequency is trebled, and we get , the G that lies a fifth above .
One fourth of the length of the string vibrates at four times the original frequency, and one fifth vibrates at five times the original. A string vibrating at a frequency four times sounds at , two octaves above , and a string vibrating at five times sounds at . It follows that the frequency of will be half that of . If we use units of time such that the frequency of is unity, then and we deduce that . Likewise since we deduce that the E just above middle C sounds at . We conclude that the interval of a fifth is given by the ratio , the major third is the ratio , and the minor third is the ratio of to , namely . Hence, starting at C = 1 we get , , and . Similar arguments show that , and .
Thus, we see that the harmonious intervals are products of powers of the prime numbers 2, 3 and 5. The minor third is , the major third is , the fourth is , the fifth is , the minor sixth is , the major sixth is , and the octave is 2. The logical deduction of such ratios is the basis of the just system of intonation, which CitationBarbour (1953) defines as “a system of tuning based on the octave , the pure fifth , and the pure major third .”
The situation with dissonant intervals is a little trickier. The tones that we have derived so far come about as a consequence of regarding C as the tonal centre. But suppose we think of G as a new tonal centre. This is like taking a new string two-thirds the length of the original and starting the analysis over. In this way we get a value for D, which is a fifth above G. In particular, since , we multiply times to obtain and thus . Since is a whole tone above , we are then tempted to assign the value to the interval of a whole tone. Likewise, since B is a major third above G we multiply times to obtain . And since is a fourth above F, and F is a fourth above C, we can set to be equal to times and hence . Finally, since is a semitone above , and is a semitone above , we conclude that the interval of a semitone can be given the ratio , and hence . Then we can take to be given by a combination of a third and a whole tone above C. This gives the product of and , namely .
So, to sum up, we have assigned rational values to all the notes of the chromatic scale given by products of powers of the prime numbers 2, 3, and 5. The resulting system of tuning is a good example of what we call “just intonation” and it has many attractive features.
In particular, the diatonic scale takes the form (Equation1(1) (1) ). This scale has been discussed by Gioseffo Zarlino (1517–1590) and others from the time of the Renaissance onward, though its roots lay in the work of Didymus, Ptolemy, Porphyry and other scholars of the early Roman Empire (CitationBarker 1984, Citation1994), and it is now commonly referred to as Ptolemy's intense diatonic. For the associated chromatic scale in its entirety, according to the logical development presented thus far, we obtain (Equation2(2) (2) ). CitationBarbour (1953) refers to this version of the chromatic scale as Alexander Malcolm's Monochord, and points out that it is equivalent to Johannes Kepler's Monochord No. 2 when the latter is transposed down by a perfect fifth. We can use the term “just intonation” to refer broadly to all of the various tuning systems constructed out of intervals involving products of powers of the primes 2, 3 and 5. The just diatonic scale (Equation1(1) (1) ) is fairly rigid as a mathematical object in the sense that there is relatively limited scope in modifying its entries. The just chromatic scale (Equation2(2) (2) ) is more flexible and there are many other choices that can be made for its entries, as we shall see in Section 4.
A good account of the virtues of the just system can be found in the works of CitationLonguet-Higgins (1987). It would appear that in practice the just system, whether it be in the narrow sense of the Ptolemaic and Keplerian scales mentioned above, or more broadly in the sense of the totality of all rational 5-limit dodecatonic scale systems, has served primarily as an idée fixe, in relation to which a vast range of ingenious temperaments have been constructed over the last few centuries, rather than as a viable solution to the problem of keyboard tuning in itself. An exception to this principle may be that of Leonhard Euler, who (if he is to be judged on the basis of what he has said in his writings) was a staunch believer in just intonation, even if he seems to wobble on the question of whether the prime factors should be restricted to 2, 3, and 5, or extended to include 7 as well and perhaps higher primes. Be that as it may, there are many subtle albeit well-known issues that arise in connection with the 5-limit just system, and it is in the numerous attempts that have been made at resolving such issues that new ideas have been originated both in musical analysis and in practice.
To get a sense of some of these issues, let us return to the idea of shifting the tonal centre. Multiplication by leads us successively from to to to and then on to . We obtain a value of for and hence for . But that implies , whereas we had already deduced , that is to say, . The multiplicative discrepancy between the two values of is called the syntonic comma (or comma of Didymus). Likewise, we had set , whereas now we get , again with a discrepancy of . The just system is evidently unstable under such shifts in the tonal centre.
Going forward, let us write P/Q for the ratio of the tones P and Q. This ratio represents the interval from the tone Q to the tone P (often written Q – P). For example, the interval from to is the ratio , a minor third.
If we recall that our value for D was obtained by applying the cycle of fifths to G, we have to ask whether is consistent with . In fact, one finds that the interval A/D is a bit flat for a fifth if . We obtain (3) (3) Since , we see that the interval A/D obtained in this way is slightly smaller than , i.e. somewhat flat. To sensitive ears it sounds out of tune, particularly if played on the harpsichord, and hence has come to be known as an example of a “wolf” interval. But if we set , the resulting interval (4) (4) is a perfect fifth. Indeed, one can check that . Thus, it seems inevitable that we must augment the just system with a second version of the note D taking the value .
Similarly, the value that we tentatively assigned earlier is consistent with the idea that should be a fourth above F. On the other hand, can also be regarded as a minor third above G, or a fifth above and these both give . Thus, we need to assign two different values to , namely and , and we can check that . Indeed, the situation of precisely parallels that of D.
Likewise, one can show that four different values for are required, corresponding to the various ways that the interval can be formed out of simpler intervals. These are (5) (5) Note that and that and are complements in the sense that their product is the octave. The multiplicity of the intervals representing is troubling, and all are rather complicated when they are factorized into products of powers of primes. We call this “the problem of ”. Is there a way around the problem of ? This question motivates us both to revisit the foundations of the just system and to introduce the idea of complex tonality.
3. Atoms, molecules and particles
To enable a way forward, leading to the complex systems that we introduce later in the article, it will be helpful first if we look at the Ptolemaic system in terms of so-called “atoms”. The idea is that we introduce a collection of simple tones (called atoms) such that each tone in the diatonic scale can be regarded as a multiplicative aggregation of atoms into molecules. The notes of the just system can be built up in this way step by step. We introduce three atoms, given by (6) (6) Then one sees that (7) (7) and hence that for the C-major scale in the just system we can write (8) (8) Each note in the scale is represented by a “molecule” with a particular configuration of atoms, and as the scale progresses more atoms are aggregated. Our atoms consist of the two “whole tones”, viz., and , alongside the semitone . We can use a shorthand to indicate the order in which the atoms are added in. Thus, for the diatonic scale starting at C = 1 we have the aggregation sequence 1 : X, Y, Z, X, Y, X, Z. The note before the colon denotes the initial tone, which is followed by a sequence of atoms. The initial unity here ensures that the scale begins at our chosen value for C, but in principle one can choose a different initial value. The algebraic identity that closes the octave implies some flexibility in the order in which the atoms can be aggregated. The seven ecclesiastical modes arise in this way, for example, corresponding to permutations of the original aggregation sequence:
Ionian mode:
Dorian mode:
Phrygian mode:
Lydian mode:
Mixolydian mode:
Aeolian mode:
Locrian mode: .
One can think of the note representing the underlying key (in this case, C) as being given in the form of a structureless monad, represented by the tone unity. But set in a larger context, this note itself can be given structure. It may be helpful to introduce a chemical analogy. The note C corresponds to the completed shell corresponding to an inert gas, such as argon. The creation of a scale, in this analogy, is represented by the addition of electrons, one by one, until the next shell is complete, corresponding to the note .
It is interesting to note that Isaac Newton seems to have been in favour of the just system but preferred a version of the scheme based on the Dorian mode on account of the symmetric arrangement of the atoms arising in that case (CitationBibby 2003). It may be that Newton was attracted to the fact that the aggregation sequence forms a palindrome in this mode. In any case it would be interesting to pursue the theory of modes arising in the present context from the point of view of the algebraic combinatorics of words, in the spirit of CitationDominguez, Clampitt, and Noll (2009), CitationŽabka (2011), CitationClampitt (2017), CitationClampitt and Noll (2018), CitationClampitt (2019), and others.
Inspection of the modal scales leads to another method for arriving at the accidentals of the chromatic scale. The resulting values for , , , and are consistent with the values we had tentatively recorded at (Equation2(2) (2) ). We obtain for the Dorian mode, for the Phrygian, for the Mixolydian, for the Aeolian, and for the Locrian.
Likewise, we obtain for the Dorian and Aeolian modes, for the Phrygian and Locrian modes, for the Phrygian, for the Aeolian, for the Locrian, and for the Locrian. But we get in the Lydian mode and in the Locrian mode. So the problem of rears its head, since the modes give two different values for it.
The modal scales as we have presented them are all rooted in the same tonality, that of C. If we change the tonal centre, instabilities emerge in the form of new tones. Suppose, for example, we wish to make a selection of notes from the just chromatic scale to represent the notes of the G-major scale. As with the C-major diatonic scale, we can ask that the G-major scale should be constructed via the accumulation of atoms. This can be achieved as follows. We begin with the molecule for , which we know to be of the form . Then we add on successively the atoms Y, X, Z, X, Y, X, Z. The resulting tones correspond to existing notes on the just keyboard.
At the next step, problems begin to emerge. If we try to construct a D-major scale, then we can start at and add successive atoms by applying the sequence . We obtain , which we have agreed is acceptable, but for we obtain , which is not one of the pre-existing tones.
When we go on to look at the remaining sharp keys we obtain three more tones, given by , , and , contradicting the values previously assigned. The flat keys fare better, but in the key of major we obtain , which is different from the we agreed on earlier. And in the key of major, we obtain , which is different from the previously proposed tuning at . Apparently, if we try to represent the major scales by successive aggregation of the atoms proposed to construct the just diatonic scale, then we obtain new values for notes to which we have already tentatively assigned values.
The generally accepted way around these difficulties in practice is, of course, the system of equal temperament, in which we introduce a irrational semitone (9) (9) which functions as the single “particle” out of which all the tones of the chromatic scale can be assembled by aggregation. Thus for the equally tempered chromatic scale we obtain (10) (10) and we have the aggregation sequence (11) (11) Note that , and that . Under equal temperament none of the harmonious intervals are perfect. Nevertheless, overall, the system produces satisfactory results for music written to be played in that system. This includes, for example, a good deal of the repertoire of music written for the piano. Equal temperament thus offers a tentative resolution to the paradox of tuning by spreading the dissonance equally across the keyboard. CitationScholes (1955) goes further in his Oxford Companion to Music at page 1018, and says:
The whole development of musical composition in the nineteenth century was based on the acceptance of the twelve equal keys into which the composer could pass without let or hindrance. Wagner's chromatic harmonies, for example, necessarily imply an orchestra with its instruments tuned “equally”.
Opponents of equal temperament, of which there are many, claim that it flattens out the music, leaving it homogeneous and characterless. Worse still, although the fifths are reasonably accurate under equal temperament, the thirds are noticeably wide. The disadvantages have been succinctly summarized by CitationKlopp (1974):
Obviously, all keys are available and the possibilities of enharmonization are unlimited. However, these freedoms are dearly paid for: there is not one single pure interval; the thirds are especially poor, giving the triad an insecure and restless sound; there is no differentiation between the keys; melodic tensions are reduced; the temperament is difficult to set.
And according to CitationTovey (1929):
No true harmonic ideas are based on equal temperament.
Euler (Citation1739, Citation1774) was dead against equal temperament for essentially the same reasons and expressed the opinion that the majority of musicians had rejected it. But in this he seems to have been mistaken, and it is clear that he did not anticipate the great wave of nineteenth and twentieth century composers who would embrace the advantages that transpositions into any key would offer. Nonetheless, one can take the view that from the Age of Enlightenment onward the ever-increasing demand for modulations into keys far removed from the tonic pushed the principles of Western tonal music as far as they could be taken – and that despite the fact that music predicated on equal temperament has produced numerous striking, important, and completely convincing compositions, there remains a lingering sense of incompleteness in the centuries-old mathematical project of developing a system of music based entirely on products of powers of primes, in the spirit of Euler's explorations. This suggests that we should return to the idea of a tonal system based on the rationals and consider in more precise terms the conditions that we would like it to satisfy, with an eye on the possibility of novel generalizations.
4. Chromatic tone systems
Let us enquire, therefore, whether it is possible to create a version of the just chromatic scale by the successive aggregation of “particles” (semitones of various magnitudes) in such a way that the result includes all of the notes of the just diatonic scale as a subscale. Although not, perhaps, obvious, the answer is affirmative, as the following example (CitationOrbach 1999), based on Kepler's scale, clearly illustrates. As particles we take (12) (12) An aggregation sequence that leads to the just chromatic scale (Equation2(2) (2) ) is then given as follows: (13) (13) That this sequence reproduces the atomic sequence 1 : X, Y, Z, X, Y, X, Z for the just diatonic scale with , and can easily be verified if one observes that X = pq, Y = pr and Z = p. But is this the only chromatic scale that can be constructed from three rational constituents by successive aggregation in such a way that it includes the just diatonic scale as a subscale?
The answer, perhaps surprisingly, turns out to be negative: in fact, there are ninety-six different versions of the chromatic scale that interleave the just diatonic scale. The situation is summarized in the following.
Theorem 4.1
Let S denote a set of three rational semitones with the property that there exists a chromatic scale of twelve notes constructed by aggregation from elements of S interleaving the just diatonic scale C = 1, , , , , , , . Three and only three such sets of semitones exist, given by
Just System I: ,
Just System II: ,
Just System III: .
Each such system gives rise to thirty-two distinct chromatic scales.
Proof.
To construct a twelve-tone chromatic scale that interleaves the just diatonic scale, we need to assemble the atoms , , from a set of three particles in such a way that X and Y each contain two particles, and Z contains a single particle. Clearly, X and Y share a particle in common, say, p. Then without loss of generality we can set X = pq and Y = pr, and Z will take one of the values p, q, r. Thus, and , and either (System 1), or (System 2), or (System 3). Solving for p, q, r, we obtain the three systems stated in the theorem. In any one of the systems, the order of aggregation for the constituents of X and the constituents of Y can be interchanged for each X and each Y. Since there are three Xs and two Ys, that gives us five possible interchanges, and hence thirty-two different variations for each system.
Just System I is the historic scheme of semitones set out in (Equation12(12) (12) ). Within any of the three systems, we can present the constituents of X either in the order p, q or the order q, p. Likewise the constituents of Y can be presented in the order p, r or the order r, p. Thus for each of the five “flat” notes of the chromatic scale, we have two (and only two) choices.
For example, in System I we obtain either or for , either or for , either or for , either or for , and either or for . Thus we have thirty-two different versions of System I, depending on which choices we make for the five accidentals.
If we take the first option in each accidental, for which the tones are the simplest, then we recover the aggregation sequence (Equation13(13) (13) ), which under System I gives rise to (Equation2(2) (2) ), which appears in CitationBarbour (1953) as the historic tuning system known as Malcolm's Monochord. Barbour gives the tones in cents, so one has to translate these approximate cent values to the corresponding exact rational numbers to obtain our chromatic scale (Equation2(2) (2) ). By (Equation12(12) (12) ) and (Equation13(13) (13) ), the aggregation sequence for Malcolm's Monochord is given more explicitly by (14) (14) starting at C. Now let's consider Kepler's Monochord No. 2, which is given by the scale (15) (15) This scale as such does not belong to Just System I, since rather than . But if we transpose Kepler's tuning down by a fifth, along the whole keyboard, thus assigning , , , and so forth, then we recover the chromatic scale (Equation2(2) (2) ), as we mentioned earlier. Hence Kepler's system is isomorphic to that of Malcolm, and admits the same aggregation sequence of semitones once a suitable starting point has been established following a transposition.
It may be helpful if we set out the structure of the three systems in a little more detail, showing the breakdown of composite atoms X and Y into their particle constituents:
Just System I: , ,
Just System II: , ,
Just System III: , .
Note that the intervals arising here are well-established in the theory of tone systems, under a variety of names; see, e.g., CitationHelmholtz (1885), CitationLonguet-Higgins (1987), CitationHaluška (2004). We have met the major whole tone , the minor whole tone , and the (minor) diatonic semitone . Now we meet the minor chromatic semitone , the major chromatic semitone , the major diatonic semitone , and the minor Pythagorean semitone .
We observe that various relations involving multiplication by the syntonic comma hold between the semitone intervals determined by Theorem 4.1. In particular, we have (16) (16) What this means is that a set of discrete transformations can be constructed taking the three systems from one to another. Let us write (17) (17) Then one can regard the three θ semitones as being different versions of a common underlying entity, differing by factors of the syntonic comma, and similarly for the two ϕ semitones. One can think of as being a kind of “neutral” semitone, whereas has a “charge” of 1 and has a charge of . Similarly, is neutral and has charge one.
Then the relations between the different systems amount to transferring units of charge from one semitone to another in such a away that the total charges of X and Y remain unchanged. Thus X is a charged whole tone, Y is a neutral whole tone, Z is a neutral semitone, and the three just tuning systems can be set out in the following scheme:
Just System I: , , ,
Just System II: , , ,
Just System III: , , .
Thus in moving from System I to System II, we transfer a unit of charge from to in the Y atom; moving from System I to System III we transfer a unit of charge from to in the X atom; whereas in moving from System II to System III we transfer a unit of charge from to in the X atom and we transfer a unit from to in the Y atom. We see that the three systems are closely related and that there is order to the arrangement.
Remark 4.1
The existence of a five-semitone chromatic scale agreeing with Ptolemy's intense diatonic scale on the notes of that scale can be demonstrated by use of the various semitones appearing in (Equation17(17) (17) ). This can be achieved by setting (18) (18) and using the chromatic aggregation sequence (19) (19)
An example of a historical scale that (modulo transposition) belongs to Just System II can be found in the work of CitationRamis de Parejo (1482). Ramis's Monochord is given by CitationBarbour (1953) both in cents and in the Eitz notation, i.e. in Pythagorean tunings that may be adjusted up or down by one or more syntonic commas. Expressed in rationals, Ramis's tuning starting at C takes the form (20) (20) This is clearly not within the scope of Theorem 4.1, since . Nonetheless, the associated aggregation sequence, which is of the form (21) (21) exhibits the particle structure of System II, which suggests, as we saw in the case of Kepler's scale, that a transposition might bring it into line with Theorem 4.1. Indeed, this turns out to be the case, for if we transpose the scale of Ramis de Pareja down by a fourth, multiplying each note by all along the keyboard, then starting at C, we obtain (22) (22) which evidently satisfies the conditions of Theorem 4.1; and for the associated aggregation sequence starting at C = 1, which is consistent with Just System II, we have (23) (23) Historical examples of scales belonging to Just System III that can be found (without need of transposition) in Barbour's list include Fogliano's Monochord No. 2, and Rousseau's Monochord (CitationFogliano 1529; CitationRousseau 1768). But there are other historical just tunings in Barbour's list that are of interest, some involving as many as four semitones, such as Euler's Monochord, that would also be worthy of study using the methods we have proposed.
Summing up, we have seen that in the just system the notes of various well-established scales can be assembled from a small number of semitone particles, and we have been able to present a more or less complete account of the tuning schemes that can be constructed on the premise that the structure of the Ptolemaic scale is preserved and that the number of semitones is three.
Nevertheless, we are left with the uncomfortable fact that the resulting systems of semitones involve products of rather high powers of prime numbers. Is it possible to improve on the situation in some way, by introducing a new scheme for music that allows one to bring the multiplicities of the primes down, so we are only left with low powers? This is the question we seek to answer in our investigation of complex intonation.
5. Complex diatonic scales
We shall give an interesting example of such a scheme. It goes outside of the usual conventions of music by allowing tones to be represented by complex numbers. We begin with the observation that all the intervals under consideration in the just system are products of powers of the numbers 2, 3 and 5. These numbers are primes in the conventional sense, but 2 and 5 can be factorized over the complex numbers. That is to say, we have (24) (24) In particular, since the note lies midway between C and , this suggests we should set (25) (25) It may not be obvious a priori what kind of acoustic event will trigger the sensation of a complex tone, but since the numbers involved are simple and striking, we are encouraged to explore further, adding further complex tones.
First, we shall pursue the matter informally, as we did in our initial analysis of the just system, following which we presented Theorem 4.1, which gave a precise account of the system in its entirety. Thus here we shall construct an example of a complex scheme by use of clever albeit essentially inductive reasoning. Then once we have by this line of argument shown the existence of a viable complex scheme we proceed to formalize the theory via the presentation of Theorem 7.1.
Let us write ξ for . Since an octave is composed of two tritones, this suggests that we should identify both and as representations of the tritone. As we remark later, our suggestion that and should be regarded as representing the same tone can be understood as a reflection of the fact that they differ by a “unit”, in this case an overall factor of . As basic intervals we retain the diatonic whole tones and , and the semitone , and supplement these with a complex semitone α defined by (26) (26) and a complex whole tone δ defined by (27) (27) The argument is that since we can say that ξ is a fourth above α. Similarly, we have , so we can say that ξ is a third above δ. Note also that , , and . These various relations justify our interpretation of α as a type of semitone and δ as a type of whole tone.
In our complex scheme we keep the just values for , , , and . Then in addition to we can set , , and , since these notes are respectively a fourth below , a minor third below , and a third above . We also need values for D, and B. If we set as one does in just tuning, then although it is a fourth below G, it is a “wolf” fifth below A, since . But if we set , then D would lie a fifth below A, but would be a “wolf” fourth below G, since . Clearly, neither of the choices and are satisfactory.
The resolution of this dilemma brings in the other complex factorization that we have at our disposal, namely . Then, in addition to the complex semitone , which has the property that , we obtain another complex semitone β, defined by (28) (28) with the property that . Then we can set , a semitone above G. We observe that both of the diatonic whole tones factorize, which justifies the interpretation of β as another species of semitone. The product of these complex semitones gives a complex whole tone which we define by setting , and one can check that (29) (29) Note that , so γ lies midway between C and E in the sense that ξ lies midway in the range between the notes C and . Thus, we set , banishing the wolves by moving D off into the complex.
We observe that the intervals and are products of low powers of Gaussian primes and hence both can be regarded as consonant in our scheme. These intervals can be contrasted with the wolves and that arise in the just scheme.
Another unsatisfactory aspect of the just scheme is the value of . This gives us , the same harsh interval that we eliminated by setting . That suggests that instead of we should set . Then we obtain the simple ratio , which by virtue of the fact that it involves only a single prime can be regarded as consonant.
Summing up, with the various adjustments in place, we conclude that the C-major scale in our complex scheme takes the form (30) (30) with the aggregation sequence . Now let us see what happens when we change keys, noting how the new choice for fits into the scheme. Recall that in the just system a change of key from C to G worked out, but at the next level with the key of D one encounters an obstacle. The G-major scale in the complex scheme is (31) (31) starting at . For the corresponding sequence of atoms we have . In particular, one notices that lifts the up to . Then one sees that lifts the up to , by use of the relation .
In the key of D major, there is an added level of complexity on account of the two sharps and the fact that D itself is a complex tone: (32) (32) For the corresponding sequence of atoms we obtain . The treatment of the is like that of the G-major scale. For the , we see that lifts to , and then β lifts to , by use of the identity . Essentially the same line of reasoning can be applied to the keys of A major, E major and B major.
The flat keys show a slightly different pattern. For example, in the case of F major, starting at , we have (33) (33) with the aggregation sequence . Thus the semitone lifts to . Then lifts to . Similar constructions apply to the keys of , , , and .
Let us take stock of the complex whole tones and semitones at our disposal. We have the three semitones (34) (34) along with the four whole tones (35) (35) These intervals can be used to construct complex analogues of the major and minor scales in all twelve keys. The resulting scheme of intervals for the twelve major scales is as follows:
C major F major
G major major
D major major
A major major
E major major
B major major .
Here, the initial note is given for each major scale, followed by the aggregation sequence that generates that scale. The notes C, E, F, G and A take the Ptolemaic values and for we have . Each minor scale is the relative minor of an associated major scale. For example, for C minor we have the sequence obtained by permuting the -major sequence. In this way we obtain a system of major and minor scales in all keys.
6. Well-tempered complexity
Motivated by the constructions of the previous section, we are now in a position to present the complex scheme in its entirety. We begin by recalling a few results concerning so-called Gaussian integers. We write for the ring of ordinary real integers, which we call rational integers. We write for the ring of Gaussian integers, by which we mean complex numbers of the form , where and . Gaussian integers can be added, subtracted and multiplied to yield other Gaussian integers. In what follows we often refer to Gaussian integers simply as “integers”. The complex conjugate of an integer is denoted , and we write for the magnitude of c.
By a unit we mean an integer that divides any integer. There are four units: , and for any integer n we refer to the four numbers as its associates. By a prime we mean any integer p, not 0, and not a unit, that is divisible only by its associates and the units . Every integer n can be expressed in the form , where the are primes. The fundamental theorem of arithmetic for Gaussian integers asserts that this expression is unique, apart from the order of the primes, the presence of units, and ambiguities between associated primes (CitationHardy and Wright 1938).
By a real prime, we mean a Gaussian prime, in the sense just defined, that is real; the conventional primes are all rational primes. For example, 3 and 7 are real primes, whereas and are not; but 2, 3, 5 and 7 are rational primes. A rational prime p>0 is a real prime only if it is of the form p = 4n + 3 for .
A Gaussian integer , , is said to be even if a, b are both even or both odd in the usual sense, and otherwise n is odd. For real n this definition of even and odd coincides with the usual one. Then one can show that an integer is even iff it is divisible by . This generalizes the idea that a rational integer is even iff it is divisible by two.
As usual, we write for rational numbers, by which we mean numbers of the form a/b for and .
The aggregate of numbers of the form , with , constitute the so-called Gaussian rationals .
Given the fundamental theorem of arithmetic for Gaussian integers, one can show that any Gaussian rational has a unique factorization of the form , where the primes are distinct and the are non-vanishing integers. The uniqueness is qualified up to permutation, the presence of units, and ambiguities between associated primes.
It will have been observed that the tones (or intervals) that we have considered so far, apart from the equal temperament semitones, are Gaussian rationals and thus inherit from the multiplicative group structure that is characteristic of the concatenation of intervals. In practice, we are concerned with Gaussian rationals that can be built up from a small basis of Gaussian primes.
Definition 6.1
We say that a tone t is complex five-limit and write if it takes the form (36) (36) for some collection of integers .
It should be plain that a real 5-limit tone , by which we mean a tone of the form , for some choice of , is necessarily a complex 5-limit tone; for clearly, if t is to be real in (Equation36(36) (36) ), we must have a = b and d = e.
More generally, we write for if t can be expressed as a product of powers of prime factors of rational integers that are of magnitude less than or equal to n. Thus, for example, consists of intervals of the form (37) (37) It should be evident that has for each the structure of a free Abelian group and that is a subgroup of for .
This leads us to comment on the significance of units in our tonal language. The discussion above suggests that if represents a complex tone and ϵ is a unit, then represents the same tone as t. Thus, t represents the same musical value as its associates. In particular, ξ, , , and all represent the tone . But , so ξ and both represent .
One might wonder then if there is some redundancy in the expression for (Equation36(36) (36) ), since and are associates, as we have just learned, and hence represent the same prime factor up to a unit. There is indeed some redundancy, but we can use (38) (38) to put (Equation37(37) (37) ) in the form (39) (39) and to put (Equation36(36) (36) ) in the form (40) (40) where ϵ is a unit, since, depending on whether b = 0, 1, 2, 3 mod 4 in (Equation38(38) (38) ) we find that . Then, relabelling the exponents, we obtain (Equation39(39) (39) ) and (Equation40(40) (40) ). Depending on the context, we can use one or the other of the equivalent expressions (Equation36(36) (36) ) and (Equation40(40) (40) ) as a canonical representation for a given tone in the complex 5-limit case.
The latter arises more naturally in prime number theory; but the former arises more directly in music theory. In fact, we can take matters further, and apply a similar line of reasoning to the complex prime factors of the number five. As we have pointed out, since , we can regard each of the complex numbers and as representing a type of whole tone interval. But their quotient is given by the complex phase factor (41) (41) We see that and . Thus, if we assume that γ and represent the same tone, one can ask what the situation is if higher powers of ω are brought into play.
Proposition 6.1
A complex tone ζ will be said to be a phase factor if . A phase factor is complex 5-limit if and only if for some unit ϵ and some it is of the form (42) (42)
Proof.
If ζ is a complex 5-limit tone then it can be put into the form (Equation36(36) (36) ). Then for it to be a phase factor we require b = −a, c = 0 and e = −d. This implies that (43) (43) for some , which shows that any complex 5-limit phase factor is of the form (Equation42(42) (42) ). Conversely, any tone of the form (Equation42(42) (42) ) is indeed a complex 5-limit phase factor.
Remark 6.1
Thus, by virtue of (Equation36(36) (36) ) and (Equation43(43) (43) ) it follows that any complex 5-limit tone admits a unique factorization of the form (44) (44) modulo units and complex 5-limit phase factors.
Remark 6.2
The space , modulo such factors, forms a Generalized Interval System (GIS) in the sense of CitationLewin (1987). We have a triple , where modulo units and complex 5-limit phases is regarded as a space of complex tones, I is another copy of , modulo such factors, now regarded as a group of intervals, and is the map that takes the complex tones to the complex interval , mod units and complex 5-limit phase factors.
7. Complex chromatic scales
Remarkably, we can construct a single complex chromatic scale that can be used for the formation of all of the major scales given in the previous section, along with their modal variations. This scale is as follows: (45) (45) Moreover, this complex chromatic scale can be constructed by aggregation in a very simple way. Only three semitones are required and the aggregation sequence starting at C = 1 is (46) (46) where , , . Note that the first six intervals in the sequence are complex conjugate to the second six. Indeed, one can check that (47) (47) Thus, we have demonstrated the existence of a twelve-tone chromatic scale in the Gaussian rationals, constructed by aggregation from a set of three complex semitones such that (48) (48) and one can check that the chromatic scale (Equation45(45) (45) ) is consistent with the scheme of major and minor scales introduced in Section 5. We are then led to ask if any other complex tuning systems exist based on three semitones. The answer is yes, and surprisingly we find that exactly three such systems exist.
Theorem 7.1
Let S denote a set of three complex 5-limit semitones satisfying (a) there exists a chromatic scale of twelve notes constructed by aggregation from the elements of S and their complex conjugates such that C = 1, , , , , , , and (b) the aggregation is conjugate symmetric, in the sense that the first six elements of the chromatic series are conjugates of the second six elements, in the same order. Then three and only three such sets exist, given by
Complex System I: , , , ,
Complex System II: ,
Complex System III: .
Proof.
Let the sequence of aggregation take the form (49) (49) where each of the Xs belong to S. Conjugate symmetry implies the sequence takes the form (50) (50) Then we impose the following constraints to fix the designated notes: (51) (51) These relations give , , , , and , where and . Thus, S must contain α and Z alongside another particle. This remaining particle is determined by the constraint , and there are three possibilities:
Complex System I. and are both new particles in this case, and hence complex conjugates, since there can be no more than three particles altogether. Thus we set and for some new semitone β. The relation then implies (52) (52) for some phase factor ζ. Imposing the condition that β should be complex 5-limit and using Proposition 6.1, one finds the solution for β is by (53) (53) where ϵ is a unit and k is an integer. Thus for System I we obtain (54) (54) This is identical to the example we considered earlier in this section, only now with the inclusion of a phase factor parametrized by ϵ and k. If tones correspond to equivalence classes of elements of modulo units and phase factors, then S is uniquely determined.
Complex System II. In this case one of the semitones and is taken to be α or . Let us say . Then implies . But if we express as a multiple of α, then we obtain , and we see that the syntonic comma makes an appearance. Thus for System II we have (55) (55) where . Note that the two complex semitones are proportional in this case, with a real factor of proportionality. The relation between α and is analogous to that between X and Y in the just system, where . Note that instead of , we could have taken , or , or . Thus, System II occurs in four variants. But if tones are equivalence classes of elements of modulo units and complex 5-limit phase factors, then there are only two variants, since α and are equivalent modulo units.
Complex System III. Here one of the semitones and is taken to be . Let us say . Then which gives . Thus for System III we have (56) (56) where . The interval ρ is well known in the theory of tuning and is variously referred to as the minor chromatic semitone or the small half-tone. Again, we could have taken and , so we see that System III occurs in two variants.
We have already presented the complex chromatic scale and the associated aggregation sequence for System I at (Equation45(45) (45) ) and (Equation46(46) (46) ). These values are recorded in Table along with the details of the other two systems of complex tonalities with three semitones. Under System II, a calculation shows that the aggregation sequence starting at C = 1 is (57) (57) and the chromatic scale is given by (58) (58) This scale is characterized by the fact that the notes C, D, E, F, G, A, take on just values, whereas the remaining notes are given by multiples of ξ. In the variant shown, we have taken , which leads to . In the variant where , we obtain . In both cases there is a wolf interval, either between D and A or between D and G. See Table for further details of the variants of System II. Turning now to System III, we see that under its first variant the aggregation sequence takes the form (59) (59) and for the chromatic scale we obtain (60) (60) Note that we have the simple ratio . We also have , which is the just value of this note. In the second variant of System III, we have and , the latter being the so-called augmented fifth, which can be thought of as a compounding of two intervals of a third. In both variants of System III one has .
8. Complex Pythagorean systems
It is not unreasonable to ask whether one can construct complex chromatic systems based on two semitones, rather than three. Whereas the point of departure for the three-semitone complex system was the just scale, the case of two complex semitones begins with consideration of the ancient Pythagorean system, the elements of which we briefly recall.
We use the term “Pythagorean” to refer broadly to the entire class of tuning systems for which intervals are products of powers of the numbers two and three (CitationHaluška 1999; CitationThirring 1990). If we begin at C = 1, then an ascending sequence of tones for can be constructed using the cycle of fifths by multiplying successive terms by and then normalizing with a power of two to bring the pitch back into the original octave. We obtain (61) (61) and so on up the cycle of fifths, but lowering the tones where necessary by one or more octaves to ensure that they all fall within the same octave. A descending sequence can be similarly constructed by multiplying successive terms by , and normalizing, to give (62) (62) and so on down the cycle of fifths. The resulting is different from the , but if we choose one or the other of them then we merge the two series to obtain a Pythagorean version of the chromatic scale, given (in this case, with the ) by (63) (63) It is straightforward to verify that this scale is generated by aggregation from a pair of semitones, given by (64) (64) corresponding respectively to the of the descending series and the of the ascending series (one step beyond the that we have shown). The aggregation sequence is (65) (65) The fact that the diesis σ appears seven times and apotome τ appears five times is reflected in the identity , which is easily checked by use of the relations and . See Table . The origins of the Pythagorean system are shrouded in mystery, but it should be evident that both the pentatonic scale, (66) (66) and the heptatonic scale (67) (67) can be generated aggregatively within this system by pairs of certain atoms, where each atom has a certain particle composition.
In the case of the pentatonic scale, we have the whole tone given by and the minor third given by . The associated aggregation sequence starting at C = 1 is 1 : X, W, X, X, W. But in the case of the heptatonic scale, the system is generated by the whole tone X and the semitone , with the aggregation sequence (68) (68) One observes that if we sharpen the notes E, A and B of Ptolemy's intense diatonic scale (Equation1(1) (1) ) by applying the syntonic comma, then we obtain the Pythagorean heptatonic scale. In particular, we have (69) (69) Once we have selected a twelve-tone chromatic Pythagorean scale by taking a strip of twelve successive notes from the infinite sequence rising above and descending below C, we can for convenience re-write our chromatic scale in such a way that the black keys are represented by notes with sharps, and we have (70) (70) This expression for the chromatic scale is equivalent to that of (Equation63(63) (63) ), but is rather more well suited for a comparison to the scales we have developed based on three semitones. As in the case of three semitones, one sees that the is composed of high powers of primes.
Thus we are prompted to search for a complex Pythagorean system involving lower powers of primes. Following the method of Theorem 7.1, our approach will be (i) to take as given the real Pythagorean tones of the pentatonic scale, (ii) to supplement these with , and (iii) to impose conjugate symmetry. We obtain the following:
Theorem 8.1
Let S denote a system of two complex semitones satisfying (a) there exists a chromatic scale of twelve notes constructed by aggregation from elements of S and their complex conjugates such that C = 1, , , , , , and (b) the aggregation is conjugate symmetric, in the sense that the first six elements of the chromatic series are conjugates of the second six, in the same order. Then }.
Proof.
Let the aggregation sequence take the form (Equation49(49) (49) ). When we impose conjugate symmetry the sequence reduces to (Equation50(50) (50) ), as in the case of three semitones. Then we impose the following constraints to fix the notes of the pentatonic scale together with : (71) (71) These relations can be solved to deduce that , , , , and . Taking into account that we need to have two semitones in total, we find that the system is uniquely determined to be , where and .
There are two variants: (i) , , and (ii) , . The resulting chromatic scale thus takes two different possible forms, depending on the choice of variant. This affects the notes E and . For variant (i) we obtain (72) (72) with the aggregation sequence (73) (73) Then for variant (ii) we obtain (74) (74) with the aggregation sequence (75) (75)
Remark 8.1
It is interesting to note that it was not necessary to assume that the semitones are complex 3-limit in the formulation of Theorem 8.1. Rather, this feature emerges as a consequence of the weaker conditions that we have imposed, in contrast with Theorem 7.1, where it was assumed that the semitones are complex 5-limit.
9. Back to reality
One can ask whether the complex tone systems that we have considered give rise to real “shadow” systems, where the magnitudes of the complex numbers under consideration might play a role. It would thus seem natural to look for an analogue of Theorem 7.1 in which the role of ξ is replaced by . This takes us out of the rational framework, but is nevertheless of interest since the results run in parallel with the complex system in a nice way. Since one has in equal temperament, as well as in historic tuning systems (for example, that of CitationGrammateus 1518) where arises as the geometric mean between and (CitationBarbour 1953), it makes sense to pursue the matter. We obtain the following:
Theorem 9.1
Let S denote a system of three real semitones satisfying (a) there exists a chromatic scale of twelve notes constructed by aggregation from elements of S such that C = 1, , , , , , , and (b) the aggregation is symmetric, in the sense that the first six elements of the chromatic series equal the second six elements, in the same order. Then there are three such systems, given by
Shadow System I: ,
Shadow System II: ,
Shadow System III: .
Proof.
The aggregation sequence is evidently of the form (76) (76) with real entries, and the constraints are given by (77) (77) It follows that , , and , where . The need for three semitones in total implies that either (i) , or (ii) , or else (iii) , where η is another semitone. Then by solution of the constraint we are led to the three systems in the claimed result.
As in Theorem 7.1, the first and third semitones are the same in all three systems, and it is the second semitone that differs. Note that Systems II and III each have two variants, depending on the order of the semitones and .
An analogous result can be obtained for the real shadow of the complex Pythagorean system, where we look at the construction of a real two-semitone system matching the Pythagorean pentatonic scale and satisfying .
Theorem 9.2
Let S denote a system of two real semitones satisfying (a) there exists a chromatic scale of twelve notes constructed by aggregation from elements of S such that C = 1, , , , , , and (b) the aggregation is symmetric in the sense that the first six elements of the chromatic series equal the second six elements in the same order. Then there are two such systems, given by
Pythagorean Shadow System I: ,
Pythagorean Shadow System II: .
Proof.
The aggregation sequence takes the form (Equation76(76) (76) ), with real entries, and the constraints (78) (78) It follows that and . That there should be two semitones implies that either , where and is the second semitone, or , where is the second semitone, and this gives the systems stated in the theorem. Note that for System I there are two variants: (a) , , (b) , .
Remark 9.1
Pythagorean Shadow System I is based on the magnitudes of the complex Pythagorean system presented in Theorem 8.1, but Pythagorean Shadow System II arises independently as a second solution of the conditions of Theorem 9.2.
10. Applications, conjectures, and conclusions
Complex tonality opens up the possibility of genuinely new types of musical structures and perhaps also helps us to understand existing structures. This becomes evident in the various harmonic relations holding between scales and between chords. In our Complex System I, all the major scales are different from one another. Thus if we play a simple melody in one key and then transpose it to a different key, the new melody is usually not quite the same as the original. In other words, the various major keys are not transpositionally equivalent. The same is true of the minor keys. In the key of C major, the tonic and subdominant triads are equivalent (they have the same interval structure) but the tonic and the dominant triads are not, on account of the complex values for B and D. Thus for C major and F major we have the triads and , but for G major we have the triad .
The hallmark of the dominant in this case is that the interval B/G is given by and that the interval D/G is given by , whereas the interval D/B is given by , as in the just system. This closeness of relationship between the F-major and the C-major triads may contribute to the perceived gentleness of the plagal resolution from F major to C major in contrast to the more robust authentic resolution from G major to C major.
A well-known problem in the theory of harmony is the question of why major and minor chords should be so different in character from one another to the ears. After all, they are composed of the same intervals, and , merely stacked in the opposite order. There is no obvious acoustic, physiological or psychological reason why they should produce such a different effect on the listener. One might argue along the lines of cultural conditioning, and to be sure, such elements may come in; but surely that cannot be the whole story. In Complex System I, the C-major chord takes the familiar form , whereas by (Equation45(45) (45) ) the parallel C-minor chord is , which can be contrasted to the just Aeolian . But for the relative minor in the complex system, we have , in which the intervals are the same as those of the relative minor in the just system. Thus, in the complex system there is a clear distinction, in the key of C major, between the structure of the parallel minor (with its complex minor third interval) and the relative minor (with its real minor third interval), whereas in the just system these two minor chords have the same structure.
Such distinctions may help us better understand, for example, the startling contrasts in the relations between C major and C minor, on the one hand, and between C major and A minor, on the other hand, in the two movements of Beethoven's Opus 111 Piano Sonata. That the first relation is like that of heaven and hell, and the second like that of acceptance and resignation. One recalls Kretschmar's lecture in Thomas Mann's Doctor Faustus.
Our remarks cannot do justice to the profundity of Beethoven's composition; nonetheless, the relations between these two different means of forming a minor chord from a major chord, and the nature of the mental states induced by these chords, are so important that any insight into the way the mind takes on board these distinctions may be of some interest.
Indeed, if one looks at the Opus 111 Sonata from the point of view of its embedding in the complex system, more of its structure becomes apparent. Structural relations between major and minor chords are evident more generally if one views the neo-Riemannian theories (CitationCohn 1998, Citation2012) from this angle. For instance, the system of hexatonic relations beginning at C major involves a progression from the real to the complex and back under Complex System I. The well-known hexatonic cycle (79) (79) obtained by successive parallel and leading-tone shifts, is given in the Keplerian system by (80) (80) But in Complex System I this sequence of transformations takes the following form: (81) (81) One sees in particular that the starting chord C major has no complex elements, whereas the associated polar chord minor is constructed entirely from complex elements. Again, more structure is revealed.
Let us consider another example, once more involving the complex -minor chord. This example concerns the role of the tritone in so-called vagrant chords (CitationSchoenberg 1978). Ordinarily, the tritone B/F is taken to be a dissonant interval, but there are settings in which it has the opposite effect. Such a setting occurs notably, to varying degrees, in the context of the minor sixth chord and its permutations. The most well known of these harmonious chords involving the tritone is the -minor sixth, which after permutations we recognize as the Tristan chord, which in just intonation takes the form .
The remarkable aspect of this chord is that in the presence of the two higher notes the tritone seems to lose its dissonance in such a way that the chord as an entièreté acquires that uncanny sense of beauty tinged with tragedy that permeates the whole of the opera Tristan und Isolde. We can only speculate (among many others – see, e.g., CitationCohn 2012; CitationSchoenberg 1978) how this comes about, but it may be that in isolation the tritone is perceived by the ear as the dissonant interval , with its multiplicity of prime powers; but when nudged by the presence of the other elements of the chord the mind flips (as with optical illusions) to a perception of this interval in its representation by the most basic of the Gaussian primes.
It is interesting in this context to recall that Lewin (1993) in a notable passage considers the harmonic intuition associated with the stimulus of the interval . In essence he argues (quite persuasively) that the intuition of the interval takes the to be the mediant of the dominant of the dominant, and thus interprets the as a leading tone:
Observe that the number arises here not as the product of and and 5, which is the most natural mathematical factorization. Rather, arises as the product of the four factors 2, , and , reflecting its “natural” way of measuring an intuited chain of intuitions in the given situation.
These arguments appear in the context of a wider discussion of the just system, in which musical intervals take the form for , in connection with which Lewin asserts the following:
It is not immediately clear what intuitions of “distance” or “motion” we are measuring by these intervals. Personally, I am convinced that our intuitions are highly conditioned by cultural factors. In particular, I do not think that the acoustics of harmonically vibrating bodies provide in themselves an adequate basis for grounding those intuitions.
Thus it may be that complex intervals map to certain intuitions of chains of intuitions, just as do the intervals of the just system in Lewin's analysis – that an acoustic stimulus may indeed trigger this intuition, in a setting where cultural factors play a significant role. Pursuing the idea further one could then conjecture that the setting of the tritone in the environment of the two higher notes of the Tristan chord is sufficient as an acoustic trigger for the interval to map to the complex interval ξ rather, than, say, a leading tone represented by the mediant of the dominant of a dominant – the latter would suggest a resolution into C major, which is clearly not what one observes, despite the many tactics that Wagner does employ in his score to achieve what one might call tentative resolutions.
In short, we propose that when the tritone is heard in isolation it is registered by the ear as harsh, that when it is heard in the context of, say, an eighteenth century contrapuntal development, it triggers a leading-tone interpretation along the lines of Lewin's gestalt; but that in the setting of the minor sixth chord the mind identifies this interval as the complex prime , which it treats as pleasurable, luxurious, even exotic – thus perhaps justifying the term diabolus in musica which has been applied to this most unstable of intervals.
According to this view, acoustic phenomena can trigger certain intuitions already established in the mind – either innately or through cultural conditioning – which can then be identified with certain mathematical objects. These mathematical objects are constructed from sets of complex 5-limit intervals, which we recognize as melodies, harmonies, rhythms, and so forth. The complex interval system itself can be viewed as an abstraction, and hence the precise form taken by this map from acoustic phenomena to the interval system will inevitably vary somewhat from person to person, from piece to piece, from culture to culture, and over time, even if the existence of such a map can be regarded as a universal.
Under the Complex System I, the Tristan chord takes the form . Let us put this array under the microscope and deconstruct it into its constituent intervals, of which there are six in all, comparing these to the corresponding intervals arising in the just version of the chord. For the interval B/F we obtain ξ in the complex case, instead of the Keplerian tritone , as we have already noted. We take ξ to be consonant since it is a low power of a prime. For the interval /B we obtain a perfect third in the complex system, as opposed to the fraction that arises in the just system. For the dissonant interval /F in the just system we obtain . The corresponding interval in the complex system takes the form , which involves fewer primes (if we do not count powers of 2) and hence can be viewed as no more dissonant that . The interval is given by in the just system. In the complex system, we use the prime factorization to get , more complicated than the just expression, but involving low powers of primes, and hence consonant. We can parse this interval as , showing it can be interpreted as the product of a minor third and a complex whole tone. This may explain why the fourth between the and the is softened, and seems darker in the context of the Tristan chord than is does on its own. The interval is given by in the just system, whereas it takes the form in the complex system, again with low powers of primes.
Finally, we observe that the sixth between B and is given by fraction in the just system, whereas it takes the form in the complex system, involving only a small number of primes. All the intervals within the Tristan chord arising in the complex system are harmonious (or are at least as harmonious as one might wish), whereas in the just system three of the intervals have large powers of primes and are unacceptable as candidates for consonance. In equal temperament all the intervals are out of tune, though it is difficult to say they are approximately in tune, since one cannot say what form “exact” tuning might take. The equally tempered version of the chord sounds acceptable – after all, this is what we hear when we attend the opera or play passages from it on the piano – much as it would have in Wagner's day. It follows that the conventionally performed version of the chord in the setting of the score is sufficient to act as a stimulus for the mind to receive it as a signal for the relevant tones of the complex system, along the lines we have discussed. We remark, incidentally, that all of the tritone intervals in Complex System I take the value ξ, modulo units and 5-limit phase factors. This is consistent with the observation that even under Wagner's many modulations, the Tristan chord does not lose its magic.
In conclusion, it may be appropriate to make a few brief remarks concerning our research methodology, even if these remarks may be irrelevant to the question of how one might further progress the work. It is not unusual in science for new ideas to arrive in a flash, after lengthy consideration of a body of earlier material, in reaction to which the earlier material can be reorganized and better understood. It may even then look to the outsider as if these developments of the earlier material somehow led to the new ideas, though in reality the sequence of events was opposite. In the present work, our realization that setting led rather naturally to the construction of complex scales with interesting properties prompted fresh investigations into the foundations of the just system, which in turn led to Theorem 4.1. It is not surprising therefore that our Theorem 4.1 does not quite have the character of an incremental development based on established ideas in music analysis. This is because its methods are fresh and offer new insights, as is evidenced in our analysis of the historical just scales catalogued by CitationBarbour (1953). With Theorem 4.1 at hand, we were then in a position to investigate the complex system more systematically, leading to Theorem 7.1. This required bringing in more mathematics, in the form of various results from the theory of Gaussian integers. The connections to word theory, generalized musical intervals, and the theory of scales came later, and although these latter areas have influenced the presentation of our work, they were not involved in its initial formulation.
It would be disingenuous to suggest that the results we have reported here came about simply as the product of a methodical research program aimed at improving on the just system, somehow eventually leading to the introduction of complex tonality. Nonetheless, going forward, it would make sense to pursue the approach that we have introduced more systematically in the investigation of both real and complex n-limit tone systems. That the prime decompositions and lead to such a wealth of musical structure may come as a surprise. But we can take the view that musical cognition entails the formation of a map from a class of acoustic phenomena to a suitably established tone system, the latter being essentially a mental construct. And if the tuning of our ears cannot stretch beyond the confines of 5-limit tones, whether real or complex, then our minds can. In Nietzsche's words, “Neue Ohren für neue Musik.” Hence, from the perspective of the mathematician, the complex n-limit tone systems may well constitute a natural framework within which music of the future can be composed.
Acknowledgements
We are grateful for comments from participants at RealTime Seminars in the Department of Computing at Goldsmiths University of London, June 2022 and May 2023, and the IMA Conference on Mathematics in Music at the Royal College of Music, London, July 2022, where drafts of this work were presented. We thank J. Armstrong, T. Blackwell, D. M. Blasius, T. Crawford, J. Forth, G. Di Graziano, D. P. Hewett, K. Rietsch and two anonymous referees for helpful comments and suggestions.
Disclosure statement
No potential conflict of interest was reported by the authors.
References
- Barbour, J. M. 1953. Tuning and Temperament: A Historical Survey. 2nd ed. East Lansing: Michigan State College Press.
- Barker, A. 1984, 1989. Greek Musical Writings. Vols. I, II. Cambridge: CUP.
- Barker, A. 1994. “Greek Musicologists in the Roman Empire.” Apeiron 27 (4): 53–74. https://doi.org/10.1515/APEIRON.1994.27.4.53.
- Bibby, N. 2003. “Tuning and Temperament: Closing the Spiral.” In Music and Mathematics, edited by J. Fauvel, R. Flood, and R. Wilson, Oxford: OUP.
- Clampitt, D. 2017. “Lexicographic Orderings of Modes and Morphisms.” In The Musical-Mathematical Mind: Patterns and Transformations, edited by G. Pareyón, S. Pina-Romero, O. A. Agustín-Aquino, and E. Lluis-Puebla, 89–97. Cham: Springer.
- Clampitt, D. 2019. “An Overview of Scale Theory Via Word Theory.” Brazilian Journal of Music and Mathematics 3 (2): 1–17.
- Clampitt, D., and T. Noll. 2018. “Naming and Ordering the Modes, in Light of Combinatorics on Words.” Journal of Mathematics and Music 12 (3): 134–153. https://doi.org/10.1080/17459737.2018.1519729.
- Cohn, R. 1998. “Introduction to Neo–Riemannian Theory: A Survey and a Historical Perspective.” Journal of Music Theory 42 (2): 167–180. https://doi.org/10.2307/843871.
- Cohn, R. 2012. Audacious Euphony: Chromaticism and the Consonant Triad's Second Nature. Oxford Studies in Music Theory. Oxford: OUP.
- Dominguez, M., D. Clampitt, and T. Noll. 2009. “WF Scales, ME Sets, and Christoffel Words.” In Mathematics and Computation in Music, edited by T. Klouche and T. Noll, 477–488. Berlin: Springer.
- Ellis, A. J. 1864. “On the Conditions, Extent, and Realization of a Perfect Musical Scale on Instruments with Fixed Tones.” Proceedings of the Royal Society 13 (60): 93–108.
- Euler, L. 1739. “Tentamen Novae Theoriae Musicae Ex Certissismis Harmoniae Principiis Dilucide Expositae. Petropoli, Ex Typographia Academiae Scientiarum.” Opera Omnia: Series 3, 1:197–427. Eneström E33
- Euler, L. 1774. “De Harmoniae Veris Principiis Per Speculum Musicum Repraesentatis.” Novi Commentarii Academiae Scientiarum Petropolitanae 18:330–353. Opera Omnia: Series 3, 1: 568–586. Eneström E457
- Fogliano, Lodovico. 1529. Musica Theorica. Venice.
- Grammateus, Henricus. 1518. “Arithmetica Applicirt Oder Gezogen Auff die edel Kunst Musica.” In Ayn New Kunstlich Beuch. Nürnberg.
- Haluška, J. 1999. “Searching the Frontier of the Pythagorean System.” Tatra Mountains Mathematical Publications 16:273–282.
- Haluška, J. 2004. The Mathematical Theory of Tone Systems. Boca Raton, Florida: CRC, Taylor & Francis.
- Hardy, G. H., and E. M. Wright. 1938. An Introduction to the Theory of Numbers. Oxford: Clarendon Press.
- Helmholtz, H. 1885. On the Sensation of Tone as a Physiological Basis for the Theory of Music. Translated by A. J. Ellis. London: Longmans, Green and Co.
- Klopp, G. C. 1974. Harpsichord Tuning. Translated by Glenn Wilson. Garderen, Holland: Werkplaats Voor Clavecimbelbouw.
- Lewin, D. 1987. Generalized Musical Intervals and Transformations. New Haven: Yale University Press.
- Longuet-Higgins, H. C. 1987. Mental Processes: Studies in Cognitive Science. Cambridge, Massachusetts: MIT Press.
- Orbach, J. 1999. Sound and Music. Lanham, Maryland: University Press of America.
- Penrose, R. 2000. “The Heritage of Pythagoras: Nineteen to the Dozen.” In Musicology and Sister Disciplines, edited by D. Greer. Oxford: OUP.
- Ramis de Parejo, Bartolomeus. 1482. Musica Practica. Bologna.
- Rousseau, Jean-Jacques. 1768. Dictionnaire de Musique. Paris: Chez la Veuve Duchesne.
- Schoenberg, A. 1978. Theory of Harmony. Translated by Roy E. Carter. Berkeley: University of California Press.
- Scholes, P. A. 1955. Oxford Companion to Music. 9th ed. Oxford: OUP.
- Solomon, J. 2000. Ptolemy Harmonics: Translation and Commentary. Leiden: Brill.
- Thirring, W. 1990. “Über Vollkommene Tonsysteme.” Annalen der Physik 502 (2–3): 245–250. https://doi.org/10.1002/(ISSN)1521-3889.
- Tovey, D. 1929. “Harmony.” In Encyclopedia Brittanica. 14th ed. J. L. Garvin (editor). London: Encyclopedia Brittanica.
- Žabka, M. 2011. “Introduction to Scale Theory over Words in Two Dimensions.” In Mathematics and Computation in Music, edited by C. Agon, M. Andreatta, G. Assayag, E. Amiot, J. Bresson, and J. Mandereau. Lecture Notes in Computer Science (LNCS 6726). Berlin: Springer.
Appendix
List of frequently occurring intervals
Table