![MathJax Logo](/templates/jsp/_style2/_tandf/pb2/images/math-jax.gif)
ABSTRACT
In the following article, we construct an interaction model of general language change. This contributes in particular to quantitative studies on reversible language change initiated by G. Altmann by adding explanatory character in tracing global features of general language change back to the individual interaction of speakers. Although the corresponding coupled differential equations are (presumably) non-integrable, we use methods from the theory of dynamical systems to deduce the long-term behaviour (depending on four interaction parameters) of the model for any given initial constellation of speakers. Subsequent numerical analysis of real data on language change is used to justify the relevance of the constructed model for the practicing quantitative linguist. We show how data-fitting methods can be used to determine the four interaction parameters and predict from them the long-term behaviour of the system, i.e. if complete language change or reversible language change will take place.
1. Introduction and Historical Context
Since its discovery, Piotrowski’s law (Piotrovskaja & Piotrovskij, Citation1974) has been tested and verified in various contexts of Quantitative Linguistics. The widespread use of this law, often called the ‘S-law’, has several reasons. Firstly, the similarity to the simplest mathematical model of the spread of contagious diseases, the SIR model (for an explanation see below), is evident (or more precisely, the Piotrowski-Altmann law can even be deduced from that analogy) and puts language change on a sound footing: the interaction of individual speakers. Secondly, the corresponding dynamical system is exactly soluble, thus there exists a mathematical function with two parameters (later three parameters) which can be fitted data-analytically to an empirically obtained dataset resulting in the possibility to immediately judge the quality of the modelling procedure. There has been some effort to apply the Piotrowski-Altmann law to situations like reversible language change, where for example a word initially gains popularity before later growing out of fashion (cf. the well-known paper by Altmann (Citation1983)). Another situation where variants of the Piotrowski-Altmann law have been successfully applied is the two-stage change Vulanović and Baayen (Citation2007). Since Piotrowski’s law (Piotrovskaja & Piotrovskij, Citation1974) is a priori either monotonically increasing or decreasing, it is principally not suitable for modelling such situations. In Altmann (Citation1983) and Vulanović and Baayen (Citation2007), the constant coefficient in the logistic differential equation is replaced by a linear or a higher-order polynomial in the time variable to account for reversible language change or the two-stage change. This is therefore of quantitative value but not of explanatory value for the underlying processes. As Altmann states in (Altmann, Citation1983) on pages 87–89, the next step should be a model of language change which comprises reversible language change in such a way that it can be traced back to the genuine interactions of speakers. This is the first step to connect language change to social, economic or historic reasoning.
The motivation of this paper lies in the fact that we wish to construct such an interaction model for language change (the so-called PLC model) for speakers of some language, which contains Piotrowski-Altmann’s law as a special case but which also contains other forms of language change like the reversible language change.
If we assume that the widespread use of the Piotrowski-Altmann law rests on the two criteria that it explains language change as an interaction process and that it is completely integrable, we meet the first of these two criteria. This is a starting point for the PLC model to become a model of value to a practising quantitative linguist. Even though our model is not exactly soluble, it can be analysed with fairly straightforward mathematical methods, leading to a complete classification of the resulting scenarios of language change depending on four interaction parameters. The model is also accessible to numerical analysis, which we carry out doing in the last part of the paper to show how empirical data-fitting can be employed to obtain the interaction parameters in order to be able to make predictions about language change in special situations. We provide the used Code (in Python) in the form of a jupyter-notebook so that the interested reader has an interactive format to check the validity of the individual steps or to apply the described methods to own datasets.
Crucial to showing the described classification of language change is the non-existence of periodic orbits of the underlying dynamical system, which we show in this paper. This is of mathematical interest in its own right.
For the sake of completeness, we wish to point out that another approach, in spirit very similar to ours, is Wheeler (Citation2007), who proposes similar techniques for a different model and with a different focus.
2. The Description of the PLC Model
2.1. The Motivational Case: SIR Model
The PLC model (short for Progressive, Liberal, Conservative) is set up in analogy to the SIR model, which is often used to study the spread of contagious diseases like measles. Here stands for the number of ‘Susceptible Individuals’,
for the number of ‘Infectious Individuals’ and
for the number of ‘Resistant Individuals’. The SIR model is a system of coupled differential equations which mimic the interaction of the described groups. If for example a susceptible and an infected individual meet, there is a certain probability that the number of infected individuals increases by one and the number of susceptible individuals decreases by one (i.e. the susceptible individual has been infected). The model has gained some popularity in the Corona pandemic, cf (Dönges et al., Citation0000). We are interested in a slightly different model adopted to the current situation of language change, hence we are not going to delve into the SIR model, which has only been named for motivational reasons (the interested reader is referred to (Wikipedia, Citation2023b)). Language change bears a strong resemblance: if two speakers meet (an ‘infected’ one who already uses some new linguistic construct like a new word and a ‘susceptible’ one, who is prone to learn the new word), there is a certain probability that the susceptible speaker will adopt the new word. We wish to point out that Piotrowski-Altmann’s law can be deduced from a special case of the SIR-model (where no ‘Resistant’ individuals are present) and results in the so-called ‘logistic equation’, which arises in various areas of mathematical modelling.
2.2. Assumptions of the PLC Model
In the following, we discuss the assumptions made to obtain a meaningful model. Firstly we are, as in the case of the Piotrowski-Altmann law, taking no local variations like borders or other language boundaries into account. There are also models that do, but they are of a very different flavour, cf (Vogl & Prochazka, Citation2020). We therefore assume a homogeneous speaker community of total size distributed evenly over some fictional country. We further assume that our country is divided into equally sized ‘interaction’ spaces (imagine an equally spaced grid), meaning that if two individuals accidentally meet in such a space, there is a certain probability of interaction (expressed below by a Greek letter). Since we assume evenly distributed populations, the probability for an individual of group A to be in such an interaction space is proportional to the number
of individuals of population A. Assuming the independence of the spatial distribution of two groups A and B, the probability of an individual of A and one of B to meet in a certain interaction space is proportional to
. We assume further that the total population of speakers has been exposed to a new linguistic construct like a new word which exists for a pre-existing one in the considered language (hence there is an old and a new version: ward, wurde in German, cf (Altmann, Citation1983).
Instead of the above-mentioned three groups (S,I,R) in the context of infectious diseases, we consider in the context of language change:
• (P): progressive speakers, who only use the new feature (e.g. word) and try to spread its use.
• (L): liberal speakers, who are indifferent towards using the new feature (but do not speak it)
• (C): conservative speakers, who refrain from using the new feature and try to convince others to use the old feature.
As in the SIR model, describes the number of individuals in the population of the progressive speakers at time
. Analogously for
From the assumption about the total size, it follows for all times:
Now we describe the interactions:
If a P and an L meet: There is a certain probability that L is converted to P (encoded in
).
If a C and an L meet: There is a certain probability that L is converted to C (encoded in
).
If a P and a C meet: There is a certain, individual probability that each is converted to L (encoded in
).
Remark 2.1. Please note that the Greek parameters introduced above are not actually the probabilities, since for the sake of simplicity additional aspects are incorporated into these parameters (e.g. number of interaction spaces and the like). Nevertheless, they are proportional to the interaction probabilities.
Thus, the coupled system of differential equations of the PLC model reads:
A prime indicates differentiation with respect to time (the independent variable in the above system of coupled differential equations). In the following, we are going to deduce properties of the described dynamical system and set them into perspective with regard to different scenarios of language change.
3. Mathematical Analysis of the Model
A priori, the PLC model is three-dimensional, but since the total number of speakers is preserved, it can be reduced to a two-dimensional dynamical system.
3.1. Reduction to Two Dimensions
Applying to eliminate the variable
, we obtain the system:
For convenience, it is best to normalize the involved quantities and express the system in terms of the lower-case letters and in the renamed parameters:
A short calculation shows that the PLC model is equivalent to the following dynamical system:
The dynamical variables describe the fraction of the total population of progressive and conservative speakers, respectively. Hence the range of
is restricted to the unit interval
and
. Moreover, since the sum of the fractions cannot exceed 1:
. Thus, the dynamics of the system is restricted to the triangle:
In the sequel, we refer to the above dynamical system ( together with the equations) as the PC system.
3.1.1. Remark on the Redefined Parameters
As can be seen directly from the definition as well as
(if generically
). Since this is crucial for the analysis in the following we generically deduce:
. Further, it follows straight away that
For a discussion of the non-generic cases, see Section 3.5.
3.1.2. Remark on the Relation to the Piotrowski-Altmann Law
What immediately springs to mind is the relation to the logistic growth. In each of the two equations, the first term precisely describes the logistic growth which in the second term is reduced proportionally to interactions.
Thus, we can deduce instantaneously that the Piotrowski-Altmann law is comprised within the PC model. To be more precise, if we start out with no conservative speakers at all, then their number remains zero for all times and we end up with:
which is the logistic differential equation or in relative terms:
For the sake of completeness, we include the solution to the differential equation 13 (the usual form of the well-known Piotrowski-Altmann law):
To include incomplete language change, the law is generalized to:
3.1.3. Remark on Possible Generalizations and Their Implications
It is not hard to show that the model can also be applied to certain situations where the interaction parameters are also negative. A necessary condition is given by:
This condition assures that the flow of the dynamical system points inwards of . The interpretation of the interactions is then as follows: A negative
for example can be interpreted as the possibility that progressive speakers can turn conservative speakers into progressive ones without having to change them to liberal speakers first. The actual process is more involved but can be seen in exactly that way. For example, if
is negative and
, it is assured on the one hand that the number of liberal speakers will not change in time if progressive and conservative speakers meet. On the other hand, meeting of progressive and conservative speakers results in a decline in the number of conservative speakers and in an increase in the number of progressive speakers.
Linguistically, this case may reflect a new loanword that does not seem really alien to a conservative speaker from the very beginning because its phonological and phonotactic structure is also typical of words of their own language, e.g. German cool, Trend, Flirt or Trick, taken in from English. (So in a way, the conservative speaker behaves progressively, possibly without intending to.)
A very interesting issue arises if one allows more than one negative parameter. As it turns out in this case, given the right choice of parameters, the dynamical system even allows periodic orbits. This gives rise to a periodically repeating process! We plan to investigate this matter further in future research but refrain from doing so in the current paper.
A possible numerical example with one negative parameter (curve-fitting for the e-epithesis 4) is given by: . More details about the mathematical analysis in this case are given in Section 3.5.
The methods described in the subsequent sections are also applicable to a much wider class of models describing the interactions of speakers. To give an inspiration of possible generalizations, we wish to point out that the conclusions of the paper carry over almost unaltered to models governed by systems of coupled differential equations of the form:
where are arbitrary positive, smooth functions and where
are real numbers.
3.2. The Critical Points
A short calculation shows that the critical points of the PC system (where both ,
vanish, i.e. where the dynamical system becomes stationary) lie at:
It is not hard to see that the point always lies within
. This corresponds to the following scenarios:
At
: in our speaker community, there are only liberal speakers, but with no exposure to the new feature (so only the old feature is used).
At
: in our speaker community, there are only progressive speakers, so everyone adopts the new feature (complete language change).
At
: in our speaker community, there are only conservative speakers, so nobody adopts the new feature.
At
: depending on the parameters, there is a fixed share of speakers using the new feature and the rest not using it (incomplete language change).
3.3. Outlook and Putting into Context
We are going to show that in the long term, the scenarios described above are the only possible outcomes allowed by the PC model. By this we mean that given the parameters , any imaginable speaker constellation will for long times approach one of the described scenarios and we are able to tell what the outcome will be. We therefore expect great predictive power of our model when combined with suitable data-fitting procedures. To be more precise, we expect that given a dataset of how the usage of some new feature has developed over some period of time, after applying a data-fitting procedure to obtain estimates for the parameters, we can predict how the usage of the feature under consideration will terminate, i.e. if it becomes extinct, will be used by all speakers or only by a certain share of speakers. This gives a precise meaning to the classification of scenarios of language change claimed in the introduction.
3.4. Classification of Long-Term Behaviour Depending on Generic Parameters
Dynamical systems, even in the supposedly easiest cases, show an intricate complexion of possible behaviour. The study of dynamical systems started with famous researchers like Newton, Lagrange and Poincaré trying to understand problems of celestial mechanics. Even two-dimensional dynamical systems can show chaotic behaviour and are in general hopelessly difficult to analyse (cf. Hilberts 16.th problem (Wikipedia, Citation2023a)). A recurring tool which often proves to be successful in the analysis of dynamical systems is the number of periodic orbits and their relation. A periodic orbit describes a specific configuration which returns to its initial position after evolving for some time (the period) according to the rules of the dynamical system. An infamous example in the solar system is given by the trajectory of the Earth around the Sun due to the gravitational law. Its period is obviously given by one year. We are going to show in the following that the PC system does not allow any periodic orbits. This will be the starting point for further analysis of the long-term behaviour of the PC system. This section deals with the generic case if all interaction parameters
are greater than zero. In Section 3.5 we will discuss the model in a singular setting. In the following, it is therefore understood that:
.
Theorem 3.1.
The PC System does not allow periodic orbits.
The idea of the proof of Theorem 3.1 is as follows: we will argue by contradiction and assume that there exists such a periodic orbit of period
. We proceed by showing that according to the rules of the PC system, the average position
of the periodic orbit coincides with the critical point . The desired contradiction is then obtained by showing that a potentially existing periodic orbit cannot encircle the critical point but has to stay on one side of the critical point and would thus pull the average in
away from
towards that side.
3.4.1. Fish-Trapping in the PC System
In order to formalize the last step in the outlined sketch of the proof, we start by proving a proposition called the fish trap in the following. To this end, we firstly calculate the locus of vanishing derivative in the -direction and the
-direction, respectively. Setting
to zero, a short calculation shows that
are the lines of vanishing and
, respectively.
Obviously, along the line , the dynamical system has to flow in the
-direction (any flowing in the
-direction is prohibited by its very definition) and along the line
the dynamical system flows in the
-direction. The intersection of both lines distinguishes the critical point
at which all derivatives vanish. More can be said about the flow: above the line
, the
-derivative is negative, below the line
, the
-derivative is positive. Mutatis mutandis, the same is true for
. The area between the two lines together with the critical lines
except point
(cf. grey area in ) make up the fish trap, which traps any flow-line for all times (hence the nomenclature).
Definition 3.2.
The sectors and
() together with the lines
but without the critical point
make up an area referred to as the fish trap.
Proposition 3.3
(Fish trap). If a trajectory enters the fish trap, it will stay there for all times.
Proof.
In order to avoid distraction from the main line of argumentation, the proof is deferred to the appendix.□
3.4.2. Non-Existence of Periodic Orbits
Now we can turn towards proving theorem 3.1.
Proof.
Assume to the contrary that there exists a non-trivial periodic orbit of period , which is notated by
. By non-trivial we mean that it lies completely in the interior of
and that there exist times
such that
. In particular, no critical point lies on the trajectory of the periodic orbit, since it would have to stay there for all times. By the assumptions of non-triviality
, we can thus integrate
Since the integral of equals
, by the periodicity, it follows straight away that the left hand side vanishes. Hence, we find:
Dividing by yields:
where denotes the mean value of
, analogously for
. The same can be done with
, yielding:
Solving the two equations for , we can conclude that the point with coordinates
coincides with the critical point
. By non-triviality
is not any of the critical points, thus assume first that
lies on or above the line
.
If stays above
for all times (for example if it stays in sector
), then
for all
. Integrating over the period
and dividing by
gives
showing that the second equation obtained above for the mean values is violated. Thus
has to cross
eventually. By proposition 3.3,
is then confined to sector
for all times coming. Since it cannot leave the fish trap anymore, being periodic, it already had to start there (it cannot reach the potential starting point anymore). But then it had been below
already for all times, leading to an analogous contradiction as before. This shows that the assumption about the existence of a periodic orbit had been wrong from the beginning, proving the assertion of the theorem.□
3.4.3. Convergence of Trajectories
Due to H. Poincaré and I. Bendixson, there is a strong result about the behaviour of two-dimensional dynamical systems related to their periodic orbits. In the case, we are concerned, it basically asserts that any trajectory converges either towards a periodic orbit or to a critical point. Having previously excluded the existence of periodic orbits, using the Poincaré-Bendixson theorem, we can deduce strong conclusions for the PC system about the long-term behaviour:
Theorem 3.4.
Every trajectory in the PC system converges to one of the critical points.
Proof.
In order to avoid distraction from the intended application to linguistics, we defer the proof together with the required prerequisites to the appendix.
We summarize what we have obtained: Given any Point , if we follow the trajectory through
in the PC system long enough, we will end up arbitrarily close to one of the critical points
. Which one is determined by the parameters
. Hence, the long-term behaviour of the language change modelled by the PC system is completely determined by the parameters.
We are going to describe this dependence more precisely. To this end, consider the flow of the dynamical system in the vicinity of a critical point at a specific time
and at a time
infinitesimally later. Since the flow fixes the critical point
(the flow is stationary there), points near
are flowing to points near
. Hence, the flow between time
and time
amounts to a linear map of neighbourhoods of
, the linearization of the flow which captures its essential features. The linear map is given by the
matrix
of partial derivatives (‘linearization of the flow’) of the equations of motion. Now a positive eigenvalue of
corresponds to a point keeping its direction but flowing away from the critical point (trajectories close to this one are thus repelled by the critical point), whereas a negative eigenvalue of
corresponds to point keeping its direction but flowing towards the critical point (thus nearby trajectories are attracted). To briefly describe the remaining cases: if an eigenvalue is zero, then the flow stagnates in the corresponding direction (consists of fixed points), whereas for a complex eigenvalue of
the flow in the vicinity of the corresponding critical point would show rotatory character. Thus by calculating the eigenvalues of
for all critical points of the PC system, we can determine the behaviour of the flow. More details can be found in any standard textbook on dynamical systems or differential equations, for example in Zill and Cullen (Citation2001).
Therefore, we differentiate both defining equations of the PC system with respect to and form the matrix:
We need to substitute the critical points for into
and then find the eigenvalues. A positive eigenvalue corresponds to a repelling eigen direction, a negative eigenvalue to an attracting eigendirection.
1.
obviously has two repelling eigendirections (the - and
-axes).
2.
3.
4.
3.4.4. Description of the Resulting Scenario
Please note that is a repelling critical point. Thus we discard it from the list below, because the trajectories will never end there. Further note that away from the
-axis, the trajectories are pushed into
. From the previous deduction, we find the following scenario: each of the critical points
is attracting. Moreover,
is a saddle point (has a repelling and an attracting eigendirection). Thus except for two trajectories coming in exactly in the attracting eigendirection, all other trajectories end in either
or
. These two trajectories constitute the so-called separatrix of the PC system since they separate the space of trajectories in those converging to
or
.
3.5. Classification of Long-Term Behaviour in the Singular Cases
In this section, we are going to repeat the analysis above for special cases of the interaction parameters and discuss their implications. We therefore recall the definitions
In general, the critical point is situated in the interior
of the triangle
. We call this the generic situation. The situation where the critical point
does not exist or is situated on the boundary
of the triangle
is called singular. The above analysis deals therefore with the generic case. To analyse the singular cases, we discuss the different possible loci which the critical point
can have on the boundary
. Therefore, consider for nomenclature:
1. Generic case
We included this point only to show that (our definition of being generic) is equivalent to the conditions on the parameters used in the previous section. Indeed, from
in follows straight away that:
From this we directly conclude: and
(by definition of
it is clear that
) and analogously
and
. Thus, the generic case is equivalent to
and
together with
.
2. One negative parameter
It must be assured that the flow of the dynamical system points not outwards of . Otherwise the flow could generate a negative proportion of liberal speakers. The condition
(by calculation of the dot-product
) ensures this. A short inspection reveals that the critical point
has at least one negative component, so that it is not situated within
. Thus, the flow of the dynamical system in
is reduced to two of the sectors
in . The conclusions are the same as before in such a situation (convergence to one of the critical points as in proposition 7.8).
3.
The line is bound to go through
whereas the line
is bound to go through
. Since
, we see that
forces
to be identical with the x-axis. Hence
and
in all of
(in our terminology from above, only the sectors
survive in
since
lie beneath the line
). But on the x-axis (
) the flow can in general only be horizontal, but since
is defined as the line where
vanishes, it follows straight away that all points on
are critical. The methods of the generic case go through unaltered, except that we have to show that any limit-set can contain only one critical point (which follows straight away from a monotonicity argument). Thus we get the same conclusions: any trajectory ends at one of the critical points (here
or a point of the segment
). As before consider the matrix
obtained from linearization of the flow. At the point
we find:
For , the quantity
is positive, whereas for
it is negative. Hence, to the left of the critical point
, the critical points of the
-axis are repelling whereas to the right of
they are attracting. Again, the fish-trapping lemma remains valid, thus any trajectory entering sector II can only converge to the critical point
, thus realizing reversible language change as no progressive speakers exist. All other trajectories which never enter sector II will end up at one of the attractive critical points with
. This is an instance of incomplete language change since in the long term, we end up with a percentage
of progressive speakers and the rest being liberal speakers (no conservative speakers left) not using the new feature.
4.
Exactly analogue to the previous case by exchanging . The interpretation is as follows: any trajectory either ends on
, in the case of which we have complete language change, or they end at one of the critical points
with
. In this case, no progressive speakers are present and hence we have reversible language change (the new feature becomes extinct).
5. in the third case
This means that and that
(we already established
before). Thus, we get the implications of both previous singular cases: All points on either
are critical and attracting. A short calculation shows that the convergence to these critical points flows along the straight lines with equations:
. The interpretations from above carry over: either incomplete or reversible language change.
6. in fourth case
Now all trajectories are lying in sector and are thus converging to
. This is another instance of reversible language change with the new feature becoming extinct.
7. in third case
Now all trajectories are converging to and we find complete language change in all cases.
8. Points on become critical
In this case, we must have that both equal the line with equation
. This implies that
and
. But then, all points on
are critical (being on
they satisfy
and by the same argument
.) An argument similar to the one before shows that the points
are attractive critical points and all trajectories will end at one of these critical points. Again this is an instance of incomplete language change, so that in this case we find only progressive (percentage
) and conservative speakers (percentage
) whereas no liberal speakers are present anymore.
4. Application to General Language Change
4.1. General Conclusions from the Mathematical Section
In this section, we only discuss the generic case, since for convenience we included the interpretation of the singular cases in section 3.5. The application to language change is now clear if we recall the meaning of the variable (the percentage of progressive speakers in the speaking community) and the variable
(the percentage of conservative speakers in the speaking community). Hence, we find a very specific choice of initial data (ratios of progressive to conservative speakers), which results in an unstable equilibrium. Even the slightest alteration of this ratio leads to a totally different long-term behaviour, either to complete language change or to the extinction of the new feature (reversible language change). Thus, we have now qualitatively solved the PLC model for language change completely, given the interaction parameters
. Provided that the model proves successful to describe real data of language change, the gain from such a model is a considerable improvement to older approaches due to the fact that as soon as any data-fitting procedure has produced estimates for the interaction parameters, the PLC model allows for predictions about the long-term behaviour of the system without having to do any further calculations. In the prevailing majority of modelled situations, fairly strong conclusions about the stability of the predictions can be concluded from the interaction parameters.
4.2. Testing the PLC Model on Empirical Data
The aim of this section is to describe how we approached the numerical testing of the PLC model. We are not going to repeat well-known facts about data-fitting procedures which have been described extensively, for example in the context of language change in Altmann (Citation1983). We use the relevant packages of the scipy, numpy modules of the programming language Python. To get better accessibility for the reader, we use a jupyter-notebook which allows for a hybrid environment consisting of the executable Code and explanations thereof. The files used below can be downloaded from GitLab under the following link https://gitlab.fosbos-rosenheim.de/pub/.
Only one aspect which to our knowledge is not (obviously) standard material on data-fitting is the fact that due to the (presumed) non-integrability of the PLC model we had to obtain the family of functions allowed for the data-fitting procedure by numerical integration. To this end, we applied another Python module using a suitable Runge-Kutta solver. We start the numerical analysis by comparing the PLC model to the well-studied Piotrowski-Altmann law.
4.2.1. Comparison to the Well-Studied Piotrowski-Altmann Law
In the following, we compare the Piotrowski-Altmann law to the PLC model. We take data from Best and Kohlhase on page 97 in (Best & Kohlhase, Citation1983), which was also used by Altmann in (Altmann, Citation1983). The data below show the development over time of the percentage of usage of the (new) word wurde opposed to the (old) word ward in German.
Using Piotrowski’s law, Altmann obtained as optimal function: . The graphic shows the solution from the PLC model (red) in comparison to Altmann’s optimal function (blue) on the dataset described above. The fitting parameters for both models can be found in the Appendix 7.3.
4.2.2. Testing on the e-Epithesis
The e-epithesis is, according to Imsiepen(Citation1983), an example of reversible language change. By e-epithesis one understands the phenomenon in Early Modern High German to put an additional -e to the end of strong verbs in the preterite, for example sahe opposed to sah. Over time, this tendency initially started to increase before eventually growing completely out of fashion. We use the data from Imsiepen(Citation1983).
In , we compare the fitting of the model from Altmann in (Altmann, Citation1983), the generalized logistic curve from (Vulanović & Baayen, Citation2007) using a third-order polynomial in the time variable and the PLC model. The graph of Altmann’s attempt is orange, the generalized logistic curve is green whereas ours is red. The parameters of each model are given in the Appendix 7.3. Although the data is quite spread out, one can see that our model does show an improvement to the older models.
4.2.3. Testing on the Development of Periphrastic Do
According to Vulanović and Baayen (Citation2007), there is another form of language change, exemplified by the proportion of periphrastic-do constructions around 1560 which can be viewed as a two-stage change. By this it is meant that the data shows a slow-down (or even a decrease) before it starts increasing again. Vulanović and Baayen used a generalized logistic function in (Vulanović & Baayen, Citation2007), where they applied (besides another approach) a polynomial of order in the time-variable. We compare the generalized logistic function to the PLC model. Even though the PLC model cannot cope with an actual decrease (under the assumption of time-independent coefficients!), the following example shows how fitting of a slow-down is realized by the PLC model. In (Vulanović & Baayen, Citation2007) six different sentence types are analysed, here we consider only the two most interesting ones for our application and refer the interested reader to https://gitlab.fosbos-rosenheim.de/pub/ for fitting of the other examples. Originally, the data has been collected by Ellegåard (Citation1953) (Table 7). We use the data as presented in (Vulanović & Baayen, Citation2007) (Table 1). There the 13 periods in (Ellegåard, Citation1953) have been reduced to 11 to compensate for the different number of texts in the periods considered (cf (Vulanović & Baayen, Citation2007). p. 2 for further discussion).
The graphs show good fitting results for both models (parameters are given in 4), but the PLC model shows less tendency to overshooting on the (time-)ends of the datasets ().
4.2.4. Prediction Capabilities of the PLC Model
In this section, we want to examine the capability of the PLC model to predict the future development of processes of language change. We therefore left out the last six data points in the case of the e-epithesis and the last three data points in the case of affirmative declarative do and applied the fitting procedures to the remaining datasets (parameters are given in 7.3). In the graphics below the missed out data points are displayed in red so that it becomes possible to judge how well the model approximates the future development of the process under consideration.
5. Conclusion
As stated, the PLC model is not limited to the context of linguistics. Since it abstractly describes the interplay between progressive, liberal and conservative influences, it should be also applicable to various different settings situated in sociology, economics or political sciences. This is subject to further study.
ADpredict.eps
Download EPS Image (72.2 KB)delta.eps
Download EPS Image (18.2 KB)pio.eps
Download EPS Image (52.6 KB)fish_trap_1.eps
Download EPS Image (42 KB)ephpredict.eps
Download EPS Image (69.1 KB)eph.eps
Download EPS Image (61.4 KB)propft.eps
Download EPS Image (31.6 KB)Sprachwandel_end.tex
Download Latex File (67.2 KB)Acknowledgments
We are deeply grateful to Professor Gabriel Altmann, without whose positive feedback we would have never considered to prepare this work for publication. This article grew out of a semester work in 2020 of one of us (E.S.) and an email exchange with Prof. Altmann. Sadly, during the preparation, Prof. Altmann passed away. We also like to thank the anonymous referees for valuable suggestions which considerably improved the content and the presentation of the manuscript.
Disclosure Statement
No potential conflict of interest was reported by the author(s).
Supplementary Material
Supplemental data for this article can be accessed online at https://doi.org/10.1080/09296174.2024.2344946
Additional information
Funding
References
- Altmann, G. (1983). Das Piotrowski-Gesetz und seine Verallgemeinerungen. In K. H. Best & J. Kohlhase, (eds) Exakte Sprachwandelforschung (pp. 59–90). Göttinger Schriften zur Sprach- und Literaturwissenschaft, GöSSL 2.
- Beardon, A. (1997). Limits, a new approach to Real Analysis. Springer Verlag.
- Best, K. H., & Kohlhase, J. (1983). Der Wandel von ward zu wurde. In K. H. Best and J. Kohlhase (Eds.), Exakte Sprachwandelforschung (pp. 91–102). Göttinger Schriften zur Sprach- und Literaturwissenschaft, GöSSL 2.
- Dönges, P., Götz, T., Krüger, T., Niedzielewski, K., Priesemann, V., & Schäfer, M. SIR-Model for households. https://arxiv.org/pdf/2301.04355.pdf
- Ellegåard, A. (1953). The auxiliary do: The establishment and regulation of its use in English. Almquist&Wiksell.
- Imsiepen, U. (1983). Die e-Epithese bei starken Verben im Deutschen. In K. H. Best & J. Kohlhase (Eds.), Exakte Sprachwandelforschung. Göttinger Schriften zur Sprach- und Literaturwissenschaft, GöSSL 2.
- Piotrovskaja, A. A., & Piotrovskij, R. G. (1974). Mathematičeskie modeli v diachronii i tekstoobrazovanii. In Statistika reči i avtomatičeskij analiz teksta (pp. 361–400). Leningrad, Nauka 1974.
- Teschl, G. (2012). Ordinary differential equations and dynamical systems (Vol. 140). Graduate Studies in Mathematics, Amer. Math. Soc.
- Verhulst, F. (1996). Nonlinear differential equations and dynamical systems, Universitext (2nd ed.). Springer-Verlag.
- Vogl, G., & Prochazka, K. (2020). Was treibt den Sprachwechsel? Physik Journal, 19(2), 35–39.
- Vulanović, R., & Baayen, H. (2007). Fitting the development of periphrastic do in all sentence types. In P. Grzybek & R. Köhler (Eds.), Exact methods in the study of language and text (pp. 679–688). Mouton de Gruyter.
- E. S. Wheeler. (2007). Language change in a communication network. In P. Grzybek & R. Köhler (Eds.), Exact methods in the study of language and text (pp. 689–698). Mouton de Gruyter.
- Wikipedia. (2023a). Hilberts 16.Tes Problem. Retrieved June 18, 2023, from. https://de.wikipedia.org/wiki/Hilbertsche_Probleme#Hilberts_sechzehntes_Problem
- Wikipedia. (2023b). SIR-Modell. Retrieved September 5, 2023, from https://de.wikipedia.org/wiki/SIR-Modell
- D. G. Zill, & M. R. Cullen. (2001). Differential equations with Boundary-Value Problems (5th ed.). Brooks/Cole.
Appendix
In this appendix, we give the detailed proofs of some results in the sections above which are necessary for exactness but which are not needed on a first reading to grasp the main ideas this work is based upon. Furthermore, we present the parameters of the fitting curves of Section 4.2.
7.1. Proof of the Fish-trap
Proposition 7.1
(Fish trap). If a trajectory enters the fish trap, it will stay there for all times.
Proof.
Without loss of generality, the trajectory starts in sector
and enters the fish trap on the boundary of sector
at time 0. Hence
Now assume the opposite to the statement of the proposition, i.e. there exists a time such that
is outside the fish trap either in sector
or in sector
. By the Intermediate Value Theorem (IVT) reference to Beardon (Citation1997) (the page number of the reference is 52) there exists a time
where
leaves the fish trap into the relevant sector, say without loss of generality, sector
. Thus
. To leave the fish trap, the velocity vector
has to point out of the fish trap and thus has to satisfy:
![](/cms/asset/a2c1f7de-96fb-4ab0-8713-a62032df23d8/njql_a_2344946_uf0001_oc.jpg)
Hence . Now on the line
, it follows by definition that
, hence
. With
, this yields
. But this contradicts the fact that
is negative along that part of the line
lying above
. Hence, the assumption was wrong, and
cannot leave the fish trap into sector
. The argument carries over verbatim to the case of entering into sector
or with the trajectory starting in sector
. If it started already in sectors
or
, it could not leave by the same arguments.
7.2. Proof of the Main Theorem
To state the theorem (and deduce our conclusions), we must first fix some notation.
Definition
7.2 (Dynamical system). Let be a smooth function from an open set
into
. Then, the differential equation
defines a dynamical system on .
Definition
7.3 (Trajectory of a dynamical system). Given a dynamical system on the set . A smooth curve
is called an integral of the dynamical system if its velocity vector
coincides with
for all times. The set of points traced out in
by
is called the corresponding trajectory.
Remark 7.4. By abuse of notation, if there is no danger of confusion, we will sometimes use the terminology trajectory and integral interchangeably.
Definition
7.5 (limit set). Given a dynamical system on the set . A point
belongs to the limit set of the point
if there exists a sequence of times
going to
such that the integral
of the dynamical system through
satisfies
We denote the limit set (the set of all points sharing the property above) for by
Theorem 7.6
(Poincaré-Bendixson). (Theorem 7.16 on page 223 in Citation2012) Let a dynamical system on be given, fix a point
and suppose
is compact, connected and contains finitely many critical points. Then one of the following cases holds:
Now we can state:
is a critical point
is a non-trivial periodic orbit
consists of finitely many critical points
and non-closed trajectories connecting them.
To deal with assumptions in theorem 7.6 we quote:
Theorem 7.7.
(Theorem 4.2 in Citation1996) If a trajectory through is confined to some bounded region, then the set
is compact, connected and non-empty.
We need one more result for the proof:
Proposition 7.8.
If is not a critical point, then there does not exist an integral
and a sequence
such that
Proof.
Assume that there is such a point . Since the argument carries over mutatis mutandis to the other sectors, we will assume without loss of generality that
is in sector
. It is clear that
has to stay in sector
for all times, because once it would enter sectors
or
it will never be able to return close to
in sector
due to proposition 3.3. But then
will be above
and
for all times. By the defining property of
this implies that for
we have:
Hence as well as
are monotonically decreasing sequences of real numbers bounded below by zero. By the Archimedean axiom and the assumptions made about
, they converge as follows:
Note that it must hold
Due to the monotonicity it also follows for
Hence for it follows that
Now we are going to evaluate the mean values as in the proof of theorem 3.1 and deduce the desired contradiction. Given any , choose a
such that for
within distance
of
and
within distance
of
we have:
Such a exists by the continuity of
in
.
Now choose so big such that
hence we find for all :
As before consider the integrals
On the other hand, we have from the PC system:
Integrating as before shows
Consider and note, as was discussed before, that
which yields
We abbreviate . Hence, we find that
lies in the
-neighborhood of
.
By choosing small enough and
big enough, we find that the right hand side
of the inequality (54) becomes arbitrarily small. This in turn implies that
closely satisfies the linear equation
which is the line . The analogous argument for
implies that
closely satisfies the equation of the line
, hence
are arbitrarily close to the critical point
. However, we have shown that
converge to the point
, which is different from
by assumption, hence for
small enough and
big enough we get the desired contradiction.□
With these prerequisites we can now turn towards the classification:
Theorem 7.9.
Every trajectory in the PC system converges to one of the critical points.
Proof.
Any trajectory of the PC system is bounded since it is confined to , theorem 7.7 implies that the limit set of any point is then non-empty, compact and connected. Together with the fact that there exist only four critical points we can apply theorem 7.6. Since theorem 3.1 prohibits the existence of periodic orbits, the limit-set of any point in
is either a critical point or a connected set consisting of some critical points together with trajectories between them. But proposition 7.8 also excludes non-critical points on a trajectory connecting the critical points. Thus, we are left with only critical points as limit-sets. The same argument as in the proof of proposition 7.8 guarantees that the convergence is actually true in the continuous sense (to exclude the possibility that a trajectory might recede from a limit point between times
). We briefly recall the argument. Either
is completely contained in sectors
or
, then we are either above or below both of
and the convergence is monotonic. If it enters
or
, by proposition 3.3 it will stay there for all times and also experience monotonic convergence.
7.3. The parameters of the Fitting Functions
For the sake of completeness, we present the parameters of the fitting functions in Section 4.2. First recall the definition of parameters of the fitting functions:
Piotrowski-Altmann
Altmann,
Altmann,
PLC model
solving
• Piotrowski-Altmann;
- Piotrowski-Altmann
- PLC model
• e-epithesis;
- Altmann
- Altmann
- PLC model
• affirmative declarative (AD);
- Altmann
- Altmann
- PLC model
• negative declarative (ND);
- Altmann
- Altmann
- PLC model
• e-epithesis-prediction;
- Altmann
- Altmann -predict
- PLC model
- PLC model-predict
• affirmative declarative-prediction;
- Altmann
- Altmann -predict
- PLC model
- PLC model-predict