Full article: Games of entangled agents

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

The paper extends the mathematical formalism of quantum physics to include games of intelligent agents that communicate due to entanglement. The novelty of the approach is based upon a human factor-based behavioral model of an intelligent agent. The model is quantum inspired: it is represented by a modified Madelung equation in which the gradient of quantum potential is replaced by a specially chosen information force. It consists of motor dynamics simulating actual behavior of the agent, and mental dynamics representing evolution of the corresponding knowledge base and incorporating this knowledge in the form of information flows into the motor dynamics. Due to feedback from mental dynamics, the motor dynamics attains quantum-like properties: its trajectory splits into a family of different trajectories, and each of those trajectories can be chosen with the probability prescribed by the mental dynamics; each agent is entangled (in a quantum way) to other agents and makes calculated predictions for future actions; human factor is associated with violation of the second law of thermodynamics: the system can move from disorder to order without external help, and that represents intrinsic intelligence. All of these departures actually extend and complement the classical methods making them especially successful in analysis of communications of agents represented by new mathematical formalism, and in particular, in agent-based economics with a human factor.

Keywords:

Public Interest Statement

The novelties introduced in the manuscript, and in particular, departures from Newtonian approach actually extend and complement the classical methods making them especially successful in analysis of communications of agents represented by new mathematical formalism, and in particular, in agent-based economics with a human factor.

1. Introduction

This paper is devoted to a new approach to differential games, (Isaacs, Citation1965), i.e. to a group of problems related to the modeling and analysis of conflict in the context of a dynamical system. In corresponding agent-based models, the “agents” are “computational objects modeled as interacting according to rules” over space and time, not real people. The rules are formulated to model behavior and social interactions based on incentives and information. Such rules could also be the result of optimization, realized through use of AI methods.

We will concentrate on the non-Newtonian properties of dynamics describing a psychology-based behavior of intelligent agents as players. In other words, we will introduce dynamical systems with a human factor.

1.1. Justification for non-Newtonian approach

All the previous attempts to develop models for so-called active systems (i.e. systems that possess certain degree of autonomy from the environment that allows them to perform motions that are not directly controlled from outside) have been based upon the principles of Newtonian and statistical mechanics. These models appear to be so general that they predict not only physical, but also some biological and economical, as well as social patterns of behavior exploiting such fundamental properties of non-linear dynamics as attractors. Notwithstanding indisputable successes of that approach (neural networks, distributed active systems, etc.), there is still a fundamental limitation that characterizes these models on a dynamical level of description: they propose no difference between a solar system, a swarm of insects, and a stock market. Such a phenomenological reductionism is incompatible with the first principle of progressive biological evolution associated with Darwin. According to this principle, the evolution of living systems is directed toward the highest levels of complexity if the complexity is measured by an irreducible number of different parts, which interact in a well-regulated fashion (although in some particular cases deviations from this general tendency are possible). At the same time, the solutions to the models based upon dissipative Newtonian dynamics eventually approach attractors where the evolution stops while these attractors dwell on the subspaces of lower dimensionality, and therefore, of the lower complexity (until a “master” reprograms the model). Therefore, such models fail to provide an autonomous progressive evolution of living systems (i.e. evolution leading to increase in complexity). Let us now extend the dynamical picture to include thermal forces. That will correspond to the stochastic extension of Newtonian models, while the Liouville equation will extend to the Fokker–Planck equation that includes thermal force effects through the diffusion term. Actually, it is a well-established fact that evolution of life has a diffusion-based stochastic nature as a result of the multi-choice character of behavior of living systems. Such an extended thermodynamics-based approach is more relevant to model of living systems, and therefore, the simplest living species must obey the second law of thermodynamics as physical particles do. However, then the evolution of living systems (during periods of their isolation) will be regressive since their entropy will increase. Therefore, Newtonian physics is not sufficient for simulation the specific properties typical for intelligence.

There is another argument in favor of a non-Newtonian approach to modeling intelligence. As pointed out by Penrose (Citation1955), the Gödel’s famous theorem has the clear implication that mathematical understanding cannot be reduced to a set of known computational rules. That means that no knowable set of purely computational procedures could lead to a computer-control robot that possesses genuine mathematical understanding. In other words, such privileged properties of intelligent systems as common sense, intuition, or consciousness are non-computable within the framework of classical models. That is why a fundamentally new physics is needed to capture these “mysterious” aspects of intelligence, and in particular, to decision-making process.

2. Dynamical model for simulations

In this section, we review and discuss a behavioral model of intelligent agents, or players. The model is based upon departure from Newtonian dynamics to quantum inspired dynamics that was first introduced in Zak (Citation1998, Citation2007, Citation2008, Citation2014a, Citation2014b), Figure .

Figure 1. Classical physics, quantum physics, and physics of life.

2.1. Destabilizing effect of Liouville feedback

We will start with the derivation of an auxiliary result that illuminates departure from Newtonian dynamics. For mathematical clarity, we will consider here a one-dimensional motion of a unit mass under action of a force f depending upon the velocity v and time t and present it in a dimensionless form(1) $\dot{v} = f (v, t)$ (1)

referring all the variables to their representative values v₀, t₀, etc.

If initial conditions are not deterministic, and their probability density is given in the form(2) $ρ_{0} = ρ_{0} (V), where ρ \geq 0, and \int_{- \infty}^{\infty} ρ d V = 1$ (2)

while ρ is a single-valued function, then the evolution of this density is expressed by the corresponding Liouville equation(3) $\frac{\partial ρ}{\partial t} + \frac{\partial}{\partial V} (ρ f) = 0$ (3)

The solution of this equation subject to initial conditions and normalization constraints (2) determines probability density as a function of V and t:(4) $ρ = ρ (V, t)$ (4)

Remark. Here and below we make distinction between the random variable v(t) and its values V in probability space.

In order to deal with the constraint (2), let us integrate Equation (3) over the whole space assuming that ρ → 0 at |V| → ∞ and |f| < ∞. Then(5) $\frac{\partial}{\partial t} \int_{- \infty}^{\infty} ρ d V = 0, \int_{- \infty}^{\infty} ρ d V = const,$ (5)

Hence, the constraint (2) is satisfied for t > 0 if it is satisfied for t = 0.

Let us now specify the force f as a feedback from the Liouville equation

(6) $f (v, t) = ϕ [ρ (v, t)]$ (6)

and analyze the motion after substituting the force (6) into Equation (2)(7) $\dot{v} = ϕ [ρ (v, t)],$ (7)

This is a fundamental step in our approach. Although the theory of ODE does not impose any restrictions upon the force as a function of space coordinates, the Newtonian physics does: equations of motion are never coupled with the corresponding Liouville equation. Moreover, it can be shown that such a coupling leads to non-Newtonian properties of the underlying model. Indeed, substituting the force f from Equation (6) into Equation (3), one arrives at the non-linear equation of evolution of the probability density(8) $\frac{\partial ρ}{\partial t} + \frac{\partial}{\partial V} {ρ ϕ [ρ (V, t)]} = 0$ (8)

Let us now demonstrate the destabilizing effect of the feedback (6). For that purpose, it should be noticed that the derivative ∂ρ/∂v must change its sign at least once, within the interval −∞ < v < ∞, in order to satisfy the normalization constraint (2).

But since(9) $S i g n \frac{\partial \dot{v}}{\partial v} = S i g n \frac{d ϕ}{d ρ} S i g n \frac{\partial ρ}{\partial v}$ (9)

there will be regions of v where the motion is unstable, and this instability generates randomness with the probability distribution guided by the Liouville equation (8). It should be noticed that the condition (9) may lead to exponential or polynomial growth of v (in the last case the motion is called neutrally stable, however, as will be shown below, it causes the emergence of randomness as well if prior to the polynomial growth, the Lipcshitz condition is violated).

2.2. Emergence of self-generated stochasticity

In order to illustrate mathematical aspects of the concepts of Liouville feedback in systems under consideration as well as associated with it instability and randomness, let us take the feedback (6) in the form(10) $f = - σ^{2} \frac{\partial}{\partial v} ln ρ,$ (10)

to obtain the following equation of motion(11) $\dot{v} = - σ^{2} \frac{\partial}{\partial v} ln ρ,$ (11)

This equation should be complemented by the corresponding Liouville equation (in this particular case, the Liouville equation takes the form of the Fokker–Planck equation)(12) $\frac{\partial ρ}{\partial t} = σ^{2} \frac{\partial^{2} ρ}{\partial V^{2}}$ (12)

Here v stands for a particle velocity, and σ² is the constant diffusion coefficient.

The solution of Equation (12) subjects to the sharp initial condition(13) $ρ = \frac{1}{2 σ \sqrt{π t}} exp (- \frac{V^{2}}{4 σ^{2} t})$ (13)

describes diffusion of the probability density, and that is why the feedback (10) will be called a diffusion feedback.

Substituting this solution with Equation (11) at V = v one arrives at the differential equation with respect to v (t)(14) $\dot{v} = \frac{v}{2 t}$ (14)

and therefore,(15) $v = C \sqrt{t}$ (15)

where C is an arbitrary constant. Since v = 0 at t = 0 for any value of C, the solution (15) is consistent with the sharp initial condition for the solution (13) of the corresponding Liouvile equation (12). The solution (15) describes the simplest irreversible motion: it is characterized by the “beginning of time” where all the trajectories intersect (that results from the violation of Lipcsitz condition at t = 0, Figure ), while the backward motion obtained by replacement of t with (−t) leads to imaginary values of velocities. One can notice that the probability density (13) possesses the same properties.

Figure 2. Stochastic process and probability density.

It is easily verifiable that the solution (15) has the same structure as the solution of the Madelung equation (Zak, Citation2014b), although the dynamical system (11), (12) is not quantum! The explanation of such a “coincidence” is very simple: the system (11), (12) has the same dynamical topology as that of the Madelung equation where the equation of conservation of the probability is coupled with the equation of conservation of the momentum. As will be shown below, the systems (11), (12) neither quantum nor Newtonian, and we will call such systems quantum-inspired, or self-supervised.

Further analysis of the solution (15) demonstrates that the solution (15) is unstable since

(16) $\frac{d \dot{v}}{d v} = \frac{1}{2 t} > 0$ (16)

and therefore, an initial error always grows generating randomness. Initially, at t = 0, this growth is of infinite rate since the Lipchitz condition at this point is violated

(17) $\frac{d \dot{v}}{d v} \to \infty at t \to 0$ (17)

This type of instability has been introduced and analyzed in Zak (Citation1992). The unstable equilibrium point (v = 0) has been called a terminal attractor, and the instability triggered by the violation of the Lipchitz condition—a non-Lipchitz instability. The basic property of the non- Lipchitz instability is the following: if the initial condition is infinitely close to the repeller, the transient solution will escape the repeller during a bounded time while for a regular repeller the time would be unbounded. Indeed, an escape from the simplest regular repeller can be described by the exponent v = v₀e^t. Obviously, v → 0 if v₀ → 0, unless the time period is unbounded. On the contrary, the period of escape from the terminal attractor (15) is bounded (and even infinitesimal) if the initial condition is infinitely small, (see Equation (17)).

Considering first Equation (15) at fixed C as a sample of the underlying stochastic process (13), and then varying C, one arrives at the whole ensemble characterizing that process (see Figure ). One can verify that, as follows from Equation (13), (Risken, Citation1989), the expectation and the variance of this process are, respectively,(18) $\bar{v} = 0, \tilde{v} = 2 σ^{2} t$ (18)

The same results follow from the ensemble (15) at −∞ ≤ C ≤ ∞. Indeed, the first equality in (18) results from symmetry of the ensemble with respect to v = 0; the second one follows from the fact that(19) $\tilde{v} \propto v^{2} \propto t$ (19)

It is interesting to notice that the stochastic process (15) is an alternative to the following Langevin equation, (Risken, Citation1989)(20) $\dot{v} = Γ (t), \bar{Γ} = 0, \tilde{Γ} = σ$ (20)

that corresponds to the same Fokker–Planck Equation (12). Here, Γ(t) is the Langevin (random) force with zero mean and constant variance σ.

Thus, the emergence of self-generated stochasticity is the first basic non-Newtonian property of the dynamics with the Liouville feedback.

2.3. Second law of thermodynamics

In order to demonstrate another non-Newtonian property of the systems considered above, let us start with the dimensionless form of the Langevin equation for a one-dimensional Brownian motion of a particle subjected to a random force, (Risken, Citation1989)(21) $\dot{v} = - k v + Γ (t), < Γ (t) > = 0, < Γ (t) Γ (t^{'}) > = 2 σ δ (t - t^{'}), [Γ] = 1 / s$ (21)

Here, v is the dimensionless velocity of the particle (referred to a representative velocity v₀), k is the coefficient of a linear damping force, Γ(t) is the Langevin (random) force per unit mass, σ > 0 is the noise strength. The representative velocity v₀ can be chosen, for instance, as the initial velocity of the motion under consideration.

The corresponding continuity equation for the probability density ρ is the following Fokker–Planck equation(22) $\frac{\partial ρ}{\partial t} = k \frac{\partial (V ρ)}{\partial V} + σ \frac{\partial^{2} ρ}{\partial V^{2}}, \int_{- \infty}^{\infty} ρ d V = 1$ (22)

Obviously without external control, the particle cannot escape from the Brownian motion.

Let us now introduce a new force (referred to unit mass and divided by v₀) as a Liouville feedback(23) $f = σ exp \sqrt{D} \frac{\partial}{\partial v} ln ρ, [f] = 1 / s$ (23)

Here D is the dimensionless variance of the stochastic process $D (t) = \int_{- \infty}^{\infty} ρ V^{2} d V$ ,

Then the new equation of motion takes the form(24) $\dot{v} = - k v + Γ (t) + σ exp \sqrt{D} \frac{\partial}{\partial v} ln ρ,$ (24)

and the corresponding Fokker–Planck equation becomes non-linear(25) $\frac{\partial ρ}{\partial t} = k \frac{\partial (V ρ)}{\partial V} + σ (1 - exp \sqrt{D}) \frac{\partial^{2} ρ}{\partial V^{2}}, \int_{- \infty}^{\infty} ρ d V = 1$ (25)

Obviously, the diffusion coefficient in Equation (25) is negative. Multiplying Equation (25) by V², then integrating it with respect to V over the whole space, one arrives at ODE for the variance $\tilde{v}$ (t)(26) $\dot{D} = 2 [σ (1 - exp \sqrt{D}) - k D]$ (26)

Thus, as a result of negative diffusion, the variance D monotonously vanishes regardless of the initial value D(0). It is interesting to note that the time T of approaching D = 0 is finite(27) $T = \frac{1}{2} \int_{D (0)}^{0} \frac{d D}{σ (1 - exp \sqrt{D}) - k D} \leq \frac{1}{2 σ} \int_{0}^{\infty} \frac{d D}{exp \sqrt{D} - 1} = \frac{π}{6 σ}$ (27)

This terminal effect is due to violation of the Lipchitz condition, at D = 0, (Zak, Citation2014a).

Let us review the structure of the force (23): it is composed only of the probability density and its variance, i.e. out of the components of the conservation Equation (25); at the same time, Equation (25) itself is generated by the equation of motion (24). Consequently, the force (23) is not an external force. Nevertheless, it allows the particle escape from the Brownian motion using its own “internal effort.” It would be reasonable to call the force (23) an information force since it links to information rather than to energy.

Thus, we came across the phenomenon that violates the second law of thermodynamics when the dynamical system moves from disorder to the order without external interactions due to a feedback from the equation of conservation of the probability to the equation of conservation of the momentum. One may ask why the negative diffusion was chosen to be non-linear. Let us turn to a linear version of Equation (26)(28) $\frac{\partial ρ}{\partial t} = - σ^{2} \frac{\partial^{2} ρ}{\partial V^{2}}, \int_{- \infty}^{\infty} ρ d V = 1$ (28)

and discuss a negative diffusion in more detail. As follows from the linear equivalent of Equation (26)(29) $\dot{D} = - 2 σ, i.e. D = D_{0} - 2 σ t < 0 at t > D_{0} / (2 σ)$ (29)

Thus, eventually the variance becomes negative, and that disqualifies Equation (29) from being meaningful. As shown in Zak (Citation2014a), the initial value problem for this equation is ill-posed: its solution is not differentiable at any point. Therefore, a negative diffusion must be non-linear in order to protect the variance from becoming negative, Figure .

Figure 3. Negative diffusion.

It should be emphasized that negative diffusion represents a major departure from both Newtonian mechanics and classical thermodynamics by providing a progressive evolution of complexity against the second law of thermodynamics.

In the next subsection, we will demonstrate again that formally the dynamics introduced above does not belong to the Newtonian world; nevertheless, its self-supervising capability may associate such a dynamics with a potential model for intelligent behavior. For that purpose, we will turn to even simpler version of this dynamics by removing the external Langevin force and simplifying the information force.

In 1945, Schrödinger wrote in his book “What is life”: “Life is to create order in the disordered environment against the second law of thermodynamics.” The self-supervised dynamical system introduced above is fully consistent with this statement. Indeed, consider a simplified version of Equations (21) and (22)

(30) $\dot{v} = σ \sqrt{D} \frac{\partial}{\partial v} ln ρ,$ (30)

(31) $\frac{\partial ρ}{\partial t} = - σ \sqrt{D} \frac{\partial^{2} ρ}{\partial V^{2}}, \int_{- \infty}^{\infty} ρ d V = 1$ (31)

Removal of the Langevin forces makes the particle isolated. Nevertheless, the particle has a capability of moving from disorder to order. For demonstration of this property, we will assume that the Langevin force was suddenly removed at t = 0 so that the initial variance D₀ > 0. Then,(32) $\dot{D} = - 2 σ \sqrt{D}$ (32) (33) $w h e n c e D = {(\sqrt{D_{0}} - σ t)}^{2}$ (33)

As follows from Equation (33), as a result of internal, self-generated force(34) $F = σ \sqrt{D} \frac{\partial}{\partial v} ln ρ,$ (34)

the Brownian motion gradually disappears and then vanishes abruptly:(35) $D \to 0, \dot{D} \to 0, \frac{d \dot{D}}{d D} \to \infty a t t \to \frac{\sqrt{D_{0}}}{σ}$ (35)

Thus, the probability density shrinks to a delta-function at $t = \frac{\sqrt{D_{0}}}{σ}$ . Consequently, the entropy $H (t) = - \int_{V} ρ ln ρ d V$ decreases down to zero, and that violates the second law of thermodynamics.

Another non-Newtonian property is entanglement.

2.4. Entanglement

In this subsection, we will introduce a fundamental and still mysterious property that was predicted theoretically and corroborated experimentally in quantum systems: entanglement. Quantum entanglement is a phenomenon in which the quantum states of two or more objects have to be described with reference to each other, even though the individual objects may be spatially separated. This leads to correlations between observable physical properties of the systems. As a result, measurements performed on one system seem to be instantaneously influencing other systems entangled with it. Different views of what is actually occurring in the process of quantum entanglement give rise to different interpretations of quantum mechanics. Here, we will demonstrate that entanglement is not a prerogative of quantum systems: it occurs in quantum-inspired (QI) systems that are under consideration in this paper. That will shed light on the concept of entanglement as a special type of global constraint imposed upon a broad class of dynamical systems that include quantum as well as QI systems.

In order to introduce entanglement in QI system, we will start with Equations (11) and (12) and generalize them to the two-dimensional case(36) ${\dot{v}}_{1} = - a_{11} \frac{\partial}{\partial v_{1}} ln ρ - a_{12} \frac{\partial}{\partial v_{2}} ln ρ,$ (36)

(37) ${\dot{v}}_{2} = - a_{21} \frac{\partial}{\partial v_{1}} ln ρ - a_{22} \frac{\partial}{\partial v_{2}} ln ρ,$ (37)

(38) $\frac{\partial ρ}{\partial t} = a_{11} \frac{\partial^{2} ρ}{\partial V^{2}} + (a_{12} + a_{21}) \frac{\partial^{2} ρ}{\partial V_{1} \partial V_{2}} + a_{22} \frac{\partial^{2} ρ}{\partial V_{2}},$ (38)

As in the one- dimensional case, this system describes diffusion without a drift

The solution to Equation (38) has a closed form(39) $ρ = \frac{1}{\sqrt{2 π det [{\hat{a}}_{i j}] t}} exp (- \frac{1}{4 t} b_{i j}^{'} V_{i} V_{j}), i = 1, 2 .$ (39)

Here,(40) $[b_{i j}^{'}] = {[{\hat{a}}_{i j}]}^{- 1}, {\hat{a}}_{11} = a_{11}, {\hat{a}}_{22} = a_{22}, {\hat{a}}_{12} = {\hat{a}}_{21} = a_{12} + a_{21}, {\hat{a}}_{i j} = {\hat{a}}_{j i}, b_{i j}^{'} = b_{j i}^{'},$ (40)

Substituting the solution (39) with Equations (36) and (37), one obtains(41) ${\dot{v}}_{1} = \frac{b_{11} v_{1} + b_{12} v_{2}}{2 t}$ (41) (42) ${\dot{v}}_{2} = \frac{b_{21} v_{1} + b_{22} v_{2}}{2 t}, b_{i j} = b_{i j}^{'} {\hat{a}}_{i j}$ (42)

Eliminating t from these equations, one arrives at an ODE in the configuration space(43) $\frac{d v_{2}}{d v_{1}} = \frac{b_{21} v_{1} + b_{22} v_{2}}{b_{11} v_{1} + b_{12} v_{2}}, v_{2} \to 0 a t v_{1} \to 0,$ (43)

This is a classical singular point treated in text books on ODE.

Its solution depends upon the roots of the characteristic equation(44) $λ^{2} - 2 b_{12} λ + b_{12}^{2} - b_{11} b_{22} = 0$ (44)

Since both the roots are real in our case, let us assume for concreteness that they are of the same sign, for instance, $λ_{1} = 1, λ_{2} = 1$ . Then the solution to Equation (43) is represented by the family of straight lines(45) $v_{2} = \tilde{C} v_{1}, \tilde{C} = c o n s t .$ (45)

Substituting this solution into Equation (41) yields (45)(46) $v_{1} = C t^{\frac{1}{2} (b_{11} + \tilde{C} b_{12})} v_{2} = \tilde{C} C t^{\frac{1}{2} (b_{11} + \tilde{C} b_{12})}$ (46)

Thus, the solutions to Equations (36) and (37) are represented by two-parametrical families of random samples, as expected, while the randomness enters through the time-independent parameters C and $\tilde{C}$ that can take any real numbers. Let us now find such a combination of the variables that is deterministic. Obviously, such a combination should not include the random parameters C or $\tilde{C}$ . It is easily verifiable that(47) $\frac{d}{d t} (ln v_{1}) = \frac{d}{d t} (ln v_{2}) = \frac{b_{11} + \tilde{C} b_{12}}{2 t}$ (47)

and therefore,(48) $(\frac{d}{d t} ln v_{1}) / (\frac{d}{d t} ln v_{2}) \equiv 1$ (48)

Thus, the ratio (48) is deterministic although both the numerator and denominator are random, (see Equation (47)). This is a fundamental non-classical effect representing a global constraint. Indeed, in theory of stochastic processes, two random functions are considered statistically equal if they have the same statistical invariants, but their point-to-point equalities are not required (although it can happen with a vanishingly small probability). As demonstrated above, the diversion of determinism into randomness via instability (due to a Liouville feedback), and then conversion of randomness to partial determinism (or coordinated randomness) via entanglement is the fundamental non-classical paradigm that may lead to instantaneous transmission of conditional information on remote distance that to be discussed below.

2.5. Relevance to model of intelligent agents

The model under discussion was inspired by E. Schrödinger, the creator of quantum mechanics who wrote in his book “What is Life”: “Life is to create order in the disordered environment against the second law of thermodynamics.” The proposed model illuminates the “border line” between living and non-living systems. The model introduces a biological particle that, in addition to Newtonian properties, possesses the ability to process information. The probability density can be associated with the self-image of the biological particle as a member of the class to which this particle belongs, while its ability to convert the density into the information force—with the self-awareness (both these concepts are adopted from psychology). Continuing this line of associations, the equation of motion (such as Equations (1) or (7)) can be identified with a motor dynamics, while the evolution of density (see Equations (3) or (8)—with a mental dynamics. Actually, the mental dynamics plays the role of the Maxwell sorting demon: it rearranges the probability distribution by creating the information potential and converting it into a force that is applied to the particle. One should notice that mental dynamics describes evolution of the whole class of state variables (differed from each other only by initial conditions), and that can be associated with the ability to generalize that is a privilege of living systems. Continuing our biologically inspired interpretation, it should be recalled that the second law of thermodynamics states that the entropy of an isolated system can only increase. This law has a clear probabilistic interpretation: increase of entropy corresponds to the passage of the system from less probable to more probable states, while the highest probability of the most disordered state (that is the state with the highest entropy) follows from a simple combinatorial analysis. However, this statement is correct only if there is no Maxwell’ sorting demon, i.e. nobody inside the system is rearranging the probability distributions. But this is precisely what the Liouville feedback is doing: it takes the probability density ρ from Equation (3), creates functionals and functions of this density, converts them into a force and applies this force to the equation of motion (1). As already mentioned above, because of that property of the model, the evolution of the probability density becomes non-linear, and the entropy may decrease “against the second law of thermodynamics,” Figure .

Figure 4. Living system: Deviation from classical thermodynamics.

Obviously the last statement should not be taken literary; indeed, the proposed model captures only those aspects of the living systems that are associated with their behavior, and in particular, with their motor–mental dynamics, since other properties are beyond the dynamical formalism. Therefore, such physiological processes that are needed for the metabolism, reproduction, est., are not included into the model. That is why this model is in a formal disagreement with the first and second laws of thermodynamics while the living systems are not. Indeed, applying the first law of thermodynamics we imply violation of conservation of mechanical energy since other types of energies (chemical, electro-magnetic, etc.) are beyond our mathematical formalism. Applying the second law of thermodynamics, we consider our system as isolated one while the underlying real system is open due to other activities of livings that were not included in our model. Nevertheless, despite these limitations, the proposed model captures the “magic” of life: the ability to create self-image and self-awareness, and that fits perfectly to the concept of intelligent agent. Actually, the proposed model represents governing equations for interactions of intelligent agents. In order to emphasize the autonomy of the agents’ decision-making process, we will associate the proposed models with self-supervised (SS) active systems.

By an active system, we will understand here a set of interacting intelligent agents capable of processing information, while an intelligent agent is an autonomous entity, which observes and acts upon an environment and directs its activity toward achieving goals. The active system is not derivable from the Lagrange or Hamilton principles, but it is rather created for information processing. One of the specific differences between active and physical systems is that the former are supposed to act in uncertainties originated from incompleteness of information. Indeed, an intelligent agent almost never has access to the whole truth of its environment. Uncertainty can also arise because of incompleteness and incorrectness in the agent’s understanding of the properties of the environment. That is why QI SS systems are well suited for representation of active systems.

2.6. Self-supervised active systems with integral feedback

In this subsection, we will introduce a feedback from the mental to motor dynamics that is different from the feedback (6) discussed above. This feedback will make easier to formulate new principles of the competitive mode of agents associated with game theory.

Let us introduce the following feedback, (Zak, Citation2008)(49) $f = \frac{1}{ρ (v, t)} \int_{- \infty}^{v} [ρ (ζ, t) - ρ^{*} (ζ)] d v$ (49)

With the feedback (49), Equations (7) and (8) take the form, respectively,(50) $f = \frac{1}{ρ (v, t)} \int_{- \infty}^{v} [ρ (ζ, t) - ρ^{*} (ζ)] d v$ (50)

(51) $\frac{\partial ρ}{\partial t} + ρ (t) - ρ^{*} = 0$ (51)

The last equation has the analytical solution(52) $ρ = (ρ_{0} - ρ^{*}) e^{- t} + ρ^{*}$ (52)

Subject to the initial condition(53) $ρ (t = 0) = ρ_{0}$ (53)

This solution converges to a prescribed, or target, stationary distribution ρ^*(V). Obviously the normalization condition for ρ is satisfied if it is satisfied for ρ₀ and ρ^*. This means that Equation (51) has an attractor in the probability space, and this attractor is stochastic. Substituting the solution (52) with Equation (50), one arrives at the ODE that simulates the stochastic process with the probability distribution (52)(54) $\dot{v} = \frac{e^{- t}}{[ρ_{0} (v) - ρ^{*} (v)] e^{- t} + ρ^{*} (v)} \int_{- \infty}^{v} [ρ_{0} (ζ) - ρ^{*} (ζ)] d ζ$ (54)

As notices above, the randomness of the solution to Equation (54) is caused by instability that is controlled by the corresponding Liouville equation. It should be emphasized that in order to run the stochastic process started with the initial distribution ρ₀ and approaching a stationary process with the distribution ρ^*, one should substitute into Equation (54) the analytical expressions for these functions.

It is reasonable to assume that the solution (4) starts with sharp initial condition(55) $ρ_{0} (V) = δ (V)$ (55)

As a result of that assumption, all the randomness is supposed to be generated only by the controlled instability of Equation (54). Substitution of Equation (55) into Equation (54) leads to two different domains of v: v ≠ 0 and v = 0 where the solution has two different forms, respectively,(56) $\int_{- \infty}^{v} ρ^{*} (ζ) d ζ = \frac{C_{1}}{e^{- t} - 1}, v \neq 0$ (56)

(57) $v \equiv 0$ (57)

Equation (57) represents a singular solution, while Equation (56) is a regular solution that include arbitrary constant C. The regular solutions is unstable at t = 0, |v| → 0 where the Lipschitz condition is violated(58) $\frac{d \dot{v}}{d v} \to \infty a t t \to 0, | v | \to 0$ (58)

and therefore, an initial error always grows generating randomness.

Let us analyze the behavior of the solution (56) in more detail. As follows from this solution, all the particular solutions intersect at the same point v = 0 at t = 0, and that leads to non-uniqueness of the solution due to violation of the Lipcshitz condition. Therefore, the same initial condition v = 0 at t = 0 yields infinite number of different solutions forming a family (56); each solution of this family appears with a certain probability guided by the corresponding Liouville equation (51). For instance, in cases plotted in Figures and , the “winner” solution is, respectively,(59) $v_{1} = ε \to 0, ρ (v_{1}) = ρ_{max}, and v = v_{2}, ρ (v_{2}) = sup {ρ}$ (59)

Figure 5. Stochastic process and probability density.

Figure 6. Global maximum.

since it passes through the maximum of the probability density (51). However, with lower probabilities, other solutions of the family (53) can appear as well. Obviously, this is a non-classical effect. Qualitatively, this property is similar to those of quantum mechanics: the system keeps all the solutions simultaneously and displays each of them “by a chance,” while that chance is controlled by the evolution of probability density (51).

The approach is generalized to n-dimensional case simply by replacing v with a vector v = v₁, v₂, … v_n since Equation (51) does not include space derivatives.

Examples. Let us start with the following normal distribution(60) $ρ^{*} (V) = \frac{1}{\sqrt{2 π}} e^{- \frac{V^{2}}{2}}$ (60)

Substituting the expression (60) and (55) with Equation (56) at V = v, one obtains(61) $v = e r f^{- 1} (\frac{C_{1}}{e^{- t} - 1}), v \neq 0$ (61)

As another example, let us choose the target density ρ^* as the student’s distribution, or so called power law distribution(62) $ρ^{*} (V) = \frac{Γ (\frac{ν + 1}{2})}{\sqrt{ν π} Γ (\frac{ν}{2})} {(1 + \frac{V^{2}}{ν})}^{- (ν + 1) / 2}$ (62)

Substituting the expression (62) with Equation (56) at V = v, and ν = 1, one obtains(63) $v = tan (\frac{C}{e^{- t} - 1}) f o r v \neq 0$ (63)

The 3D plot of the solutions of Equations (61) and (63), are presented in Figures , and respectively.

Figure 7. Dynamics driving random events to normal distribution.

Figure 8. Dynamics driving random events to power law.

2.7. Finding global maximum

Based upon the proposed model with integral feedback, a simple algorithm for finding a global maximum of an n-dimensional function can be formulated. The idea of the proposed algorithm is very simple: based upon the model with integral feedback (50), and (51), introduce a positive function $ψ (v_{1}, v_{2}, . . . v_{n}), | v_{i} | < \infty$ to be maximized as the probability density ρ^*(v₁, v₂, ...v_n) to which the solution of Equation (50) is attracted. Then the larger value of this function will have the higher probability to appear. The following steps are needed to implement this algorithm.

(1)	Build and implement the n-dimensional version of the model Equations (50), and (51), as an analog devise(64) ${\dot{v}}_{i} = \frac{e^{- t}}{n {[ρ_{0} (v) - ρ^{} (v)] e^{- t} + ρ^{} (v)}} \int_{- \infty}^{v_{i}} [ρ_{0} (ζ) - ρ^{*} (ζ {)]}^{} d ζ, i = 1, 2, \dots n .$ (64)
(2)	Normalize the function to be maximized(65) $\bar{ψ} ({v}) = \frac{ψ ({v})}{\int_{- \infty}^{\infty} ψ ({v}) d {v}}$ (65)
(3)	Using Equation (51), evaluate time τ of approaching the stationary process to accuracy ε(66) $τ \approx ln \frac{1 - \bar{ψ}}{ε \bar{ψ}}$ (66)
(4)	Substitute $\bar{ψ}$ instead of ρ^* with Equations (64) and run the system during the time interval τ.
(5)	The solution will “collapse” into one of possible solutions with the probability $\bar{ψ}$ .Observing (measuring) the corresponding values of {v^*}, find the first approximation to the optimal solution.
(6)	Switching the device to the initial state and then starting again, arrive at the next approximations.
(7)	The sequence of the approximations represents Bernoulli trials that exponentially improve the chances of the optimal solution to become a winner. Indeed, the probability of success ρ_s and failure ρ_f after the first trial are, respectively,(67) $ρ_{s} = {\bar{ψ}}_{1}, ρ_{f} = 1 - {\bar{ψ}}_{1}$ (67) Then the probability of success after M trials is(68) $ρ_{s M} = 1 - {(1 - \bar{ψ})}^{M} \to 1 a t M \to \infty$ (68) Therefore, after polynomial number of trials, one arrived at the solution to the problem (unless the function ψ is flat).

The main advantage of the proposed methodology is in a weak restriction imposed upon the space structure of the function $\bar{ψ} ({v})$ : it should be only integrable since there is no space derivatives included in the model (64). This means that $\bar{ψ} ({v})$ is not necessarily to be differentiable. For instance, it can be represented by a Weierstrass-like function $f (v) = \sum_{0}^{\infty} a^{n} cos (b^{n} π v)$ , where 0 < a < 1, b is a positive odd integer, and ab > 1 + 1.5π.

In a particular case when $\bar{ψ} ({x})$ is twice differentiable, the algorithm is insensitive to local maxima because it is driven not by gradients, but by the values of this function.

2.8. Entanglement in QI active systems with integral feedback

We will continue the analysis of the QI system with integral feedback introduced above proceeding with the two-dimensional case(69) ${\dot{v}}_{1} = \frac{1}{2 ρ (v_{1}, v_{2}, t)} \int_{- \infty}^{v_{1}} [ρ (η, v_{2}, t) - ρ^{*} (η, v_{2})] d η$ (69) (70) ${\dot{v}}_{2} = \frac{1}{2 ρ (v_{1}, v_{2}, t)} \int_{- \infty}^{v_{2}} [ρ (v_{1}, η, t) - ρ^{*} (v_{1}, η)] d η$ (70) (71) $\frac{\partial ρ (V, t)}{\partial t} + ρ (V, t) - ρ^{*} (V) = 0$ (71)

The solution of Equation (71) has the same form as for one-dimensional case, (see Equation (52))(72) $ρ = ρ_{0} + ρ^{*} (1 - e^{- t})$ (72)

Substitution this solution into Equations (69) and (70), yields, respectively,(73) ${\dot{v}}_{1} = \frac{e^{- t}}{2 [ρ_{0} (v_{1}, v_{2}) - ρ^{*} (v_{1}, v_{2})] e^{- t} + ρ^{*} (v_{1}, v_{2})} \int_{- \infty}^{v_{1}} [ρ_{0} (η, v_{2}) - ρ^{*} (η, v_{2})] d η$ (73)

(74) ${\dot{v}}_{2} = \frac{e^{- t}}{2 [ρ_{0} (v_{1}, v_{2}) - ρ^{*} (v_{1}, v_{2})] e^{- t} + ρ^{*} (v_{1}, v_{2})} \int_{- \infty}^{v_{2}} [ρ_{0} (v_{1}, η) - ρ^{*} (v_{1}, η)] d η$ (74)

that are similar to Equation (54). Following the same steps as in one-dimensional case, one arrives at the following solutions of Equations (73) and (74), respectively,(75) $\int_{- \infty}^{v_{1}} ρ^{*} (η, v_{2}) d η = \frac{C_{1}}{e^{- t} - 1}, v_{1} \neq 0$ (75) (76) $\int_{- \infty}^{v_{2}} ρ^{*} (η, v_{1}) d η = \frac{C_{2}}{e^{- t} - 1}, v_{2} \neq 0$ (76)

that are similar to the solution (56). Since ρ(v₁, v₂) is the known (preset) function, Equations (75) and (76) implicitly define v₁ and v₂ as functions of time. Eliminating time t and orbitary constants C₁, C₂, one obtains(77) $\frac{d}{d t} [l n \int_{- \infty}^{v_{1}} ρ^{*} (η, v_{2}) d η] / \frac{d}{d t} [l n \int_{- \infty}^{v_{2}} ρ^{*} (v_{1}, η) d η] \equiv 1$ (77)

Thus, the ratio (77) is deterministic although both the numerator and denominator are random, (see Equations (75) and (66)).

3. Application to games of entangled agents

In this section, we will address a situation when agents are competing. That means that they have different objectives. Turning to Equation (75), (76), one can rewrite them for the case of competing agents(78) ${\dot{v}}_{i} = \frac{1}{n ρ (v_{1,} \dots v_{n}, t)} \int_{- \infty}^{v_{i}} [ρ (ζ_{i}, v_{j \neq i}, t) - ρ_{i}^{*} (ζ_{i}, v_{j \neq i})] d ζ_{i}$ (78)

(79) $\frac{d ρ}{d t} + ρ (t) - \frac{1}{n} \sum_{1}^{n} a_{k} ρ_{k}^{*} = 0, \sum_{k = 1}^{n} a_{k} = n ρ_{i}^{*} \neq ρ_{j}^{*} i f i \neq j .$ (79)

where $ρ_{k}^{*}$ is the preset density of the kth agent that can be considered as his objective, a_k is a constant weight of the kth agent’s effort to approach his objective.

Thus, each kth agent is trying to establish his own static attractor $ρ_{k}^{*}$ , but due to entanglement, the whole system will approach the weighted average(80) $ρ = [ρ^{0} - \frac{1}{n} \sum_{i = 1}^{n} (a_{i} ρ_{i}^{*})] e^{- t} + \frac{1}{n} \sum_{i = 1}^{n} a_{i} ρ_{i}^{*}$ (80)

$ρ (t) = \frac{1}{n} \sum_{1}^{n} a_{k} ρ_{k}^{*} at t \to \infty$

Substituting the solution (80) with Equation (78), one arrives at a coupled system of n ODE with respect to n state variables v_i. Although a closed form analytical solution of the system (78) and (79) is not available, its property of the Lipcshitz instability at t = 0 could be verified. This means that the solution to the system (78) and (79) is random, and if the system is run many times, the statistical properties of the whole ensemble will be described by Equation (80). Obviously, those agents who have chosen density with a sharp maximum are playing more risky game. Here, we have assumed that competing agents are still entangled, and therefore, their information about each other is complete. More complex situation when the agents are not entangled, and exchanged information is incomplete is address in the next section. The simplest way to formalize the incompleteness of information possessed by competing agent is to include the “vortex” terms into Equation (77): these terms could change each particular trajectory of the agent motion, but they would not change the statistical invariants that remain available to the competing agents(81) ${\dot{v}}_{i} = \frac{1}{n ρ (v_{1,} \dots v_{n}, t)} [\int_{- \infty}^{v_{i}} [ρ (ζ_{i}, v_{j \neq i}, t) - ρ^{*} (ζ_{i}, v_{j \neq i})] d ζ_{i} + \sum_{j \neq i}^{n} T_{i j} tanh v_{j}]$ (81)

It is easily verifiable that the augmented neural net-like terms do not effect the corresponding Liouville equation, and therefore, they do not change the static attractor in the probability space described by Equation (76). However, they may significantly change the configuration of the random trajectories in physical space making their entanglement more sophisticated. Another way to formalize uncertainty is to introduce a complex joint probability density where its imaginary part represents a measure of uncertainty in density distribution. This case will be considered below in more details.

3.1. Problem formulation

In this section, we will present a draft of application of self-supervised active dynamical systems to differential games. Following von-Neuman, and Isaaks, (Isaacs, Citation1965), we will introduce a two-player zero-sum (antagonistic) differential game that is described by dynamical Equation (64) rewritten for i = 1,2(82) ${\dot{v}}_{i} = \frac{1}{2 ρ (v_{1,} . . . v_{n}, t)} \int_{- \infty}^{v_{i}} [ρ (ζ_{i}, v_{j \neq i}, t) - a_{i} C_{i} f_{i} (ζ_{i}, v_{j \neq i})] d ζ_{i} f_{1} = f, f_{2} = f^{- 1}, a_{1} + a_{2} = 2 .$ (82)

where v is the state vector, f₁ is the control vector of the maximizing player E (evader), f₂ is the control vector of the minimizing player P (pursuer), and C₁, C₂are the normalizing factors. Obviously, f is a known function of both state variables.

However, the rules of the game we propose is slightly different from those introduced by Isaaks, namely: the player P tries to minimize the function f, (i.e. maximize the function f⁻¹) while the player E tries to maximize f in the same manner as it was described in the previous subsection i.e. via entanglement. The Liouville equation for the system (69) follows from Equation (79)

(83) $\frac{d ρ}{d t} + ρ (t) - \frac{1}{2} (a_{1} C_{1} f + a_{2} C_{2} f^{- 1}) = 0$ (83)

whence(84) $ρ = [ρ^{0} - \frac{1}{2} (a_{1} C_{1} f + a_{2} C_{2} f^{- 1})] e^{- t} + \frac{1}{2} (a_{1} C_{1} f + a_{2} C_{2} f^{- 1}),$ (84)

We will now give a description of the game.

The game starts with zero initial conditions:(85) $v_{1} = 0, v_{2} = 0, ρ^{0} = δ (v_{1}, v_{2}) a t t = 0$ (85)

It is assumed that each player has access to the systems (82), (84), and therefore, he has complete information about its state. The substitution of Equation (84) with Equation (82) closes the system (82). However, because of a failure of the Lipcshitz condition at t = 0 (see Equation (58)), the solution of Equation (82) is random, and each player can predict it only in terms of probability. As follows from Equation (84), the highest probability to appear has the solution that delivers the global maximum to the payoff function(86) $F = a_{1} C_{1} f + a_{2} C_{2} f^{- 1}$ (86)

Obviously, the player that has higher weight a_i would have better chances to win since the global maximum of Equation (86) is closer to the global maximum of his goal function. With reference to Equation (86), a player can evaluate time τ of approaching the stationary process to accuracy ε as(87) $τ \approx ln \frac{1 - F}{ε F}$ (87)

and introduce(88) $v_{1}^{1} = v_{1} (τ), v_{2}^{1} (τ), f_{1}^{1} = f_{1} (v_{1}^{1}, v_{2}^{2}), f_{2}^{1} = f_{2} (v_{1}^{1}, v_{2}^{1})$ (88)

This is the end of the first move. After that, each player updates his weight as following(89) $a_{1}^{1} = a_{1} + \frac{f_{1}^{1} - f_{1}^{0}}{f_{1}^{0}}, a_{2}^{1} = a_{2} - \frac{f_{1}^{1} - f_{1}^{0}}{f_{1}^{0}}, f_{1}^{0} \neq 0,$ (89)

and starts the next move with the same initial conditions (85). But the system (82), (84) is different now: the control functions f₁, f₂ are to be replaced by their updated values $f_{1}^{1}, f_{2}^{1}$ , respectively. Thus, during the first move, the potential winner is selected by a chance, and during the next move, his chances are increased due to favorable update of the weights. However, the role of the chance is still significant even during the subsequent moves; indeed, if the global maximum of the control function F is sharp, the initially selected potential winner still can lose.

The game ends when one of the players achieves his goal my maximizing his control function to a preset level, for instance, if(90) $f_{1} - f_{2} > A^{2}$ (90)

3.2. Games with incomplete information

The theory presented above includes applications to such problems as battle games, games with moving craft, pursuit games, etc. However, the main limitation of this theory, as well as the most of the game theories, is that it requires complete information about the state variables available to both players. This limitation significantly diminishes the applicability of the theory to real-life games where the complete information is not available. That is why the extension of this theory to cases of incomplete information is of vital importance. In our application, we will assume that each player knows only his own state variables, while he has to guess about the state variables of his adversary. For that case, the mathematical formalism of QI systems can offer a convenient tool to replace unknown value of a state variable by its expected value. Such a possibility is available due to players’ dependence (but not necessarily entanglement) via the joint probability density: since each player possesses the joint density, he can, at any moment, compute the expected value of the state variable of the other player.

We assume that the players follow the strategy: “what do you think I think you think …?” and we will start with the assumption that each player takes a conservative view by thinking that although he does not know the values of the state variable of his adversary, the adversary does know the values of his state variable. Then the governing equation for the Evader will be(91) ${\dot{v}}_{1} = \frac{1}{2 ρ (v_{1,} {\bar{V}}_{2}, t)} \int_{- \infty}^{v_{1}} ρ (ζ_{1}, {\bar{V}}_{2}, t) - a_{1} C_{1} f_{1} (ζ_{1}, {\bar{V}}_{2}) d ζ_{1}$ (91)

Here ${\bar{V}}_{2}$ is the expected value of V₂(92) ${\bar{V}}_{2} = \int_{- \infty}^{\infty} V_{2} ρ (V_{1}, V_{2}) d V_{1} d V_{2}$ (92)

Now the Evader has to create the image of the Pursuer using the expected value of his state variable(93) ${\dot{v}}_{2 | 1} = \frac{1}{2 ρ (v_{1,} {\bar{V}}_{2}, t)} \int_{- \infty}^{v_{2}} [ρ (v_{1}, ζ_{2}, t) - a_{2} C_{2} f_{2} (v_{1}, ζ_{2})] d ζ_{2}$ (93)

where v_2|1 is the state variable of the Pursuer in view of the Evader.

The corresponding Liouville equation that governs the joint probability equation is not changed: it is still given by Equation (79). Its solution (84) should be substituted with Equations (91) and (93) along with the Equation (92). Obviously, the expected value (92) is found from the solution (84)(94) ${\bar{V}}_{i} = [{\bar{V}}_{i}^{0} - \int_{- \infty}^{\infty} d ζ_{1} \int_{- \infty}^{\infty} d ζ_{2} {[\frac{1}{2} (a_{1} C_{1} f + a_{2} C_{2} f^{- 1})] e^{- t} + \frac{1}{2} (a_{1} C_{1} f + a_{2} C_{2} f^{- 1})] {\bar{V}}_{i}}, i = 1, 2 .$ (94)

The system of Equations (91), (93), and (79) with reference to Equations (92), (94) is closed.

Similar system can be obtained for governing equation of the Pursuer coupled with the governing equation of his image of the Evador:(95) ${\dot{v}}_{2} = \frac{1}{2 ρ ({\bar{V}}_{1}, v_{2}, t)} \int_{- \infty}^{v_{1}} ρ ({\bar{V}}_{1}, ζ_{2}, t) - a_{2} C_{2} f_{2} ({\bar{V}}_{1}, ζ_{2}) d ζ_{2}$ (95)

(96) ${\dot{v}}_{1 | 2} = \frac{1}{2 ρ ({\bar{V}}_{1}, v_{2}, t)} \int_{- \infty}^{v_{1}} ρ (ζ_{1}, v_{2}, t) - a_{2} C_{2} f_{2} (ζ_{1}, v_{2}) d ζ_{1}$ (96)

(97) ${\bar{V}}_{1} = \int_{- \infty}^{\infty} V_{1} ρ (V_{1}, V_{2}) d V_{1} d V_{2}$ (97)

After substitution of Equation (79) with Equations (95) and (96), with reference to Equations (97) and (94), one arrives at the closed system.

Thus, we obtain two independent systems of ODE describing entanglement of the player with the image of his adversary. Each system has random solutions that appear with the probability described by Equation (79). After time interval τ (see Equation (87)), each player gets access to the real values of the functions f_i to be maximized, and based upon that, he can update the state variables and weights for the next move, (see Equations (88) and (89)).

Let us consider now the case when the players do not know the values of state variables of their adversary.

Then instead of the systems (91), (93), and (95), (96) we have, respectively(98) ${\dot{v}}_{1} = \frac{1}{2 ρ (v_{1,} {\bar{V}}_{2}, t)} \int_{- \infty}^{v_{1}} ρ (ζ_{1}, {\bar{V}}_{2}, t) - a_{1} C_{1} f_{1} (ζ_{1}, {\bar{V}}_{2}) d ζ_{1}$ (98)

(99) ${\dot{v}}_{2 | 1 | 1} = \frac{1}{2 ρ ({\bar{V}}_{1,} {\bar{V}}_{2}, t)} \int_{- \infty}^{v_{2}} [ρ ({\bar{V}}_{1}, ζ_{2}, t) - a_{2} C_{2} f_{2} ({\bar{V}}_{1}, ζ_{2})] d ζ_{2}$ (99)

(100) ${\dot{v}}_{2} = \frac{1}{2 ρ ({\bar{V}}_{1}, v_{2}, t)} \int_{- \infty}^{v_{1}} ρ ({\bar{V}}_{1}, ζ_{2}, t) - a_{2} C_{2} f_{2} ({\bar{V}}_{1}, ζ_{2}) d ζ_{2}$ (100)

(101) ${\dot{v}}_{1 | | 2 | 2} = \frac{1}{2 ρ ({\bar{V}}_{1}, {\bar{V}}_{2}, t)} \int_{- \infty}^{v_{1}} ρ (ζ_{1}, {\bar{V}}_{2}, t) - a_{2} C_{2} f_{2} (ζ_{1}, {\bar{V}}_{2}) d ζ_{1}$ (101)

Here, v_2|1|1 is the state variable of the Pursuer’s view on the Evader in view of the Pursuer, and v_1|2|2 is the state variable of the Evader’s view on the Pursuer in view of the Evader.

It is easy to conclude that the image Equations (99) and (101) can be solved independently(102) $v_{2 | 1 | 1} = \int_{0}^{t} d t \frac{1}{2 ρ ({\bar{V}}_{1,} {\bar{V}}_{2}, t)} \int_{- \infty}^{v_{2}} [ρ ({\bar{V}}_{1}, ζ_{2}, t) - a_{2} C_{2} f_{2} ({\bar{V}}_{1}, ζ_{2})] d ζ_{2}$ (102)

(103) $v_{1 | 2 | 2} = \int_{0}^{t} d t \frac{1}{2 ρ ({\bar{V}}_{1}, {\bar{V}}_{2}, t)} \int_{- \infty}^{v_{1}} ρ (ζ_{1}, {\bar{V}}_{2}, t) - a_{2} C_{2} f_{2} (ζ_{1}, {\bar{V}}_{2}) d ζ_{1}$ (103)

Now replacing ${\bar{V}}_{2}, {\bar{V}}_{1}$ in Equations (98) and (100) by the solutions for v_2|1|1 and v_1|2|2, respectively, one arrives at two independent ODE describing behaviors of the players. Therefore, at this level of incompleteness of information, the entanglement disappears.

The games with incomplete information give a reason to distinguish two type of dependence between the agents described by the variables v_i in the iQ systems. The first type of dependence is entanglement that has been introduced and discussed above. One should recall that in order to be entangled, the agents are supposed to run the system jointly during some initial period of time. But what happens if the agents had never been in contact? Obviously they are not entangled, i.e. they cannot predict each other’s motions. However, they are not completely independent: they can make random decisions, but the probability of these decisions will be correlated via the joint probability. As a result, the agents will be able to predict expected decisions of each other. We will call such correlation a weak entanglement. As follows from the games with incomplete information considered above, weak entanglement was presented as entanglement of an agent with the probabilistic image of another agent.

4. Games of partially entangled agents

In this section we introduce a new, more sophisticated entanglement that does not exist in quantum mechanics, but can be found in QI models. This finding is based upon existence of incompatible stochastic processes that are considered below.

4.1. Incompatible stochastic processes

Classical probability theory defines conditional probability densities based upon the existence of a joint probability density. However, one can construct correlated stochastic processes that are represented only by conditional densities since a joint probability density does not exist. For that purpose, consider two coupled Langevin equations (Risken, Citation1989)(104) ${\dot{x}}_{1} = g_{11} (x_{2}) L_{1} (t)$ (104) (105) ${\dot{x}}_{2} = g_{22} (x_{1}) L_{2} (t)$ (105)

where the Langevin forces L₁(t) and L₂(t) satisfy the conditions(106) $〈L_{i} (t)〉 = 0, 〈L_{i} (t) L_{i} (t^{'})〉 = 2 g_{i i} δ (t - t^{'})$ (106)

Then the joint probability density ρ(X₁, X₂) describing uncertainties in values of the random variables x₁and x₂ evolves according to the following Fokker–Planck equation(107) $\frac{\partial ρ}{\partial t} = g_{11}^{2} (X_{2}) \frac{\partial^{2} ρ}{\partial X_{1}^{2}} + g_{22}^{2} (X_{1}) \frac{\partial^{2} ρ}{\partial X_{2}^{2}}$ (107)

Let us now modify Equations (104) and (105) as following(108) ${\dot{x}}_{1} = g_{11}^{2} (x_{2}^{*}) L_{1} (t)$ (108)

(109) ${\dot{x}}_{2} = g_{22}^{2} (x_{1}^{*}) L_{2} (t)$ (109)

where $x_{1}^{*}$ and $x_{1}^{*}$ are fixed values of x₁ and x₂ that play role of parameters in Equations (108) and (109), respectively. Now the uncertainties of x₁and x₂ are characterized by conditional probability densities ρ₁(X₁|X₂) and ρ₂(X₂|X₁) while each of these densities is governed by its own Fokker–Planck equation(110) $\frac{\partial ρ_{1}}{\partial t} = g_{11}^{2} (X_{2}) \frac{\partial^{2} ρ_{1}}{\partial X_{1}^{2}}$ (110)

(111) $\frac{\partial ρ_{2}}{\partial t} = g_{22}^{2} (X_{1}) \frac{\partial^{2} ρ_{2}}{\partial X_{2}}$ (111)

The solutions of these equations subject to sharp initial conditions(112) $ρ_{i} (X_{i}, t | X_{i}^{'}, t^{'}) = δ (X_{i} - X_{i}^{'}), i = 1, 2 .$ (112)

for t > t′ read(113) $ρ_{1} (X_{1} | X_{2}) = \frac{1}{\sqrt{4 π g_{11}^{2} (X_{2}) (t - t^{'})}} exp (- \frac{{(X_{1} - X_{1}^{'})}^{2}}{4 g_{11}^{2} (X_{2}) (t - t^{'})}$ (113)

(114) $ρ_{2} (X_{2} | X_{1}) = \frac{1}{\sqrt{4 π g_{22}^{2} (X_{1}) (t - t^{'})}} exp (- \frac{{(X_{2} - X_{2}^{'})}^{2}}{4 g_{22}^{2} (X_{1}) (t - t^{'})}$ (114)

As shown in Zak (Citation1998), a joint density for the conditional densities (113) and (114) exists only in special cases of the diffusion coefficients g₁₁ and g₂₂ when the conditional probabilities are compatible. These conditions are(115) $i n k (ρ_{1}, ρ_{2}) = \frac{\partial^{}}{\partial X_{1} \partial X_{2}} ln \frac{ρ_{1} (X_{1} | X_{2})}{ρ_{2} (X_{2} | X_{1})} \equiv 0$ (115)

Indeed(116) $ρ (X_{1}, X_{2}) = ρ_{1} (X_{1} | X_{2}) \int_{- \infty}^{\infty} ρ (ζ, X_{2}) d ζ = ρ_{2} (X_{2} | X_{1}) \int_{- \infty}^{\infty} ρ (X_{1}, ζ) d ζ,$ (116)

whence(117) $ln \frac{ρ_{1} (X_{1} | X_{2})}{ρ_{2} (X_{2} | X_{1})} = ln \int_{- \infty}^{\infty} ρ (X_{1}, ζ) d ζ - ln \int_{- \infty}^{\infty} ρ (ζ, X_{2}) d ζ$ (117)

and that leads to Equation (115).

Thus, the existence of the join density ρ(X₁, X₂) for the conditional densities ρ₁(X₁|X₂), and ρ₂(X₂|X₁) requires that(118) $\frac{\partial^{2}}{\partial X_{1} \partial X_{2}} [\frac{{(X_{1} - X_{1}^{'})}^{2}}{4 g_{11}^{2} (X_{2})} - \frac{(X_{2} - X_{2}^{'})}{4 g_{22}^{2} (X_{1})}] \equiv 0$ (118)

Obviously the identity (118) holds only for specially selected functions g₁₁(X₂) and g₂₂(X₁), and therefore, existence of the joint density is an exception rather than a rule.

4.2. Partial entanglement

In order to prove existence of a new form of entanglement, let us modify the system Equations (36), (37), and (38) as following(119) ${\dot{v}}_{1} = - a_{11} (v_{2}) \frac{\partial}{\partial v_{1}} ln ρ_{1} (v_{1} | v_{2})$ (119) (120) $\frac{\partial ρ_{1} (V_{1} | V_{2})}{\partial t} = a_{11} (V_{2}) \frac{\partial^{2} ρ_{1} (V_{1} | V_{2})}{\partial V_{1}^{2}}$ (120) (121) ${\dot{v}}_{2} = - a_{22} (v_{1}) \frac{\partial}{\partial v_{1}} ln ρ_{2} (v_{2} | v_{1})$ (121) (122) $\frac{\partial ρ_{2} (V_{2} | V_{1})}{\partial t} = a_{22} (V_{1}) \frac{\partial^{2} ρ_{2} (V_{2} | V_{1})}{\partial V_{2}^{2}}$ (122)

Since here we do not postulate existence of a joint density, the system is written in terms of conditional densities, while Equations (120) and (121) are similar to Equations (110) and (111). The solutions of these PDE can be written in the form similar to the solutions (113) and (114)(123) $ρ_{1} (V_{1} | V_{2}) = \frac{1}{\sqrt{4 π a_{11} (V_{2}) (t - t^{'})}} exp (- \frac{{(V_{1} - V_{1}^{'})}^{2}}{4 a_{11} (V_{2}) (t - t^{'})}$ (123)

(124) $ρ_{2} (V_{2} | V_{1}) = \frac{1}{\sqrt{4 π a_{22} (V_{1}) (t - t^{'})}} exp (- \frac{{(V_{2} - V_{2}^{'})}^{2}}{4 a_{11} (V_{1}) (t - t^{'})}$ (124)

As noticed in the previous subsection, the existence of the joint density ρ(V₁, V₂) for the conditional densities ρ₁(V₁|V₂) and ρ₂(V₂|V₁) require that(125) $\frac{\partial^{2}}{\partial V_{1} \partial V_{2}} [\frac{{(V_{1} - V_{1}^{'})}^{2}}{4 a_{11} (V_{2})} - \frac{(V_{2} - V_{2}^{'})}{4 a_{22} (V_{1})}] \equiv 0$ (125)

In this case, the joint density exists (although its finding is not trivial, Zak, Citation1998), and the system (119)–(122) can be reduced to a system similar to (36)–(38). But here we will be interested in case when the joint density does not exist. It is much easier to find such functions $a_{11} (V_{2}), a_{22} (V_{1})$ for which the identity (125) does not hold, and we assume that(126) $\frac{\partial^{2}}{\partial V_{1} \partial V_{2}} [\frac{{(V_{1} - V_{1}^{'})}^{2}}{4 a_{11} (V_{2})} - \frac{(V_{2} - V_{2}^{'})}{4 a_{22} (V_{1})}] \neq 0$ (126)

In this case, the system (119)–(122) cannot be simplified. In order to analyze this system in detail, lets substitute the solutions (123) and (124) with Equations (119) and (121), respectively. Then with reference to Equation (14), one obtains(127) ${\dot{v}}_{1} = \frac{v_{1}}{2 t}$ (127)

(128) ${\dot{v}}_{2} = \frac{v_{2}}{2 t}$ (128)

and therefore(129) $v_{1} = C_{1} \sqrt{t}$ (129)

(130) $v_{2} = C_{2} \sqrt{t}$ (130)

It should be recalled that according to the terminology introduced in Section 3, the systems (119)–(120) and the systems (121)–(122) can be considered as dynamical models for interaction of two communicating agents where Equations (119) and (121) describe their motor dynamics, and Equations (120) and (122)—mental dynamics, respectively. Also, it should be reminded that the solutions (129) and (130) are represented by one parametrical family of random samples, as in Equation (15), while the randomness enters through the time-independent parameters C₁ and C₂ that can take any real numbers. As follows from Figure , all the particular solutions (129) and (130) intersect at the same point v_1,2 = 0 at t = 0, and that leads to non-uniqueness of the solution due to violation of the Lipcshitz condition. Therefore, the same initial condition v_1,2 = 0 at t = 0 yields infinite number of different solutions forming a family; each solution of this family appears with a certain probability guided by the corresponding Fokker–Planck Equations (120) and (122), respectively. Similar scenario was described in the Section 2 of this paper. But what is unusual in the system (119)–(121) is correlations: although Equations (120) and (122) are correlated, and therefore, mental dynamics are entangled, Equations (119) and (121) are not correlated (since they can be presented in the form of independent Equations (127) and (128), respectively), and therefore, the motor dynamics are not entangled. This means that in the course of communications, each agent “selects” a certain pattern of behavior from the family of solutions (129) and (130), respectively, and these patterns are independent; but the probabilities of these “selections” are entangled via Equations (120) and (122). Such sophisticated correlations cannot be found in physical world, and they obviously represent a “human touch.” Unlike the entanglement in system with joint density (such as that in Equations (36)–(38)) here the agents do not share any deterministic invariants (compare to Equation 48). Instead, the agents can communicate via “best guesses” based upon known conditional probability densities distributions.

In order to quantify the amount of uncertainty due to incompatibility of the conditional probability densities (123) and (124), let us introduce a concept of complex probability, (Zak, Citation1998),(131) $f (V_{1}, V_{2}) = a (V_{1}, V_{2}) + i b (V_{1}, V_{2})$ (131)

Then the marginal densities are(132) $f_{1} (V_{1}) = \int_{- \infty}^{\infty} a (V_{1}, V_{2}) d V_{2} + i \int_{- \infty}^{\infty} b (V_{1}, V_{2}) d V_{2} = a_{1} (V_{1}) + i b_{1} (V_{1})$ (132)

(133) $f_{2} (V_{2}) = \int_{- \infty}^{\infty} a (V_{1}, V_{2}) d V_{1} + i \int_{- \infty}^{\infty} b (V_{1}, V_{2}) d V_{1} = a_{2} (V_{2}) + i b_{2} (V_{2})$ (133)

Following the formalism of conditional probabilities, the conditional densities will be defined as(134) $f_{1 | 2} = \frac{f (V_{1}, V_{2})}{f_{2} (V_{2})} = \frac{a (V_{1}, V_{2}) + i b (V_{1}, V_{2})}{a_{2} (V_{2}) + i b_{2} (V_{2})} = \frac{a a_{2} + b b_{2}}{a_{2}^{2} + b_{2}^{2}} + i \frac{a_{2} b - a b_{2}}{a_{2}^{2} + b_{2}^{2}}$ (134)

(135) $f_{2 | 1} = \frac{f (V_{1}, V_{2})}{f_{1} (V_{1})} = \frac{a (V_{1}, V_{2}) + i b (V_{1}, V_{2})}{a_{1} (V_{1}) + i b_{1} (V_{1})} = \frac{a a_{1} + b b_{1}}{a_{1}^{2} + b_{1}^{2}} + i \frac{a_{1} b - a b_{1}}{a_{1}^{2} + b_{1}^{2}}$ (135)

with the normalization constraint(136) $\int_{- \infty}^{\infty} \int_{- \infty}^{\infty} (a^{2} + b^{2})^{1 / 2} d V_{1} d V_{2} = 1$ (136)

This constraint can be enforced by introducing a normalizing multiplier in Equation (131) which will not affect the conditional densities (134) and (135).

Clearly,(137) $A \leq {(a^{2} + b^{2})}^{1 / 2}, and \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} a d V_{1} d V_{2} \leq 1$ (137)

Now our problem can be reformulated in the following manner: given two conditional probability densities (123) and (124), and considering them as real parts of (unknown) complex densities (134) and (135), find the corresponding complex joint density (131), and therefore, all the marginal (132) and (133), as well as the imaginary parts of the conditional densities. In this case, one arrives at two coupled integral equations with respect to two unknowns a(V₁, V₂) and b(V₁, V₂) (while the formulations of a₁(V₁, V₂), a₂(V₁, V₂), b₁(V₁, V₂) and b₂(V₁, V₂) follow from Equations (132) and (133)). These equations are(138) $ρ_{1} (V_{1}, V_{2}) = \frac{a a_{2} + b b_{2}}{a_{2}^{2} + b_{2}^{2}}, ρ_{2} (V_{1}, V_{2}) = \frac{a a_{1} + b b_{1}}{a_{2}^{2} + b_{2}^{2}},$ (138)

The system (138) is non-linear, and very little can be said about general property of its solution without detailed analysis. Omitting such an analysis, let us start with a trivial case when(139) $b = 0$ (139)

In this case, the system (138) reduces to the following two integral equations with respect to one unknown that is a(V₁, V₂):(140) $ρ_{1} (V_{1}, V_{2}) = \frac{a (V_{1}, V_{2})}{\int_{- \infty}^{\infty} a (V_{1}, V_{2}) d V_{2}}, ρ_{2} (V_{1}, V_{2}) = \frac{a (V_{1}, V_{2})}{\int_{- \infty}^{\infty} a (V_{1}, V_{2}) d V_{1}},$ (140)

This system is overdetermined unless the compatibility conditions (115) are satisfied.

As known from classical mechanics, the incompatibility conditions are usually associated with a fundamentally new concept or a physical phenomenon. For instance, incompatibility of velocities in fluid (caused by non-existence of velocity potential) introduces vorticity in rotational flows, and incompatibility in strains describes continua with dislocations. In order to interpret the incompatibility (115), let us return to the system (138). Discretizing the functions in Equation (138) and replacing the integrals by the corresponding sums, one reduces Equation (138) to a system of n algebraic equations with respect to n unknowns. This means that the system is closed, and cases when a solution does not exist are exceptions rather than a rule. Therefore, in most cases, for any arbitrarily chosen conditional densities, for instant, for those given by Equations (123) and (124), the system (138) defines the complex joint density in the form (131).

Now we are ready to discuss a physical meaning of the imaginary component of the complex probability density. Firstly, as follows from comparison of Equations (138) and (140), the imaginary part of the probability density appears as a response to incompatibility of the conditional probabilities, and therefore, it can be considered as a “compensation” for the incompatibility. Secondly, as follows from the inequalities (137), the imaginary part consumes a portion of the “probability mass” increasing thereby the degree of uncertainty in the real part of the complex probability density. Hence, the imaginary part of the probability density can be defined as a measure of the uncertainty “inflicted” by the incompatibility into the real part of this density.

In order to avoid solving the system of integral equations (138), we can reformulate the problem in an inverse fashion by assuming that the complex joint density is given. Then the real parts of the conditional probabilities that drive Equations (119) and (120) can be found from simple formulas (134) and (135).

Let us illustrate this new paradigm, and consider two players assuming that each player knows his own state but does not know the state of his adversary. In order to formalize the degree of initial incompleteness of information, introduce the complex joint probability density, $ρ_{0} (V_{1}, V_{2}) = a_{0} δ (V_{1}, V_{2}) + i b_{0} δ (V_{1}, V_{2})$

that shows how much the players know and how much they do not know about each other when the game starts. With reference to the normalization constraint (136),(142) ${(a_{0}^{2} + b_{0}^{2})}^{1 / 2} = 1$ (142)

The structure of the real part of the joint probability density can be chosen the same as in Equation (84)(143) $Re ρ = a_{0} {[δ - \frac{1}{2} (α_{1} C_{1} f + α_{2} C_{2} f^{- 1})] e^{- t} + \frac{1}{2} (α_{1} C_{1} f + α_{2} C_{2} f^{- 1})},$ (143)

However, since here a₀ < 1, the real part of the joint probability density is reduced due to a “leak” of the probability “mass” from the real to the imaginary part, and this makes predictions less certain for the both players. Otherwise, the formal structure of the motor dynamics is similar to that described by Equations (83) and (84).

The imaginary part can be preset as(144) $Im ρ = b_{0} [(δ - C_{3} ρ^{*}) e^{- t} + C_{3} ρ^{*}]$ (144)

where ρ^* is the probability density characterizing the degree of uncertainty of information that the players have about each other, while the larger ρ^* the more the probability leak from the real to imaginary part of the complex probability density. The arbitrary constants C₁, C₂, and C₃ couples the real and the imaginary parts via the normalization constraint (136)(145) $\int_{- \infty}^{\infty} \int_{- \infty}^{\infty} [{(Re ρ)}^{2} + (Im ρ^{2})]^{1 / 2} d V_{1} d V_{2} = 1$ (145)

The motor dynamics has a slight change compare to Equation (82)(146) ${\dot{v}}_{1} = Re {\frac{e^{- t}}{2 [δ (v_{1}, v_{2}) - ρ^{*} (v_{1}, v_{2})] e^{- t} + ρ^{*} (v_{1}, v_{2})} \int_{- \infty}^{v_{1}} [δ (ζ, v_{2}) - ρ^{*} (ζ, v_{2})]^{} d ζ}$ (146) (147) ${\dot{v}}_{2} = Re {\frac{e^{- t}}{2 [δ (v_{1}, v_{2}) - ρ^{*} (v_{1}, v_{2})] e^{- t} + ρ^{*} (v_{1}, v_{2})} \int_{- \infty}^{v_{2}} [δ (v_{1}, ζ) - ρ^{*} (v_{1}, ζ)]^{} d ζ}$ (147)

Thus, both players rely only upon the real part of the complex joint density instead of a real joint density (that may not exist in this case). But as follows from the inequalities (137), the values of density of the real part are lowered due to loss of the probability mass, and this increases the amount of uncertainty in player’s predictions. In order to minimize that limitation, the players can invoke the imaginary part of the joint density that gives them qualitative information about the amount of uncertainty at the selected maxima.

It should be noticed that the game starts with a significant amount of uncertainties that will grow with next moves. Such subtle and sophisticated relationship is typical for communications between humans, and the proposed model captures it via partial entanglement introduced above.

Remark. So far we considered the imaginary part of a joint probability density as a result of incompatibility of conditional densities of the players. However, this part can have a different origin: it can also represent a degree of deception that the players apply in real-life games. As in the previous example, in games with deception the imaginary part of the joint probability density increases uncertainty of the players’ prediction capabilities. The mathematical formalism of the game with deception is similar to that discussed above.

5. Passive period of players’ performance

In this section, we will discuss the capacity of mathematical formalism that provides an extension of the proposed model to a new space with imaginary time where players exhibit virtual motions such as dreams and memories. In order to demonstrate that, let us replace Equation (54) by the following(148) $f = \frac{ζ}{ρ (v, t)} \int_{- \infty}^{v} [ρ (ζ, t) - ρ^{*} (ζ)] d v$ (148)

where(149) $ζ = \frac{\sqrt{T - t}}{\sqrt{| T - t |}}$ (149)

and T is the period of active performance of the player.

Then at 0 < t < T(150) $ζ = 1$ (150)

the player is active, and its activity is described by the governing Equations (50) and (51).

For t = T $ζ = 0$

the player is at rest, and its state is described by a simple Newtonian state.

But for t > T the feedback (148)(151) $f = \frac{i}{ρ (v, t)} \int_{- \infty}^{v} [ρ (η, t) - ρ^{*} (η)] d η$ (151)

as well as Equations (50), (61) and (52)(152) $\dot{v} = i \frac{ζ_{0}}{ρ (v, t)} \int_{- \infty}^{v} [ρ (η, t) - ρ^{*} (η)] d η$ (152)

(153) $\frac{\partial ρ}{\partial t} + i ζ_{0} [ρ (t) - ρ^{*}] = 0$ (153)

become complex. For better interpretation, it will be more convenient to introduce an imaginary time(154) $\tilde{t} = i t$ (154)

Then, the formal solutions of these equations are(155) $ρ = [(ρ_{0} - ρ^{*}) e^{- \tilde{t}} + ρ^{*}]$ (155) (166) $\int_{- \infty}^{v} ρ^{*} (η) d η = (\frac{C}{e^{- \tilde{t}} - 1}), v \neq 0$ (166)

Thus, the velocity v and the probability density ρ become real functions of imaginary time. It is reasonable to assume that the family of trajectories in the solution (63) describes virtual motions evolving in imaginary time with the probability (62), while the time scale of these motions could be different from the real one. Such a surrealistic activity can be associated with memories and dreams, i.e. with the period of passive performance during which a player has an opportunity to enrich his information with help of memories, and to plan and test his future performance.

6. Discussion and conclusion

We start the discussion with outlining the mathematical novelties of this work. Actually a new class of ODE that are coupled with their Liouville equation is introduced. The leading ideas came from the Madelung equation that is a hydrodynamics version of the Schrödinger equation(167) $\frac{\partial ρ}{\partial t} + \nabla ∙ (\frac{ρ}{m} \nabla S) = 0$ (167) (168) $\frac{\partial S}{\partial t} + {(\nabla S)}^{2} + F - \frac{ħ^{2} \nabla^{2} \sqrt{ρ}}{2 m \sqrt{ρ}} = 0$ (168)

Here ρ and S are the components of the wave function $ψ = \sqrt{ρ} e^{i S / ħ}$ , and ℏ is the Planck constant divided by 2π. The last term in Equation (2) is known as quantum potential. From the viewpoint of Newtonian mechanics, Equation (167) expresses continuity of the flow of probability density, and Equation (168) is the Hamilton–Jacobi equation for the action S of the particle. Actually the quantum potential in Equation (168), as a feedback from Equation (167) to Equation (168), represents the difference between the Newtonian and quantum mechanics, and therefore, it is solely responsible for fundamental quantum properties.

Our approach is based upon replacing the quantum potential with a different Liouville feedback, Figure .

In Newtonian physics, the concept of probability ρ is introduced via the Liouville equation(169) $\frac{\partial ρ}{\partial t} + \nabla ∙ (ρ F) = 0$ (169)

generated by the system of ODE(170) $\frac{d v}{d t} = F [v_{1} (t), \dots v_{n} (t), t]$ (170)

where v is velocity vector.

It describes the continuity of the probability density flow originated by the error distribution(171) $ρ_{0} = ρ (t = 0)$ (171)

in the initial condition of ODE (171).

Let us rewrite Equation (168) in the following form(172) $\frac{d v}{d t} = F [ρ (v)]$ (172)

where v is a velocity of a hypothetical particle.

This is a fundamental step in our approach: in Newtonian dynamics, the probability never explicitly enters the equation of motion. In addition to that, the Liouville equation generated by Equation (172) is non-linear with respect to the probability density ρ(173) $\frac{\partial ρ}{\partial t} + \nabla ∙ {ρ F [ρ (V)]} = 0$ (173)

and therefore, the system (172), (173) departs from Newtonian dynamics. However, although it has the same topology as quantum mechanics (since now the equation of motion is coupled with the equation of continuity of probability density), it does not belong to it either. Indeed Equation (172) is more general than the Hamilton–Jacoby Equation (2): it is not necessarily conservative, and F is not necessarily the quantum potential although further we will impose some restriction upon it that links F to the concept of information. The relation of the system (172), (123) to Newtonian and quantum physics is illustrated in Figure . Two different types of feedbacks replacing the quantum potential (see Equation (23) and Equation (49)) are introduced and analyzed. Both of these feedbacks lead to different approaches to psychological games. As demonstrated, this new class of ODE includes mathematical formalization of human factor. Therefore, the paper extends the mathematical formalism of quantum physics to include games and economics with a component of psychology. The novelty of the approach is based upon a human factor-based behavioral model of an intelligent agent. The model is quantum inspired: it is represented by a modified Madelung equation in which the gradient of quantum potential is replaced by a specially chosen information force. It consists of motor dynamics simulating actual behavior of the agent, and mental dynamics representing evolution of the corresponding knowledge base and incorporating this knowledge in the form of information flows into the motor dynamics. Due to feedback from mental dynamics, the motor dynamics attains quantum-like properties: its trajectory splits into a family of different trajectories, and each of those trajectories can be chosen with the probability prescribed by the mental dynamics; each agent entangled (in a quantum way) to other agents and makes calculated predictions for future actions. In case of a complex density, its imaginary part represents a measure of uncertainty of the density distribution. Human factor is associated with violation of the second law of thermodynamics: the system can move from disorder to order without external help, and that represent intrinsic intelligence. All of these departures actually extend and complement the classical methods making them especially successful in analysis of communications of agents represented by new mathematical formalism. Special attention is concentrated on new approach to theory of differential games, and in particular, on behavioral properties of players as intelligent subjects possessing self-image and self-awareness. Due to quantum-like entanglement they are capable to predict and influence actions of their adversaries. The model addresses a new type of entanglement that correlates the probabilities of actions of livings rather than the actions themselves.

There are several differences between the proposed and conventional game theories. Firstly, in the proposed game, the players are entangled: they cannot make independent deterministic decisions; instead, they make coordinated random decisions such that, at least, the probabilities of these decisions are dependent. Therefore, the proposed game represents a special case of non-determine symmetric simultaneous zero-sum game. Secondly, the maximization of the pay-off function here does not require any special methods (like gradient ascend) since it is “built-in” into the dynamical model. Indeed, the pay-off function (86) is represented by the probability density of the stochastic attractor, and therefore, its maximum value has the highest probability to appear as a random solution of the underlying dynamical model (82). Moreover, the payoff function (86) is not required to be differentiable at all (although it must be integrable).

This work opens up the way to development of a fundamentally new approach to mathematical formalism of economics that is inspired by the formalism of quantum mechanics. Mathematical treatment of economics has a relatively short history. Formal economic modeling began in the nineteenth century with the use of differential calculus to represent and explain economic behavior, such as utility maximization, an early economic application of mathematical optimization. Economics became more mathematical as a discipline throughout the first half of the twentieth century, but introduction of new and generalized techniques in the period around the World War II, as in game theory, (Isaacs, Citation1965), would greatly broaden the use of mathematical formulations in economics. However, not withstanding undisputable success of mathematical methods in economics, there were alarmed critics of the discipline, as well as some noted economists. John Maynard Keynes, Robert Heilbroner, Friedrich Hayek, and others have criticized the broad use of mathematical models for human behavior, arguing that some human choices are irreducible to mathematics. Actually the alert was expressed much earlier by Newton who stated “I can calculate the motion of heavenly bodies, but not the madness of people.”

Limitations of modern mathematical methods in economics are especially transparent in the area of agent-based computational economics. It is a relatively recent field, dating from about the 1990s as to published work. It studies economic processes, including whole economies, as dynamic systems of interacting agents over time. As such, it falls in the paradigm of complex adaptive systems. In corresponding agent-based models, agents are not real people but “computational objects modeled as interacting according to rules” … “whose micro-level interactions create emergent patterns” in space and time. The rules are formulated to predict behavior and social interactions based on incentives and information. The theoretical assumption of mathematical optimization by agents markets is replaced by the less restrictive postulate of agents with bounded rationality adapting to market forces.

In this context, the presented mathematical formalism incorporates a human factor in agent-based economics while departing from physics formalism toward physics of life.

Additional information

Funding

Funding. The author received no direct funding for this research.

Notes on contributors

Michail Zak

This work opens up the way to development of a fundamentally new approach to mathematical formalism of economics that is inspired by the formalism of quantum mechanics. Mathematical treatment of economics has a relatively short history. Recently developed Newtonian models have been criticized for application to human behavior, arguing that some human choices are irreducible to Newtonian physics. Actually, the alert was expressed much earlier by Newton who stated “I can calculate the motion of heavenly bodies, but not the madness of people.” Limitations of modern mathematical methods in economics are especially transparent in the area of agent-based computational economics. In this context, the presented mathematical formalism incorporates a human factor in agent-based economics while departing from physics formalism toward physics of life.

The area of recent research activity of the author is: Mathematical physics, artificial intelligence, and theory of turbulence.

References

Isaacs, R. (1965). Differential games. New York, NY: Wiley.
Google Scholar
Penrose, R. (1955). Why new physics is needed to understand the mind. Cambridge: University Press.
Google Scholar
Risken, H. (1989). The Fokker-Planck equation. New York, NY: Springer.10.1007/978-3-642-61544-3
Google Scholar
Zak, M. (1992). Terminal model of Newtonian dynamics. International Journal of Theoretical Physics, 32, 159–190.
Web of Science ®Google Scholar
Zak, M. (1998). Incompatible stochastic processes and complex probabilities. Physics Letters A, 238, 1–7.10.1016/S0375-9601(97)00763-9
Web of Science ®Google Scholar
Zak, M. (2007). Physics of life from first principles. EJTP 4, 16, 11–96.
Google Scholar
Zak, M. (2008). Quantum-inspired maximizer. Journal of Mathematical Physics, 49, 042702.10.1063/1.2908281
Web of Science ®Google Scholar
Zak, M. (2014a). Particle of life: Mathematical abstraction or reality? New York, NY: Nova.
Google Scholar
Zak, M. (2014b). Origin of randomness in quantum mechanics. Electronic Journal of Theoretical Physics, 11, 149–164.
Google Scholar

Games of entangled agents

Abstract

Public Interest Statement

1. Introduction

1.1. Justification for non-Newtonian approach

2. Dynamical model for simulations

2.1. Destabilizing effect of Liouville feedback

2.2. Emergence of self-generated stochasticity

2.3. Second law of thermodynamics

2.4. Entanglement

2.5. Relevance to model of intelligent agents

2.6. Self-supervised active systems with integral feedback

2.7. Finding global maximum

2.8. Entanglement in QI active systems with integral feedback

3. Application to games of entangled agents

3.1. Problem formulation

3.2. Games with incomplete information

4. Games of partially entangled agents

4.1. Incompatible stochastic processes

4.2. Partial entanglement

5. Passive period of players’ performance

6. Discussion and conclusion

Notes on contributors

Michail Zak

References

Information for

Open access

Opportunities

Help and information

Games of entangled agents

Abstract

Public Interest Statement

1. Introduction

1.1. Justification for non-Newtonian approach

2. Dynamical model for simulations

2.1. Destabilizing effect of Liouville feedback

2.2. Emergence of self-generated stochasticity

2.3. Second law of thermodynamics

2.4. Entanglement

2.5. Relevance to model of intelligent agents

2.6. Self-supervised active systems with integral feedback

2.7. Finding global maximum

2.8. Entanglement in QI active systems with integral feedback

3. Application to games of entangled agents

3.1. Problem formulation

3.2. Games with incomplete information

4. Games of partially entangled agents

4.1. Incompatible stochastic processes

4.2. Partial entanglement

5. Passive period of players’ performance

6. Discussion and conclusion

Additional information

Funding

Notes on contributors

Michail Zak

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date