Full article: Approximate reduction of linear population models governed by stochastic differential equations: application to multiregional models

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

In this work we develop approximate aggregation techniques in the context of slow-fast linear population models governed by stochastic differential equations and apply the results to the treatment of populations with spatial heterogeneity. Approximate aggregation techniques allow one to transform a complex system involving many coupled variables and in which there are processes with different time scales, by a simpler reduced model with a fewer number of ‘global’ variables, in such a way that the dynamics of the former can be approximated by that of the latter. In our model we contemplate a linear fast deterministic process together with a linear slow process in which the parameters are affected by additive noise, and give conditions for the solutions corresponding to positive initial conditions to remain positive for all times. By letting the fast process reach equilibrium we build a reduced system with a lesser number of variables, and provide results relating the asymptotic behaviour of the first- and second-order moments of the population vector for the original and the reduced system. The general technique is illustrated by analysing a multiregional stochastic system in which dispersal is deterministic and the rate growth of the populations in each patch is affected by additive noise.

KEYWORDS:

2010 MATHEMATICS SUBJECT CLASSIFICATION:

1. Introduction

Nature offers many examples of systems with an inherent complexity whose study leads to mathematical models with a large number of state variables whose analytical study is, in most cases, not feasible. In order to be able to extract important information about the behaviour of some of these complex models, one can resort to ‘approximate aggregation methods’. These are mathematical techniques, which are usually applied in systems governed by processes with different time scales, in which appropriate approximations are introduced in order to transform the system under consideration into a reduced system with a lower number of variables, called ‘global variables’. In this way, the behaviour of the original system can be approximated, but not known with exactitude, in terms of the knowledge of the behaviour of the reduced system.

Approximate aggregation techniques in population dynamics have been widely studied in the context of deterministic systems with different time scales both in continuous and discrete time (see the review in [Citation2] and the references therein), as well as in discrete stochastic models incorporating either environmental [Citation19] or demographic stochasticity [Citation21].

Stochastic differential equations (SDEs) can be thought of as resulting from the introduction of environmental stochasticity in the coefficients of deterministic ODEs. In spite of the difficulty of their analytical treatment, they have become popular as population modeling tools (see for example [Citation6, Citation8] for the scalar case and [Citation13, Citation24] for applications to competition models and epidemiology respectively).

The aim of this work is to formulate an approximate aggregation technique valid to reduce fast-slow linear population models governed by SDEs, to give sufficient conditions for the solutions of the models to be positive and to relate the asymptotic behaviour of the first- and second-order moments of the population vector for the original and the reduced system. The original model is built by considering a fast deterministic process that converges to an equilibrium, together with a slow process in which the parameters are affected by additive noise. In the resulting system the quotient between the rate of diffusion squared and the speed of drift is $O (ε)$ , $ε \to 0$ , where ϵ is a measure of the difference of time scales between the fast and the slow process. This is in contrast with most previous works in the field [Citation5, Citation10, Citation11, Citation16] which are valid in other situations. An exception is [Citation14] which covers our case of interest but which only deals with the relationships between the original and the reduced systems in finite time intervals.

The existing literature on the field of the approximate reduction of SDEs has been developed mainly in the context of physics and control theory. Our approach, although general in nature, is aimed to population dynamics applications. Specifically, we will employ it to study a multiregional model consisting of a single population living in a multipatch environment in such a way that dispersal is deterministic and the growth rate of the population in each patch is affected by additive noise. Models with spatial heterogeneity have been widely studied through aggregation techniques (see [Citation3] for the continuous time case and [Citation2] for discrete time), normally making use of the fact that in many practical situations the dispersal of individuals amongst the different patches is fast with respect to other processes like demography or competition (an exception to this is [Citation20]). In this work we contemplate a situation that has not been dealt with so far: dispersal between some spatial patches can be fast with respect to demography whereas migration between some other patches can happen at the same time scale of demography. For example we can think of a population of birds living in different islands in such a way that inter-island movements happen at the scale of demography but, in comparison, intra-island migrations are fast.

The manuscript is organized as follows: in Section 2 we present the general formulation of linear SDE models and, in order to be able to use them in population dynamics applications, we give sufficient conditions that guarantee that if initial conditions are positive, the solution to such a system remains positive for all times. Section 3 presents the general formulation of a linear two time scale population model with stochasticity affecting the slow process. Section 4 carries out the reduction of the system, which is accomplished by letting the fast process reach equilibrium and defining and adequate set of new variables. In Section 5 we apply the general technique to the case of a stochastic multiregional model, with special attention to the case in which all migrations are fast with respect to demography, since it is the most frequent situation in practice and moreover in that case the reduced system is scalar. Section 6 presents a result that allows one to relate the asymptotic behaviour of the first and second statistical moments of the population vector for the original and the reduced system, and applies it to study the asymptotic behaviour of the multiregional models of Section 5. A brief discussion of results and the Appendix with mathematical proofs complete the manuscript.

2. Linear population models with two time scales

Throughout the paper we assume we are working in a complete probability space $(Ω, F, {F_{t}}_{t \geq 0}, P)$ where the filtration ${F_{t}}_{t \geq 0}$ satisfies the usual conditions [Citation18]. Let us consider a structured population modelled by an linear autonomous homogeneous stochastic differential equation. The model has the form (1) $d X (t) = A X (t) d t + \overset{m}{\sum_{h = 1}} B^{h} X (t) d W^{h} (t), X (0) = X_{0},$ (1) where $X (t) = (X^{1} (t), X^{2} (t), \dots, X^{d} (t)) \in R^{d}$ is the population vector, $A = [a_{i j}] \in R^{d \times d}$ , $B^{h} = [b_{i j}^{h}] \in R^{d \times d},$ $h = 1, 2, \dots, m$ and $W (t) = (W^{1} (t), W^{2} (t), \dots, W^{m} (t))^{T}$ is a m-dimensional standard Wiener process defined in the previous probability space. Moreover, we assume that $X_{0} \in R^{d}$ is a non-random vector. Models of the kind (Equation1(1) $d X (t) = A X (t) d t + \overset{m}{\sum_{h = 1}} B^{h} X (t) d W^{h} (t), X (0) = X_{0},$ (1) ) are obtained from a linear deterministic model (2) $\frac{d X (t)}{d t} = A X (t), X (0) = X_{0}$ (2) if we add noise to the population vital rates $a_{i j}$ . Indeed, system (Equation1(1) $d X (t) = A X (t) d t + \overset{m}{\sum_{h = 1}} B^{h} X (t) d W^{h} (t), X (0) = X_{0},$ (1) ) can be written in the form $d X^{i} (t) = \overset{d}{\sum_{j = 1}} (a_{i j} d t + b_{i j}^{1} d W^{1} (t) + \dots + b_{i j}^{m} d W^{m} (t)) X^{j} (t), i = 1, \dots, d$ from where we see that each coefficient $b_{i j}^{h}$ characterizes the intensity of the noise $d W^{h}$ affecting $a_{i j}$ . Note that the case in which the noises $d W^{j} (t)$ are correlated can be reduced to this setting through an appropriate transformation [Citation1, p. 126]. Systems of the kind (Equation1(1) $d X (t) = A X (t) d t + \overset{m}{\sum_{h = 1}} B^{h} X (t) d W^{h} (t), X (0) = X_{0},$ (1) ) can be interpreted in the sense of Ito or in the sense of Stratonovich, and the results obtained in the two cases differ [Citation23]. However, one can choose any of the two interpretations as long as one defines appropriately the parameters of the model [Citation7]. In this work we will make use of the Ito interpretation. It is well known [Citation1, p. 126] that in the previous conditions there exists a unique solution to (Equation1(1) $d X (t) = A X (t) d t + \overset{m}{\sum_{h = 1}} B^{h} X (t) d W^{h} (t), X (0) = X_{0},$ (1) ) which is continuous with probability one.

In order to be a valid model for a population, the solution of (Equation1(1) $d X (t) = A X (t) d t + \overset{m}{\sum_{h = 1}} B^{h} X (t) d W^{h} (t), X (0) = X_{0},$ (1) ) for any positive initial condition must remain non-negative for all times, so we turn our attention to this property. Given a vector or matrix $x$ we will write $x \geq 0$ (resp. $x > 0$ ) to denote that all the components of $x$ are non-negative (resp. positive). Let us recall that a square matrix $A$ is said to be a Metzler matrix or a essentially non-negative matrix when it has non-negative off-diagonal entries, that is, $a_{i j} \geq 0$ for $i \neq j$ [Citation22]. It is well known that if $A$ is a Metzler matrix then solutions of the deterministic system (Equation2(2) $\frac{d X (t)}{d t} = A X (t), X (0) = X_{0}$ (2) ) meet the desired property. However, in model (Equation1(1) $d X (t) = A X (t) d t + \overset{m}{\sum_{h = 1}} B^{h} X (t) d W^{h} (t), X (0) = X_{0},$ (1) ) additional requirements are needed in order to ensure positivity of solutions. The next result gives sufficient conditions:

Theorem 2.1

If matrix $A$ is a Metzler matrix and matrices $B^{1}, \dots, B^{k}$ are diagonal, then given $X (0) = X_{0} > 0,$ the solution of (Equation1(1) $d X (t) = A X (t) d t + \overset{m}{\sum_{h = 1}} B^{h} X (t) d W^{h} (t), X (0) = X_{0},$ (1) ) verifies that $P (X (t) > 0$ for all $t \geq 0) = 1$ .

Proof.

See Appendix.

3. A model with two time scales

We suppose a stage-structured population in which individuals are classified into stages or groups attending to any characteristic of the life cycle. Moreover, each of these groups is divided into several subgroups that can correspond to different spatial patches, different individual activities or any other characteristic that could change the life cycle parameters. The model is therefore general in the sense that we do not state in detail the nature of the population or the subpopulations.

We consider the population being subdivided in q populations (or groups). Each group is subdivided in subpopulations (subgroups) in such a way that for each $i = 1, 2, \dots, q$ , group i has $N_{i}$ subgroups. Therefore, the total number of subgroups is $N := N_{1} + N_{2} + \dots + N_{q}$ . We denote the fast time as τ, while the slow time will be denoted by t. In this way we have $t = ε τ$ where $ε << 1$ is a small number that represents the ratio between the slow and the fast times.

Le $x^{i j} (τ)$ be the density of subpopulation j of population i at time τ, with $i = 1, 2, \dots, q$ and $j = 1, 2, \dots, N_{i}$ . In order to describe the population of group i at time τ we will use vector $x^{i} (τ) = (x^{i 1} (τ), x^{i 2} (τ), \dots, x^{i N_{i}} (τ))^{T} \in R^{N_{i}},$ $i = 1, 2, \dots, q,$ where T denotes transposition. The composition of the total population is then given by vector $X (τ) = (x^{1} (τ)^{T}, x^{2} (τ)^{T}, \dots, x^{1} (τ)^{T})^{T} \in R^{N}$ .

In the evolution of the population we will consider two linear processes whose corresponding characteristic time scales, are very different from each other. In order to include in our model both time scales we will model these two processes, to which we will refer as the fast and the slow dynamics, by two different matrices.

In principle, we will make no special assumptions regarding the characteristics of the slow dynamics other than linearity and restrictions to guarantee positivity of solutions to the slow process. Thus, we will assume that the parameters of the slow process are defined by (3) $S + \overset{k}{\sum_{h = 1}} B^{h} \frac{d W^{h} (t)}{d t},$ (3) where: S.1. Matrix $S \in R^{N \times N}$ models the deterministic part of the slow process and has non-negative off-diagonal entries, that is, $S$ is a Metzler matrix. We consider $S$ divided into blocks $S_{i j}, 1 \leq i, j \leq q$ in such a way that $S = [\begin{matrix} S_{11} & \dots & S_{1 q} \\ ⋮ & ⋱ & ⋮ \\ S_{q 1} & \dots & S_{q q} \end{matrix}] .$ Each block $S_{i j} = [s_{i j}^{α β}]$ has dimensions $N_{i} \times N_{j}$ and characterizes the rates of transference of individuals from the subgroups of group j to the subgroups of group i. More specifically, for each $α = 1, 2, \dots, N_{i}$ and each $β = 1, 2, \dots, N_{j}$ , entry $s_{i j}^{α β}$ represents the rate of transference of individuals from subgroup β of group j to subgroup α of group i.

S.2. $W^{1}, \dots, W^{k}$ are independent standard Wiener processes (so that by $d W^{1} (t) / d t, \dots, d W^{k} (t) / d t$ we denote the associated weakly defined gaussian white noises) and, for each $h = 1, \dots, k$ , matrix $B^{h} \in R^{N \times N}$ models the contribution of noise $d W^{h} (t)$ to the dynamics of the slow process. In order to be able to guarantee positivity of solutions (Theorem 2.1), we assume that matrices $B^{h}$ are diagonal, that is, $B^{h} = diag (B_{1}^{h}, \dots, B_{q}^{h}), h = 1, \dots, k with B_{i}^{h} = diag (b_{i 1}^{h}, \dots, b_{i N_{i}}^{h}), i = 1, \dots, q .$ Note from Equation (Equation3(3) $S + \overset{k}{\sum_{h = 1}} B^{h} \frac{d W^{h} (t)}{d t},$ (3) ) that $b_{i α}^{h}$ models the intensity of the noise $d W^{h} (t)$ affecting coefficient $s_{i i}^{α α}$ .

In this context, we say that an eigenvalue λ of a certain square matrix $A$ is strictly dominant when the real part of λ is strictly larger than the real part of the rest of the eigenvalues of $A$ .

As far as the behaviour of the fast dynamics is concerned, we will make the following three assumptions:

F.1. The fast process is deterministic.

F.2. The fast dynamics is an internal process for each group, that is, there is no transference of individuals from one group to a different one. Therefore, for each $i = 1, \dots, q$ , the fast dynamics of group i will be represented by a Metzler matrix $F_{i}$ of dimensions $N_{i} \times N_{i}$ . We will assume that $F_{i}$ is irreducible in the sense that there exists r>0 such that $r I + F_{i}$ is a primitive non-negative matrix [Citation22]. Therefore, the matrix that governs the fast dynamics for the whole population is (4) $F = diag (F_{1}, F_{2}, \dots, F_{q}) .$ (4)

F.3. The fast process in each group has a non-trivial equilibrium point which is asymptotically stable. More specifically, for each $i = 1, \dots, q$ matrix $F_{i}$ has eigenvalue 0 and it is strictly dominant so that the rest of the eigenvalues of $F_{i}$ have negative real parts. Since $F_{i}$ is a Metzler irreducible matrix, eigenvalue 0 is simple [Citation22, Theorem 2.6] and moreover, there exist positive right and left eigenvectors $v_{i}$ and $u_{i}$ of $F_{i}$ associated to eigenvalue 0 for which we choose the following normalization conditions (5) $\begin{aligned} F_{i} v_{i} & = 0; u_{i}^{T} F_{i} = 0^{T} \\ ∥ v_{i} ∥_{1} & = 1; u_{i}^{T} v_{i} = 1, \end{aligned}$ (5) where $∥ * ∥_{1}$ denotes the 1-vector norm.

In order to incorporate both time scales in our model, we will make use of parameter $ε = t / τ$ . The model, to which we will refer as original system, has the following form in the slow time (6) $d X (t) = \frac{1}{ε} F X (t) d t + S X (t) d t + \overset{k}{\sum_{h = 1}} B^{h} X (t) d W^{h} (t) .$ (6) Alternatively, using the fast time τ we have (7) $d X (τ) = F X (τ) d τ + ε S X (τ) d τ + \sqrt{ε} \overset{k}{\sum_{h = 1}} B^{h} X (τ) d W^{h} (τ),$ (7) where we have made an abuse of notation and kept the same notation for $X$ and the $W^{h}$ in the new time, and we have used [Citation1, p. 47] that if $W (t)$ is a standard Wiener process then so is $\sqrt{ε} W (t / ε)$ . We stress that the quotient between the rate of diffusion squared and the speed of drift is $O (ε)$ , $ε \to 0$ , which is in contrast with most approaches in the analysis of two-time scales systems [Citation5, Citation10, Citation11, Citation16] which are valid in other situations.

4. Approximate reduction of the model

In order to reduce the original system (Equation7(7) $d X (τ) = F X (τ) d τ + ε S X (τ) d τ + \sqrt{ε} \overset{k}{\sum_{h = 1}} B^{h} X (τ) d W^{h} (τ),$ (7) ), we will use the fact that the fast process has an asymptotic stable equilibrium, and we will approximate this system by another one in which the fast process has reached equilibrium.

Let $i = 1, \dots, q$ be fixed and let ${\hat{F}}_{i} := v_{i} u_{i}^{T}$ . From Equation (Equation5(5) $\begin{aligned} F_{i} v_{i} & = 0; u_{i}^{T} F_{i} = 0^{T} \\ ∥ v_{i} ∥_{1} & = 1; u_{i}^{T} v_{i} = 1, \end{aligned}$ (5) ), we have that if the system were governed by the fast process exclusively, for any initial condition $x^{i} \in R_{+}^{N_{i}}$ the population of each group i would tend to vector ${\hat{x}}^{i} := {\hat{F}}_{i} x^{i} = (u_{i}^{T} x^{i}) v_{i}$ . From this expression we note that vector $v_{i}$ defines the equilibrium population structure for group i and $u_{i}$ is a vector of reproductive values, that is, the larger $u_{i}^{j}$ , the higher the contribution of the j-th subgroup of the ith group to the equilibrium population. Therefore $u_{i}$ characterizes the size of the equilibrium population.

We define matrix $\hat{F}$ in the following way, $\hat{F} = diag ({\hat{F}}_{1}, {\hat{F}}_{2}, \dots, {\hat{F}}_{q}),$ and then for any initial condition $X \in R_{+}^{N}$ , the equilibrium population for the fast dynamics in the whole system is given by $\hat{X} = \hat{F} X$ . Let us define the non-negative matrices $V : = diag (v_{1}, v_{2}, \dots, v_{q}) \in R^{N \times q}, U : = diag (u_{1}^{T}, u_{2}^{T}, \dots, u_{q}^{T}) \in R^{q \times N}$ whose interpretation is immediate bearing in mind what we pointed out about $v_{i}$ and $u_{i}$ .

Some of the properties of these matrices are gathered in the following lemma, whose proof is straightforward:

Lemma 4.1

Matrices $F, \hat{F}, V$ and $U$ verify:

$\hat{F} = {V U, U V = I}_{q}$ and the columns of $V$ are independent and constitute a basis of $Im \hat{F}$ .
$F \hat{F} = 0$ .

Now, from Equation (Equation6(6) $d X (t) = \frac{1}{ε} F X (t) d t + S X (t) d t + \overset{k}{\sum_{h = 1}} B^{h} X (t) d W^{h} (t) .$ (6) ) we will build an auxiliary system replacing the state variables in the right side by its equilibrium values for the fast process and use that $F \hat{F} = 0$ , (8) $\begin{aligned} d X (t) & = \frac{1}{ε} F \hat{F} X (t) d t + S \hat{F} X (t) d t + \overset{k}{\sum_{h = 1}} B^{h} \hat{F} X (t) d W^{h} (t) \\ = S \hat{F} X (t) d t + \overset{k}{\sum_{h = 1}} B^{h} \hat{F} X (t) d W^{h} (t) . \end{aligned}$ (8) Now we define the vector of global variables as (9) $Y = (y^{1}, \dots, y^{q})^{T} : = U X \in R^{q}$ (9) and, multiplying Equation (Equation8(8) $\begin{aligned} d X (t) & = \frac{1}{ε} F \hat{F} X (t) d t + S \hat{F} X (t) d t + \overset{k}{\sum_{h = 1}} B^{h} \hat{F} X (t) d W^{h} (t) \\ = S \hat{F} X (t) d t + \overset{k}{\sum_{h = 1}} B^{h} \hat{F} X (t) d W^{h} (t) . \end{aligned}$ (8) ) on the left by $U$ and using that $\hat{F} = V U$ and Equation (Equation9(9) $Y = (y^{1}, \dots, y^{q})^{T} : = U X \in R^{q}$ (9) ) we obtain the aggregated system (10) $d Y (t) = \bar{S} Y (t) d t + \overset{k}{\sum_{h = 1}} {\bar{B}}^{h} Y (t) d W^{h} (t),$ (10) where we have defined (11) $\begin{aligned} \bar{S} & = [{\bar{s}}_{i j}] := U S V = [\begin{matrix} u_{1}^{T} S_{11} v_{1} & \dots & u_{1}^{T} S_{1 q} v_{q} \\ ⋮ & ⋱ & ⋮ \\ u_{q}^{T} S_{q 1} v_{1} & \dots & u_{q}^{T} S_{q q} v_{q} \end{matrix}] \in R^{q \times q} \end{aligned}$ (11) (12) $\begin{aligned} {\bar{B}}^{h} & := {U B}^{h} V = diag (\overset{N_{1}}{\sum_{α = 1}} v_{1}^{α} u_{1}^{α} b_{1 α}^{h}, \dots, \overset{N_{q}}{\sum_{α = 1}} v_{q}^{α} u_{q}^{α} b_{q α}^{h}) \in R^{q}, h = 1, \dots, k . \end{aligned}$ (12) Note that the global variables $Y (t)$ defined by Equation (Equation9(9) $Y = (y^{1}, \dots, y^{q})^{T} : = U X \in R^{q}$ (9) ) have the following expression in terms of the variables $X (t)$ of the auxiliary system: $y^{i} (t) = u_{i}^{T} x^{i} (t) = u_{i}^{1} x^{i 1} (t) + u_{i}^{2} x^{i 2} (t) + \dots + u_{i}^{N_{i}} x^{i N_{i}} (t), i = 1, \dots, q .$ Note that:

$y^{i} (t)$ is a linear combination of the variables corresponding to group i, being the coefficients of the combination the components of vector $u_{i}$ . Recall that $u_{i}$ is a vector of reproductive values for the fast process in group i. Therefore, for each $j = 1, \dots, N_{i}$ , variable $x^{i j} (t)$ has a relative weight in $y^{i} (t)$ which is proportional to $u_{i}^{j}$ , that is, proportional to the contribution to the total equilibrium population that an individual initially present in group i and subgroup j would have in the case that the system were governed by the fast process exclusively.
The global variables are conservative for the fast process. Indeed, suppose that the fast process is the only one acting in the system. Then we would have $\dot{X} (t) = F X (t) / ε$ and using that $U F = 0$ , $\dot{Y} (t) = U \dot{X} (t) = U F X (t) / ε = 0$ .
The components of the matrices representing the drift and the diffusion for the reduced system are certain linear combinations of their analogues for the original system, where the coefficients of the combination are determined by the equilibrium characteristics of the fast process.

The next result together with Theorem 2.1 guarantees that the original and the aggregated systems have positive solutions for any positive initial conditions.

Proposition 4.2

$F + ε S$ and $\bar{S}$ are Metzler matrices and matrices ${\bar{B}}^{h},$ $h = 1, \dots, k$ are diagonal. Therefore, according to Theorem 2.1 both the original system (Equation7(7) $d X (τ) = F X (τ) d τ + ε S X (τ) d τ + \sqrt{ε} \overset{k}{\sum_{h = 1}} B^{h} X (τ) d W^{h} (τ),$ (7) ) and the aggregated system (Equation10(10) $d Y (t) = \bar{S} Y (t) d t + \overset{k}{\sum_{h = 1}} {\bar{B}}^{h} Y (t) d W^{h} (t),$ (10) ) verify that for any positive initial condition the solution remains positive for all t>0 with probability one.

Proof.

$F + ε S$ is clearly a Metzler matrix for it is the sum of Metzler matrices. Now let $i \neq j$ . Since $S$ is a Metzler matrix we have that $S_{i j} \geq 0$ and using the fact that $u_{i}$ and $v_{j}$ are positive vectors, from Equation (Equation11(11) $\begin{aligned} \bar{S} & = [{\bar{s}}_{i j}] := U S V = [\begin{matrix} u_{1}^{T} S_{11} v_{1} & \dots & u_{1}^{T} S_{1 q} v_{q} \\ ⋮ & ⋱ & ⋮ \\ u_{q}^{T} S_{q 1} v_{1} & \dots & u_{q}^{T} S_{q q} v_{q} \end{matrix}] \in R^{q \times q} \end{aligned}$ (11) ) it follows that ${\bar{s}}_{i j} = u_{i}^{T} S_{i j} v_{j} \geq 0$ and so $\bar{S}$ is a Metzler matrix. Moreover, from Equation (Equation12(12) $\begin{aligned} {\bar{B}}^{h} & := {U B}^{h} V = diag (\overset{N_{1}}{\sum_{α = 1}} v_{1}^{α} u_{1}^{α} b_{1 α}^{h}, \dots, \overset{N_{q}}{\sum_{α = 1}} v_{q}^{α} u_{q}^{α} b_{q α}^{h}) \in R^{q}, h = 1, \dots, k . \end{aligned}$ (12) ) it is clear that ${\bar{B}}^{h}$ is a diagonal matrix for each $h = 1, \dots, k$ .

5. Multiregional models with two time scales

In this section we will illustrate the reduction technique by applying it to a multiregional model.

5.1. Model setting

We consider a population living in a multipatch system. We assume that there are a number N of different patches among which the individuals can migrate. The growth of the population in each patch is linear and is affected by stochasticity. Migration among patches is assumed to be deterministic.

We number the patches in the form $(i, α)$ , i=1, 2, $α = 1, \dots, N_{i}$ , where $N := N_{1} + N_{2}$ . Coming back to the terminology of Section 3, the first index, i, defines the ‘group’ of patches and the second, α, the ‘subgroup’ within that group. Note that in our setting we have chosen that the number q of groups is 2 just for the sake of simplicity in the expression of the matrices involved, but the generalization of the model to an arbitrary number q of groups is straightforward. Let $x^{i α} (t)$ denote the population in group i and subgroup j at time t and let $X = (x^{11}, \dots, x^{1 N_{1}}, x^{21}, \dots, x^{2 N_{2}})^{T} \in R^{N}$ be the population vector.

We assume that migration is fast within each group of patches, that is, from any patch $(i, α)$ to any other patch of the form $(i, β)$ , $β \neq α$ . Migration is assumed to be slow between patches belonging to different groups, that is, from any patch $(i, α)$ to any other patch of the form $(j, β)$ , with $j \neq i$ . We can think of each group of patches as spatial regions located close to each other so that migration between them is easy, whereas different groups of patches correspond to regions amongst which migration is more difficult. For example, in a population of birds each group of patches can correspond to an island and the subgroups can correspond to the different spatial locations within an island, so that intra-island migrations are fast with respect to inter-island movements.

Let τ and t denote, respectively, the times corresponding to the fast and the slow migrations and let $ε := τ / t$ be the ratio between both. We assume that the growth of the population takes place in the slow time scale. For each pair $(i, α)$ , i=1, 2, $α = 1, \dots, N_{i}$ , let $r_{i}^{α}$ be the deterministic population growth rate in patch $(i, α)$ and let us assume that this growth rate is affected by a noise defined by a certain linear combination $σ_{i α}^{1} d W^{1} (t) / d t + \dots + σ_{i α}^{k} d W^{k} (t) / d t$ of (weakly defined) independent white noise processes, where $σ_{i α}^{h} \geq 0$ for each $h = 1, \dots, k$ . Now we define matrices $\begin{aligned} R_{i} & := diag (r_{i}^{1}, \dots, r_{i}^{N_{i}}) \in R^{N_{i} \times N_{i}}, i = 1, \dots, q \\ R & := diag (R_{1}, R_{2}) \in R^{N \times N} \\ G_{i}^{h} & := diag (σ_{i 1}^{h}, \dots, σ_{i N_{i}}^{h}) \in R^{N_{i} \times N_{i}}, i = 1, \dots, q, h = 1, \dots, k \\ G^{h} & := diag (G_{1}^{h}, G_{2}^{h}) \in R^{N \times N}, h = 1, \dots, k . \end{aligned}$ Regarding the slow migration between different groups of patches, for each $i \neq j$ and each $α = 1, \dots, N_{i}$ , $β = 1, \dots, N_{j}$ , we define $l_{i j}^{a β} \geq 0$ as the (slow) migration coefficient from patch $(j, β)$ to patch $(i, α)$ . Similarly we define $l_{i i}^{a β} = 0$ for each $α, β = 1, \dots, N_{i}$ with $α \neq β$ (as there is no slow migration within a group of patches) and $l_{i i}^{α α} := - \overset{q}{\sum_{j = 1, j \neq i}} \overset{N_{j}}{\sum_{β = 1, β \neq α}} l_{j i}^{β α}, i = 1, 2, α = 1, \dots, N_{i} .$ Let us define matrices $L_{i j} := [l_{i j}^{α β}] \in R^{N_{i} \times N_{j}}$ , $i, j = 1, \dots, q$ and $L : = [\begin{matrix} L_{11} & L_{12} \\ L_{21} & L_{22} \end{matrix}] .$ Then the slow process, that is, the joint effect of the growth process and of the slow migration between patches of different groups, can be modelled by the following system of SDEs $d X (t) = (L + R) X (t) d t + \overset{k}{\sum_{h = 1}} G^{h} X (t) d W^{h} (t), X (0) = X_{0} .$ Regarding the fast migration between patches of the same group, for each i=1, 2 and $α, β = 1, \dots, N_{i}$ with $α \neq β,$ let $m_{i}^{α β} \geq 0$ be the (fast) migration rate from patch $(i, β)$ to patch $(i, α)$ . For $β = α$ we define (13) $m_{i}^{α α} := - \overset{N_{i}}{\sum_{β = 1, β \neq α}} m_{i}^{β α} .$ (13) Now let $M_{i} := [m_{i}^{α β}] \in R^{N_{i} \times N_{i}}$ , i=1, 2. We assume that the $m_{i}^{α β}$ are such that $M_{i}$ is irreducible for each i=1, 2. From Equation (Equation13(13) $m_{i}^{α α} := - \overset{N_{i}}{\sum_{β = 1, β \neq α}} m_{i}^{β α} .$ (13) ) we have that the columns of each $M_{i}$ add up to zero and so matrix $M_{i} + I$ is a non-negative primitive column stochastic matrix. Therefore 0 is the (strictly) dominant eigenvalue of $M_{i}$ and moreover it is simple. Let $u_{i} := 1_{i} = (1, \dots, 1)^{T} \in R^{N_{i}}$ and $v_{i} = (v_{i}^{1}, \dots, v_{i}^{N_{i}})^{T} \in R^{N_{i}}$ be its associated positive left and right eigenvectors, where we assume the normalization condition $1_{i}^{T} v_{i} = 1$ . Note that vector $v_{i}$ defines the equilibrium distribution between the different patches of group i when we consider the fast migration as the only process acting on the system. Now we define matrices $\begin{aligned} M & := diag (M_{1}, M_{2}) \in R^{N \times N} \\ V & := diag (v_{1}, v_{2}) \in R^{N \times 2}, U := diag (1_{1}^{T}, 1_{2}^{T}) \in R^{2 \times N} \\ {\hat{M}}_{i} & := v_{i} 1_{i}^{T} \in R^{N_{i} \times N_{i}}, i = 1, 2, \hat{M} := diag ({\hat{M}}_{1}, {\hat{M}}_{2}) \in R^{N \times N} . \end{aligned}$ Then, the complete model that takes into account the joint effect of the slow population growth in each patch and the slow and fast migrations between patches is given by $d X (t) = (\frac{1}{ε} M + L + R) X (t) d t + \overset{k}{\sum_{h = 1}} G^{h} X (t) d W^{h} (t)$ or, using the fast time τ, (14) $d X (τ) = (M + ε (L + R)) X (τ) d τ + \sqrt{ε} \overset{k}{\sum_{h = 1}} G^{h} X (τ) d W^{h} (τ), X (0) = X_{0},$ (14) which constitutes a system of N linear SDEs.

Note that under the above conditions we are in the general setting of Section 3 by taking $S = L + R, F = M, \hat{F} = \hat{M}, B^{h} = G^{h}, h = 1, \dots, k$ and Hypotheses S1, S2, F1, F2 and F3 for the slow and fast processes are met.

5.2. Model reduction

Therefore we can proceed to the aggregation of the original system following the procedure developed in Section 4. We define the global variables $Y := U X$ , that is, $y^{i} (t) = 1_{i}^{T} x^{i} (t) = x^{i 1} (t) + x^{i 2} (t) + \dots + x^{i N_{i}} (t), i = 1, 2,$ so that $y^{i}$ is the total population in all patches of group i, and the reduced aggregated system is the two dimensional SDE (15) $d Y (t) = \bar{S} Y (t) d t + \overset{k}{\sum_{h = 1}} {\bar{G}}^{h} Y (t) d W^{h} (t), Y (0) = {U X}_{0},$ (15) where $\begin{aligned} \bar{S} & := U (L + R) V = (\begin{matrix} 1_{1}^{T} (L_{11} + R_{1}) v_{1} & 1_{1}^{T} L_{12} v_{2} \\ 1_{2}^{T} L_{21} v_{1} & 1_{1}^{T} (L_{22} + R_{2}) v_{2} \end{matrix}) \in R^{2 \times 2} \\ {\bar{G}}^{h} & := {U G}^{h} V = diag (1_{1}^{T} G_{1}^{h} v_{1}, 1_{2}^{T} G_{2}^{h} v_{2}), h = 1, \dots, k . \end{aligned}$ In order to be more specific we will consider the particular case in which $N_{1} = N_{2} = 2$ , that is, there are 4 patches $(1, 1)$ , $(1, 2)$ , $(2, 1)$ and $(2, 2),$ and the population vector is $X = (x^{11}, x^{12}, x^{21}, x^{22})^{T}$ . Dispersal is fast between patches $(1, 1)$ and $(1, 2)$ and between $(2, 1)$ and $(2, 2)$ , and is slow in the rest of cases.

Let i=1, 2 be fixed. We will keep the notation introduced above except in the case of the fast migrations, where in order to simplify it we will denote by $p_{i}$ and $q_{i}$ the (fast) migration rates from $(i, 1)$ to $(i, 2)$ and from $(i, 2)$ to $(i, 1)$ respectively. Therefore $M_{i} = (\begin{matrix} - p_{i} & q_{i} \\ p_{i} & - q_{i} \end{matrix}), i = 1, 2.$ We assume $0 < p_{i}, q_{i} < 1$ so that matrix $M_{i}$ is irreducible and vector $v_{i}$ has the form $v_{i} = (q_{i}, p_{i})^{T} / (p_{i} + q_{i})$ . Then the complete model has the form (Equation14(14) $d X (τ) = (M + ε (L + R)) X (τ) d τ + \sqrt{ε} \overset{k}{\sum_{h = 1}} G^{h} X (τ) d W^{h} (τ), X (0) = X_{0},$ (14) ) with $G^{h} = diag (σ_{11}^{h}, σ_{12}^{h}, σ_{21}^{h}, σ_{22}^{h})$ , $h = 1, \dots, k$ and $M + ε (L + R)$ given by $(\begin{matrix} - p_{1} - ε (l_{21}^{11} + l_{21}^{21} - r_{1}^{1}) & q_{1} & ε l_{12}^{11} & ε l_{12}^{12} \\ p_{1} & - q_{1} - ε (l_{21}^{12} + l_{21}^{22} - r_{1}^{2}) & ε l_{12}^{21} & ε l_{12}^{22} \\ ε l_{21}^{11} & ε l_{21}^{12} & - p_{2} - ε (l_{12}^{11} + l_{12}^{21} - r_{2}^{1}) & q_{2} \\ ε l_{21}^{21} & ε l_{21}^{22} & p_{2} & - q_{2} - ε (l_{12}^{12} + l_{12}^{22} - r_{2}^{2}) \end{matrix}) .$ The global variables are $Y = (y^{1}, y^{2})^{T} = (x^{11} + x^{12}, x^{21} + x^{22})^{T}$ and the reduced system has the form (Equation15(15) $d Y (t) = \bar{S} Y (t) d t + \overset{k}{\sum_{h = 1}} {\bar{G}}^{h} Y (t) d W^{h} (t), Y (0) = {U X}_{0},$ (15) ) with (16) $\begin{aligned} \bar{S} & = (\begin{matrix} {\bar{s}}_{11} & {\bar{s}}_{12} \\ {\bar{s}}_{21} & {\bar{s}}_{22} \end{matrix}) \\ = (\begin{matrix} \frac{q_{1} (- l_{21}^{11} - l_{21}^{21} + r_{1}^{1}) + p_{1} (- l_{21}^{12} - l_{21}^{22} + r_{1}^{2})}{p_{1} + q_{1}} & \frac{q_{1} (l_{12}^{11} + l_{12}^{21}) + p_{1} (l_{12}^{12} + l_{12}^{22})}{p_{2} + q_{2}} \\ \frac{q_{1} (l_{21}^{11} + l_{21}^{21}) + p_{1} (l_{21}^{12} + l_{21}^{22})}{p_{1} + q_{1}} & \frac{q_{2} (- l_{12}^{11} - l_{12}^{21} + r_{2}^{1}) + p_{2} (- l_{12}^{12} - l_{12}^{22} + r_{2}^{2})}{p_{2} + q_{2}} \end{matrix}) \\ {\bar{G}}^{h} & = diag ({\bar{σ}}_{1}^{h}, {\bar{σ}}_{2}^{h}) = diag (\frac{q_{1} σ_{11}^{h} + p_{1} σ_{12}^{h}}{p_{1} + q_{1}}, \frac{q_{2} σ_{21}^{h} + p_{2} σ_{22}^{h}}{p_{2} + q_{2}}), h = 1, \dots, k . \end{aligned}$ (16)

5.3. Case in which migration is fast

Let us now consider the particular but relevant case in which there is only one group of patches, and therefore all the migrations among the patches are fast with respect to demography. This has been the usual setting in the literature of approximate aggregation techniques [Citation2, Citation3]. We can consider this case as a particular instance of the setting in Section 5.1 when we take q equal to 1 instead of equal to 2. In order to simplify the notation, we omit the index 1 regarding the group of patches. Therefore, we denote the patches as $j = 1, \dots, N$ , and the vectors and matrices have the form $X = (x^{1}, \dots, x^{N})^{T}$ , $R := diag (r^{1}, \dots, r^{N})$ , $L = 0$ (there are no slow migrations), $M := [m^{α β}]$ and $G^{h} := diag (σ_{1}^{h}, \dots, σ_{N}^{h})$ . Therefore, the original system (Equation14(14) $d X (τ) = (M + ε (L + R)) X (τ) d τ + \sqrt{ε} \overset{k}{\sum_{h = 1}} G^{h} X (τ) d W^{h} (τ), X (0) = X_{0},$ (14) ) takes the form $d X (τ) = (M + ε R) X (τ) d τ + \sqrt{ε} \overset{k}{\sum_{h = 1}} G^{h} X (τ) d W^{h} (τ) .$ In this case there is only one global variable $y (t) = x^{1} (t) + x^{2} (t) + \dots + x^{N} (t)$ and if $v = (v^{1}, \dots, v^{N})^{T}$ and $1 = (1, \dots, 1)^{T}$ are the positive right and left eigenvectors of $M$ associated to eigenvalue 0 scaled so that $1^{T} v = 1$ , we have $V = v,$ $U = 1^{T}$ . Therefore, the reduced system is the scalar SDE $d y (t) = \bar{s} y (t) d t + \overset{k}{\sum_{h = 1}} {\bar{g}}^{h} y (t) d W^{h} (t), y (0) = \overset{N}{\sum_{j = 1}} x^{j} (0),$ where (17) $\bar{s} := 1^{T} R v = \overset{N}{\sum_{j = 1}} v^{j} r^{j}, {\bar{g}}^{h} = 1^{T} G^{h} v = \overset{N}{\sum_{j = 1}} v^{j} σ_{j}^{h} .$ (17)

6. Relationships between the original and the reduced systems

The aim of this Section is to relate the asymptotic behaviour of the first- and second-order moments of the solution of the original and the reduced systems (Equation8(8) $\begin{aligned} d X (t) & = \frac{1}{ε} F \hat{F} X (t) d t + S \hat{F} X (t) d t + \overset{k}{\sum_{h = 1}} B^{h} \hat{F} X (t) d W^{h} (t) \\ = S \hat{F} X (t) d t + \overset{k}{\sum_{h = 1}} B^{h} \hat{F} X (t) d W^{h} (t) . \end{aligned}$ (8) ) and (Equation10(10) $d Y (t) = \bar{S} Y (t) d t + \overset{k}{\sum_{h = 1}} {\bar{B}}^{h} Y (t) d W^{h} (t),$ (10) ) introduced in Section 3. The two systems fit in the framework of [Citation14] in which the quotient between the rate of diffusion squared and the speed of drift is $O (ε^{p})$ , $ε \to 0$ , with $p > 1 / 2$ , but the results obtained in that reference, essentially a stochastic extension of the Tikhonov theory of singular perturbations, are valid only for finite time intervals.

The first- and second-order moments of the solution of general linear stochastic equations with a deterministic initial condition are finite and can be calculated as the solution of certain linear ordinary differential equations. Specifically [Citation1], for system (Equation1(1) $d X (t) = A X (t) d t + \overset{m}{\sum_{h = 1}} B^{h} X (t) d W^{h} (t), X (0) = X_{0},$ (1) ) $E X (t)$ is the unique solution to the equation $\frac{d}{d t} E X (t) = A E X (t), E X (0) = X_{0},$ whereas the matrix of second-order moments $H (t) := E (X (t) X (t)^{T}) \in R^{d \times d}$ is the unique non-negative-definite symmetric solution of the equation (18) $\frac{d H (t)}{d t} = A H (t) + H (t) A^{T} + \overset{k}{\sum_{h = 1}} B^{h} H (t) (B^{h})^{T}, H (0) = X_{0} X_{0}^{T} .$ (18) Regarding Equation (Equation18(18) $\frac{d H (t)}{d t} = A H (t) + H (t) A^{T} + \overset{k}{\sum_{h = 1}} B^{h} H (t) (B^{h})^{T}, H (0) = X_{0} X_{0}^{T} .$ (18) ), in order to work with a more tractable expression, we will make use of the Kronecker matrix product and the ‘vec’ operator (see [Citation9] for details). The Kronecker matrix product for two matrices $A = [a_{i j}] \in C^{m \times n}$ and $B = [b_{i j}] \in C^{r \times s}$ is defined as a matrix of size $m r \times n s$ with mn blocks in which the block in position $(i, j)$ has the form $a_{i j} B$ . For any matrix $A \in C^{m \times n}$ , vec $A$ is defined as the column vector that contains, in order, the columns of $A$ . For any matrices $A$ , $Y$ , and $B$ for which the product $A Y B$ makes sense, one has vec $(A Y B) = (B^{T} \otimes A)$ vec $Y$ [Citation9, Theorem 6.8], and so applying the ‘vec’ operator to both sides of Equation (Equation18(18) $\frac{d H (t)}{d t} = A H (t) + H (t) A^{T} + \overset{k}{\sum_{h = 1}} B^{h} H (t) (B^{h})^{T}, H (0) = X_{0} X_{0}^{T} .$ (18) ) we obtain that the second-order moments verify $\frac{d}{d t} vec H (t) = A_{2} vec H (t), vec H (0) = vec (X_{0} X_{0}^{T}),$

where $A_{2} {= I}_{N} \otimes A + A \otimes I_{N} + \overset{k}{\sum_{h = 1}} B^{h} \otimes B^{h} \in R^{d^{2} \times d^{2}}$ and $I_{N}$ denotes the $N \times N$ identity matrix.

Therefore, the asymptotic behaviour of the first- and second-order moments of system (Equation1(1) $d X (t) = A X (t) d t + \overset{m}{\sum_{h = 1}} B^{h} X (t) d W^{h} (t), X (0) = X_{0},$ (1) ) can be characterized in terms of the dominant eigenvalue and eigenvectors of matrices $A$ and $A_{2}$ . We will now use this fact to study the moments of systems (Equation8(8) $\begin{aligned} d X (t) & = \frac{1}{ε} F \hat{F} X (t) d t + S \hat{F} X (t) d t + \overset{k}{\sum_{h = 1}} B^{h} \hat{F} X (t) d W^{h} (t) \\ = S \hat{F} X (t) d t + \overset{k}{\sum_{h = 1}} B^{h} \hat{F} X (t) d W^{h} (t) . \end{aligned}$ (8) ) and (Equation10(10) $d Y (t) = \bar{S} Y (t) d t + \overset{k}{\sum_{h = 1}} {\bar{B}}^{h} Y (t) d W^{h} (t),$ (10) ).

In the case of the original system (Equation8(8) $\begin{aligned} d X (t) & = \frac{1}{ε} F \hat{F} X (t) d t + S \hat{F} X (t) d t + \overset{k}{\sum_{h = 1}} B^{h} \hat{F} X (t) d W^{h} (t) \\ = S \hat{F} X (t) d t + \overset{k}{\sum_{h = 1}} B^{h} \hat{F} X (t) d W^{h} (t) . \end{aligned}$ (8) ), the analogous to matrices $A$ and $A_{2}$ in the previous reasonings are matrices $C \in R^{N \times N}$ and $C_{2} \in R^{N^{2} \times N^{2}}$ given by (19) $C (ε) := F + ε S, C_{2} (ε) := F_{2} + ε S_{2},$ (19) where we have defined $F_{2} := I_{N} \otimes {F + F \otimes I}_{N} {, S}_{2} := I_{N} \otimes {S + S \otimes I}_{N} + \overset{k}{\sum_{h = 1}} B^{h} \otimes B^{h} .$ In the case of the aggregated system (Equation10(10) $d Y (t) = \bar{S} Y (t) d t + \overset{k}{\sum_{h = 1}} {\bar{B}}^{h} Y (t) d W^{h} (t),$ (10) ) the corresponding matrices are $\bar{C} (ε) = ε \bar{S} \in R^{q \times q}, {\bar{C}}_{2} (ε) = ε {\bar{S}}_{2} \in R^{q^{2} \times q^{2}}$ where ${\bar{S}}_{2} := I_{q} \otimes \bar{S} + \bar{S} \otimes I_{q} + \overset{k}{\sum_{h = 1}} {\bar{B}}^{h} \otimes {\bar{B}}^{h} .$ We introduce the following two hypotheses:

Hypothesis 6.1

Matrix $\bar{S}$ has a simple and strictly dominant real eigenvalue μ associated to non-negative right and left eigenvectors $r$ and $l$ , respectively.

Hypothesis 6.2

Matrix ${\bar{S}}_{2}$ has a simple and strictly dominant real eigenvalue $μ_{2}$ associated to non-negative right and left eigenvectors $r_{2}$ and $l_{2}$ , respectively.

From Hypothesis 6.1 we have that the first-order moments of the reduced system have the following asymptotic behaviour $lim_{t \to \infty} \frac{E Y (t)}{\exp (μ t)} = \frac{⟨ l, y_{0} ⟩}{⟨ l, r ⟩} r,$ where $y_{0} := {U X}_{0}$ and $⟨ *, * ⟩$ denotes the standard vector scalar product. Analogously, from Hypothesis 6.2 we have, for the second-order moments, $lim_{t \to \infty} \frac{vec E (Y (t) Y (t)^{T})}{\exp (μ_{2} t)} = \frac{⟨ l_{2}, y_{0} \otimes y_{0} ⟩}{⟨ l_{2}, r_{2} ⟩} r_{2} .$ The next result relates the asymptotic behaviour of the first- and second-order moments of the original system (Equation8(8) $\begin{aligned} d X (t) & = \frac{1}{ε} F \hat{F} X (t) d t + S \hat{F} X (t) d t + \overset{k}{\sum_{h = 1}} B^{h} \hat{F} X (t) d W^{h} (t) \\ = S \hat{F} X (t) d t + \overset{k}{\sum_{h = 1}} B^{h} \hat{F} X (t) d W^{h} (t) . \end{aligned}$ (8) ) with that of the reduced system when ϵ is small, that is, when the separation between the time scales of migration and of demography is large enough:

Theorem 6.1

(a) Let Hypothesis 6.1 hold. Then for small enough $ε > 0,$ matrix $C (ε) = F + ε S$ has a simple and strictly dominant eigenvalue $λ (ε)$ that can be written in the form $λ (ε) = ε μ + O (ε^{2})$ with associated right and left non-negative eigenvectors $r (ε)$ and $l (ε)$ such that $r (ε) = V r + O (ε); l (ε) = U^{T} l + O (ε) .$ Consequently, for the original system (Equation8(8) $\begin{aligned} d X (t) & = \frac{1}{ε} F \hat{F} X (t) d t + S \hat{F} X (t) d t + \overset{k}{\sum_{h = 1}} B^{h} \hat{F} X (t) d W^{h} (t) \\ = S \hat{F} X (t) d t + \overset{k}{\sum_{h = 1}} B^{h} \hat{F} X (t) d W^{h} (t) . \end{aligned}$ (8) ) we have: (20) $lim_{τ \to \infty} \frac{E X (τ)}{\exp (ε μ + O (ε^{2}))} = \frac{⟨ l, {U X}_{0} ⟩}{⟨ l, r ⟩} V r + O (ε) .$ (20) (b) Let Hypothesis 6.2 hold. Then for small enough $ε > 0,$ matrix $C_{2} (ε) = F_{2} + ε S_{2}$ has a simple and strictly dominant eigenvalue $λ_{2} (ε)$ that can be written in the form (21) $λ_{2} (ε) = ε μ_{2} + O (ε^{2})$ (21) with associated right and left non-negative eigenvectors $r_{2} (ε)$ and $l_{2} (ε)$ such that (22) $r_{2} (ε) = (V \otimes V) r_{2} + O (ε); l_{2} (ε) = (U \otimes U)^{T} l_{2} + O (ε) .$ (22) Consequently, the asymptotic behaviour of the second-order moment of the original system is characterized by: (23) $lim_{τ \to \infty} \frac{vec E (X (t) X (t)^{T})}{\exp (ε μ_{2} + O (ε^{2}))} = \frac{⟨ l_{2}, (U \otimes U) vec (X_{0} X_{0}^{T}) ⟩}{⟨ l_{2}, r_{2} ⟩} (V \otimes V) r_{2} + O (ε) .$ (23)

Proof.

See Appendix.

From the last theorem it follows in particular that if $μ < 0$ (resp. $μ > 0$ ), then $λ (ε) < 0$ (resp. $λ (ε) > 0$ ) for small enough ϵ, so that if the expected value of the population vector tends to zero (resp. infinity) in the reduced system the same happens for the original one when ϵ is small. Something analogous happens for the second-order moments regarding $μ_{2}$ and $λ_{2} (ε)$ .

Let us apply this result to relate the behaviour of the multiregional models of Section 5. In the first place we will consider the model with q=2, $N_{1} = N_{2} = 2$ . Note from (Equation16(16) $\begin{aligned} \bar{S} & = (\begin{matrix} {\bar{s}}_{11} & {\bar{s}}_{12} \\ {\bar{s}}_{21} & {\bar{s}}_{22} \end{matrix}) \\ = (\begin{matrix} \frac{q_{1} (- l_{21}^{11} - l_{21}^{21} + r_{1}^{1}) + p_{1} (- l_{21}^{12} - l_{21}^{22} + r_{1}^{2})}{p_{1} + q_{1}} & \frac{q_{1} (l_{12}^{11} + l_{12}^{21}) + p_{1} (l_{12}^{12} + l_{12}^{22})}{p_{2} + q_{2}} \\ \frac{q_{1} (l_{21}^{11} + l_{21}^{21}) + p_{1} (l_{21}^{12} + l_{21}^{22})}{p_{1} + q_{1}} & \frac{q_{2} (- l_{12}^{11} - l_{12}^{21} + r_{2}^{1}) + p_{2} (- l_{12}^{12} - l_{12}^{22} + r_{2}^{2})}{p_{2} + q_{2}} \end{matrix}) \\ {\bar{G}}^{h} & = diag ({\bar{σ}}_{1}^{h}, {\bar{σ}}_{2}^{h}) = diag (\frac{q_{1} σ_{11}^{h} + p_{1} σ_{12}^{h}}{p_{1} + q_{1}}, \frac{q_{2} σ_{21}^{h} + p_{2} σ_{22}^{h}}{p_{2} + q_{2}}), h = 1, \dots, k . \end{aligned}$ (16) ) that if there is at least a non-zero coefficient $l_{i j}^{α β}$ , matrix $\bar{S}$ is irreducible and then Hypothesis 6.1 holds. Its dominant eigenvalue is given by $μ = \frac{{\bar{s}}_{11} + {\bar{s}}_{22} + \sqrt{({\bar{s}}_{11} - {\bar{s}}_{22})^{2} + 4 {\bar{s}}_{12} {\bar{s}}_{21}}}{2}$ and so the rate of growth of the first-order moments for the original system is $ε μ + O (ε^{2})$ for small enough ϵ. This expression allows one to study how the different combinations of the migration and growth parameters affect the asymptotic behaviour of the expected value of the population. In particular, $μ < 0$ if and only if $tr \bar{S} < 0$ and $det \bar{S} > 0$ . Since $\bar{S}$ has order two we can calculate right and left eigenvectors $r$ and $l$ associated to μ and then apply Equations (Equation20(20) $lim_{τ \to \infty} \frac{E X (τ)}{\exp (ε μ + O (ε^{2}))} = \frac{⟨ l, {U X}_{0} ⟩}{⟨ l, r ⟩} V r + O (ε) .$ (20) ) and (Equation23(23) $lim_{τ \to \infty} \frac{vec E (X (t) X (t)^{T})}{\exp (ε μ_{2} + O (ε^{2}))} = \frac{⟨ l_{2}, (U \otimes U) vec (X_{0} X_{0}^{T}) ⟩}{⟨ l_{2}, r_{2} ⟩} (V \otimes V) r_{2} + O (ε) .$ (23) ) to obtain the full information about the asymptotic behaviour of the first-order moments of the system.

We could argue analogously for the second-order moments: in this case matrix ${\bar{S}}_{2}$ is of order four and so an analytical computation of $μ_{2}$ , $r_{2}$ and $l_{2}$ is non-feasible, but for instance we can use the Routh–Hurwitz criterion [Citation17] to obtain conditions for $μ_{2}$ to be negative, so that $ε μ_{2} + O (ε^{2})$ will also be negative for small ϵ.

In the case of the system of Section 5.3, the analysis is simpler cause the aggregated system is scalar and so are $\bar{S}$ and ${\bar{S}}_{2}$ , so that Hypotheses 6.1 and 6.2 hold trivially. Moreover, $μ = \bar{s}$ and $μ_{2} = 2 \bar{s} + \sum_{h = 1}^{k} ({\bar{g}}^{h})^{2}$ where $\bar{s}$ and the ${\bar{g}}^{h}$ are given by Equation (Equation17(17) $\bar{s} := 1^{T} R v = \overset{N}{\sum_{j = 1}} v^{j} r^{j}, {\bar{g}}^{h} = 1^{T} G^{h} v = \overset{N}{\sum_{j = 1}} v^{j} σ_{j}^{h} .$ (17) ) and we can take $r = l = r_{2} = l_{2} = 1$ . Therefore from Equations (Equation20(20) $lim_{τ \to \infty} \frac{E X (τ)}{\exp (ε μ + O (ε^{2}))} = \frac{⟨ l, {U X}_{0} ⟩}{⟨ l, r ⟩} V r + O (ε) .$ (20) ) and (Equation23(23) $lim_{τ \to \infty} \frac{vec E (X (t) X (t)^{T})}{\exp (ε μ_{2} + O (ε^{2}))} = \frac{⟨ l_{2}, (U \otimes U) vec (X_{0} X_{0}^{T}) ⟩}{⟨ l_{2}, r_{2} ⟩} (V \otimes V) r_{2} + O (ε) .$ (23) ) we have $\begin{aligned} lim_{τ \to \infty} \frac{E X (τ)}{\exp (ε μ + O (ε^{2}))} & = y (0) v + O (ε), \\ lim_{τ \to \infty} \frac{vec E (X (t) X (t)^{T})}{\exp (ε μ_{2} + O (ε^{2}))} & = y (0)^{2} (v \otimes v) + O (ε) . \end{aligned}$ Note in particular that, when ϵ is small enough, the expected value of the population vector tends to zero if $\bar{s} < 0$ , and the second-order moments also tend to zero if $\bar{s} < - \sum_{h = 1}^{k} ({\bar{g}}^{h})^{2} / 2$ .

7. Discussion

In this work we have presented a technique to carry out the reduction of linear two-scale population models governed by SDEs, therefore allowing one to simplify the treatment of these models. The reduction of the original model with N variables is carried out by letting the fast process reach equilibrium and defining an appropriate set of q global variables, q<N, which are linear combinations of the state variables and are conservative for the fast process. Moreover, we have obtained conditions that guarantee the positivity of solutions to the model.

We have also presented a result that allows one to know the the asymptotic behaviour of the first- and second-order statistical moments of the population vector of the original system through the computation of the dominant eigenvalues and eigenvectors of certain matrices of dimensions $q \times q$ and $q^{2} \times q^{2}$ associated to the reduced system.

The aggregation technique has been applied to study stochastic multiregional models in which migration between some patches can be fast with respect to demography whereas dispersal between other patches happens at the scale of demography. In the simpler case in which all migrations are fast, the reduced system is a scalar SDE whose analysis is straightforward, therefore allowing one to easily characterize the asymptotic behaviour of those moments for the original multiregional model.

This work suggests possible lines of future development: on the first hand, and still within the linear setting, work needs to be done to try to relate the stochastic stability [Citation15] of the origin in the original and the reduced system, for this will provide more information regarding the persistence-extinction of the population. Secondly, in order to be able to study more realistic population models, the technique should be extended to nonlinear settings.

Disclosure statement

No potential conflict of interest was reported by the authors.

ORCID

Luis Sanz http://orcid.org/0000-0002-1054-4568

Additional information

Funding

Authors are supported by MINECO (Ministerio de Economía, Industria y Competitividad), Spain, project MTM2014-56022-C2-1-P.

References

L. Arnold, Stochastic Differential Equations: Theory and Applications, Wiley Interscience, New York, 1974.
Google Scholar
P. Auger, R.B. de La Parra, J.-C. Poggiale, E. Sánchez, and L. Sanz, Aggregation methods in dynamical systems and applications in population and community dynamics, Phys. Life Rev. 5(2) (2008), pp. 79–105. doi: 10.1016/j.plrev.2008.02.001
Google Scholar
P. Auger, J. Poggiale, and E. Sánchez, A review on spatial aggregation methods involving several time scales, Ecol. Complex. 10 (2012), pp. 12–25. doi: 10.1016/j.ecocom.2011.09.001
Google Scholar
H. Baumgärtel, Analytic Perturbation Theory for Matrices and Operators, Vol. 15, Springer, New York, 1985.
Google Scholar
N. Berglund and B. Gentz, Geometric singular perturbation theory for stochastic differential equations, J. Differential Equations 191(1) (2003), pp. 1–54. doi: 10.1016/S0022-0396(03)00020-2
Google Scholar
C.A. Braumann, Variable effort fishing models in random environments, Math. Biosci. 156(1) (1999), pp. 1–19. doi: 10.1016/S0025-5564(98)10058-5
Google Scholar
C.A. Braumann, Harvesting in a random environment: Itô or stratonovich calculus? J. Theoret. Biol. 244(3) (2007), pp. 424–432. doi: 10.1016/j.jtbi.2006.08.029
Google Scholar
C. Carlos and C.A. Braumann, General population growth models with allee effects in a random environment, Ecol. Complex. 30 (2016), pp. 26–33. doi: 10.1016/j.ecocom.2016.09.003
Google Scholar
M. Fiedler, Special Matrices and their Applications in Numerical Mathematics, Kluwer Boston, Inc., 1986.
Google Scholar
N. Herath and D. Del Vecchio, Model order reduction for linear noise approximation using time-scale separation, in 2016 IEEE 55th Conference on Decision and Control (CDC), IEEE, Chicago, IL, USA, 2016, pp. 5875–5880.
Google Scholar
N. Herath and D. Del Vecchio, Model reduction for a class of singularly perturbed stochastic differential equations: Fast variable approximation, in American Control Conference (ACC), 2016, IEEE, Boston, MA, USA, 2016, pp. 3674–3679.
Google Scholar
R.A. Horn and C.R. Johnson, Matrix Analysis, Cambridge University Press, Cambridge, 2012.
Google Scholar
D. Jiang, J. Yu, C. Ji, and N. Shi, Asymptotic behavior of global positive solution to a stochastic sir model, Math. Comput. Modelling 54(1) (2011), pp. 221–232. doi: 10.1016/j.mcm.2011.02.004
Google Scholar
Y. Kabanov and S. Pergamenshchikov, Two-Scale Stochastic Systems: Asymptotic Analysis and Control, Vol. 49, Springer, Berlin, 2003.
Google Scholar
R. Khasminskii, Stochastic Stability of Differential Equations, Vol. 66, Springer, Berlin, 2012.
Google Scholar
H.J. Kushner, Weak Convergence Methods and Singularly Perturbed Stochastic Control and Filtering Problems, Vol. 3, Birkhäuser, Boston, 1990.
Google Scholar
J.A. Linda, An Introduction to Mathematical Biology, Pearson, Upper Saddle River, NJ, 2007.
Google Scholar
X. Mao, Stochastic Differential Equations and their Applications, Woodhead Publishing; 2nd ed. ( January 13, 2008), Oxford, 1997.
Google Scholar
L. Sanz and J. Alonso, Approximate aggregation methods in discrete time stochastic population models, Math. Model. Nat. Pheno. 5(6) (2010), pp. 38–69. doi: 10.1051/mmnp/20105603
Google Scholar
L. Sanz and R. Bravo de la Parra, Variables aggregation in a time discrete linear model, Math. Biosci. 157(1) (1999), pp. 111–146. doi: 10.1016/S0025-5564(98)10079-2
Google Scholar
L. Sanz, A. Blasco, and R. Bravo de la Parra, Approximate reduction of multi-type Galton–Watson processes with two time scales, Math. Models Methods Appl. Sci. 13(04) (2003), pp. 491–525. doi: 10.1142/S0218202503002659
Google Scholar
E. Seneta, Non-Negative Matrices and Markov Chains, Springer Science & Business Media, New York, 2006.
Google Scholar
M. Turelli, Random environments and stochastic calculus, Theoret. Popul. Biol. 12(2) (1977), pp. 140–178. doi: 10.1016/0040-5809(77)90040-5
Google Scholar
C. Xu and S. Yuan, Competition in the chemostat: A stochastic multi-species model and its asymptotic behavior, Math. Biosci. 280 (2016), pp. 1–9. doi: 10.1016/j.mbs.2016.07.008
Google Scholar

Appendix

Proof

Proof of Theorem 2.1.

Let us fix $X_{0} > 0$ and let $k_{0}$ be sufficiently large so that for every $i = 1, \dots, d$ , the ith component of $X_{0}$ verifies $X_{0}^{i} > 1 / k_{0}$ . For each integer $k \geq k_{0}$ we define the stopping time $τ_{k} := inf {t \geq 0 : X^{i} (t) \leq 1 / k for some i = 1, \dots, d}$ . Clearly $τ_{k}$ is increasing as $k \to \infty$ . Let us set $τ_{\infty} := lim_{k \to \infty} τ_{k}$ . We want to show that $τ_{\infty} = \infty$ with probability one, and to do so we will proceed by contradiction. Let us assume that the previous statement is false. Then there exists T>0 and $ε \in (0, 1)$ such that $P (τ_{\infty} \leq T) > ε$ , and therefore there exists an integer $k_{1} \geq k_{0}$ such that (A1) $P (τ_{k} \leq T) \geq ε forall k \geq k_{1} .$ (A1) Let ${\overset{˚}{R}}_{+}^{d} = (0, \infty) \times \overset{(d)}{\dots} \times (0, \infty)$ , let $v : {\overset{˚}{R}}_{+}^{d} \to R$ be any $C^{2}$ function and let $V (t) := v (X (t))$ . From Ito's theorem [Citation1] we have (A2) $d V (t) = L v (X (t)) d t + \frac{\partial v}{\partial x} (X (t)) \overset{m}{\sum_{h = 1}} B^{h} X (t) d W^{h} (t),$ (A2) where $\begin{aligned} L v (x) & = \overset{d}{\sum_{i, j = 1}} \frac{\partial v}{\partial x_{i}} a_{i j} x_{j} + \frac{1}{2} \overset{m}{\sum_{h = 1}} \overset{d}{\sum_{i, j = 1}} (B^{h} {x x}^{T} (B^{h})^{T})_{i j} x_{i} x_{j} \frac{\partial^{2} v}{\partial x_{i} \partial x_{j}} \\ = \overset{d}{\sum_{i, j = 1}} \frac{\partial v}{\partial x_{i}} a_{i j} x_{j} + \frac{1}{2} \overset{m}{\sum_{h = 1}} \overset{d}{\sum_{i = 1}} b_{i}^{h} b_{j}^{h} x_{i} x_{j} \frac{\partial^{2} v}{\partial x_{i} \partial x_{j}} \end{aligned}$ and we have used that the $B^{h}$ are diagonal, $B^{h} = diag (b_{1}^{h}, \dots, b_{d}^{h})$ . Let us choose $v (x) := - \sum_{i = 1}^{d} \log x_{i} / (1 + x_{i})$ , which is positive and $C^{2} ({\overset{˚}{R}}_{+}^{d})$ . Straightforward calculations show that (A3) $L v (x) = - \overset{d}{\sum_{i = 1}} \frac{a_{i i}}{1 + x_{i}} - \overset{d}{\sum_{i = 1}} \sum_{j \neq i} \frac{a_{i j} x_{j}}{1 + x_{i}} + \frac{1}{2} \overset{m}{\sum_{h = 1}} \overset{d}{\sum_{i = 1}} (b_{i}^{h})^{2} \frac{1 + 2 x_{i}}{(1 + x_{i})^{2}} .$ (A3) Note that, if $x_{i} > 0$ for all $i = 1, \dots, d$ , then the first and the third term in Equation (EquationA3(A3) $L v (x) = - \overset{d}{\sum_{i = 1}} \frac{a_{i i}}{1 + x_{i}} - \overset{d}{\sum_{i = 1}} \sum_{j \neq i} \frac{a_{i j} x_{j}}{1 + x_{i}} + \frac{1}{2} \overset{m}{\sum_{h = 1}} \overset{d}{\sum_{i = 1}} (b_{i}^{h})^{2} \frac{1 + 2 x_{i}}{(1 + x_{i})^{2}} .$ (A3) ) are bounded from above in ${\overset{˚}{R}}_{+}^{d}$ , and moreover, since $A$ is a Metzler matrix, the second term is negative in ${\overset{˚}{R}}_{+}^{d}$ . Therefore there exists $K \geq 0$ such that $L v (x) \leq K$ in ${\overset{˚}{R}}_{+}^{d}$ . Clearly $X (t \land τ_{k})$ , where we define $a \land b := min {a, b}$ , belongs to ${\overset{˚}{R}}_{+}^{d}$ for all $t \geq 0,$ and therefore, using the previous bound we have from Equation (EquationA2(A2) $d V (t) = L v (X (t)) d t + \frac{\partial v}{\partial x} (X (t)) \overset{m}{\sum_{h = 1}} B^{h} X (t) d W^{h} (t),$ (A2) ) $V (X (T \land τ_{k})) \leq V (X_{0}) + K (T \land τ_{k}) + \int_{0}^{T \land τ_{k}} \frac{\partial v}{\partial x} (X (t)) \overset{m}{\sum_{h = 1}} B^{h} X (t) d W^{h} (t)$ and therefore, using that the expected value of the stochastic integral is zero (A4) $E V (X (T \land τ_{k})) \leq V (X_{0}) + K E (T \land τ_{k}) \leq V (X (0)) + K T .$ (A4) On the other hand, let us define $Ω_{k} = {τ_{k} \leq T}$ for $k \geq k_{1}$ , so that Equation (A1) means that $P (Ω_{k}) \geq ε$ . Now for all $ω \in Ω_{k}$ there exists some $i = 1, \dots, d$ such that $X^{i} (t, ω) = 1 / k$ and, since v is a sum of positive summands, $V (X (τ_{k}, ω)) \geq - \log (1 / k) / (1 + 1 / k)$ . Since V is non-negative, if we denote by $1_{Ω_{k}}$ the indicator function of $Ω_{k}$ we can write $E V (X (T \land τ_{k})) \geq E [1_{Ω_{k}} V (X (τ_{k}))] \geq - ε \log (1 / k) / (1 + 1 / k)$ and taking the limit $k \to \infty$ we obtain $lim_{k \to \infty} E V (X (T \land τ_{k})) = \infty$ which clearly contradicts (EquationA4(A4) $E V (X (T \land τ_{k})) \leq V (X_{0}) + K E (T \land τ_{k}) \leq V (X (0)) + K T .$ (A4) ) and therefore it must be $τ_{\infty} = \infty$ a.s. as we wanted to prove.

We state as a Theorem a collection of results from Baumgärtel [Citation4] about perturbation of semisimple eigenvalues of matrices that we will use in the proof of Theorem 6.1:

Theorem A.1

Essentially Theorems 2 and 3 in [Citation4], pp. 267–269

Let E be a complex linear space of finite dimension N and let $A (ε)$ be a linear operator defined on E that admits the following holomorphic expansion $A (ε) = A_{0} + ε A_{1} + ε^{2} A_{2} + \dots, | ε | < R .$ Let $λ_{0}$ be a semi-simple eigenvalue of $A_{0}$ and let $P$ be its associated eigenprojection.

Let $λ (ε)$ be a perturbed eigenvalue such that $λ (ε) \to λ_{0}$ when $ε \to 0$ . Then $λ (ε)$ has the form $λ (ε) = λ_{0} + μ_{j} ε + o (ε), ε \to 0,$ for some $j = 1, \dots, q,$ where $μ_{j}, j = 1, \dots, q$ are the eigenvalues of the restriction $T ∣_{Im P}$ of operator $T := {P A}_{1} P$ to $Im P$ . Moreover, associated to $λ (ε)$ we can choose a $($ right $)$ eigenvector $x (ε)$ that is continuous at $ε = 0$ and such that $x_{0} := x (0) \neq 0$ verifies ${T x}_{0} = μ_{j} x_{0}$ .
In the case that μ is a simple eigenvalue of $T,$ $λ (ε)$ and $x (ε)$ are holomorphic functions of ϵ.

Proof

Proof of Theorem 6.1

For small ϵ we will consider matrix $C (ε) = F + ε S$ as a perturbation of matrix $F$ . From Equation (Equation4(4) $F = diag (F_{1}, F_{2}, \dots, F_{q}) .$ (4) ) and the properties of each matrix $F_{i}$ , we have that 0 is a semisimple eigenvalue of $F$ and that it is strictly dominant. Therefore (A5) $F = (V ∣ V^{'}) diag (0_{q}, Σ^{'}) (\frac{U}{U^{'}})$ (A5) is a Jordan canonical decomposition of $F$ [Citation12] where $V^{'} \in R^{N \times (N - q)}$ , $U^{'} \in R^{N \times (N - q)}$ and $Σ^{'} \in R^{(N - q) \times (N - q)}$ are certain matrices and $Σ^{'}$ is associated to eigenvalues with strictly negative real part. From Equation (EquationA5(A5) $F = (V ∣ V^{'}) diag (0_{q}, Σ^{'}) (\frac{U}{U^{'}})$ (A5) ) and Lemma 4.1, we have that $\hat{F} = V U$ is the eigenprojection matrix of $F$ associated to eigenvalue 0.

Using the continuity of eigenvalues on matrix entries, the dominant eigenvalue of $C (ε)$ for small enough ϵ necessarily must belong to the set of eigenvalues of $C (ε)$ that tend to 0 when $ε \to 0$ . Using part 1 of Theorem A.1 applied to $C (ε) = F + ε S$ with $A_{0} = F$ , $A_{1} = S,$ $λ_{0} = 0$ and $P = \hat{F}$ , we conclude that this set of eigenvalues has the form (A6) $0 + ε μ_{j} + o (ε), j = 1, \dots, q,$ (A6) where the $μ_{j}$ are the eigenvalues of the restriction $T ∣_{Im \hat{F}}$ of $T := \hat{F} S \hat{F}$ to $Im \hat{F}$ . Clearly $Im \hat{F}$ is invariant for $T$ and, using Equation (Equation11(11) $\begin{aligned} \bar{S} & = [{\bar{s}}_{i j}] := U S V = [\begin{matrix} u_{1}^{T} S_{11} v_{1} & \dots & u_{1}^{T} S_{1 q} v_{q} \\ ⋮ & ⋱ & ⋮ \\ u_{q}^{T} S_{q 1} v_{1} & \dots & u_{q}^{T} S_{q q} v_{q} \end{matrix}] \in R^{q \times q} \end{aligned}$ (11) ) and Lemma 4.1, we have $T = \hat{F} S \hat{F} = V U S V U = V \bar{S} U$ , so that $\bar{S}$ is the matrix of the restriction $T ∣_{Im \hat{F}}$ expressed in the basis of $Im \hat{F}$ defined by the columns of $V$ . Therefore, $T ∣_{Im \hat{F}}$ and $\bar{S}$ have the same eigenvalues and, moreover, if $x$ and $y$ are, respectively, right and left eigenvectors of $\bar{S}$ associated to a certain eigenvalue λ, then $V x \neq 0$ and $U^{T} y \neq 0$ are right and left eigenvectors of $T ∣_{Im \hat{F}}$ associated to eigenvalue λ.

Therefore, using Hypothesis 6.1 it follows that μ is a simple eigenvalue for $T ∣_{Im \hat{F}}$ and is associated to right and left eigenvectors $V r \neq 0$ and $U^{T} l \neq 0$ , respectively. Moreover, μ is strictly dominant for $T ∣_{Im \hat{F}}$ so that, using Equation (EquationA6(A6) $0 + ε μ_{j} + o (ε), j = 1, \dots, q,$ (A6) ), $μ (ε) := ε μ + o (ε)$ is a simple eigenvalue for $C (ε)$ and strictly dominant when ϵ is small enough.

Now, using part 2 of Theorem A.1 and the fact that μ is simple for $T ∣_{Im \hat{F}}$ , it follows that $μ (ε)$ has the form $μ (ε) := ε μ + O (ε^{2})$ and that is associated to right and left eigenvectors which can be written in the form $V r + O (ε)$ and $U^{T} l + O (ε)$ respectively. Then, for small enough ϵ $lim_{τ \to \infty} \frac{E X (τ)}{\exp (ε μ + O (ε^{2}))} = \frac{(l^{T} U + O (ε)) X_{0}}{(l^{T} U + O (ε)) ({V r}^{T} + O (ε))} (V r + O (ε)) = \frac{⟨ l, {U X}_{0} ⟩}{⟨ l, r ⟩} V r + O (ε)$ and so Equation (Equation20(20) $lim_{τ \to \infty} \frac{E X (τ)}{\exp (ε μ + O (ε^{2}))} = \frac{⟨ l, {U X}_{0} ⟩}{⟨ l, r ⟩} V r + O (ε) .$ (20) ) is proved.

(b) The proof is analogous to that of part (a). A notable property of the Kronecker product that we will use in the sequel is [Citation9, Theorem 6.1] $(A_{1} \otimes A_{2} \otimes \dots \otimes A_{m}) (B_{1} \otimes B_{2} \otimes \dots \otimes B_{m}) = (A_{1} B_{1}) \otimes (A_{2} B_{2}) \otimes \dots \otimes (A_{m} B_{m}) .$ We will use Equation (Equation19(19) $C (ε) := F + ε S, C_{2} (ε) := F_{2} + ε S_{2},$ (19) ) and for small ϵ will consider matrix $C (ε)$ as a perturbation of matrix $F_{2} = I_{N} \otimes F + F \otimes I_{N}$ . First, let us study the spectral properties of $F_{2}$ . Let ${\hat{v}}_{i}$ , $i = 1, \dots, N$ , denote the ith. column of matrix $(V ∣ V^{'})$ , that is, ${\hat{v}}_{i}$ is an eigenvector of $F$ associated to eigenvalue 0 if $1 \leq i \leq q$ or to an eigenvalue with strictly negative real part if $q + 1 \leq i \leq N$ .

From [Citation9, Theorem 6.5] we know that the eigenvalues of $I_{N} \otimes {F + F \otimes I}_{N}$ are the set ${λ_{i} + λ_{j} : λ_{i} \in σ (F), i = 1, \dots, N}$ and that, associated to each eigenvalue $λ_{i} + λ_{j}$ we have right and left eigenvectors ${\hat{v}}_{i} \otimes {\hat{v}}_{j}$ and ${\hat{u}}_{i} \otimes {\hat{u}}_{j}$ where ${\hat{v}}_{i}$ and ${\hat{u}}_{i}$ are, respectively, right and left eigenvectors of $F$ associated to eigenvalue $λ_{i}$ , $i, j = 1, \dots, N$ . Taking into account the spectrum of $F$ we conclude that $σ (F_{2})$ consists of eigenvalue 0 with multiplicity $q^{2}$ associated to right (resp. left) left eigenvectors ${\hat{v}}_{i} \otimes {\hat{v}}_{j}$ , (resp. ${\hat{u}}_{i} \otimes {\hat{u}}_{j}$ ) $i, j = 1, \dots, q$ , and $N^{2} - q^{2}$ eigenvalues with strictly negative real parts associated to right (resp. left) eigenvectors ${\hat{v}}_{i} \otimes {\hat{v}}_{j}$ (resp. ${\hat{u}}_{i} \otimes {\hat{u}}_{j}$ ), where either i or j belong to the set ${q + 1, \dots, N}$ . Clearly, eigenvalue $λ = 0$ is semisimple and is strictly dominant for $F_{2}$ .

It is easy to check that if $A$ and $B$ are any matrices with the block structure $A = (a_{1} ∣ \dots ∣ a_{k})$ , $B = (B_{1} ∣ \dots ∣ B_{s})$ where the $a_{i}$ are column vectors, then $A \otimes B = (a_{1} \otimes B_{1} ∣ \dots ∣ a_{1} \otimes B_{s} ∣ \dots ∣ a_{k} \otimes B_{1} ∣ \dots ∣ a_{k} \otimes B_{s}) .$ Applying this result we can write $\begin{aligned} ({\hat{v}}_{1} \otimes {\hat{v}}_{1} ∣ \dots ∣ {\hat{v}}_{1} \otimes {\hat{v}}_{q} ∣ \dots ∣ {\hat{v}}_{q} \otimes {\hat{v}}_{1} ∣ \dots ∣ {\hat{v}}_{q} \otimes {\hat{v}}_{q}) & = V \otimes V \\ ({\hat{u}}_{1} \otimes {\hat{u}}_{1} ∣ \dots ∣ {\hat{u}}_{1} \otimes {\hat{u}}_{q} ∣ \dots ∣ {\hat{u}}_{q} \otimes {\hat{u}}_{1} ∣ \dots ∣ {\hat{u}}_{q} \otimes {\hat{u}}_{q}) & = U^{T} \otimes U^{T} \end{aligned}$ so that we can write a Jordan canonical decomposition of $F_{2}$ in the form $F_{2} = (V \otimes V ∣ V_{2}^{'}) diag (0_{q^{2}}, Σ_{2}^{'}) (\frac{U \otimes U}{U_{2}^{'}}),$ where $Σ_{2}^{'}$ , $V_{2}^{'}$ and $U_{2}^{'}$ are appropriate matrices. From this expression we see that ${\hat{F}}_{2} := (V \otimes V) (U \otimes U) = (V U) \otimes (V U) = \hat{F} \otimes \hat{F}$ is the eigenprojection of $F_{2}$ corresponding to eigenvalue 0.

Like in part (a), we know that, for small enough ϵ, the dominant eigenvalue of $C_{2} (ε)$ necessarily must belong to the set of eigenvalues of $C_{2} (ε)$ that tend to 0 when $ε \to 0$ . Let us consider the restriction $T_{2} ∣_{Im {\hat{F}}_{2}}$ of matrix $T_{2} := {\hat{F}}_{2} S_{2} {\hat{F}}_{2}$ to $Im {\hat{F}}_{2}$ . Clearly $Im {\hat{F}}_{2}$ is invariant for $T_{2}$ and, using Equation (Equation11(11) $\begin{aligned} \bar{S} & = [{\bar{s}}_{i j}] := U S V = [\begin{matrix} u_{1}^{T} S_{11} v_{1} & \dots & u_{1}^{T} S_{1 q} v_{q} \\ ⋮ & ⋱ & ⋮ \\ u_{q}^{T} S_{q 1} v_{1} & \dots & u_{q}^{T} S_{q q} v_{q} \end{matrix}] \in R^{q \times q} \end{aligned}$ (11) ), Lemma 4.1 and the properties of the Kronecker product we have $T_{2} = {\hat{F}}_{2} S_{2} {\hat{F}}_{2} = (V \otimes V) {\bar{S}}_{2} (U \otimes U)$ so that ${\bar{S}}_{2}$ is the matrix of $T_{2} ∣_{Im {\hat{F}}_{2}}$ expressed in the basis of $Im {\hat{F}}_{2}$ defined by the columns of $V \otimes V$ . Therefore, $T_{2} ∣_{Im {\hat{F}}_{2}}$ and ${\bar{S}}_{2}$ have the same eigenvalues and, moreover, if $x$ and $y$ are, respectively, right and left eigenvectors of ${\bar{S}}_{2}$ associated to a certain eigenvalue λ, then $(V \otimes V) x \neq 0$ and $(U \otimes U)^{T} y \neq 0$ are right and left eigenvectors of $T_{2} ∣_{Im {\hat{F}}_{2}}$ associated to eigenvalue λ.

Now let us assume Hypothesis 6.2. Then we can reason exactly as we did in part (a), substituting $C (ε)$ , $F$ , $S$ , $\bar{S}$ , $\hat{F}$ , $V$ and $U$ for $C_{2} (ε)$ , $F_{2}$ , $S_{2}$ , ${\bar{S}}_{2}$ , ${\hat{F}}_{2}$ , $V \otimes V$ and $U \otimes U,$ respectively, and obtain that for small enough ϵ, $C_{2} (ε)$ has a simple and strictly dominant eigenvalue that can be written in the form (Equation21(21) $λ_{2} (ε) = ε μ_{2} + O (ε^{2})$ (21) ) and associated to eigenvectors with the form (Equation22(22) $r_{2} (ε) = (V \otimes V) r_{2} + O (ε); l_{2} (ε) = (U \otimes U)^{T} l_{2} + O (ε) .$ (22) ), from where the asymptotic behaviour (Equation23(23) $lim_{τ \to \infty} \frac{vec E (X (t) X (t)^{T})}{\exp (ε μ_{2} + O (ε^{2}))} = \frac{⟨ l_{2}, (U \otimes U) vec (X_{0} X_{0}^{T}) ⟩}{⟨ l_{2}, r_{2} ⟩} (V \otimes V) r_{2} + O (ε) .$ (23) ) follows.

Approximate reduction of linear population models governed by stochastic differential equations: application to multiregional models

ABSTRACT

1. Introduction

2. Linear population models with two time scales

3. A model with two time scales

4. Approximate reduction of the model

5. Multiregional models with two time scales

5.1. Model setting

5.2. Model reduction

5.3. Case in which migration is fast

6. Relationships between the original and the reduced systems

7. Discussion

Disclosure statement

References

Appendix

Proof of Theorem 2.1.

Essentially Theorems 2 and 3 in [Citation4], pp. 267–269

Proof of Theorem 6.1

Information for

Open access

Opportunities

Help and information

Approximate reduction of linear population models governed by stochastic differential equations: application to multiregional models

ABSTRACT

1. Introduction

2. Linear population models with two time scales

3. A model with two time scales

4. Approximate reduction of the model

5. Multiregional models with two time scales

5.1. Model setting

5.2. Model reduction

5.3. Case in which migration is fast

6. Relationships between the original and the reduced systems

7. Discussion

Disclosure statement

ORCID

Additional information

Funding

References

Appendix

Proof of Theorem 2.1.

Essentially Theorems 2 and 3 in [Citation4], pp. 267–269

Proof of Theorem 6.1

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date