257
Views
0
CrossRef citations to date
0
Altmetric
Articles

Estimation of parameter functions in ordinary differential equations with a stage structure: a linear case

, , &
Pages 235-255 | Received 01 Oct 2013, Accepted 10 Jan 2014, Published online: 06 Mar 2014

Abstract

In many mathematical models with complex parameter functions, it is critical to suggest a strategy for estimating those functions. This paper considers the problem of fitting parameter functions to given data by using a finite number of piecewise functions. Here, the primary aim is to propose several necessary and sufficient conditions that can guarantee the existence of parameter functions. With these criteria, the paper provides an algorithm for solving the parameter estimation problem efficiently. Numerical examples verify that the proposed algorithm is effective in estimating parameter functions in differential equations.

AMS Subject Classifications:

1 Introduction

Mathematical modelling using a system of differential equations with a set of parameters has been widely used in most scientific fields, including physics, chemistry, biology, earth and environmental sciences, and engineering. Mathematical problems related with such parameter-dependent differential equations can be classified into forward and inverse problems. Forward problems need the analysis and simulation of the model for given parameter values, whereas inverse problems require the estimation of parameters based on the measurement of output variables. The latter have received considerable attention in the last three decades (see e.g. [Citation1, Citation2] and the references therein). As statistical approaches to parameter estimation problems, nonlinear regression and other data-fitting methods have been widely used.[Citation3Citation7]

Among various important problems related to parameter estimation, this paper is particularly interested in the problem of fitting parameter functions to given data by using a finite number of piecewise functions. The primary aim of the paper is to propose some conditions for the existence of parameter functions and introduce an efficient algorithm for estimating these functions. The number of subintervals and the type of basis function in each subinterval are not known a priori, and therefore the main concern is how the number of subintervals and the basis function can be determined and how time nodes can be optimally found for each subinterval.

This paper begins with linear stage-structured differential equations whose parameter functions can be approximated by piecewise constants, and other types of basis functions for approximating parameter functions and nonlinear differential equations with noisy data are left to future research. For a given data-set, the focus is on finding a piecewise constant parameter function in a differential equation to interpolate the data. A necessary and sufficient condition for the existence and uniqueness of the piecewise constant parameter function when subintervals are known is addressed first. In addition, several criteria for determining time nodes and then parameter functions when subintervals are yet to be determined are proposed. These criteria are provided by a C-test function in Section 2. An algorithm is suggested to determine time nodes and coefficients for each subinterval optimally with the given data-set and the number of subintervals. Several numerical experiments demonstrate that the algorithm is effective in estimating parameter functions. Further, the solution of the linear stage-structured differential equation with the estimated parameter function provides a good approximation for the solution of the nonlinear differential equation.

The rest of this paper proceeds as follows: Section 2 starts with a formulation of the problem of estimating parameter functions with a linear stage-structured differential equation. A C-test function is defined to determine whether there exist suitable solutions of the differential equation to interpolate a set of given data. Section 3 presents the necessary and sufficient conditions for the given data-set to solve the problem of estimating parameter functions when time nodes of subintervals are known. Section 4 derives the conditions by using the C-test function for determining parameter functions when time nodes of subintervals are yet to be determined. Section 5 introduces an algorithm for determining parameter functions based on an optimization problem, and Section 6 presents the numerical results for estimated parameter functions and their corresponding solutions based on the proposed algorithm.

2 Parameter estimation problem

2.1 Motivation

Numerous biological phenomena can be described by stage-structured modelling. For instance, growth rates of most fish species are sensitive to seasonal effects, mainly to temperature changes, and it is well known that there is a strong relationship between specific growth rates and temperature for most populations of bacteria (see e.g. [Citation8Citation10] and the references therein). In addition, many delay differential equations with a stage structure can be found in the literature.[Citation11Citation15]

For the given data-set {(pj,qj)}j=1J,p1<p2<<pJ, the following stage-structured differential equation is considered:dudt=f(t,u;θ(t)),t(p1,pJ);u(pj)=qjforj=1,,J.with the parameter function1 θ(t)=i=1Iθiχ[ti-1,ti)(t),1 where the number of intervals I, the node ti and the parameter θi, (i=1,,I) are to be determined by assuming that t0=p1. Here, I,ti, and θi depend on the differential equation and the given data-set {(pj,qj)}j=1J.

2.2 Linear case

This paper focuses on linear stage-structured differential equations by emphasizing the relationship between the parameter function and the data-set. Nonlinear differential equations are left to future research.

Now consider the following estimation problem: For a set of given data {(pj,qj)}j=1J, find θ(t) in the form of (Equation1) such that2a dudt=θ(t)u,t(p1,pJ),2a 2b u(pj)=qjforj=1,,J.2b

Note that the solution u(t) of (2) can be given by3 u(t)=i=1Iu(ti-1)eθi(t-ti-1)χ[ti-1,ti)(t),3 when the parameter function θ(t) in (Equation1) is known.

Because most physical data have positive values, solutions of Equation (Equation2a) are assumed to be positive and data-sets are restricted to cases such as those described in Definition 2.1.

Definition 2.1

The data-set D={(pj,qj)}j=1J is called positively well-posed if D has the following two properties:

(i)

{pj}j=1J is a strictly increasing sequence,

(ii)

qj>0forj=1,,J.

If there is only a single subinterval, then the necessary and sufficient condition for the constant function θ to be uniquely determined in the interval is summarized in the following lemma:

Lemma 2.2

Suppose that there is a given data set {(pj,qj)}j=1J that is positively well-posed in a single interval [t0,t1]. Then the constant function θ and the solution u(t) satisfying (2) can be uniquely determined if and only ifdim(span{(1,pj,logqj)j=1,,J})=2.

Proof

Suppose that u(t) satisfiesu(t)=u(t0)eθ(t-t0),u(pj)=qjforj=1,,J.Then the given data-set {(pj,qj)}j=1J must satisfylogqj=logu(t0)+θ(pj-t0).That is, {(pj,logqj)}j=1J are collinear. The existence and uniqueness of u(t0) and θ are equivalent todim(span{(1,pj,logqj)j=1,,J})=2.

Before describing the relationship between data points, the C-test function is first defined.

Definition 2.3

Define the C-test function C:(R×R+)3R byC(x1,y1,x2,y2,x3,y3):=(x1-x2)logy3+(x2-x3)logy1+(x3-x1)logy2.

Proposition 2.4

Suppose that x1<x2<x3. If the C-test function C is positive (or negative), then there exists a log-concave (or log-convex) function that passes through (x1,y1), (x2,y2) and (x3,y3). In addition, the following properties hold:4 C(x1,y1,x2,y2,x3,y3)=-C(x1,y1,x3,y3,x2,y2)=C(x2,y2,x3,y3,x1,y1)=-C(x2,y2,x1,y1,x3,y3)=C(x3,y3,x1,y1,x2,y2)=-C(x3,y3,x2,y2,x1,y1).4

Proof

Note that the C-test function C can be rewritten asC(x1,y1,x2,y2,x3,y3)=(x1-x2)(x2-x3)[logy1-logy2x1-x2-logy2-logy3x2-x3].Suppose that C(x1,y1,x2,y2,x3,y3)>0. Then (logy1-logy2)/(x1-x2)>(logy2-logy3)/(x2-x3) which implies the existence of a concave quadratic function that passes through (x1,logy1),(x2,logy2) and (x3,logy3). The case of C(x1,y1,x2,y2,x3,y3)<0 is similar. Equation (Equation4) follows immediately from the definition of the C-test function C.

Remark 2.5

C(x1,y1,x2,y2,x3,y3)=0 implies that (x1,y1),(x2,y2) and (x3,y3) lie on one exponential curve: y=aeθx, wherea=yjyjykxjxk-xj,θ=logyk-logyjxk-xj,j,k=1,2,3,jk.Equivalently, the exponential curve can be rewritten asy=yjykyjx-xjxk-xj,j,k=1,2,3,jk.

Definition 2.6

The data-set D={(pj,qj)}j=1J is called reducible for the parameter estimation problem if and only if there exist three adjacent data points {(pj,qj),(pj+1,qj+1),(pj+2,qj+2)},(j=1,,J-2) such that C(pj,qj,pj+1,qj+1,pj+2,qj+2)=0. Otherwise, the set of data D={(pj,qj)}j=1J is called irreducible.

Remark 2.7

Suppose that the given data-set {(pj,qj)}j=1J is positively well-posed and irreducible in the interval [t0,tI] with the partition i=1I[ti-1,ti). Then every subinterval contains no more than two data points if the data-set determines the parameter function θ(t) to be uniquely fulfilling (2).

3 The case of known subintervals

This section starts by looking at the case in which ti, (i=1,,I), are given. The following theorem provides the conditions for the given data-set to guarantee the existence and uniqueness of the parameter function θ(t) and the corresponding solution u(t) of (2).

Theorem 3.1

Let I=J-1. Suppose that the given data set {(pj,qj)}j=1J is positively well-posed and let t0<t1<<tI be given time nodes of [t0,tI]. Then there exists a unique parameter function θ(t) and its solution u(t) that satisfy (2) if and only if the following condition holds:

If any interval [tk-1,tk) contains two data points, then k-1 data points are located in [t0,tk-1) for any k{1,,I}.

Proof

Rearrange data points as follows:

(i)

For 1iI, let mi be the number of data points in the interval [ti-1,ti) so that m1++mI=J and mi{0,1,2},

(ii)

For 1iI, denote {(pji,qji)}j=1mi by the set of data points in the interval [ti-1,ti),

(iii)

For 1iI, {pji}j=1mi is a strictly increasing sequence.

Suppose that there is a continuous function u(t) satisfyingu(t)=i=1Iu(ti-1)eθi(t-ti-1)χ[ti-1,ti)(t).u(pji)=qjij=1,,mi,i=1,,I.Then the given data (pji,qji) must satisfy5 logqji=logu(ti-1)+θi(pji-ti-1)for1jmi,1iI.5 And by the continuity conditions, we obtain6 logu(ti)=logu(ti-1)+θi(ti-ti-1)for1iI-1.6 Setx=logu(t0),θ1,logu(t1),θ2,,logu(tI-1),θItR2I,b=logq11,,logqm11,0,logq12,,logqm22,0,,0,logq1I,,logqmIItRJ+I-1. Combining (Equation5) and (Equation6), we haveAx=b,where the (J+I-1)×2I matrix A is explicitly given byA=1(p11-t0)00000001(pm11-t0)00000001(t1-t0)-1000000001(p12-t1)00000001(pm22-t1)00000001(t2-t1)-10000000001(pI-1-tI-1)00000001(pI-1-tI-1)00000001(tI-tI-1)-1000000001(p1I-tI-1)00000001(pmII-tI-1).It then follows that the existence of a unique parameter function θ(t) and the corresponding function u(t) that satisfy (2) is equivalent to the invertibility of the matrix A. Because mi2 (i=1,,I) and I=J-1, there exists at least one k{1,,I} such that mk=2 by the pigeon hole principle.

With some elementary operations, A can be transformed into the following matrixAA1000A2000A3,where 0 denotes a zero block matrix andA1=1(p11-t0)0001(pm11-t0)0001(t1-t0)-1000001(p1k-1-tk-2)0001(pmk-1k-1-tk-2)0001(tk-1-tk-2),A2=1001,andA3=-100001(p1k+1-tk)0001(pmk+1k+1-tk)0001(tk+1-tk)-1000001(p1I-tI-1)0001(pmII-tI-1).Sincerank(A1)min{m1++mk-1+(k-1),2(k-1)},rank(A2)=2,rank(A3)min{mk+1++mI+(I-k),2(I-k)},the following can be obtained by recalling I=J+1=j=1Imj+1 :rank(A)=rank(A1)+rank(A2)+rank(A3)min{m1++mk-1+(k-1),2(k-1)}+2+min{mk+1++mI+(I-k),2(I-k)}=min{m1++mk-1+(k-1),2(k-1)}+2+min{(2I-k-1)-(m1++mk-1),2(I-k)}=min{m1++mk-1,k-1}+(k-1)+2-max{m1++mk-1,k-1}+(2I-k-1).Based on the above, in the case of m1++mk-1k-1, rank(A)<2I, and thus A is not invertible. Therefore, there is a need for the condition m1++mk-1=k-1 with mk=2 to make A invertible.

Remark 3.2

   

(i)

Theorem 3.1 provides the necessary and sufficient condition for data and subintervals for the determination of a parameter function θ(t) uniquely to fulfil (2). That is, if a subinterval contains two data points, then the number of data points should equal that of subintervals in the whole left side of the interval.

(ii)

In the case of I>J-1,rank(A)J+I-1<2I, and therefore A is not invertible. Although the parameter function θ(t) can be determined, the uniqueness of θ(t) and its corresponding function u(t) satisfying (2) cannot be guaranteed.

(iii)

In the case of I<J-1, there is an over-determined problem. The only case in which the over-determined system Ax=b has a unique solution is that the linear system can be reduced to the system Ax=b with rank(A)=2I.

Remark 3.3

Let {(pj,qj)}j=1J a given data-set that is positively well posed and t0<t1<<tI be given time nodes in [t0,tI] with I=J-1. Suppose that there are two consecutive subintervals [tk-1,tk) and [tk,tk+1) containing two data points in each subinterval. Then the parameter function θ(t) cannot be determined uniquely. Indeed, for θ(t) to be determined uniquely, Theorem 3.1 requires that [t0,tk-1) and [t0,tk) contain k-1 and k data points, respectively, which leads to a contradiction.

4 The case of subintervals not yet determined

The previous section describes the necessary and sufficient condition for a data-set to determine a parameter function and its corresponding solution uniquely when I=J-1. The parameter function estimation problem is over-determined in the case of I<J-1. If the problem is over-determined with too much information, then a particular solution may not exist. As one way to address this problem, the number of unknown parameters can be increased by assuming that time nodes {ti}i=1I-1 need to be determined. Suppose a given data-set {(pj,qj)}j=1J is positively well-posed and irreducible. Then every subinterval should have at most two data points. Note that in each subinterval, the constant value of the parameter θ and its corresponding solution u(t) is uniquely determined by two data points. Therefore, to determine the parameter function for the whole interval, the following types of contiguous subintervals need to be considered:

(T1)

The first and last subintervals in this category both contain two data points, whereas all other subintervals contain only one data point,

(T2)

Only the first or last subinterval in this category contains two data points, whereas all other subintervals contain only one data point.

In the case of T2, the existence of ti and the corresponding parameter θi is always guaranteed, and therefore this section focuses on the case of T1. Note that I=J-2 in the case of T1.

Figure 1. For given data with I=2,t1 can be uniquely determined in case (a), whereas t1 cannot be determined in case (b).

Figure 1. For given data with I=2,t1 can be uniquely determined in case (a), whereas t1 cannot be determined in case (b).

Lemma 4.1

Assume that x1,x2 and x3 are any nonzero real numbers. Then the following two conditions are equivalent:

(i)

x1·x3<0 or x2·x3>0,

(ii)

x1·x2>0 or x2·x3>0.

Proof

Suppose (ii) is not true, that is, x1·x2<0andx2·x3<0. Then x1·x3>0andx2·x3>0, which implies that (i) is not true. By contrapositive, this completes the proof.

Lemma 4.2

Suppose that I=2 and the given data set {(pj,qj)}j=14 is positively well-posed and irreducible over the whole interval [t0,t2]. Then there exist unique t1[p2,p3] and the corresponding parameters θ1 and θ2 such that the function u(t) given by (Equation3) satisfies (2) if and only ifC(p1,q1,p2,q2,p3,q3)×C(p2,q2,p3,q3,p4,q4)0.

Proof

Suppose that there exist t1[p2,p3] and the corresponding parameters θ1 and θ2 such that the function u(t) given by (Equation3) satisfies (2). Then we haveu(t)=u(t0)eθ1(t-t0)χ[t0,t1)(t)+u(t1)eθ2(t-t1)χ[t1,t2)(t)C0(t0,t2),and (Equation2b) also requires the following relationships:u(t0)eθ1(p1-t0)=q1,u(t0)eθ1(p2-t0)=q2,u(t1)eθ2(p3-t1)=q3,u(t1)eθ2(p4-t1)=q4.The parameter values of θ1 and θ2 and the logarithmic values of u(t0) and u(t1) can be determined in terms of {(pj,qj)}j=14 by solving the above equations:7 θ1=logq1-logq2p1-p2,θ2=logq3-logq4p3-p4,logu(t0)=logq1-logq2p1-p2t0+p1logq2-p2logq1p1-p2,logu(t1)=logq3-logq4p3-p4t1+p3logq4-p4logq3p3-p4.7 From the continuity condition limtt1-u(t)=u(t1), we have8 u(t1)=u(t0)eθ1(t1-t0).8 Combining (Equation7) and (Equation8) gives9 logq3-logq4p3-p4t1+p3logq4-p4logq3p3-p4=logq1-logq2p1-p2t1+p1logq2-p2logq1p1-p2,9 and finally,t1=-p1logq2-p2logq1p1-p2-p3logq4-p4logq3p3-p4/logq1-logq2p1-p2-logq3-logq4p3-p4. Now the condition in which t1 lies between p2 and p3 needs to be derived (see Figure ). This condition can be restated as (t1-p2)(t1-p3)0, and a direct calculation gives0(t1-p2)(t1-p3)=-C(p1,q1,p2,q2,p3,q3)×C(p2,q2,p3,q3,p4,q4)(p1-p2)(p3-p4)(logq1-logq2p1-p2-logq3-logq4p3-p4)2.Note that the denominator is not zero because data points are irreducible. Therefore, t1[p2,p3] if and only if C(p1,q1,p2,q2,p3,q3)×C(p2,q2,p3,q3,p4,q4)0.

Lemma 4.3

Suppose that I=3 and the given data set {(pj,qj)}j=15 is positively well-posed and irreducible over the whole interval [t0,t3]. Then there exist t1[p2,p3],t2[p3,p4] and corresponding parameters θ1,θ2, and θ3 such that the function u(t) given by (Equation3) satisfies (2) if and only ifC(p1,q1,p2,q2,p3,q3)×C(p2,q2,p3,q3,p4,q4)0orC(p2,q2,p3,q3,p4,q4)×C(p3,q3,p4,q4,p5,q5)0.

Figure 2. An example of the case in which five data points satisfy the C-test condition.

Figure 2. An example of the case in which five data points satisfy the C-test condition.

Proof

Suppose that there exist two free nodes t1 and t2 such that following conditions hold (see Figure ):10 u(t0)eθ1(p1-t0)=q1,u(t0)eθ1(p2-t0)=q2,u(t1)eθ2(p3-t1)=q3,u(t2)eθ3(p4-t2)=q4,u(t2)eθ3(p5-t2)=q5.10 For this problem, we need to determine eight variables θ1,θ2,θ3,u(t0),u(t1),u(t2),t1 and t2. From (Equation10), it is clear that11 θ1=logq1-logq2p1-p2,θ3=logq4-logq5p4-p5,logu(t0)=logq1-logq2p1-p2t0+p1logq2-p2logq1p1-p2,logu(t2)=logq4-logq5p4-p5t2+p4logq5-p5logq4p4-p5.11 By the continuity condition of the solution12 logu(t1)=logq1-logq2p1-p2t1+p1logq2-p2logq1p1-p2,u(t1)eθ2(t2-t1)=u(t2).12 Therefore, θ1,θ3,u(t0),u(t1) and u(t2) can be determined by t1,t2,θ2, and the data-set {(pj,qj)}j=15. However, θ2 is related to t1 and t2, and therefore showing the existence of θ2 is equivalent to showing that the algebraic systemlogq3-logu(t1)=θ2(p3-t1),logu(t2)-logu(t1)=θ2(t2-t1)has at least one solution θ2. This leads to the relationshiplogq3-logu(t1)p3-t1=logu(t2)-logu(t1)t2-t1,which is equivalent to13 (p3-t2)logu(t1)-(t1-t2)logq3+(t1-p3)logu(t2)=0.13 Combining Equations (Equation11), (Equation12) and (Equation13) gives14 0=logq1-logq2p1-p2-logq4-logq5p4-p5t1t2-logq1-logq2p1-p2p3-logq3+p4logq5-p5logq4p4-p5t1+p1logq2-p2logq1p1-p2-logq3+logq4-logq5p4-p5p3t2-p1logq2-p2logq1p1-p2-p4logq5-p5logq4p4-p5p3.14 The RHS of (Equation14) is denoted as f(t1,t2). Then the roots of the function f in the rectangular domain [p2,p3]×[p3,p4] are considered. Since f is a polynomial of t1,t2 and f(p3,p3)=0, the function f(t1,t2) has roots in the domain [p2,p3]×[p3,p4], provided thatf(p2,p3)×f(p2,p4)0orf(p2,p4)×f(p3,p4)0.Representing the above conditions by using the C-function (see the Appendix) givesC(p1,q1,p2,q2,p3,q3)×C(p2,q2,p3,q3,p4,q4)0orC(p2,q2,p3,q3,p4,q4)×C(p3,q3,p4,q4,p5,q5)0,which completes the proof.

Lemma 4.3 can be extended to general cases.

Theorem 4.4

(C-test)   Suppose that I=n+2 (n is a positive integer) and the given data set {(pj,qj)}j=1n+4 is positively well-posed and irreducible over the whole interval [t0,tn+2]. Then there exist ti[pi+1,pi+2] for all i=1,,n+1, and corresponding parameters θ1,,θn+2 such that the function u(t) given by (Equation3) satisfies (2) if and only if15 C(pi,qi,pi+1,qi+1,pi+2,qi+2)×C(pi+1,qi+1,pi+2,qi+2,pi+3,qi+3)015 for some i{1,,n+1}.

Proof

As in Lemma 4.3, the theorem holds for n=1. Assume that it also holds for n-1.

()Suppose that there exist ti[pi+1,pi+2] for all i=1,,n+1, and corresponding parameters θ1,,θn+2 such that the function u(t) given by (Equation3) satisfies (2). By the continuity condition as in Equation (Equation9), we havelogu(t1)=logq1-logq2p1-p2t1+p1logq2-p2logq1p1-p2.Now consider the new data-set {(t1,u(t1))}{(pj,qj)}j=3n+4. Since the theorem holds for n-1, the following condition holds:16 C(t1,u(t1),p3,q3,p4,q4)×C(p3,q3,p4,q4,p5,q5)0orC(pi,qi,pi+1,qi+1,pi+2,qi+2)×C(pi+1,qi+1,pi+2,qi+2,pi+3,qi+3)016 for some i{3,,n+1}. Define L:RR by L(t1):=C(t1,u(t1),p3,q3,p4,q4), that is,L(t1)=(p3-p4)logu(t1)+(p4-t1)logq3+(t1-p3)logq4=(p3-p4)(logq1-logq2p1-p2t1+p1logq2-p2logq1p1-p2)MYAMP]+(p4-t1)logq3+(t1-p3)logq4.Since L is a linear function of t1 and p2t1p3, the condition (Equation16) impliesL(p2)×C(p3,q3,p4,q4,p5,q5)0orL(p3)×C(p3,q3,p4,q4,p5,q5)0.From the calculation, we getL(p2)×C(p3,q3,p4,q4,p5,q5)=C(p2,q2,p3,q3,p4,q4)×C(p3,q3,p4,q4,p5,q5)andL(p3)×C(p3,q3,p4,q4,p5,q5)=-p3-p4p1-p2×C(p1,q1,p2,q2,p3,q3)×C(p3,q3,p4,q4,p5,q5).Therefore,C(p2,q2,p3,q3,p4,q4)×C(p3,q3,p4,q4,p5,q5)0orC(p1,q1,p2,q2,p3,q3)×C(p3,q3,p4,q4,p5,q5)0.By Lemma 4.1, we obtainC(p1,q1,p2,q2,p3,q3)×C(p2,q2,p3,q3,p4,q4)0orC(p2,q2,p3,q3,p4,q4)×C(p3,q3,p4,q4,p5,q5)0.This means that there exists i{1,,n+1} such that (Equation15) holds.

()Suppose that (Equation15) holds for some i{1,,n}. Then consider the new data-set {(pj,qj)}j=1n+3. Since the theorem holds for n-1, there exist ti[pi+1,pi+2] for all i=1,,n, and corresponding parameters θ1,,θn+1 such that the function u(t) given by (Equation3) satisfies (2). If tn+1=pn+3 is chosen, then θn+2 can be determined such that the function u(t) in the subinterval [tn+1,tn+2] passes the data point (pn+4,qn+4). Suppose that (Equation15) holds for some i=n+1. Then consider the new data-set {(pj,qj)}j=2n+4. By similar arguments, ti[pi+1,pi+2] can be constructed for all i=1,,n+1, and corresponding parameters θ1,,θn+2.

Figure 3. The first and last intervals both contain two data points, whereas the others, only one data point (T1).

Figure 3. The first and last intervals both contain two data points, whereas the others, only one data point (T1).

Remark 4.5

Under the assumption of the irreducibility of data points, the value of the C-function is always nonzero. From this fact, the condition (Equation15) can be rewritten as (Equation17).

Theorem 4.6

(modified C-test)    Suppose that I=n+2 (n is a positive integer) and the given data set {(pj,qj)}j=1n+4 is positively well-posed and irreducible over the whole interval [t0,tn+2]. Then there exist ti[pi+1,pi+2] for all i=1,,n+1, and corresponding parameters θ1,,θn+2 such that the function u(t) given by (Equation3) satisfies (2) if and only if17 C(pi,qi,pi+1,qi+1,pi+2,qi+2)×C(pi+1,qi+1,pi+2,qi+2,pi+3,qi+3)>017 for some i{1,,n+1}.

Then the existence of a solution of Equation (2) can be easily checked by the above theorem when I=J-2. However, this theorem cannot guarantee the uniqueness of ti, the parameter θi and the solution u(t). In this regard, a meaningful algorithm is now introduced to determine the time nodes, the parameter function and its corresponding solution u(t) by formulating an appropriate optimization problem.

5 An algorithm for estimating parameter functions

This section introduces an algorithm for estimating a piecewise constant parameter function in the case of I<J-1. If input information (data points, the number of subintervals and the structure of subintervals) is given, then the whole interval can be divided into several subgroups that are of the form T1 or T2. As discussed earlier, the modified C-test is conducted only on T1 because the existence of ti and θi is always guaranteed in the case of T2. If the modified C-test fails, then the existence of ti and θi cannot be guaranteed, and therefore another structure of subintervals needs to be chosen.

Assume that the modified C-test is successful in T1. Then ti can be found by minimizing the total variation of the constant parameter θi in the whole interval. For this, the objective functional can be defined as18 f(t1,,tn-1)=i=1n-1(θi-1-θi)2,18 where θi is a constant parameter in the subinterval depending on {ti}i=1n-1. Here, n is the number of intervals in T1 or T2. In addition, there are following inequality constraints for the form of T1(see Figure ):19 p1<p2t1p3t2tn-2pntn-1pn+1<pn+2.19 Since tn-1 can be expressed uniquely in terms of (tn-2,u(tn-2)),(pn,qn),(pn+1,qn+1) and (pn+2,qn+2), the domain variables of the functional (Equation18) can be reduced by one. Then the explicit formulation for the functional f can be rewritten as follows:f(t1,,tn-2)=(logq1-logq2p1-p2-logu(t1)-logq1t1-p1)2+j=1n-3(logu(tj)-logqj+2tj-pj+2-logu(tj+1)-logqj+3tj+1-pj+3)2+(logu(tn-2)-logqntn-2-pn-logqn+1-logqn+2pn+1-pn+2)2,whereu(t1)=q1exp}logq1-logq2p1-p2(t1-p1){=q2exp}logq1-logq2p1-p2(t1-p2){andu(tk)=qk+1exp}logu(tk-1)-logqk+1tk-1-pk+1(tk-pk+1){for k=2,,n-2. Combining these recursive relationships giveslogu(tk)=(tk-pk+1)(t2-p3)(tk-1-pk+1)(t1-p3)logu(t1)-(tk-pk+1)(t3-p4)(t2-t1)(tk-1-pk+1)(t1-p3)logq3--(tk-pk+1)(tk-1-tk-2)(tk-1-pk+1)(tk-2-pk)logqk-tk-tk-1tk-1-pk+1logqk+1for k=2,,n-2. This implies that the term u(tk) can be represented by time nodes {ti}i=1k and data points.

In a similar manner, the constraint pntn-1pn+1 in the condition (Equation19) can be expressed for the variables t1,,tn-2 as(logqn-logqn+1pn-pn+1-logqn+1-logqn+2pn+1-pn+2)(logqn-logqn+1pn-pn+1-logu(tn-2)-logqntn-2-pn)0.This is obtained by applying Lemma 4.2 to the following set:{(tn-2,u(tn-2)),(pn,qn),(pn+1,qn+1),(pn+2,qn+2)}.Here, the numerical method proposed in [Citation16] is employed to find {ti}i=1I-1 that can minimize the objective functional subject to inequality constraints. Now the proposed algorithm is briefly introduced (see Algorithm  1).

6 Numerical simulation

6.1 Data-set 1

Data-set 1 deals with a set of 10 data points listed in Table .

Table 1. Simulated data in data set 1.

These data points are generated from the analytic solution to the following logistic differential equation.20 dudt=ru(1-uK)fort(p1,p10)andu(p1)=u0,20 where the parameters K=375, r=9.547976955 and u0=2.121772657 are from [Citation17]. Note that this is a positively well-posed data-set.

Here, the goal is to find a piecewise parameter function for the stage-structured differential Equation (2) that minimizes the functional (Equation18). Before proceeding, choose an appropriate number of subintervals and the structure of the subinterval. Because 10 data points are considered, the appropriate number of subintervals, I, ranging from 5 to 9. If I is less than 5, then it goes against to the definition of an irreducible data-set. If I is greater than 9, then its subinterval contains only one or zero data point, and this does not properly reflect on the dependency of data points. For this reason, select I{5,6,7,8,9}.

In this example, assume that the number of subintervals is seven, that is, I=7, and the structure of subintervals is (2, 2, 1, 1, 1, 1, 2), which means that the first, second and seventh intervals have two data points, whereas the others, one data point. As shown in Figure , the whole interval is composed of two subgroups (G1 and G2) that are rectangled to identify which data points belong to which subgroups. This decomposition is based on Lemma 2.2 and determines parameters uniquely by using two data points in a single subinterval.

Figure 4. Given information in data-set 1.

Figure 4. Given information in data-set 1.

First, consider the subgroup G1. Note that this subgroup has two subintervals and is classified as T1. According to Algorithm 1, the modified C-test needs to be conducted for G1. From the simple calculationC(p1,q1,p2,q2,p3,q3)×C(p2,q2,p3,q3,p4,q4)=6.8894×10-7>0,there exist t1,θ1 and θ2 for the solution u(t) of Equation (2), as shown in Lemma 4.2.

Similarly, the second group G2 has six subintervals and is classified as T1. Here, of course, two data points (p3,q3) and (p4,q4) are common elements of G1 and G2. The subgroup G2 is also T1, and therefore the modified C-test is again needed. SinceC(p4,q4,p5,q5,p6,q6)×C(p5,q5,p6,q6,p7,q7)=2.7138×10-7>0,the existence of time nodes and parameters in G2 is guaranteed. Then the results shown in Table can be obtained by applying a minimization scheme to minimize the functional (Equation18).

Table 2. Minimization results for the functional (18) for data-set 1.

With estimated time nodes and parameters, the L2 error between the derived and exact solutions of (2) is 1.7055×10-2. Figure shows the graph of the solutions.

Figure 5. The dashed line indicates the exact solution of (Equation20), the solid line is the derived solution and dots represent data points.

Figure 5. The dashed line indicates the exact solution of (Equation2020 dudt=ru(1-uK)fort∈(p1,p10)andu(p1)=u0,20 ), the solid line is the derived solution and dots represent data points.

Table 3. Simulated data in data-set 2.

Figure 6. Given information in data-set 2 (dots represent data points).

Figure 6. Given information in data-set 2 (dots represent data points).

Table 4. Minimization results for the functional (18) for data set 2.

Figure 7. The dashed line indicates the exact solution of (Equation21), the solid line is the derived solution and dots represent data points.

Figure 7. The dashed line indicates the exact solution of (Equation2121 dudt=t2uin(p1,p10)andu(p1)=exp(p13/3).21 ), the solid line is the derived solution and dots represent data points.

6.2 Data-set 2

Data-set 2 generates {pj}j=110 as random numbers in [-1,1] and {qj}j=110 by solving the following differential equation at pj:21 dudt=t2uin(p1,p10)andu(p1)=exp(p13/3).21 The generated data-set {(pj,qj)}j=110 is shown in Table . Data-set 2 considers seven subintervals (I=7), and the structure of subintervals is (1, 1, 1, 2, 2, 1, 2). In other words, the fourth, fifth and seventh subintervals have two data points, whereas the others, one data point. As depicted in Figure , the whole interval can be decomposed by three subgroups G1, G2 and G3. The first subgroup G1 has four subintervals and is classified as T2. In this case, there is no need to conduct the modified C-test because it is T2, and therefore the existence of time nodes and corresponding parameters are always guaranteed. On the other hand, G2 and G3 have two and three subintervals, respectively, and both are classified as T1. Therefore, the modified C-test needs to be conducted. In G2, however, the value ofC(p4,q4,p5,q5,p6,q6)×C(p5,q5,p6,q6,p7,q7)=-1.8281×10-6is negative, implying the nonexistence of t4. Accordingly, it is not possible to determine a piecewise constant parameter function θ(t) satisfying (2) with this structure.

Consider another structure of subintervals. Suppose that seven subintervals (I=7) are given and the structure of subintervals is (1, 1, 2, 1, 2, 1, 2). In this case, the third, fifth and seventh subintervals have two data points, whereas the others, one data point. The whole interval can be decomposed into three subgroups G1, G2 and G3. As in the cases of the other data-sets, the modified C-test needs to be conducted for G2 and G3, which are of T1 type. Here simple calculations indicate that the modified C-test is successful, and therefore the optimal parameter function can be calculated. Table shows the results. In addition, the L2 error between the derived and exact solutions of (Equation21) is 4.9836×10-3. Figure shows the graph of the solutions with estimated time nodes and parameters.

7 Funding

This research was supported by NRF of Korea [grant number 2011-0000344], [grant number 2009-0065241], [grant number 2008-C00043].

References

  • Beck JV, Arnold KJ. Parameter estimation in engineering and science. New York (NY): John Wiley & Sons; 1977.
  • Tarantola A. Inverse problem theory. Methods for data fitting and model parameter estimation. Amsterdam: Elsevier; 1987.
  • Banks HT, Bihari KL. Modelling and estimating uncertainty in parameter estimation. Inverse Probl. 2001;17:95–111.
  • Banks HT, Fitzpatrick BG. Statistical methods for model comparison in parameter estimation problems for distributed systems. J. Math. Biol. 1990;28:501–527.
  • Hvala N, Zec M, Strmc̆nik S. Non-linear model parameter estimation-estimating a feasible parameter set with respect to model use. Math. Comput. Modell. Dyn. Syst. Meth. Tools Appl. Eng. Related Sci. 2008;14:587–605.
  • Martinsons CD. Optimal parameterization of a mathematical model for solving parameter estimation problems. Inverse Probl. Sci. Eng. 2005;13:109–131.
  • Seber GAF, Wild CJ. Nonlinear regression. Hoboken (NJ): Wiley-Interscience; 2003.
  • Beverton RJH, Holt SJ. A review of the lifespans and mortality rates of fish in nature, and their relation to growth and other physiological characteristics. Chichester: John Wiley & Sons Ltd; 2008. p. 142–180.
  • Cren EDL. The length-weight relationship and seasonal cycle in gonad weight and condition in the perch (perca fluviatilis). J. Animal Ecol. 1951;20:201–219.
  • White P, Kalff J, Rasmussen J, Gasol J. The effect of temperature and algal biomass on bacterial production and specific growth rate in freshwater and marine habitats. Microb. Ecol. 1991;21:99–118.
  • Aiello WG, Freedman H. A time-delay model of single-species growth with stage structure. Math. Biosci. 1990;101:139–153.
  • Aiello WG, Freedman HI, Wu J. Analysis of a model representing stage-structured population growth with state-dependent time delay. SIAM J. Appl. Math. 1992;52:855–869.
  • Gourley SA, Kuang. Y. A stage structured predator-prey model and its dependence on maturation delay and death rate. J. Math. Biol. 2004;49:188–200.
  • Gurney WSC, Nisbet RM. Fluctuation periodicity, generation separation, and the expression of larval competition. Theoret. Population Biol. 1985;28:150–180.
  • Nisbet RM, Blythe SP, Gurney WSC, Metz JAJ. Stage-structure models of populations with distinct growth and development processes. IMA J. Math. Appl. Med. Biol. 1985;2:57–68.
  • Rockafellar RT. The multiplier method of Hestenes and Powell applied to convex programming. J. Optim. Theory Appl. 1973;12:555–562.
  • Gauze GF. The struggle for existence. Baltimore (MD): The Williams & Wilkins company; 1934.

Details of calculation in Lemma 4.3

 f(p2,p3)=[logq1-logq2p1-p2-logq4-logq5p4-p5]p2p3-[logq1-logq2p1-p2p3-logq3+p4logq5-p5logq4p4-p5]p2+[p1logq2-p2logq1p1-p2-logq3+logq4-logq5p4-p5p3]p3-[p1logq2-p2logq1p1-p2-p4logq5-p5logq4p4-p5]p3=p2p3-p2p3-p2p3+p2p3p1-p2logq1-p2p3-p2p3-p1p3-p1p3p1-p2logq2-(p3-p2)logq3-p2p3-p2p5-p32+p3p5p4-p5logq4+p2p3-p2p4-p32+p3p4p4-p5logq5=(p2-p3)logq3-(p2-p3)(p3-p5)p4-p5logq4+(p2-p3)(p3-p4)p4-p5logq5=p2-p3p4-p5×C(p3,q3,p4,q4,p5,q5),f(p2,p4)=[logq1-logq2p1-p2-logq4-logq5p4-p5]p2p4-[logq1-logq2p1-p2p3-logq3+p4logq5-p5logq4p4-p5]p2+[p1logq2-p2logq1p1-p2-logq3+logq4-logq5p4-p5p3]p4-[p1logq2-p2logq1p1-p2-p4logq5-p5logq4p4-p5]p3=p2p4-p2p3-p2p4+p2p3p1-p2logq1-p2p4-p2p3-p1p4+p1p3p1-p2logq2-(p4-p2)logq3-p2p4-p2p5-p3p4+p3p5p4-p5logq4+p2p4-p2p4-p3p4+p3p4p4-p5logq5=-(p3-p4)logq2-(p4-p2)logq3-(p2-p3)logq4=-C(p2,q2,p3,q3,p4,q4), 

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.