764
Views
4
CrossRef citations to date
0
Altmetric
Research Articles

COMBOS2: an algorithm to the input–output equations of dynamic biosystems via Gaussian elimination

ORCID Icon, &
Pages 896-907 | Received 08 Mar 2020, Accepted 14 Apr 2020, Published online: 07 Jul 2020

Abstract

Differential algebra (DA) methods are currently being exploited for analyzing dynamic biosystem models for their structural identifiability (SI) properties. An early step in this approach entails finding an equivalent input–output (I/O) model. A recent approach for finding these equations, based on Grbner bases and imbedded in the app COMBOS is relatively slow and sometimes unsuccessful, even for moderate size models. In this paper, we propose a faster algorithm, using a Gaussian elimination approach for analyzing the subclass of linear dynamic biosystem models. We apply this methodology to find the simplest set of globally SI parameter combinations of these models and show that it works effectively for linear biosystem models of simple to moderate complexity.

2010 MSC:

1. Introduction

Structural identifiability (SI) analysis deals with the problem of finding the solutions from input–output {u(t),y(t)} data [Citation1,Citation2] for the unknown parameters p of a dynamic system model of the form(Equation1) with state and output equations: (1) x˙(t,p)=f(x(t,p),u(t),t;p),t[t0,T]y(t,p)=g(x(t,p);p),(1) where p is a k-dimensional parameter vector, x is an n-dimensional state vector, y is an m-dimensional output vector, u is an r-dimensional input vector, and f and g are rational polynomial functions of their arguments, a reasonable assumption in most applications [Citation3]. Dynamic biosystem models typically have this form, with function f being polynomial, ratios of polynomials, or representative of linear compartmental biomodels in, e.g. tracer perturbation experiments on nonlinear biosystems.

Parameter pp is said to be SI locally if there are finite number of distinct solutions for it, and globally SI if there is only one solution. It's unidentifiable if there are uncountably number of solutions. Nevertheless, even when a model has unidentifiable parameters, finding useful combinations of parameters that are structurally identifiable is always possible. Furthermore, these combinations of parameters can often be utilized to reparameterize the original model into a structurally identifiable one. Practically speaking, SI is a prerequisite for finding parameter values (or values for parameter combinations) in a real “noisy” data problem, usually known as the numerical identifiability problem.

Differential algebra (DA) methods have been illustrated quite applicable in dealing with global and local SI properties of dynamic system models [Citation4–7]; and many software tools have been elaborated and implemented using many differential algebra algorithms [Citation7,Citation8]. but, they are restricted by complexity of computational algebraic or other difficulties, and are then far limited in application to models of low to moderate complexity.

An early and critical step in DA identifiability analysis is finding the model input–output (I/O) equations, by first eliminating its state variables. To make the I/O equations monic, the coefficients c(p) of them are fixed uniquely by normalizing the Equation [Citation7]. Structural identifiability is then recognized by checking the injectivity of the coefficients c(p), in other words, the model (Equation1) is globally identifiable if and only if c(p)=c(p) implies p=p for arbitrary p [Citation7]. Therefore, recognizing structural identifiability is facilitated to finding the solutions to c(p)=c(p) using many differential algebra algorithms.

Several methods are available for finding the I/O equations, including Ritt's pseudodivision [Citation9], a Gröbner Basis alternative to Ritt's pseudodivision (GRA), developed by Meshkat et al. [Citation3], and some other primary elimination methods [Citation10–13]. All are encumbered by the computational algebraic complexity encountered in finding the I/O equations and are thus limited to relatively small to moderately complex models [Citation14–16].

Gröbner Basis computations in current algorithmic approaches are a major encumbrance to solution speed, in some cases causing one symbolic algebra programme (COMBOS Footnote1) to hang, thereby providing no solutions, or running on for several hours in another DA programme (DAISY Footnote2) before providing SI solution both for a not-so-complex linear biosystem model (see Section 8 below). Gröbner Basis computations can be quite slow, even for relatively small problems; and increasing model complexity by even a small amount (the number of equations, equation terms, or unknown parameters) can have a great impact on speed and efficiency, especially for finding the I/O equations.

In this paper, we describe an algorithm that simplifies computation of and accelerates the speed of determining the I/O equations, limited only to linear ODE models. In brief, we use a Gaussian elimination approach directly applicable to linear biosystem and other ODE models [Citation17–20], replacing the Gröbner Basis-based GRA Method in COMBOS for obtaining I/O equations when the ODEs are linear. We also extend some results associated with the GRA algorithm, finding a strict bound on the minimum number of derivatives of the model equations required for finding the I/O equations of linear ODEs. The new methodology is admittedly limited, but it is a step forward, because even some linear models present major encumbrances to ID analysis overcome by the updated algorithms in this new version of COMBOS.

2. Algorithm for finding the I/O equations

To obtain the I/O equations for (Equation1), we need to first eliminate the state variables and their derivatives. For linear ODE models, if the number of equations (Neq) is more than the number of state variables and their derivatives (Nx), and at least two equations have dependent state variables, we can use one of them to eliminate the other, and can eliminate all state variables and their derivatives by repeating this step.

The condition (Neq > Nx) ensures that at least one input–output equation is found. To find the input–output equations involving all output variables with the highest order, we also show and prove that condition Neq ≥ Nx + m is sufficient (m is the number of output equations). This does not work for nonlinear ODE models [Citation3].

Finding the I/O equations is done in three steps: (1) Eliminate all state variables and their derivatives. (2) Assemble the coefficient matrix of the new system of equations. (3) Apply the Gaussian elimination algorithm to the coefficient matrix.

Step (1). In model (1), the equations y(t,p)=g(x(t,p);p) without derivatives of x is called the output equations. First take the derivative of the output equations, to eliminate the first derivative of x from the state variable equations, by back substitution. Every generated unknown variable then should exist in two equations of the new system. If Neq ≥ Nx + m, this additional equation is enough to eliminate the state variables, otherwise, the second derivative of the output equations is required. Differentiation of the corresponding state variable equations is continued until Neq ≥ Nx + m.

In the GRA algorithm, taking derivatives (nm)+1 times is sufficient. In Section 6, we show that this boundary is not always reached, and usually differentiating again generates more I/O equations.

Step (2). For using Gaussian elimination, we need to write all equations in matrix form, thereby assembling a coefficient matrix for the new system in which we can summarize all the information of the new system. This matrix must have as many columns as there are variables and their corresponding derivatives, and as many rows as equations. The order must be specific -- first state variables and their derivatives, next output variables and input variables and their derivatives. The matrix is constructed one row at a time, writing a 1 for the coefficient if the corresponding variable is present and 0 (zero) if it is absent.

Step (3). Following Gaussian elimination of the coefficient matrix, we generate a matrix in reduced row-echelon form. Rows of the new matrix, in which elements corresponding to state variables and their derivatives are zero, are input–output equations coefficients.

Theorem 2.1

Consider a linear ODE system of the form (Equation1) with m output equations. To find m I/O equations involving m output variables with highest order k, (the number of differentiations), we must differentiate k times where the condition Neq ≥ Nx + m is satisfied.

Proof.

As shown by Meshkat [Citation3], after differentiating k times, we have Neq ≥ Nx + m for the new system. Since all equations of a linear ODE system of the form (Equation1) are linearly independent, the new system remains linearly independent. Then, when we apply the Gaussian elimination algorithm on the new system, we obtain the reduced row-echelon matrix (Rrem) with non-zero elements shown in Figure . In each row of Rrem, the leading entry is the only non-zero entry in its column. In addition, Neq ≥ Nx + m, and we conclude that we have Nx + m rows involving only leading entries that correspond to one member of the set {x1(k),x2(k),,x˙1,x˙2,,y1(k),,ym(k)}. Finally, to find m I/O equations involving m output variables with order k, choose the m rows that correspond to {y1(k),,ym(k)}.

Figure 1. Reduced row-echelon form of a linear ODE system of the form (Equation1) with m output equations. Rows Nx+1 to Nx+m in this matrix shows the coefficients of the I/O equations.

Figure 1. Reduced row-echelon form of a linear ODE system of the form (Equation1(1) x˙(t,p)=f(x(t,p),u(t),t;p),t∈[t0,T]y(t,p)=g(x(t,p);p),(1) ) with m output equations. Rows Nx+1 to Nx+m in this matrix shows the coefficients of the I/O equations.

3. Linear compartment models and illustrative examples

We use linear compartment models [Citation21,Citation22] in several examples below to illustrate our novel algorithm for finding I/O equations and identifiable parameters combinations. We also compare this with results obtained using the GRA algorithm, where available. (2) x˙(t)=K(p)x(t)+B(p)u(t),y(t,p)=C(p)x(t),(2) where u is an r-dimensional input vector, x is an n-dimensional state vector, y is an m-dimensional output vector, and K is n×n compartmental matrix with rate constants kij that satisfy: (3) kij0ij,kii=j=0nkjiij,(3) where k0i is the transfer rate constant from compartment i to the external environment; B is n×r input matrix; C is m×n output matrix, p is a p-dimensional parameter vector containing the K, B and C parameters (a positive subset of the real numbers).

4. 2-Compartment model example

SI analysis of a simple 2-compartment model (Figure ) demonstrates all of the steps in our new algorithm. (4) x˙1=(k01+k21)x1+k12x2+u,x˙2=k21x1(k02+k12)x2,y=x1/V1.(4) We first use GRA and then our algorithm for finding the I/O equations and show that the results are the same. Then we use the Meshkat algorithm [Citation3] for finding identifiable parameters combinations (combos) and reparametrize the model in terms of these combos. There are n = 2 state variables {x1,x2}, m = 1 output equation y=x1/v1 and Neq = 3, Nx = 4. Then, according to the GRA Algorithm, taking derivatives n(m1)=2(11)=2 times is sufficient to find the input–output equation.

Figure 2. 2-Compartmental ODE Model. u is the input to compartment x1, the dashed line with a circle is the output inform compartment x2. The same notation will be used in the following figures.

Figure 2. 2-Compartmental ODE Model. u is the input to compartment x1, the dashed line with a circle is the output inform compartment x2. The same notation will be used in the following figures.

1: We first differentiate x1V1y (output equation) and then add the result to the system (Neq = 4, Nx = 4): (5) x˙1V1y˙,(5)

2: Differentiate x˙1V1y˙ to obtain y¨, and the corresponding state variable equation to obtain x1¨ and then add the result to the system (Neq = 6, Nx = 5): (6) x¨1V1y¨,x¨1(k01+k21)x˙1+k12x˙2+u˙,(6) The new system has 6 equations in 5 unknown state variables and their derivatives, {x1,x2,x˙1,x˙2,x¨1}, enough to eliminate all unknowns. According to the GRA algorithm and the ranking {x¨1,x˙2,x˙1,x2,x1,y¨,y˙,u˙,y,u}, we find the Gröbner basis of it using Mathematica.Then I/O equation is an equation only in {y¨,y˙,u˙,y,u} of the following form: (7) (k02+k12)u+(k01k02+k01k12+k02k21)V1yu˙+(k01+k02+k12+k21)V1y˙+V1y¨=0,(7) The coefficient matrix for model (Equation4) is the following form:

we now use Gaussian elimination on this coefficient matrix and obtain the following matrix in the reduced row-echelon form:

We select the rows of the new matrix which all elements corresponding to state variables and their derivatives are 0 (zero). Multiplying these rows with variables, we obtain the I/O equations (Figures  and ).

Figure 3. Coefficient matrix of 2-compartment model.

Figure 3. Coefficient matrix of 2-compartment model.

Figure 4. Reduced row-echelon form of 2-compartment model. The last row shows the coefficients of the I/O equations of this model.

Figure 4. Reduced row-echelon form of 2-compartment model. The last row shows the coefficients of the I/O equations of this model.

The I/O equation for model (Equation4) is: (8) (k02+k12)V1u+(k01k02+k01k12+k02k21)y1V1u˙+(k01+k02+k12+k21)y˙+y¨=0,(8) Multiplying Equation (Equation8) by V1, we see our I/O equation is to the same as the GRA algorithm I/O Equation (Equation7). Now we investigate the parameter identifiability of the system (Equation4) using Gröbner bases for the solutions to c(p)=c(p) and show V1,k12k21,k02k12,k01k21 are uniquely identifiable. Finally, the most appropriate identifiable combinations are {q1=k01k21,q2=k12k21,q3=k02k12,q4=V1}. Let x2=k21X2. Then the reparameterized equations are: (9) x˙1=q1x1+q2X2+u,X˙2=x1+q3X2,y=x1/q4.(9)

5. 3-Compartment model example

We analyse the 3-compartment model with one output equation shown in Figure , to show that we cannot find I/O equations when the number of equations (Neq) is equal to the number of state variables and their derivative (Nx). (10) x˙1=(k21+k31)x1+k12x2+k13x3+u,x˙2=k21x1(k02+k12)x2,x˙3=k31x1(k03+k13)x3,y=x1/V1.(10)

Figure 5. 3-Compartment model.

Figure 5. 3-Compartment model.
Here we have n = 3 state variables {x1,x2,x3}, m = 1 output equation y=x1/v1 and Neq=4, Nx = 6. Then, according to the GRA Algorithm, taking derivatives n(m1)=3(11)=3 times is sufficient to find the input–output equation.

1: Differentiate x1V1y and then add the result to the system (Neq = 5, Nx = 6): (11) x˙1V1y˙,(11)

2: Differentiate x˙1V1y˙ to obtain y¨, and the corresponding state variable equation to obtain x1¨ and then add the result to the system (Neq=7, Nx=7): (12) x¨1V1y¨,x¨1(k21+k31)x˙1+k12x˙2+k13x˙3+u˙,(12) The new system now has 7 equations in 7 unknown state variables and their derivatives.

3: Differentiate x¨1V1y¨ to obtain y, and the corresponding state variable equation to obtain x1, x2¨ and x3¨. Then add the result to the system (Neq = 11, Nx = 10): (13) x1V1y,x1(k21+k31)x¨1+k12x¨2+k13x¨3+u¨,x¨2+k21x˙1(k02+k12)x˙2,x¨3+k31x˙1(k03+k13)x˙3,(13) The new system now has 11 equations in 10 unknown state variables and their derivatives, {x1,x2,x3,x˙1,x˙2,x˙3,x¨1,x¨2,x¨3,x1}, enough to eliminate all unknowns. According to the GRA algorithm and the ranking {x1,x¨3,x¨2,x¨1,x˙3,x˙2,x˙1,x3,x2,x1,y,y¨,y˙,u¨,u˙,y,u}, we find the Gröbner Basis of it using Mathematica.Then I/O equation is: (14) (k02k03+k03k12+k02k13+k12k13)u+(k02k03k21+k02k13k21+k02k03k31+k03k12k31)V1y(k02+k03+k12+k13)u˙u¨+(k02k03+k03k12+k02k13+k12k13+k02k21+k03k21+k13k21+k02k31+k03k31+k12k31)V1y˙+(k02+k03+k12+k13+k21+k31)V1y¨+V1y=0(14) The coefficient matrix for (Equation10) is then:

We then use Gaussian elimination on this matrix and obtain a reduced row echelon form (not shown, too large). We select the only row with zero elements corresponding state variables and their derivatives (Figure ). (15) {0,0,0,0,0,0,0,0,0,0,1,k02+k03+k12+k13+k21+k31,k02k03+k03k12+k02k13+k12k13+k02k21+k03k21+k13k21+k02k31+k03k31+k12k31,(1/V1),(k02k03k12k13)/V1,k02k03k21+k02k13k21+k02k03k31+k03k12k31,(k02k03k03k12k02k13k12k13)/V1}(15) Multiplying this row with variables, we obtain the I/O equation. This equation is the same as that obtained with the GRA algorithm (Equation14), after multiplying by V1.

Figure 6. Coefficient matrix of 3-compartment model.

Figure 6. Coefficient matrix of 3-compartment model.

We then use Gröbner bases to find the solutions to c(p)=c(p) and show V1,k21+k31 are uniquely identifiable and k02+k12,k03+k13,k12k21,k13k31 are locally identifiable with 2 (two) solutions. Finally, the simplest identifiable combinations {q1=k21k31,q2=k12k21,q3=k13k31,q4=k02k12,q5=k03k13,q6=V1} are choosen so all model terms are positive after reparametrizing. Let x2=k21X2,x3=k31X3. Then the reparameterized equations are: (16) x˙1=q1x1+q2X2+q3X3+u,X˙2=x1+q4X2,X˙3=x1+q5X3,y=x1/q6.(16)

6. 4-Compartment thyroid system model example

The model shown in Figure , which represents the linear dynamics of two thyroid hormones (T3 and T4) in blood (1 and 2) and tissues (3 and 4), with conversion of T4 into T3 in the lumped tissue compartment. It has one input and two output. For this example, we find sufficient input–output equations for finding identifiable combinations, with fewer derivatives than with the GRA algorithm. (17) x˙1=k31x1+k13x3+u,x˙2=k42x2+k24x4,x˙3=k31x1(k03+k13+k43)x3,x˙4=k42x2+k43x3(k04+k24)x4,y1=x1,y2=x2.(17) We use the same method as above, but the minimum bound for the number of taking derivatives is 2 using our algorithm, while it is 3 in GRA algorithm. There are n = 4 state variables {x1,x2,x3,x4} and m = 2 output equations {y1=x1,y2=x2}. Then, according to Meshkat [Citation14], taking derivatives n(m1)=4(21)=3 times is sufficient for finding the I/O equation.

Figure 7. 4-Compartment model.

Figure 7. 4-Compartment model.

1: Differentiate y1x1,y2x2 (output equations) and then add the result to the system (Neq=8, Nx=8): (18) y˙1x˙1,y˙2x˙2,(18) The number of equations is still not enough to eliminate all unknowns.

2: Differentiate y1˙ and y2˙ to obtain y1¨ and y2¨, and the corresponding state variable equation to obtain x1¨ and x2¨. Then add the result to the system (Neq=12, Nx=10): (19) y¨1x¨1,y¨2x¨2,x¨1k31x˙1+k13x˙3+u,x¨2k42x˙2+k24x˙4,(19) Then new system now has 12 equations in 10 unknown state variables and their derivatives, {x1,x2,x3,x4,x˙1,x˙2,x˙3,x˙4,x¨1,x¨2}, enough to eliminate all unknowns. So, we do not need to take another derivative and the algorithm terminates here. According to the GRA algorithm and the ranking {x¨2,x¨1,x˙4,x˙3,x˙2,x˙1,x4,x3,x2,x1,y¨2,y¨1,y˙2,y˙1,u˙,y2,y1,u}, we find the Gröbner Basis of it using Mathematica.Then I/O equations have the following forms: (20) 1)k24k43uk24k31k43y1+k04k13k42y2k24k43y˙1+(k04k13+k13k24+k13k42)y˙2+k13y¨2=02)(k03+k13+k43)u+(k03k31+k31k43)y1u˙+(k03+k13+k31+k43)y1˙+y1¨=0(20) Coefficient matrix for model (Equation17) is:

We then use Gaussian elimination and acquire a reduced row-echelon form (not shown, too big). We select two rows with 0 (zero) elements corresponding to state variables and their derivatives (Figure ). 1){0,0,0,0,0,0,0,0,0,0,1,0,k04+k24+k42,k24k43k13,0,k04k42,k24k31k43k13,k24k43k13}2){0,0,0,0,0,0,0,0,0,0,0,1,0,k03+k13+k31+k43,1,0,k03k31+k31k43,k03k13k43} Multiplying these rows with variables, we obtain two I/O equations: 1)k24k43k13uk24k31k43k13y1+k04k42y2k24k43k13y˙1+(k04+k24+k42)y˙2+y¨2=02)(k03+k13+k43)u+(k03k31+k31k43)y1u˙+(k03+k13+k31+k43)y1˙+y1¨=0 Our input–output equations are the same as GRA algorithm I/O Equation (Equation20). Finally, we investigate parameter identifiability of the system (Equation17) using Gröbner bases for solutions to c(p)=c(p) and show k13,k31,k03+k43,k04k42,k24k43 and k04+k24+k42 are uniquely identifiable.

Figure 8. Coefficient matrix of 4-compartmental model.

Figure 8. Coefficient matrix of 4-compartmental model.

7. Evaluation of the third derivative

We investigate the result of obtaining a third derivative using the GRA algorithm. we show that the third derivative gives more input–output equations and, consequently, more coefficients. But identifiable parameters combination results are the same as using the Meshkat alghorithm [Citation3].

Let's explore the relationships among the coefficients of the I/O equations obtained from the third and second steps, which do not change the results. A coefficient table of these input–output equations for the second derivative are:

We did not repeat the same coefficients because they do not affect the exhaustive summary or the Gröbner Basis.

For the third derivative, we differentiate y1¨ and y2¨ to obtain y1 and y2, and the corresponding state variable equation to acquire x1, x2, x3¨ and x4¨. Add these to the system (Neq = 18, Nx = 14): (21) y1x1,y2x2,x1k31x¨1+k13x¨3+u,x2k42x¨2+k24x¨4,x¨3+k31x˙1(k03+k13+k43)x˙3,x¨4+k42x˙2+k43x˙3(k04+k24)x˙4,(21) Using the GRA algorithm and according to the ranking {x2,x1,x¨4,x¨3,x¨2,x¨1,x˙4,x˙3,x˙2,x˙1,x4,x3,x2,x1,y2,y1,y2¨,y1¨,y2˙,y1˙,u¨,u˙,y2,y1,u}, we find the Gröbner Basis of the new system.Then I/O equations are: (22) 1)k24k43uk24k31k43y1+k04k13k42y2k24k43y˙1+(k04k13+k13k24+k13k42)y˙2+k13y¨2=02)(k03+k13+k43)u+(k03k31+k31k43)y1u˙+(k03+k13+k31+k43)y1˙+y1¨=03)(k032+2k03k13+k132+k03k31+k13k31+2k03k43+2k13k43+k31k43+k432)u(k032k31+k03k13k31+k03k312+2k03k31k43+k13k31k43+k312k43+k31k432)y1+k31u˙u¨(k032+2k03k13+k132+k03k31+2k13k31+k312+2k03k43+2k13k43+k31k43+k432)y1˙+y1¨=04)(k24k43)u+(k03k04k42+k04k42k43)y2+(k24k43)y˙1+(k03k04+k03k24+k03k42+k04k42+k04k43+k24k43+k42k43)y˙2+(k03+k04+k24+k42+k43)y¨2+y2=0(22) Here we summarize some differences in the procedure of finding identifiable parameter combinations using the second and third derivative. For the second derivative, we have equations (Neq = 12) and state variables and their derivatives (Nx = 10), while for three, Neq=18, Nx=14. Because of this, use of the Grbner Basis or other methods becomes more computationally complex. Also we have 2 (two) input–output equations after two differentiations., After three, we have 4 input–output equations two of which are the same as after two differentiations. The two other input–output equations do not effect the final answer and can be eliminated. We should show that eliminating their coefficient does not effect the exhaustive summary and Grbner Basis. We use a coefficient table of input–output equations, where we write distinct coefficients, because repeating a coefficient does not effect the exhaustive summary and Grbner Basis.

Theorem 7.1

Let a1,a2,,an,b1,b2,,bm,p1,p2,,pn+mR{0}, and n,mN, then solution set, {a1a2anb1b2bm=p1p2pn+m,a1=p1,a2=p2,,an=pn} is equal to solution set, {b1b2bm=pn+1pn+2pn+m,a1=p1,a2=p2,,an=pn}. {a1a2anb1b2bm=p1p2pn+ma1=p1a2=p2an=pn{b1b2bm=pn+1pn+2pn+ma1=p1a2=p2an=pn

Proof.

By induction.

Theorem 7.2

Let a1,a2,,an,b1,b2,,bm,p1,p2,,pn+mR{0}, and n,mN, then solution set, {a1+a2++an+b1+b2++bm=p1+p2++pn+m,a1=p1,a2=p2,,an=pn} is equal to solution set, {b1+b2++bm=pn+1+pn+2++pn+m,a1=p1,a2=p2,,an=pn}. {a1+a2++an+b1+b2++bm=p1+p2++pn+ma1=p1a2=p2an=pn{b1+b2++bm=pn+1+pn+2++pn+ma1=p1a2=p2an=pn

Proof.

By induction.

Theorem 7.3

Let A, B and C be subsets of the exhaustive summary in linear model, if AB=C and AA,BB, then CC=AB (The symbol ≅ means that two sets have the same distinct solutions).

Proof.

Let SA,SA,SB,SB,SC and SC denote the sets of solutions of A,A,B,B,C and C respectively. We should show SCSC and SCSC. For SCSC, Suppose sSC=SASB. Then sSA or sSB; if it is in SA, then by relation AA, we have sSASCSCSC, by symmetry sSB leads to the same conclusion. We also can show SCSC by using this method.

Theorems 7.1, 7.2 and 7.3 are true for Gröbner Basis of the same model, that is, we can perform Gröbner Basis on each side of above expressions without changing the equality [Citation23]. For theorem 7.1, we need to get the coefficient domain of polynomials as rational functions instead of integers numbers to simplify the solutions. We use these theorems for comparing between the obtained coefficients of I/O equations by taking derivatives two times (Table ) and three times (Table ).

Table 1. Coefficients of the I/O Equations for the second derivative.

Table 2. Coefficients of the I/O Equations for the third derivative.

We introduce a method to show the relation between the coefficients in Table  (T1) and Table  (T2).

First, we specify each coefficient according to its position number in T1 or T2. For example, the relation number of coefficient k03+k13+k43 is 7 in T1.

Second, the coefficient in each table may have a different relation number. For example, the relation number of coefficient k03+k13+k31+k43 is 9 in T1 and 11 in T2.

Third, Suppose coefficient α is in T2, but not in T1. To find relation number in T1, we find a subset A including α of T2 and a subset B of T1 such that AB. that is: IfαT2T1αAT1andBT2SuchthatAB The relation number of αA in T1 is equal to all the relation number of the coefficients in B. For example (23) {k03+k13+k43(7),k13(2)}{k03+k43(2,7),k13(2)}{k03+k43(2,7),k24k43(3)}{k03k04k42+k04k42k43(2,3,7),k24k43(3)}(23) Then the relation number of k03k04k42+k04k42k43 is 2, 3, 7 in T1. The relation number is not unique and we use them only to show related coefficients do not effect the final answer and can be eliminated. The best way to find these subsets is to write the coefficients in T2 as a combinations of coefficients in T1. Some of these combinations are in the following forms: (24) k03k04+k03k24+k03k42+k04k42+k04k43+k24k43+k42k43=(k03+k43)(k04+k24+k42)+k04k42k032+2k03k13+k132+k03k31+k13k31+2k03k43+2k13k43+k31k43+k432=(k03+k13+k43)(k03+k13+k31+k43)k032+2k03k13+k132+k03k31+2k13k31+k312+2k03k43+2k13k43+k31k43+k432=(k03+k13+k43)(k03+k13+k31+k43)+k31(k13+k31)k032k31+k03k13k31+k03k312+2k03k31k43+k13k31k43+k312k43+k31k432=k31(k03+k43)(k03+k13+k31+k43)(24)

8. Linear thyroid hormone distribution and metabolism (D&M) model example (Figure 9)

This more complex version of a well-studied biosystem model [Citation24–26] illustrates the power of our algorithm. (25) x˙1=k12x2+k13x3(k31+k21)x1+u1,x˙2=k21x1(k12+k02+k52)x2,x˙3=k31x1(k13+k03+k63)x3,x˙4=k45x5+k46x6(k64+k54)x4+u2,x˙5=k54x4+k52x2(k45+k05)x5,x˙6=k64x4+k63x3(k46+k06)x6,y1=x1,y2=x4.(25) We use the method of Section 4. This model has n = 6 state variables {x1,x2,x3,x4,x5,x6} and m = 2 output equations {y1=x1,y2=x4}. Then, according to the GRA algorithm, taking derivatives n(m1)=6(21)=5 times is sufficient to find the input–output equation (Figure ).

1: We differentiate y1x1 and y4x4 and then add the result to the system (Neq = 10, Nx = 12): (26) y˙1x˙1,y˙4x˙4,(26)

2: Differentiate y1˙ and y2˙ to obtain y1¨ and y2¨, and the corresponding state variable equation to acquire x1¨ and x4¨. Add these to the system (Neq = 14, Nx = 14): (27) y¨1x¨1,y¨4x¨4,x¨1+k12x˙2+k13x˙3(k31+k21)x˙1+u˙1,x¨4+k45x˙5+k46x˙6(k64+k54)x˙4+u˙2,(27)

Figure 9. Linear thyroid D&M model.

Figure 9. Linear thyroid D&M model.

3: Differentiate y1¨ and y2¨ to obtain y1 and y2, and the corresponding state variable equation to acquire x1, x4, x3¨ and x4¨. Add these to the system (Neq=22, Nx=20): (28) y1x1,y4x4,x1+k12x¨2+k13x¨3(k31+k21)x¨1+u¨1,x4+k45x¨5+k46x¨6(k64+k54)x¨4+u¨2,x¨2+k21x˙1(k12+k02+k52)x˙2,x¨3+k31x˙1(k13+k03+k63)x˙3,x¨5+k54x˙4+k52x˙2(k45+k05)x˙5,x¨6+k64x˙4+k63x˙3(k46+k06)x˙6,(28)

The new system now has 22 equations in 20 unknown state variables and their derivatives, {x1,x2,x3,x4,x5,x6,x˙1,x˙2,x˙3,x˙4,x˙5,x˙6,x¨1,x¨2,x¨3,x¨4,x¨5,x¨6,x1,x4}, enough to eliminate all unknowns. The algorithm terminates here. (In comparison the GRA algorithm requires 5 (instead of 3) steps, giving 38 equations in 32 unknown (instead of 22 in 20 unknowns) state variables and their derivatives). This gives two input–output equations.

To test structural identifiability of this model, we adopt the quasi-numerical method used in DAISY [Citation7]. We provide the exhaustive summary c(p), using coefficients of the I/O equations plus random numbers. For every function of the unknown p in the exhaustive summary we assign a pseudo-randomly chosen numerical value for the elements of vector p, providing a system of nonlinear algebraic equations in the unknown p. This quasi-numerical approach allows significantly reduced symbolic computations and hence broadens the class of testable models. For this example, the resulting nonlinear algebraic equations (with random numbers) were solved using Mathematica, which returns solutions for each unknown parameter. When these solutions indicate that the parameter is equal to its assigned number, the model is globally structurally identifiable, otherwise it is not. We chose numerical parameter values as follows: (29) {k022,k033,k055,k067,k1211,k1313,k2117,k3119,k4523,k4629,k5231,k5437,k6341,k6443}(29) After solving exhaustive summary, none are equal to their numerical value, so the model is not structurally identifiable (Appendix 1).

9. Implementation of method as web application

Our new algorithm is implemented as web application COMBOS2, available online at: http://biocyb1.cs.ucla.edu/combos2. To show the speed of our algorithm compared with the Gröbner Basis approach in COMBOS, we give some examples of analyzing linear models for identifiable combinations, which DAISY does not provide. COMBOS [Citation8] provides identifiable parameter combinations for the 4-Compartment biosystem model in about 17 seconds, decreasing to 5 seconds with COMBOS2. COMBOS2 shows that a 5-Compartment identifiable model [Citation27] is identifiable in about 9 seconds, whereas COMBOS needs 28 seconds. Each is at least 3 times faster with COMBOS2. For the 6-compartment linear thyroid model above, COMBOS did not provide any answer because it hung during the I/O phase of the computation. COMBOS2 took only 2 seconds for finding I/O equations, but it also hangs when attempting to find identifiable parameter combinations, because it uses the Gröbner Basis approach in its paramter combinations algorithm. An alternative method for solving this problem using Mathematica is described in the Appendix 2. DAISY does not compute identifiable parameters combinations, but it did analyse for structural identifiability of individual parameters, and this took several hours to complete.

10. Conclusion

We introduced a novel algorithm for finding the I/O equations for systems of linear biosystem models and other linear ODEs of the form (Equation1) with m output equations, by taking derivatives of the system and then using a Gaussian elimination approach. The new algorithm runs much faster and is simpler to implement than a Grobner basis approach. We also showed the number of derivatives needed for computing I/O equations is smaller when the condition Neq ≥ Nx + m is satisfied. We demonstated the utility of the new algorithmic approach in a new app COMBOS2, applying it to several example models and, in particular, to a 6-dimensional thyroid hormone dynamics model, shown in 2 seconds to be not structurally identifiable, while COMBOS provided no solution. Another app DASIY needed several hours to provide only a structural identifiability analysis for the original individual parameters, and no parameter combination results. The main limitation of our new algorithm is, of course, that it is not applicable to nonlinear biosystem models. But we believe it gets a bit closer.

Disclosure statement

No potential conflict of interest was reported by the authors.

Notes

References

  • DiStefano III J. Dynamic systems biology modeling and simulation. Waltham (MA): Academic Press; 2015; Jan 10.
  • Cobelli C, Distefano 3rd JJ. Parameter and structural identifiability concepts and ambiguities: a critical review and analysis. Am J Physiol Regul Integr Comp Physiol. 1980;239(1):R7–R24.
  • Meshkat N, Anderson C, DiStefanoIII JJ. Alternative to Ritt's pseudodivision for finding the input-output equations of multi-output models. Math Biosci. 2012;239(1):117–123.
  • Ljung L, Glad T. On global identifiability for arbitrary model parametrizations. Automatica. 1994;30(2):265–276.
  • Itu C, Ochsner A, Vlase S, et al. Improved rigidity of composite circular plates through radial ribs. Proc Inst Mech Eng Pt L J Mater Des Appl. 2019;233(8):1585–1593.
  • Marin M, Craciun EM, Pop N. Considerations on mixed initial-boundary value problems for micropolar porous bodies. Dyn Syst Appl. 2016;25(1–2):175–196.
  • Bellu G, Saccomani MP, Audoly S, et al. DAISY: a new software tool to test global identifiability of biological and physiological systems. Comput Methods Programs Biomed. 2007;88(1):52–61.
  • Meshkat N, Kuo CE, DiStefano III J. On finding and using identifiable parameter combinations in nonlinear dynamic systems biology models and COMBOS: a novel web implementation. PLoS ONE. 2014;9(10):e110261.
  • Ritt JF. Differential algebra. New York (NY): American Mathematical Society; 1950.
  • Jirstrand M. Algebraic methods for modeling and design in control. Division of Automatic Control, Department of Electrical Engineering, Linkping University; 1996.
  • Yazdi MK. An accurate relationship between frequency and amplitude to nonlinear oscillations. J Taibah Univ Sci. 2018;12(5):532–535.
  • Marin M, Vlase S, Ellahi R, et al. On the partition of energies for the backward in time problem of thermoelastic materials with a dipolar structure. Symmetry. 2019;11(7):863.
  • Bhatti MM, Shahid A, Abbas T, et al. Study of activation energy on the movement of gyrotactic microorganism in a magnetized nanofluids past a porous plate. Processes. 2020;8(3):328.
  • Meshkat N, Eisenberg M, DiStefano III JJ. An algorithm for finding globally identifiable parameter combinations of nonlinear ODE models using Grbner Bases. Math Biosci. 2009;222(2):61–72.
  • Meshkat NC. A differential algebra method for eliminating unidentifiability. Los Angeles: University of California; 2011.
  • Saccomani MP, Audoly S, Bellu G, et al. Examples of testing global identifiability of biological and biomedical models with the DAISY software. Comput Biol Med. 2010;40(4):402–407.
  • Bareiss EH. Sylvester's identity and multistep integer-preserving Gaussian elimination. Math Comput. 1968;22(103):565–578.
  • Marin M, Nicaise S. Existence and stability results for thermoelastic dipolar bodies with double porosity. Continuum Mech Thermodyn. 2016;28(6):1645–1657.
  • Abbas IA. Eigenvalue approach for an unbounded medium with a spherical cavity based upon two-temperature generalized thermoelastic theory. J Mech Sci Technol. 2014;28(10):4193–4198.
  • Marin M, Ellahi R, Chirila A. On solutions of Saint-Venant's problem for elastic dipolar bodies with voids. Carpathian J Math. 2017;33(2):219–232.
  • Jacquez JA. Compartmental analysis in biology and medicine, BioMedware. Ann Arbor, MI. 1996;512.
  • Audoly S, DAngio L, Saccomani MP, et al. Global identifiability of linear compartmental models-a computer algebra algorithm. IEEE Trans Biomed Eng. 1998;45(1):36–47.
  • Cox D, Little J, OShea D. Ideals, varieties, and algorithms: an introduction to computational algebraic geometry and commutative algebra. New York: Springer Science & Business Media; 2013.
  • Distefano III JJ, Wilson KC, Jang M, et al. Identification of the dynamics of thyroid hormone metabolism. Automatica. 1975;11(2):149–159.
  • DiStefano 3rd JJ, Mori FU. Parameter identifiability and experiment design: thyroid hormone metabolism parameters. Am J Physiol Regul Integr Comp Physiol. 1977;233(3):R134–R144.
  • Sefkow AJ, DiStefano III JJ, Himick BA, et al. Kinetic analysis of thyroid hormone secretion and interconversion in the 5-day-fasted rainbow trout, Oncorhynchus mykiss. Gen Comp Endocrinol. 1996;101(2):123–138.
  • Meshkat N, Sullivant S, Eisenberg M. Identifiability results for several classes of linear compartment models. Bull Math Biol. 2015;77(8):1620–1651.

Appendices

Appendix 1

(A1) 1)k021333k63516191517,k0344k63,k051204k635694943k633280,k0636k631189k63,k1211,k1313,k2117,k3119,k453489143k633280,k461189k63,k5231(43k633280)1517,k54141(328043k63),k6443k6341(A1) (A2) 2)k02385237k6314136513547637,k03117(72217k63),k0520468k631082031731k6362320,k06612k632259117k63,k1218719,k1324717,k2119,k3117,k45662929731k6362320,k462259117k63,k52527(731k6362320)547637,k541779(62320731k63),k64731k63779,(A2) (A3) 3)k0254612647k631473007k632023153647+58375263385237,k03119(19k63+11647+1034),k0538657647(2480647k63+44640k63+703)k63695826(2480647k63+44640k63+703)k63+36,k06532k63+713647+1283419k63,k121317(64718),k131119(64718),k21647+18,k3118647,k4538657(647k63+18k63)2480647k63+44640k63+703,k46713(647+18)19k63,k5241(1332647k6335927k6342160647+758880)385237,k541527(37647k63666k63+42160),k641527(37)(647k6318k63),(A3) (A4) 4)k0254612647k631473007k63+2023153647+58375263385237,k03119(19k6311647+1034),k0538657647(2480647k6344640k63703)k63+695826(2480647k6344640k63703)k63+36,k06532k63713647+1283419k63,k121317(64718),k131119(64718),k2118647,k31647+18,k4538657(647k6318k63)2480647k6344640k63703,k46713(64718)19k63,k5241(1332647k63+35927k6342160647758880)385237,k541527(37647k63666k63+42160),k6437527(647k63+18k63)(A4)

Appendix 2

In this section, we introduced a method for finding identifiable parameters combinations of the linear thyroid model by using Mathematica. COMBOS2 hangs when attempting to find identifiable parameter combinations. A technique for solving this problem is to reduce the number of parameters by substituting some of them with numbers.

First Step. According to the calculations for the linear thyroid model (EquationA1), (EquationA2), (EquationA3), (EquationA4), the best choice is as follows: (A5) {k1211,k1313,k2117,k3119}(A5) After substituting these parameters in c(p)=c(p), we investigate the parameter identifiability of the system (Equation25) using Gröbner bases for the solutions to new c(p)=c(p) and have: (A6) {k54+k64,k46k64,k46k63,k45k64,k45k52,k06+k46,k05+k45,k03+k63,k02+k52}(A6) Let k5464=k54+k64,k46T64=k46k64,k46T63=k46k63,k45T64=k45k64,k45T52=k45k52,k0646=k06+k46,k0545=k05+k45,k0363=k03+k63,k0252=k02+k52, and also X2=k12x2,X3=k13x3,X5=k45x5,X6=k46x6,k45T52T12=k45T52k12,k46T63T13=k46T63k13. Then the reparameterized equations are: (A7) x˙1=X2+X3(k21+k31)x1+u1,X˙2=k12T21x1(k12+k0252)X2,X˙3=k13T31x1(k13+k0363)X3,x˙4=X5+X6k6454x4+u2,X˙5=k45T54x4+k45T52T12X2k0545X5,X˙6=k46T64x4+k46T63T13X3k4606X6,y1=x1,y2=x4.(A7) Second Step. Using Mathematica we investigate the parameter identifiability of this new system (EquationA7) and find the following combinations: (A8) {k5464,k46T64,k46T63T13,k45T54,k45T52T12,k4505,k21+k31,k13T31,k12T21,k0646,k0363+k13,k0252+k12}(A8) By combining the identifiabile parameters combinations obtained from the first (EquationA6) and second (EquationA8) steps for the linear thyroid model, we have the following Table :

Table A1. The identifiabile parameters combinations of the linear thyroid model.

Then the reparameterized linear thyroid model equations are: (A9) x˙1=X2+X3+K11x1+u1,X˙2=T22x1+K22X2,X˙3=T33x1+K33X3,x˙4=X5+X6+K44x4+u2,X˙5=T55x4+T52X2+K55X5,X˙6=T66x4+T63X3+K66X6,y1=x1,y2=x4.(A9)