15,460
Views
78
CrossRef citations to date
0
Altmetric
Teacher’s Corner

Drawing Conclusions from Cross-Lagged Relationships: Re-Considering the Role of the Time-Interval

Abstract

The cross-lagged panel model (CLPM), a discrete-time (DT) SEM model, is frequently used to gather evidence for (reciprocal) Granger-causal relationships when lacking an experimental design. However, it is well known that CLPMs can lead to different parameter estimates depending on the time-interval of observation. Consequently, this can lead to researchers drawing conflicting conclusions regarding the sign and/or dominance of relationships. Multiple authors have suggested the use of continuous-time models to address this issue.

In this article, we demonstrate the exact circumstances under which such conflicting conclusions occur. Specifically, we show that such conflicts are only avoided in general in the case of bivariate, stable, nonoscillating, first-order systems, when comparing models with uniform time-intervals between observations. In addition, we provide a range of tools, proofs, and guidelines regarding the comparison of discrete- and continuous-time parameter estimates.

INTRODUCTION

The cross-lagged panel model (CLPM), a discrete-time (DT) structural equation modeling (SEM) model, is a popular method used to analyze longitudinal panel data consisting of multiple repeated measurements of a set of variables. The CLPM is conceptually similar to the first-order vector autoregressive (VAR(1)) model used in time series analysis (cf. Hamilton, Citation1994), which has seen a recent growth in popularity for the analysis of experience sampling data (cf. Bolger & Laurenceau, Citation2013; Chow, Ferrer, & Hsieh, Citation2011; Hamaker, Dolan, & Molenaar, Citation2005). This type of SEM model is particularly appealing in that it offers the opportunity to assess potentially Granger-causal (Granger, Citation1969) or predictive relationships between variables when lacking an experimental design, using cross-lagged regression parameters.

However, this type of lagged regression suffers from the well-known problem of time-interval dependency. That is, although this model takes into account the longitudinal structure of the data with respect to the order of measurement occasions, it does not explicitly account for the amount of time that elapses between measurement occasions. This has two major practical implications for researchers. First, if observations within a study are gathered with unequal time-intervals, the standard CLPM will lead to biased results, because it will model the unequal time-intervals as if they were equidistant. Second, even if all observations are equally-spaced in time, the estimated lagged effects will be specific to the time-interval used in the study. That is, the estimated lagged relationships between variables will vary as a function of the amount of time between measurements (a.o., Gollob & Reichardt, Citation1987). This means that researchers who study the same phenomenon with different uniform time-intervals will come across different estimates of seemingly the same lagged effects. Importantly, this can in some instances lead to researchers studying the same phenomena coming to opposite conclusions regarding the sign and relative strengths (i.e., order of predictive dominance) of lagged effects. To overcome both of the problems posed by the time-interval dependency issue, the use of a Continuous-Time (CT) Modeling approach has been suggested repeatedly in the literature (e.g., Boker, Citation2002; Chow et al., Citation2005; Oravecz, Tuerlinckx, & Vandekerckhove, Citation2009; Oud & Delsing, Citation2010a). In particular, both Voelkle, Oud, Davidov, and Schmidt (Citation2012) and Boker, Neale, and Rausch (Citation2004) have demonstrated how these models can be estimated in SEM-based software.

In the current article, we will focus specifically on the issue of comparability between CLPM effect estimates of the same phenomena when different uniform time-intervals are used in data collection. This is an issue that is particularly important in the social sciences at the moment, where issues such as replicability and meta-analyses have seen recent widespread critical attention. We offer an accessible account of the specific conditions under which conclusions regarding the sign and relative strengths of effects are invariant under different choices of uniform time-interval.

First, we will begin with an introduction to the key concepts behind the standard CLPM and its continuous-time equivalent, both SEM-based approaches, and a treatment of how these models are related to one another. This treatment includes a number of formulas and tools by which effect estimates based on another time-interval or time-scale, or using another model, can be made comparable, with corresponding R code in an Appendix. Second, we will examine the issue of researchers who use different uniform time-intervals to come to different conclusions with regard to the sign and relative strength of effects. Although previous authors have been at pains to point out that this can sometimes occur, we will elucidate the exact circumstances in which this will or will not occur. In the general case, within the first-order processes, only when researchers are studying univariate processes or stable bivariate processes with real eigenvalues, are such conclusions insensitive to the choice of uniform time-interval or choice of model. We conclude with some recommendations for substantive researchers and a discussion of related pertinent issues.

PRELIMINARIES

In this section, we will introduce the key concepts used throughout the rest of the article. The CLPM is briefly discussed, along with some shortfalls of this model which motivate the rest of the current article. Following this, we briefly introduce the continuous-time model (CTM) that has been suggested by various authors as a means of overcoming the two time-interval problems associated with the CLPM, a DT model. Finally, we will show how these two models can be related in a straightforward manner, a finding that is then utilized throughout the rest of the article.

The Discrete-Time (DT) Cross-Lagged Panel Model (CLPM)

The CLPM is a popular DT SEM-based model used for the analysis of longitudinal panel data, that is, data in which the same set of variables is measured repeatedly at several measurement occasions (cf. Finkel, Citation1995; Hamaker, Kuiper, & Grasman, Citation2015; Rogosa, Citation1980). The CLPM can be seen as an extension to panel data of the first-order vector autoregressive (VAR(1)) model from time-series analysis (cf. Hamilton, Citation1994). For the sake of simplicity, although we will discuss issues relevant to general VAR(1) models, we will use the CLPM term throughout. In these models, q variables (y) measured at each measurement occasion (m) are regressed on themselves and each other at the previous measurement occasion (m1). For the sake of notational simplicity, and without loss of generality, we will assume throughout that the variables y are standardized, and thus, so too are all parameter estimates. Notably, when comparing parameter estimates, one should examine the standardized parameters to allow for a fair comparison of the size of lagged effect parameters.

The CLPM can then be represented in the general case, using solely the structural equation, as:

(1) ym=Φym1+ϵm(1)

where ym are the q standardized variables of interest at measurement occasion m, which implies an intercept of zero and results in standardized parameters; ϵm represents a m -vector of errors that are independent and identically distributed but may be contemporaneously correlated; and Φ is the q×q matrix consisting of standardized autoregressive and cross-lagged parameters. The elements of the effect matrix Φ are denoted by ϕjk and represent the effect of variable j at measurement occasion m1 on variable k at measurement occasion m for j,k=1,,q, which is the same for all measurement occasions (m). In case of a stable and therefore stationary process, the modulus (i.e., the absolute value) of the eigenvalues are smaller than 1. This implies that the process is mean-reverting, that is, tends to drift toward its long-term mean, here, zero, because of the standardized variables.

An example of a bivariate CLPM is shown in . Researchers who apply the CLPM are generally interested in the standardized cross-lagged parameters, ϕ12 and ϕ21 in , and the comparison of their relative strengths. Stability of the constructs is controlled for through the inclusion of the autoregressive effects, that is, parameters ϕ11 and ϕ22 in (cf. Rogosa, Citation1980). Due to the fact that the CLPM controls for this stability, it is often believed that the cross-lagged regression parameters obtained with this model are the most appropriate measures for studying causality in longitudinal correlational data (Deary, Allerhand, & Der, Citation2009) and determining which variable has a stronger causal influence on the other (Bentler & Speckart, Citation1981). Such conclusions are based on the notion of Granger causality (Granger, Citation1969).Footnote1 The CLPM is a flexible model that can be used for many different numbers of variables. Adaptations of the CLPM have been proposed, for example, for the study of mediation (Maxwell, Cole, & Mitchell, Citation2011) and the estimation of dynamical networks (Bringmann et al., Citation2013).

FIGURE 1 Bivariate relationship between y1 (e.g., child health) and y2 (e.g., maternal health) for time points t=0,,(T1)Δt with autoregression parameters ϕ11 and ϕ22, and cross-lagged parameters ϕ12 and ϕ21. In case of a symmetric relationship, the cross-lagged/reciprocal parameters are equal, that is, ϕ12=ϕ21.

FIGURE 1 Bivariate relationship between y1 (e.g., child health) and y2 (e.g., maternal health) for time points t=0,…,(T−1)Δt with autoregression parameters ϕ11 and ϕ22, and cross-lagged parameters ϕ12 and ϕ21. In case of a symmetric relationship, the cross-lagged/reciprocal parameters are equal, that is, ϕ12=ϕ21.

The CLPM can be described as a DT model in that the model does not explicitly account for the length of the time-intervals between measurement occasions. Thus, the CLPM fails to account for the fact that: (i) the time-intervals between measurement occasions may be unequal, and (ii) the variables measured continue to exist and influence each other in-between measurement occasions. Although this may appear to be mainly theoretical or conceptual concerns, the DT nature of the CLPM has rather practical implications for researchers. As has been pointed out by numerous researchers (e.g., Gollob & Reichardt, Citation1987; Pelz & Lew, Citation1970; Voelkle & Oud, Citation2013), the estimated cross-lagged and autoregressive effect matrix Φ may vary as a function of the length of the time-interval, that is, the amount of time that elapses, between two consecutive measurement occasions: Δtm=tmtm1, where tm represents the exact time point at which measurement m is made. This means that two researchers studying the same phenomenon may come across completely different estimates of the same effects purely due to differences in the choice of which time-interval to use, even when they both have equal time-intervals between measurements within their studies.

To reflect the time-interval dependency of the CLPM, we can rewrite the DT effect matrix Φ as Φ(Δtm). Notably, in the case of unequal time-intervals, the estimated Φ matrix is a mix of Φ(Δtm) estimates for different time-intervals and thus most probably a biased estimate. Therefore, in this article, we will focus on applications of the CLPM in cases where there is a uniform time-interval of measurement, that is, Δtm=Δt for all m. Consequently, we will use Φ(Δt) throughout the article.

Continuous-Time Model (CTM) Equivalent of the CLPM

CTM overcomes the time-interval dependency problem by explicitly taking the time-interval dependent nature of the relationships between variables at different measurement occasions into account. Conceptually, the CTM assumes that the processes of interest take on values and, moreover, influence each other at every moment in time, not only on the occasions at which the researcher measures them. For a substantive rationale for using this type of model in a psychology or social science setting, we refer readers to Boker (Citation2002).

The CTM of interest in this case is the first-order stochastic differential equation model also referred to as the multivariate Ornstein-Uhlenbeck process, that is,

(2) dy(t)dt=Ay(t)+Ωdw(t)dt(2)

where the q×q matrix A, referred to as the drift matrix, contains the standardized parameters which relate the values of the standardized variables in y at a particular time t to the derivative (i.e., rate of change) in y(t) with respect to time (Oravecz, Tuerlinckx, & Vandekerckhove, Citation2011; Oud & Delsing, Citation2010b; Voelkle & Oud, Citation2013). The elements of A are denoted αjk, representing the standardized effect of variable j on the rate of change in variable k for j,k=1,,q. In case of a stable and therefore stationary process, the eigenvalues of A are negative. The last term on the right-hand side of Equation 2 is the continuous-time version of a random error process that is independent and identically distributed, but may be contemporaneously correlated. For a detailed discussion, see among others Oud and Delsing (Citation2010b) and, for details on how this model can be estimated in a SEM-framework, see e.g., Boker (Citation2002); Oud and Delsing (Citation2010b).

This CTM can be considered the continuous-time analoge of a DT VAR(1) process, that is, the CLPM shown in Equation 1. However, the differential equation at the heart of the CTM approach represents quite a different approach to modeling longitudinal processes than the CLPM. The effect matrix A at the core of the model, shown in Equation 2, does not represent the relationships between variables at different measurement occasions, but rather the relationship between the values of the variables y at time t and their instantaneous rate of change. In other words, the drift matrix represents how we would expect the values of the variables y to change over a very short time-interval after time t, given the values of those variables at time t. The drift matrix controls the strength and the direction of the centralizing tendency or dampening force (Oravecz & Tuerlinckx, Citation2011; Oravecz et al., Citation2009): the tendency to stay near the average position of the process (i.e., µ), which is zero in case of standardized variables. For more details regarding the interpretation of the drift matrix, see Ryan, Kuiper, and Hamaker (Citationin press).

Relating the CTM and the Discrete-Time Model (CLPM)

Although the DT CLPM and the CTM discussed above represent distinct ways of modeling longitudinal processes, they are related to each other. It can be shown that the DT effect matrix Φ(Δt) is actually a function of the drift matrix A:

(3) Φ(Δt)=eAΔt(3)

where Φ(Δt), A and Δt are as defined above, and e is the matrix exponential (if q > 1). Equality 3 will be key to the ideas and tools presented in this article, as well as the proofs supporting them. For an accessible introduction to the matrix exponential and the logic behind this equality, see Voelkle et al. (Citation2012).

Note that all CTMs have a unique DT-model equivalent; however, not all DT models have a unique CTM equivalent. Furthermore, when the DT process is not smooth and differentiable (Boker & Nesselroade, Citation2002), there is no CTM equivalent at all. An example is a univariate, first-order, autoregressive process with a negative autoregressive parameter, because an exponential is always positive.Footnote2 As such, we will refer to DT models with a CTM equivalent as ones that exhibit ‘positive autoregression’. Note that such ‘positive autoregression’, first-order processes are often plausible when analyzing psychological processes (cf. Koval & Kuppens, Citation2012; Oravecz & Tuerlinckx, Citation2011). In some cases, there may be many CTMs which generate the same DT-model effect matrix Φ(Δt). This occurs when Φ(Δt) has complex eigenvalues, meaning there are multiple periodic-equivalent or aliasing drift matrices A (cf. Hamerle, Nagl, & Singer, Citation1991; Yue, Thunberg, & Goncalves, Citation2016). In practice, this means that we cannot solve uniquely for A given Φ(Δt) with complex eigenvalues. Notably, the converse is possible, that is, one can solve uniquely for Φ(Δt) given an A with complex eigenvalues.

We present here some properties of the exponential relationship which will be used in the proofs found in later sections of the article. We make use of an eigenvalue decomposition of the effect matrix A, that is,

(4) A=VDAV1(4)

where V is a matrix of eigenvectors and DA is a diagonal matrix containing the eigenvalues of A. For more details on this, see Appendix A. The decomposition in (4) is important as it allows us to more explicitly define how we can solve for A and Φ(Δt) given one or the other, as will be discussed next, which is helpful in comparing effect estimates. Appendix B gives the accompanying proofs, as well as details on how to calculate the expressions in the statistical programming language R (R Core Team, Citation2013).

To find Φ(Δt) given A we can use,

(5) Φ(Δt)=eAΔt=VeDAΔtV1(5)

where eDAΔt(=DΦ(Δt)) represents the diagonal matrix containing the scalar exponentials of the eigenvalues of A multiplied by the scalar Δt. We can use this to relate any two Φ(Δt) matrices which use different uniform time-intervals, say, Φ(d1) and Φ(d2), with the equality,

(6) Φ(d2)=(Φ(d1))d2d1.(6)

Furthermore, we can solve for A given any Φ(Δt) by using,

(7) A=V(log(DΦ(Dt))/Δt)V1,(7)

where DΦ(Dt) is the diagonal matrix containing the eigenvalues of Φ(Δt). It is important to note that this expression should only be used when Φ(Δt) does not have complex eigenvalues.

With regard to the comparison of drift matrices A between researchers, different estimates can result from the same process or even the same data if researchers choose a different time-scale. That is, the estimate of A depends on the unit of time we choose for Δt (e.g., half-hours vs. hours or weeks vs. months). However, assuming each researcher is capturing the same underlying dynamics, no contradictory conclusions will result: see Appendix B for more details, including a useful transformation to use in case of different time-scales.

COMPARING RESULTS BASED ON DIFFERENT UNIFORM TIME-INTERVALS

Until now we have seen that the parameter estimates of the CLPM vary as a function of the time-interval between measurements, and introduced the alternative CTM that explains this phenomenon. We now wish to establish in what circumstances this time-interval dependency leads to misleading or inconsistent substantive conclusions. We first distinguish three types of substantive conclusions researchers may be interested in. Subsequently, we examine whether inconsistencies in these conclusions occur due to the use of different uniform time-intervals, in the context of bivariate and higher-variate CLPMs.

Three Types of Substantive Conclusions in Terms of Standardized Model Parameters

Here, we define three main types of conclusions which researchers are generally interested in, and which are based on the relative size or sign (positive vs. negative) of parameters of the CLPM. For each of these types, we end with an example of a misleading result in terms of matrix elements (p,s) (i.e., ϕps(Δt)) which can occur using two different uniform time-intervals, namely for Δt=d1 and d2 (both finite and positive). Notably, we express these inequalities in terms of the population parameters, and not estimates. The reason for this is that we wish to make clear that these are misleading conclusions which may occur even in scenarios in which the parameter estimates are highly precise.

I. Order of dominance of the standardized autoregression effects.

Although researchers are generally more interested in the cross-lagged effects, they may also be interested in determining which of the processes under investigation is most stable by comparing the size of the corresponding standardized autoregressive effects. For instance, Koval and Kuppens (Citation2012) study the autoregressive relationship of emotional inertia, and individual differences therein.

A misleading result is obtained if, for example,

ϕpp(d1) > ϕss(d1) ϕpp(d2) < ϕss(d2),forp  s.

II. Order of dominance of the cross-lagged effects.

Researchers are generally interested in whether the standardized cross-lagged effect of y1 on y2 is greater than the standardized cross-lagged effect of y2 on y1, or vice versa. For instance, Nohe, Meier, Sonntag, and Michel (Citation2015) conduct a meta-analysis in which they compare the lagged effect of work-family conflict on strain to the lagged effect of strain on work-family conflict. As another example, in a sample of persons diagnosed with burnout, Schuurman, Ferrer, De Boer-Sonnenschein, and Hamaker (Citation2016) investigated whether the effect of competence at occasion m1 on exhaustion at occasion m was stronger or weaker than the corresponding cross-lagged effect of exhaustion on competence.

A misleading result is obtained if, for example, for ‘corresponding’ cross-lagged effects,

(a) ϕps(d1) > ϕsp(d1) & ϕps(d2) < ϕsp(d2),for p  s

In the general multivariate case, we may also be interested in comparing any cross-lagged parameters we estimate. In this case, a misleading result is obtained if:

(b) ϕps(d1)>ϕuv(d1) & ϕps(d2)<ϕuv(d2),forpsanduv

III. Sign of the standardized cross-lagged effect.

Researchers are also generally interested in whether, for example, the cross-lagged effect of y1,m1 (e.g., physical well-being) on y2,m (e.g., social well-being) is positive or negative. For example, Schuurman et al. (Citation2016) inspect and interpret the sign of the cross-lagged effects before comparing the ordering.

A misleading result is obtained if, for example,

ϕps(d1) > 0 & ϕps(d2) < 0, for p  s.

These are three types of conclusions, regarding the ordering and sign of the lagged effects, which are commonly of interest to substantive researchers. The interested reader is referred to the Misleading Results section where we, among others, discuss other possible substantive conclusions researchers might wish to make.

No Misleading Conclusions: Bivariate, Stable, Nonoscillating, First-Order Systems

We first inspect the case of bivariate CLPMs, that is, bivariate first-order systems, see for an example. Let us assume, without loss of generalization, the following values for the 2×2 eigenvector-matrix V and the 2×2 eigenvalue-diagonal-matrix DΦ(d1) of Φ(Δt=d1):

V=v11v12v21v22
DΦ(d1)=λ100λ2,

with λ1 and λ2 the eigenvalues of Φ(Δt=d1).Footnote3 Consequently, V1 is given by:

V1=1|V|v22v12v21v11,where|V|=v11v22v12v21,

and Φ(d1) by

Φ(d1)=1Vv11λ1v22v12v21λ2(λ1λ2)v11v12(λ1λ2)v21v22v12v21λ1+v11v22λ2.

From Equation 6, using n=d2/d1 for ease of notation, it follows that:

Φ(d2)=(Φ(d1))n       =V(DΦ(d1))nV1
=1Vv11λ1nv22v12v21λ2n(λ1nλ2n)v11v12(λ1nλ2n)v21v22v12v21λ1n+v11v22λ2n,

with

(DΦ(d1))n=λ1n00λ2n.

We can assess whether it is possible for misleading conclusions I to III to occur in the bivariate case by expressing the associated parameter inequalities in terms of eigenvalues and eigenvectors. Namely, all three types of misleading results are obtained when the sign of (λ1λ2) differs from that of (λ1nλ2n).

Thus, to assess whether these types of misleading conclusions can occur in bivariate, first-order systems based on the use of a different uniform time-interval, it suffices to assess whether there exists an λ1, λ2 and n such that λ1 > λ2 and λ1n < λ2n or vice versa. For a stable, ‘positive autoregression’, first-order process with real eigenvalues, the eigenvalues λ1 and λ2 of Φ(d1) lie between 0 and 1. In that case, there exist no n such that the sign of (λ1λ2) differs from that of (λ1nλ2n). Therefore, misleading conclusions I to III cannot occur when inspecting bivariate, stable, ‘positive autoregression’, nonoscillating, first-order systems employing different uniform time-intervals. Notably, the conclusions one would draw from the continuous-time effect matrix A in a bivariate, stable, nonoscillating, first-order setting are the same as those one would draw from DT model at any uniform time-interval, which is shown in Appendix C.

In sum, researchers will not come to misleading or inconsistent conclusions I to III in the case of bivariate, stable, first-order systems with real eigenvalues due to the use of a different uniform time-interval; that is, it is not possible to: (i) obtain a different sign in cross-lagged effects; (ii) find a different dominant cross-lagged effect; or (iii) find a different dominant autoregression effect. We will illustrate this next with an example.

Example

We will use the artificial example of Voelkle et al. (Citation2012): Suppose, we are examining two variables (q=2), child’s health and maternal health. We are interested in the effect of child’s health (y1,m1) on maternal health one time period later (y2,m), as well as the effect maternal health (y2,m1) has on child health one time period later (y1,m), that is, ϕ12 and ϕ21 in . Assume that the underlying drift matrix is given as:

A=0.44720.23200.04350.1170.

Now, take that we collect data generated by this underlying continuous-time process at time-intervals of Δt=1 (e.g., months). Estimating the DT standardized effect matrix (asymptotically) yields:

Φ(Δt=1)=0.640.180.030.89.

Examining this effect matrix, we could make the following three conclusions: (i) maternal health is more stable, that is, has a stronger standardized autoregressive effect, than child’s health, that is, ϕ22=0.89 > ϕ11=0.64; (ii) the standardized effect of maternal health on child’s health is greater than the effect of child’s health on maternal health, that is, ϕ12=0.18 > ϕ21=0.03; (iii) both standardized cross-lagged effects are positive, that is, ϕ12=0.18 > 0 & ϕ21=0.03 > 0.

Now, suppose that we had chosen a time-interval for data collection that was twice as long, that is, Δt=2 (e.g., bi-monthly). In that case, we would have (asymptotically) come to the following DT standardized effect matrix:

Φ(Δt=2)=0.420.270.050.80.

We can see that, although the standardized estimates for each cross-lagged and autoregressive effects change with this new time-interval, the conclusions drawn from this effect matrix are same as for those drawn from the matrix with the shorter time-interval: (i) ϕ22=0.80 > ϕ11=0.42; (ii) ϕ12=0.27 > ϕ21=0.05; and (iii) ϕ12=0.27 > 0 & ϕ21=0.05 > 0.

Using Equation 5, we can derive the DT standardized effect matrix for any time-interval. This can be plotted, as done in , and is referred to as Φ(Δt) plot in this article. The Φ(Δt) in shows that, for all time-intervals, one will not find misleading conclusions: (i) & (ii) the autoregressive and cross-lagged effects do not cross each other, which implies no difference in order of dominance; and (iii) the cross-lagged effects do not cross the zero-line at any time-interval, which implies the same sign over the whole time range.

FIGURE 2 Φ(Δt)plot: A bivariate example of how the parameter values (i.e., the elements in Φ(Δt)) change as a function of the time-interval Δt.

FIGURE 2 Φ(Δt)plot: A bivariate example of how the parameter values (i.e., the elements in Φ(Δt)) change as a function of the time-interval Δt.

Furthermore, we can see that we reach the same conclusions when examining only the drift matrix A: (i) maternal health (y2) is more stable, that is, has an autoregressive effect closer to zero, than child’s health, that is, α22=0.1170 < α11=0.4472; because the diagonal elements will often be negative, we can also state: α22=0.1170 > α11=0.4472; (ii) the effect of maternal health on child’s health are greater than the effect of child’s health on maternal health, that is, α12=0.2320 > α21=0.0435; (iii) both cross-lagged effects are positive, that is, α12=0.2320 > 0 & α21=0.0435 > 0.

In the next subsection, we will discuss that equivalence of conclusions I to II drawn in the context of different uniform time-intervals of measurement does not hold in general when we are dealing with other types of bivariate systems or systems of more than two variables.

Misleading Conclusions

Although the substantive conclusions I to III are robust for using different uniform time-intervals in DT, first-order models that are bivariate, stable, and nonoscillating, this serves as the exception that proves the rule. To the best of our knowledge, the only equivalence that does exist in general is in the case of symmetric effects matrices, implying equal ‘corresponding’ cross-lagged effects, which is shown in Appendix D. Namely, if one finds a symmetric relationship, this will also be symmetric with every other choice of time-interval and model. Consequently, ϕps=ϕsp (and also αps=αsp) for all s,p=1,,q and thus misleading conclusion IIa will not occur. Nevertheless, misleading conclusions I, IIb, and III are still possible when modeling a system of more than two variables. Moreover, in case of a nonsymmetric effects matrix, even all substantive conclusions can be misleading, that is, I, IIa, IIb, and III. Multiple examples of misleading results in trivariate, first-order systems can be found in, for instance, Oud (Citation2002, pp. 13–15) and Reichardt (Citation2011).

In sum, except for DT, first-order models that are bivariate, stable, and nonoscillating misleading results can arise, also for the other bivariate, first-order systems. The remaining systems can be classified as follows:

  • bi- and higher-variate, first-order systems with complex eigenvalues (both stable and unstable);

  • bi- and higher-variate, unstable, first-order systems with real eigenvalues;

  • tri- and higher-variate, stable, first-order systems with real eigenvalues.

Next, we will illustrate that misleading results can arise in the above classes of systems, by giving a (counter-)example. Additionally, we will illustrate that misleading results can arise when inspecting other types of conclusions, already in case of bivariate, stable, nonoscillating, first-order systems.

Bivariate, first-order systems with complex eigenvalues counter-example

In case of bivariate, first-order systems with complex eigenvalues, researchers can draw misleading conclusions I to III from Φ(Δt) depending on the time-interval chosen. In Appendix E we show that it is possible to specify ranges of Δt for which the sign and order of dominance (of autoregressive and of cross-lagged parameters) are invariant. Furthermore, we show that for every other range the substantive conclusions I to III made from Φ(Δt) match those made from the drift matrix. displays the Φ(Δt) plot for a system with

A=1.790.793.930.21.

FIGURE 3 The Φ(Δt) plot for a bivariate Φ(Δt) with complex eigenvalues, where conclusions I to III may differ per time-interval Δt.

FIGURE 3 The Φ(Δt) plot for a bivariate Φ(Δt) with complex eigenvalues, where conclusions I to III may differ per time-interval Δt.

This drift matrix reflects an oscillating system with eigenvalues 1±ι0.5π and eigenvectors (1,1±ι2). From the Φ(Δt) plot, one can see that for Δt between 0 and 2 and between 4 and 6: (i) ϕ11 < ϕ22; (ii) ϕ12 < ϕ21; and (iii) ϕ12 > 0 & ϕ21 < 0. Thus, for time-intervals Δt between 0 and 2 and between 4 and 6, substantive conclusions I to III are not misleading. From the Φ(Δt) plot, one can also see that for Δt between 2 and 4: (i) ϕ11 < ϕ22; (ii) ϕ12 > ϕ21; and (iii) ϕ12 < 0 & ϕ21 < 0. Thus, for time-intervals Δt between 2 and 4, substantive conclusions I to III are not misleading. Note that the results for these ranges differ. Hence, for some combinations of Δt values, one will obtain misleading conclusions I to III. Additionally, from the drift matrix, one concludes that: (i) α11=1.79 > α22|=0.21; because the diagonal elements are negative, we can also state: α11=1.79 < α22=0.21; (ii) α12=0.79 < α21=3.93; (iii) α12=0.79 > 0 & α21=3.93 < 0. Thus, for time-intervals Δt between 0 and 2 and between 4 and 6, substantive conclusions I to III are not misleading when comparing it to the ones based on A. However, for time-intervals Δt between 2 and 4, conclusions I to III are misleading when comparing it to the ones based on A. Be aware that, when we are dealing with processes with complex eigenvalues, one needs to estimate the CTM directly, rather than apply the transformation in Equation 13. That is to say, given the nonunique solutions when solving for A from a Φ(Δt) with complex eigenvalues (e.g., using Equation 13), even the various possible (i.e., aliasing) drift matrices may lead to different conclusions regarding the signs and orderings of dominance. This can, for example, be seen from the aliasing drift matrices found in Hamerle et al. (Citation1991, pp. 203, 209).

In sum, in case of bi- or higher-variate systems with complex eigenvalues, conclusions I to III based on DT-model effect estimates may be contradictory depending on the time-interval of observation.

Bivariate, unstable, first-order systems with real eigenvalues counter-example

When the bi- or higher-variate, first-order process with real eigenvalues is not stable, conclusions I to III can be contradictory across different time-intervals. A first-order process is unstable if at least one eigenvalue has a positive real part. In the Φ(Δt) plot is depicted for a bivariate, first-order process with one positive and one negative eigenvalue. Here, we see that the ordering of effects changes for Δt > 2.

FIGURE 4 The Φ(Δt) plot for a unstable, bivariate Φ(Δt) with one negative and one positive real eigenvalue, where conclusions I to III may differ per time-interval Δt.

FIGURE 4 The Φ(Δt) plot for a unstable, bivariate Φ(Δt) with one negative and one positive real eigenvalue, where conclusions I to III may differ per time-interval Δt.

Trivariate, stable, first-order systems with real eigenvalues counter-example

Let us consider the following 3×3 drift matrix:

A=0.390.010.000.320.350.041.461.180.83,

where all three eigenvalues are real and negative: −0.67, −0.55, and −0.35, thus reflecting a stable and therefore stationary process with real eigenvalues. In , one can see how the elements in Φ(Δt) change as a function of Δt. When inspecting the autoregressive effects, one can see that the ordering for ϕ11 and ϕ22 changes as a function of the time-interval Δt, that is, for small time-intervals, ϕ11 is lower and, for longer time-intervals (approximately Δt > 2.5), it is higher than ϕ22. Hence, we see misleading conclusion I.

FIGURE 5 The Φ(Δt) plot for a trivariate Φ(Δt), with change in dominance and sign, that is, where conclusions I to III may differ per time-interval Δt.

FIGURE 5 The Φ(Δt) plot for a trivariate Φ(Δt), with change in dominance and sign, that is, where conclusions I to III may differ per time-interval Δt.

When inspecting the cross-lagged effects, one can see that the ordering for ϕ21, ϕ31, and ϕ32 changes depending on the employed time-interval. At first, the ordering from high to low is: ϕ32, ϕ21, and ϕ31, Then, just before Δt=6, the order of ϕ32 and ϕ21 changes. Furthermore, at approximately Δt=7 and Δt=9, the ϕ31-line crosses that of ϕ32 and ϕ21, respectively. Hence, we see misleading conclusion IIb. Consequently, the conclusions about predictive strength depends on the choice of time-interval. Notably, we expect that one mostly wants to compare the counterparts of each cross-lagged effect, as in conclusion type IIa. We can see that ϕ31 (green dotted line) crosses ϕ13 (black dotted line). Hence, we see also specifically misleading conclusion IIa.

Furthermore, we also see that for small time-intervals ϕ31 is negative; whereas, for larger time-intervals (higher than 5) the sign of this effect is positive.Footnote4 Hence, we see misleading conclusion III. Notably, this change in sign is because each lagged-effect parameter is a nonlinear function both of the time-interval Δt and every other element in the underlying drift matrix A. In terms of interpretation, Deboeck and Preacher (Citation2016) have suggested for this reason that, in a mediation context, the elements of ϕ(Δt) should be interpreted as total effects rather than direct effects. For a further discussion of this issue in the case of more-than-bivariate CT models, see Aalen, Røysland, Gran, and Ledergerber (Citation2012).

In sum, in this example, we can generate misleading conclusions I to III, based on a single tri-variate, stable, first-order process with real eigenvalues.

Other types of conclusions counter-example

Bear in mind that we have dealt with specific types of conclusions that are commonly of interest. One may, for example, also be interested in whether the lagged effect of a variable on itself is greater or smaller than the lagged effect of this variable on another variable. Such a conclusion is time-interval dependent, even in the bivariate case with real, negative eigenvalues. For example, in one can see that at first, ϕ11 > ϕ12, but after the time-interval where the black lines cross the ordering reverses (i.e., ϕ11 < ϕ12). Thus, some conclusions might be misleading in a bivariate, stable, nonoscillating, first-order system.

Moreover, the conclusions we focus on address comparisons of (cross-)lagged effects with each other (inspecting the ordering of dominance) or with zero (inspecting the sign) and do not address significance testing. Testing whether an autoregressive or cross-lagged parameter is different from zero depends on the chosen time-interval, but not on the time-scale. Hence, when the goal is significance testing, one should definitely use the CTM. Notably, the significance testing result depends on a number of other factors, like sample size. It is possible that in a realistic scenario, further misleading conclusions will occur due to insufficiently precise parameter estimates, or high uncertainty regarding these estimates.

SUMMARY AND RECOMMENDATIONS

When studying time-varying phenomena, progress in any substantive area of interest is dependent on researchers being able to compare the effects estimated in different studies. We showed, in the case of the common CLPM – a DT, first-order model – how results based on other time-intervals, —scales, or from different types of models (i.e., DT vs. CT) can be made comparable and facilitated this with existing R functions.

Additionally, we have shown the exact circumstances under which drawing three common types of conclusions regrading ordering and sign of lagged effects from a CLPM for a particular uniform time-interval is unproblematic. We have shown that this only holds for bivariate, stable, nonoscillating, first-order systems. That is, it is not possible to: (i) find a different dominant autoregression effect; (ii) find a different dominant cross-lagged effect; or (iii) obtain a different sign in cross-lagged effects for such processes. Furthermore, we showed that under these constraints, researchers will draw the same conclusions (regarding types I to III) when they interpret the first-order continuous-time model effect matrix directly rather than the first-order DT effect matrix for a particular interval.

It should be stressed that the conditions for the three ‘common conclusions’ to be not misleading are strict, namely within the first-order systems only the bivariate, stable processes with real eigenvalues. Consequently, in the general case, it might be best advised to inspect the CTM parameter, that is, the drift matrix; either by transforming the results of a DT model or estimating the CTM directly. Namely, the drift matrix A allows for straightforward comparisons between the results of studies which use different time-intervals of measurement, because it represents a time-interval invariant way of expressing the sign and strength of influence between two processes. When comparing two or more studies via the CTM parameter estimate A, one only needs to ensure that you correct for the use of different time-scales (e.g., months vs. weeks), as outlined in Appendix B.

DISCUSSION

A restriction needed for the comparison of two CLPM effect matrices is that within each study all the time-intervals are equally spaced. Note that this is to ensure that the DT model is not a (biased) mixture of parameter estimates for different time-intervals. In certain contexts (e.g., in large-scale panel surveys) this is a more realistic assumption than in others (e.g., in Experience Sampling Method studies). One practical solution to the problem of unequally spaced time-intervals in the estimation of the CLPM is to insert missing data such that the data have (approximately) equal time-intervals, that is, are (approximately) equally-spaced in time. Such a practical solution could thus allow researchers to use the CLPM and compare CLPM results from studies in which time-intervals are (made) approximately equal. We note, however, that this equally-spaced data assumption of the CLPM does not imply that all researchers should collect equally-spaced data. To the contrary, it might be advantageous to use unequal time-intervals to ensure that we have a chance to observe or better estimate oscillating patterns (Voelkle & Oud, Citation2013, Section 1.2). Additionally, one should then apply the CTM to the unequally spaced data and draw conclusions from the resulting drift matrix.

In this article, we restrict ourselves to first-order processes which display ‘positive autoregression’, that is, systems which can be modelled with first-order CTMs, as specified in this article. That is, first-order DT processes with real, negative eigenvalues cannot be represented by a first-order CTM like specified in Equation 2.

We showed that, only in bivariate, stable, first-order systems in which the eigenvalues of the effect matrix are real, researchers using different uniform time-intervals will not in general come to contradictory conclusions regarding the sign and relative ordering of effects. This can be interesting when, for example, studying one phenomenon in dyads with an observational study such that the intervals are (approximately) equal. However, this finding should not be taken as an argument for examining only bivariate (stable, first-order) relationships. Namely, when a system is truly trivariate and when omitting a relevant variable, the results will be affected accordingly (i.e., the resulting bivariate autoregression matrix will not be a subset of the one obtained by the trivariate model, see Kuiper & Hamaker, Citationn.d.). Thus, to make reliable conclusions, it is necessary to include all relevant variables in your model.

From our results, one can conclude that in the general case, researchers should draw their substantive conclusions on the basis of the drift matrix, rather than the DT effect matrix at a particular time-interval. This can be done in two ways; by indirect or direct estimation of the CTM. The indirect method of CTM estimation consists of first estimating the CLPM and then solving for the underlying drift matrix. However, this method can only be applied under the constraint that the true drift matrix has real eigenvalues (cf. Hamerle et al., Citation1991; Yue et al., Citation2016). Notably, in case the drift matrix (estimate) has complex eigenvalues, the drift matrix cannot always be uniquely solved for by the DT effect matrix for a particular interval. Additionally, transformations of Φ(Δt) for different time-intervals then are only uniquely defined for multiples of the chosen time-interval Δt. Thus, if there is cyclic behavior, direct estimation of the CTM may be more appropriate. One may also wish then to estimate a second-order rather than a first-order differential equation model. Direct estimation of the CTM has numerous other advantages; for example, multilevel CTMs can account for having different time-intervals between measurement both within- and between- participants, as is common in Experience Sampling Method data.Footnote5

To conclude, we have shown the exact circumstances under which drawing three common types of conclusions regarding the ordering and sign of lagged effects from a CLPM for a particular uniform time-interval is unproblematic. We have shown that this only holds for bivariate, stable, first-order systems in which the eigenvalues of the effect matrix are real. In the more general case, we have shown that taking a CT modeling approach leads to more clarity and describes more accurately (in a time-interval invariant way) how dynamic processes relate to one another.

Additional information

Funding

The work of each of these authors was supported by grants from the Netherlands Organisation for Scientific Research (NWO; Ryan, Onderzoekstalent Grant 406-15-128; Kuiper, VENI Grant 451-16-019).

Notes

1 Various assumptions about the underlying causal process and model are needed for reliable interpretation of these effects as representing causal quantities. These will not be detailed or critically evaluated in the current article.

2 When using another specification of the CTM, it can model first-order DT processes with negative eigenvalues: Fisher (Citation2001) demonstrates this with the use of two continuous-time Ito processes.

3 Often the eigenvectors are chosen such that the sum of the squared elements are 1; e.g., v112+v212=1.

4 Oscillating patterns in our first-order processes of interest occur in the case of complex eigenvalues for two or more variables, and can also occur in case of real eigenvalues when inspecting three or more variables, like in our example. When looking at a vector field (see Strogatz, Citation1994), one can see the differences between these two scenarios; in case of complex eigenvalues, the trajectory is like a spiral.

5 One can find specifications of multilevel CTMs in, for instance, Oravecz and Tuerlinckx (Citation2011) and Voelkle and Oud (Citation2013). Various software packages exist to estimate CTMs, for example, the Bayesian Hierarchical Ornstein-Uhlenbeck Model (BHOUM) toolbox package (Oravecz, Citation2014); CT-SEM (Driver, Voelkle, & Oud, Citation2017), an R package based on OpenMx and STAN, for modeling continuous-time appropriately; and generalized local linear approximation (GLLA) via OpenMx (Boker, Deboeck, Edler, & Keel, Citation2010).

6 There are multiple methods which can be used to calculate the matrix exponential, also when using the expm package. A more detailed description on these types of methods can be found in Moler and Van Loan (Citation2013).

References

  • Aalen, O., Røysland, K., Gran, J., & Ledergerber, B. (2012). Causality, mediation and time: A dynamic viewpoint. Journal of the Royal Statistical Society: Series A (Statistics in Society), 175(4), 831–861. doi: 10.1111/rssa.2012.175.issue-4
  • Bentler, P. M., & Speckart, G. (1981). Attitudes “cause” behaviors: A structrual equation analysis. Journal of Personality and Social Psychology, 40, 226–238. doi: 10.1037/0022-3514.40.2.226
  • Boker, S. M. (2002). Consequences of continuity: The hunt for intrinsic properties within parameters of dynamics in psychological processes. Multivariate Behavioral Research, 37(3), 405–422. doi: 10.1207/S15327906MBR3703_5
  • Boker, S. M., Deboeck, P., Edler, C., & Keel, P. (2010). Generalized local linear approximation of derivatives from time series. In S. Chow, & E. Ferrar (Eds.), Statistical methods for modeling human dynamics: An interdisciplinary dialogue (pp. 179–212). Boca Raton, FL: Taylor & Francis.
  • Boker, S. M., Neale, M., & Rausch, J. (2004). Latent differential equation modeling with multivariate multi-occasion indicators. In O. H. Van Montfort, & A. Satorra (Eds.), Recent developments on structural equation models (pp. 151–174). Dordrecht, the Netherlands: Kluwer Academic.
  • Boker, S. M., & Nesselroade, J. R. (2002). A method for modeling the intrinsic dynamics of intraindividual variability: Recovering parameters of simulated oscillators in multi-wave panel data. Multivariate Behavioral Research, 37, 127–160. doi: 10.1207/S15327906MBR3701_06
  • Bolger, N., & Laurenceau, J.-P. (2013). Intensive longitudinal methods: An introduction to diary and experience sampling research. New York, NY: The Guilford Press.
  • Bringmann, L. F., Vissers, N., Wichers, M., Geschwind, N., Kuppens, P., Peeters, F., … Tuerlinckx, F. (2013). A network approach to psychopathology: New insights into clinical longitudinal data. PLoS ONE, 8, e60188, 1–13. doi: 10.1371/journal.pone.0060188
  • Chow, S., Ferrer, E., & Hsieh, F. (2011). Statistical methods for modeling human dynamics: An interdisciplinary dialogue. New York, NY: Routledge.
  • Chow, S., Ram, N., Boker, S., Fujita, F., Clore, G., & Nesselroade, J. (2005). Capturing weekly fluctuation in emotion using a latent differential structural approach. Emotion, 5(2), 208–225. doi: 10.1037/1528-3542.5.2.208
  • Deary, I. J., Allerhand, M., & Der, G. (2009). Smarter in middle age, faster in old age: A cross-lagged panel analysis of reaction time and cognitive ability over 13 years in the West of Scotland Twenty-07 Study. Psychology and Aging, 24, 40–47. doi: 10.1037/a0014442
  • Deboeck, P. R., & Preacher, K. J. (2016). No need to be discrete: A method for continuous time mediation analysis. Structural Equation Modeling: A Multidisciplinary Journal, 23(1), 61–75. doi: 10.1080/10705511.2014.973960
  • Driver, C., Voelkle, M., & Oud, H. (2017). ctsem: Continuous time structural equation modelling [Computer software manual]. (R package version 2.4.0). Retrieved from https://cran.r-project.org/web/packages/ctsem/vignettes/hierarchical.pdf
  • Finkel, S. E. (1995). Causal analysis with panel data. Thousand Oaks, CA: Sage.
  • Fisher, M. (2001). Modeling negative autoregression in continuous time. (http://www.markfisher.net/mefisher/papers/continuousar.pdf)
  • Gollob, H. F., & Reichardt, C. S. (1987). Taking account of time lags in causal models. Child Development, 58(1), 80–92. doi: 10.2307/1130293
  • Granger, C. W. J. (1969). Investigating causal relations by econometric models and cross-spectral methods. Econometrica, 37, 424–438. doi: 10.2307/1912791
  • Hamaker, E. L., Dolan, C. V., & Molenaar, P. C. M. (2005). Statistical modeling of the individual: Rationale and application of multivariate time series analysis. Multivariate Behavioral Research, 40(2), 207–233. doi: 10.1207/s15327906mbr4002_3
  • Hamaker, E. L., Kuiper, R. M., & Grasman, R. P. P. P. (2015). A critique of the cross-lagged panel model. Psychological Methods, 20(1), 102–116. doi: 10.1037/a0038889
  • Hamerle, A., Nagl, W., & Singer, H. (1991). Problems with the estimation of stochastic differential equations using structural equations model. Journal of Mathematical Sociology, 16(3), 201–220. doi: 10.1080/0022250X.1991.9990088
  • Hamilton, J. D. (1994). Time series analysis. Princeton, NJ: Princeton University Press.
  • Koval, P., & Kuppens, P. (2012). Changing emotion dynamics: Individual differences in the effect of anticipatory social stress on emotional inertia. Emotion, 12, 256–267. doi: 10.1037/a0024756
  • Kuiper, R. M., & Hamaker, E. L. (n.d.). Effect on residuals in time-lagged relationships due to omitted variables: It is time to go continuously.
  • Maxwell, S. E., Cole, D. A., & Mitchell, M. A. (2011). Cross-sectional analyses of longitudinal mediation: Partial and complete mediation under an autoregressive model. Multivariate Behavioral Research, 46(5), 816–841. doi: 10.1080/00273171.2011.606716
  • Moler, C., & Van Loan, C. (2013). Nineteen dubious ways to compute the exponential of a matrix, twenty-five years later. SIAM Review, 45(1), 3–49. doi: 10.1137/S00361445024180
  • Nohe, C., Meier, L. L., Sonntag, K., & Michel, A. (2015). The chicken or the egg? a meta-analysis of panel studies of the relationship between work–Family conflict and strain. Journal of Applied Psychology, 100(2), 522–536. doi: 10.1037/a0038012
  • Oravecz, Z. (2014). Bayesian ornstein-uhlenbeck model (boum) toolbox package. Retrieved from https://sites.psu.edu/zitaoravecz/bayesian-ornstein-uhlenbeck-model/
  • Oravecz, Z., & Tuerlinckx, F. (2011). The linear mixed model and the hierarchical ornsteinuhlenbeck model: Some equivalences and differences. British Journal of Mathematical and Statistical Psychology, 64, 134–160. doi: 10.1348/000711010X498621
  • Oravecz, Z., Tuerlinckx, F., & Vandekerckhove, J. (2009). A hierarchical OrnsteinUhlenbeck model for continuous repeated measurement data. Psychometrika, 74(3), 395–418. doi: 10.1007/S11336-008-9106-8
  • Oravecz, Z., Tuerlinckx, F., & Vandekerckhove, J. (2011). A hierarchical latent stochastic differential equation model for affective dynamics. Psychological Methods, 16(4), 468–490. doi: 10.1037/a0024375
  • Oud, J. (2002). Continuous time modeling of the cross-lagged panel design. Kwantitatieve Methoden, 69, 1–27.
  • Oud, J., & Delsing, M. (2010b). Continuous time modeling of panel data by means of SEM. In K. van Montfort, J. H. L. Oud, A. Satorra (Eds.), Longitudinal research with latent variables. (pp. 201–244). Berlin, Heidelberg: Springer-Verlag.
  • Oud, J., & Delsing, M. J. M. H. (2010a). Continuous time modeling of panel data by means of SEM. In K. Van Montefort, J. Oud, & A. Satorra (Eds.), Longitudinal research with latent variables (pp. 201–244). New York, NY: Springer.
  • Pelz, D., & Lew, R. (1970). Sociological methodology. In E. Borgatta (Ed.), Heise’s causal model applied. San Francisco, CA: Jossey-Bass.
  • R Core Team. (2013). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. Retrieved from http://www.R-project.org/
  • Reichardt, C. S. (2011). Commentary: Are three waves of data sufficient for assessing mediation? Multivariate Behavioral Research, 46, 842–851. doi: 10.1080/00273171.2011.606740
  • Rogosa, D. R. (1980). A critique of cross-lagged correlation. Psychological Bulletin, 88, 245–258. doi: 10.1037/0033-2909.88.2.245
  • Ryan, O., Kuiper, R. M., & Hamaker, E. L. (in press). Continuous time modeling in the behavioral and related sciences. In M. C. V. K. V. Montfort, & J. H. L. Oud (Ed.), A continuous time approach to intensive longitudinal data: What, why and how? New York, NY: Springer.
  • Schuurman, N. K., Ferrer, E., de Boer-Sonnenschein, M., & Hamaker, E. L. (2016). How to compare cross-lagged associations in a multilevel autoregressive model. Psychological Methods, 21(2), 206–221. doi: 10.1037/met0000062
  • Strogatz, S. H. (1994). Nonlinear dynamics and chaos: with applications to physics, biology, chemistry, and engineering. Reading, MA: Addison-Wesley.
  • Voelkle, M. C., & Oud, J. H. L. (2013). Continuous time modelling with individually varying time intervals for oscillating and non-oscillating processes. British Journal of Mathematical and Statistical Psychology, 66, 103–126. doi: 10.1111/j.2044-8317.2012.02043.x
  • Voelkle, M. C., Oud, J. H. L., Davidov, E., & Schmidt, P. (2012). An SEM approach to continuous time modeling of panel data: Relating authoritarianism and anomia. Psychological Methods, 17(2), 176–192. doi: 10.1037/a0027543
  • Yue, Z., Thunberg, J., & Goncalves, J. (2016). Inverse problems for matrix exponential in system identification: system aliasing. Arxiv Preprint arXiv:1605.06973.

APPENDIX A

Eigenvalue Decompositions

Based on the eigenvalue decomposition, a (diagonalizable) square matrix A can be written as:

(8) A=VDV1,(8)

where V is a (nonsingular) matrix containing the eigenvectors of A, and D is a diagonal matrix containing the (distinct) eigenvalues of A.

For A2, it holds true that:

(9) A2=(VDV1)(VDV1)=V(D)2V1,(9)

By induction, this yields:

(10) An=V(D)nV1,(10)

for positive reals n and for all reals n when none of the eigenvalues of A are zero. For the interested reader: in case of complex eigenvalues, there does not exist a unique solution when n is not an integer.

Using Equations 8 and 10, it holds true that:

(11) eAΔt=l=0(Δt)lAll!     =l=0(Δt)lV(D)lV1l!     =Vl=0(ΔtD)ll!V1     =VeDΔtV1.(11)

Notably, there are multiple methods which can be used to calculate the matrix exponential, a more detailed description on these types of methods can be found in Moler and Van Loan (Citation2013).

Note that the eigenvectors for A and eAΔt (= Φ(Δt)) are the same and that their eigenvalues are related by D versus eDΔt (= DΦ(Dt)), where the latter depends on the time-interval. Consequently, it holds that D=log(DΦ(Dt))/Δt, which is independent of the time-interval.

APPENDIX B

Details on Relating DT and CT

There is only one Φ belonging to one specific A, which can be calculated in R, when using the expm package,Footnote6 by:

Phi <- expm(A * delta_t)

A DT effect matrix using a time-interval of d1 can be transformed into one as if another time-interval, namely d2, was used, by employing:

Φ(d1)=eAd1,Φ(d2)=eAd2       =eAd1d2d1,whichyields:Φ(d2)=(Φ(d1))d2d1.

Thus, one needs to raise the effect to the power d2d1. In case d2d1 is an integer one can calculate this in R, when using the expm package, by:

Phi_d2 <- Phi_d1

In case the ratio is not integer, one should use the eigenvalue decomposition and take the power of the eigenvalues as stated in Appendix A; R code is available upon request.

An alternative method of comparing DT effects is to convert all of the relevant effects matrices to their underlying continuous-time drift matrix:

(13) A=log(Φ(Δt))/Δt =V(log(DΦ(Dt))/Δt)V1.(13)

In R, when using the expm package, this can established by:

A <- logm(Phi)/delta_t

Be aware that, also in case of complex eigenvalues, R will give you only one solution, although there are then multiple; more details are given in Appendix E. Therefore, one should always check whether the eigenvalues (of Φ or A) are real-valued or complex; for example, in R by:

eigen(A, sym = FALSE)$values

If one compares CT effect estimates (i.e., As), one should also make an adjustment. Bear in mind that the DT model parameters are time-interval dependent but time-scale independent, this implies that the values of the time-interval invariant A are time-scale dependent. For example, suppose a researcher uses years as the time-scale and that he obtained Φ(Δtyears=1)=eAyears1. When he would have set the time-scale to months, he would have obta-ined Φ(Δtmonths = 12) = eAmonths12. Since Φ(Δtyears=1) =Φ(Δtmonths=12), it holds that eAmonths12=eAyears1, yielding Ayears=12Amonths. Thus, although the CTM parameter A is time-interval invariant that makes comparisons of studies using different time-intervals easy and straightforward, its values are dependent on the time-scale. Hence, for good comparability of the values of multiple drift matrices one should do this easy, but necessary, transformation:

                     Atimescale2=TAtimescale1,withT=timescale2intermsoftimescale1timescale1 > 0,

with, in the example, T=1year1month=12months1month=12. Because of this linear relationship with positive constant (T), no misleading conclusions are possible for all uni- and multivariate processes when comparing the results of multiple CTMs (assuming the same underlying process). Stated otherwise, the drift matrix obtained for using some time-scale, Atimescale1, has no different order of magnitude and sign than the one obtained for another one, Atimescale2=TAtimescale1, with T > 0.

APPENDIX C

Examine the Drift Matrix A

Using the notation from the section starting on page 11 and following Equation 4, we can write:

A=1/d1|V|v11log(λ1)v22v12v21log(λ2)(log(λ1)log(λ2))v21v22(log(λ1)log(λ2))v11v12v12v21log(λ1)+v11v22log(λ2),

where log(λ1)/d1=:η1 and log(λ2)/d1=:η2 are the eigenvalues of A. Notably, time-interval d1 appears in the time-interval independent A and its eigenvalues, since the eigenvalues of Φ(d1) (i.e., λ1 and λ2) are functions of this: λ1,λ2=eη1,2d1 and λ1n,λ2n=eλ1,2d1n=eη1,2d2.

One can deduce, by looking at the elements (p,s) of both A and Φ(d1), that all three types of misleading results are obtained when the sign of (λ1λ2) differs from that of (η1η2)=(log(λ1)log(λ2))/d1 or better, (log(λ1)log(λ2)), since d1 > 0. Thus, to assess whether the three types of misleading conclusions can occur in bivariate, first-order systems based on the use of a different models (DT vs. CT), it suffices to assess whether there exists an λ1, λ2 such that, say, λ1 > λ2 and log(λ1) < log(λ2). Recall that λ1 and λ2 are the eigenvalues of Φ(d1). In case of stable, ‘positive autoregression’, first-order systems with real eigenvalues, these lie between 0 and 1. In that case, the sign of (λ1λ2) never differs from that of (log(λ1)log(λ2)). Therefore, misleading conclusions I to III cannot occur in bivariate, stable, first-order systems with real eigenvalues due to the use of a different model (i.e., DT vs. CT). Additionally, this is also true for some bivariate, stable, first-order systems with complex eigenvalues, as is shown in Appendix D.

APPENDIX D

Symmetric Relationships

The symmetric case refers to the case where reciprocal/cross-lagged effects are constrained to be equal, that is, when the upper-diagonal part of the effect matrix ‘equals’ the lower-diagonal part of the effect matrix, that is, when ϕij=ϕji for all i,j=1,,q. For instance, depicts a symmetric bivariate relationship if ϕ12=ϕ21.

When a matrix M is symmetric, it holds that M=M and thus that V1=V (which means that V is orthogonal):

(14) M=(VDMV1)=(VDMV)=(V)DMV=VDMV=VDMV1=M,(14)

using the properties (M1M2M3)=M3M2M1, and D=D for symmetric or better diagonal matrices D. Since Φ(d1), (Φ(d1))n, and A share the same eigenvectors (and only differ in eigenvalues: DΦ(d1), (DΦ(d1))n, and D=log(DΦ(d1))/d1, respectively), if one of these is symmetric the other two will be as well.

Thus, in case of a symmetric relationship, one will find equal cross-lagged counterparts (i.e., ϕij=ϕji or αij=αji for all i,j=1,,q) irrespective of the chosen time-interval and model. Consequently, misleading conclusion IIa (i.e., the one in terms of cross-lagged counterparts) does not occur. Note that cross-lagged effects that are not each others’ counterparts can cross each other, that is, misleading conclusion IIb can occur. Moreover, misleading conclusions I and III can still occur.

APPENDIX E

Complex Eigenvalues

In case of a 2×2 matrix, complex eigenvalues can be written as:

η1=1/2tr(A)+ι1/2(4det(A)(tr(A))2)=:τ+ιω,                           η2=τιω,

with tr(A)=α11+α22, det(A)=α11α22α12α21, ι2=1 and, by definition, ω  0 (i.e., (tr(A))24det(A) < 0). When the eigenvalues are complex and the real part is negative, then the system is oscillating but stable, that is, the system will spiral around the equilibrium along some axes to the equilibrium. Notably, since complex eigenvalues always come in conjugate pairs, a one-variable first-order continuous-time model never shows oscillations. In case of oscillating processes, one may also wish to estimate a second-order rather than a first-order differential equation model.

The eigenvalues for a bivariate Φ(Δt)=eAΔt, that is, the eigenvalues for a bivariate, ‘positive autoregression’, first-order system are:

λ1=eη1Δt=e(τ+ιω)Δt=eτΔteιωΔt=eτΔt(cos(ωΔt)+ιsin(ωΔt))=eτΔtcos(ωΔt)+ιeτΔtsin(ωΔt):=γ(Δt)+ιθ(Δt),λ2=eη2Δt:=γ(Δt)ιθ(Δt).

Hence, if A has complex eigenvalues and thus eigenvectors (i.e., ω  0), the eigenvectors and thus eigenvalues of Φ(Δt) are complex as well (i.e, θ(Δt)  0). Since e2πιn=1 for n=0,±1,±2,, a specific A leads to only one Φ(Δt). On the other hand, since the cosine and sine functions are periodic with 2π/Δt, multiple As, namely ones with complex eigenvalues and their periodic equivalents, the so-called aliasing matrices (for more details see Hamerle et al., Citation1991), can lead to the same Φ(Δt). Stated otherwise, in case of complex eigenvalues, there is no one-to-one correspondence between Φ(Δt) and A if you do not specify additional conditions. Yue et al. (Citation2016) state that there is a one-to-one correspondence between Φ and A in case the imaginary parts of all the eigenvalues of A (i.e., ω) lie into (π/Δt,π/Δt). If the used sampling frequency is higher than the maximum sampling frequency, other a-priori information about A is required to reduce the number of multiple solutions and/or one needs to search over a collection of matrix logarithms to find the sparsest one, see Yue et al. (Citation2016).

For a DT process to be stable, we need the modulus (i.e., the absolute value) of the eigenvalues of Φ(Δt) to be smaller than 1: γ(Δt)2+θ(Δt)2 < 1. Note that if θ(Δt)=0 and thus if the eigenvalue is real and not complex, this reduces to γ(Δt)2=γ(Δt) < 1, which implies real eigenvalues between −1 and 1. From the equations above, we can deduce that:

γ(Δt)2+θ(Δt)2=(eτΔtcos(ωΔt))2+(eτΔtsin(ωΔt))2
=e2τΔt(cos2(ωΔt))2+sin2(ωΔt))2=e2τΔt=eτΔt.

Consequently, if the modulus of an eigenvalue of Φ(Δt) (belonging to a ‘positive autoregression’, first-order system) is smaller than 1, the real part of the corresponding eigenvalue of A (i.e., τ) is negative, and vice versa. Note that there exist first-order DT processes for which the modulus of the eigenvalues is smaller than one, but for which there is no first-order CT equivalent as specified in Equation 2; namely, stable DT processes that do not exhibit ‘positive autoregression’.

When comparing the order of the complex eigenvalues of Φ(Δt) versus A, we have to satisfy twice two conditions, depending on the ordering of the imaginary parts of the two eigenvalues. The first set of two conditions is:

  1. τ+ιω > τιω if ω > ω;

  2. γ(Δt)+ιθ(Δt) > γ(Δt)ιθ(Δt)sin(ωΔt) >  sin(ωΔt) >  if θ(Δt) > θ(Δt).

These two conditions coincide if ωΔt lies between 0 and π (with a period of 2π), that is, if ω lies between 0 and π/Δt (with a period of 2π/Δt). The second set of two conditions is:

  1. τ+ιω < τιω if ω < ω;

  2. γ(Δt)+ιθ(Δt) < γ(Δt)ιθ(Δt)sin(ωΔt) <  sin(ωΔt) if θ(Δt) < θ(Δt).

These two conditions coincide if ω lies between π/Δt and 0 (with a period of 2π/Δt). Combined, this leads to the restriction that ω lies between πΔt and π/Δt (with a period of 2π/Δt), which is actually also the condition for which there is a unique correspondence between Φ(Δt) and A. Stated otherwise, for ranges of time-intervals Δt for which ω lies between π/Δt and π/Δt (with a period of 2π/Δt), conclusions I to III based on Φ(Δt) resemble that of A; however, for the ranges between those, conclusions I to III are misleading.

When comparing the order of the complex eigenvalues of Φ(d1) versus Φ(d2), the conditions for obtaining the same conclusions (of types I to III) are less straightforward. Hence, in some systems with complex eigenvalues, you will not find misleading conclusions I to III and in some you will. However, we do already know from the previous result that if for both Δt=d1 and Δt=d2 it holds that ω lies between π/Δt and π/Δt (with a period of 2π/Δt), that there are no misleading conclusions I to III based on Φ(d1) versus Φ(d2). Importantly, the substantive conclusions I to III then also resemble that based on the underlying A. An example is given in the Misleading Results section in the main text. Notable, there, ω=0.5π lies between π/Δt and π/Δt for Δt between 0 and 2 with a period of 2.