2,063
Views
9
CrossRef citations to date
0
Altmetric
Research Article

Exploring the Unfolding of Dynamic Effects with Continuous-Time Models: Recommendations Concerning Statistical Power to Detect Peak Cross-Lagged Effects

ABSTRACT

Cross-lagged panel models have been commonly applied to investigate the dynamic interplay of variables. In such discrete-time models, the size of the cross-lagged effects depends on the length of the time interval between the measurement occasions. Continuous-time modeling allows to explore this interval dependence of cross-lagged effects and thus to identify the maximal “peak” cross-lagged effects. To detect these peak effects, sufficient statistical power is needed. Based on results from a simulation study, we employed machine learning algorithms to identify a highly accurate prediction model. Results are incorporated into a Shiny App (available at https://psychtools.shinyapps.io/ContinuousTimePowerCalculation) for easy power calculations. Although limitations apply, our results might be helpful for study planning.

A common research question in the social sciences is about the reciprocal (or dynamic) interplay of variables over time. Models suitable for investigating such dynamics come by different names, for instance, linear panel models (Greenberg & Kessler, Citation1982), causal models (Bentler, Citation1980), autoregressive cross-lagged models (Bollen & Curran, Citation2006), and cross-lagged panel models (Mayer, Citation1986). The key components of these models are the autoregressive effects and the cross-lagged effects. An autoregressive effect indicates the strength of the impact of a variable on itself from time point to time point, whereas the cross-lagged effects describe the influences between the variables, that is, how a variable at a time point is impacted by the values of other variables at a previous time point.

One characteristic of cross-lagged effects from cross-lagged panel models is that they depend on the time interval between measurement occasions. For different interval lengths, the cross-lagged effects differ in size. This is because the impact of a variable on another first needs to unfold before it reaches its maximum and then declines with the passing of more time. For instance, a high dose of physical activity stimulates biochemical processes in the body which results in strong muscle ache some time later with a peak effect after a certain time span (e.g., Hecht & Voelkle, Citation2021, determined such a time span to be around half a day). Then, after a couple of days, the impact of high physical activity from some days ago has dissipated. Thus, there is a roughly inverse-U-shaped curve that describes the size of a cross-lagged effect depending on the length of the time interval. (adapted from Hecht & Voelkle, Citation2021) illustrates such a curve. The values of the cross-lagged effects (y-axis) are plotted against the time interval length (x-axis). For a time interval of zero, the cross-lagged effect is, of course, zero because there cannot be an impact from one variable to another if no time has passed. When more time passes, the effect from physical activity on pain severity unfolds (the curve rises) and then reaches the maximum effect for a time interval of around half a day. For longer time intervals, the size of the cross-lagged effect (and the curve) declines toward zero again.

Figure 1. Discrete-time cross-lagged effects plot. PA = Physical activity; SS = Symptom severity. Adapted from Hecht and Voelkle (Citation2021)

Figure 1. Discrete-time cross-lagged effects plot. PA = Physical activity; SS = Symptom severity. Adapted from Hecht and Voelkle (Citation2021)

When researchers plan their longitudinal studies, they need to choose a specific length of the time interval between measurement occasions. This means that they obtain cross-lagged effects from cross-lagged panel analysis for this specific time interval. However, where these effects are located in the effect distribution across interval lengths and whether these effects are the peak effects is usually unknown and hardly explorable with cross-lagged panel models. A solution to this problem is continuous-time modeling which offers exactly this feature: the unfolding and dissipation of cross-lagged relations can be investigated, independently from the spacing of measurement occasions in the study design. This is made possible by conceptualizing humans as continuously existing entities and differential calculus.

Among psychologists, there is a rather broad consensus that the human system continuously exists at all points in time (within the time span of life). This understanding is, however, not fully utilized in cross-lagged panel models and other discrete-time models. In contrast, continuous-time modeling explicitly builds upon the continuous human nature. When we observe the continuously existing and evolving human system at certain discrete time points, we obtain “snapshots” that can be used to estimate the parameters that describe the continuous system dynamics which are represented by a stochastic differential equation (see, e.g., Oud & Delsing, Citation2010).

Unfortunately, the mathematics needed for continuous-time modeling are somewhat intricate (see, e.g., Voelkle et al., Citation2012, for a didactic introduction and Hecht et al., Citation2019, for examples and explanations on how to understand the mathematical model notation), but fortunately accessible software solutions like ctsem (Driver et al., Citation2017) exist for model estimation. Analogous to discrete-time models, the parameters that describe the continuous dynamics within the human system in continuous-time models are again effects that temporally relate one variable to itself (auto-effect) and to the other variables (cross-effects). To not confuse discrete-time cross-lagged modeling and continuous-time modeling, it is advisable to carefully distinguish parameter labels (an overview is given in the work of Hecht & Voelkle, Citation2021).

The mathematical relations between a continuous-time model and its corresponding time interval dependent discrete-time counterpart models are given by a set of equations (see, e.g., EquationEquations 5–16 in the Appendix of the work of Hecht & Voelkle, Citation2021, or EquationEquations 6Equation8 in this article further below). Hereby, a key feature is that the parameters of the one model that describes the continuously evolving human system (i.e., the continuous-time parameters) can be transformed into parameters of many discrete-time cross-lagged models that describe the dynamics for arbitrary time intervals between discrete points in time. This feature implies some of the advantages of continuous-time modeling: (1) The possibility to compute discrete-time parameters for any time interval allows for comparing results from studies which used different time intervals for the assessment. (2) Each person might be assessed at different points in time; data from such flexible longitudinal designs (Hecht et al., Citation2019) can be used to estimate the continuous-time parameters, because no matter when the assessments of persons were conducted, they all are snapshots of the same continuous phenomenon. (3) The unfolding and dissipation of effects can be studied and the time interval for which the interrelation between variables (i.e., the cross-lagged effects) is strongest can be determined.

Oftentimes, these peak effects are of central substantive interest and researchers want to conduct inferential statistical tests for them. Thus, the question of statistical power immediately arises. Especially in psychology with usually underpowered studies (Maxwell, Citation2004), statistical power has been and is a pivotal and much discussed topic, and researchers might welcome recommendations to achieve sufficient power.

Purpose and scope

In the present work, we identify a prediction model for statistical power of peak cross-lagged effects with continuous-time models and provide an easy-to-use Shiny App with which the statistical power can be roughly approximated. We assume that the statistical power of the peak cross-lagged effects in continuous-time models depends on the number of persons, the number of time points, and the size of the peak cross-lagged effect, because sample size and effect size are classic factors related to statistical power.

The article is organized into the following sections. First, we briefly present a popular continuous-time model which is suitable to explore peak cross-lagged effects, compare notations and parameterizations for this model, and cite some empirical applications of this model. Second, we report results from a simulation study in which we varied the number of persons, the number of time points, the effect size, and the size of the auto-effect, and then calculated the power for a null hypothesis test of the peak cross-lagged effect against zero (α=.05). With the help of machine learning algorithms, we derive a prediction formula which could be used to roughly approximate power for study planning. Finally, we conclude with a discussion of our work (including limitations).

Multivariate continuous-time models

Model formulations

Continuous-time modeling is a popular approach in many disciplines, for instance, in the natural sciences, finance, and econometrics (e.g., Capasso & Bakstein, Citation2012; Dana & Jeanblanc, Citation2003; Gandolfo, Citation1993; Sinha & Rao, Citation1991). Recently, continuous-time modeling has found its way into the behavioral sciences (e.g., Van Montfort et al., Citation2018). Although continuous-time models come in various flavors, a common continuous-time model in the behavioral sciences and psychology is the linear stochastic differential equation model (Oud & Delsing, Citation2010), which may also be called continuous-time first-order vector autoregressive [CT-VAR(1)] model (Ryan et al., Citation2018), multivariate continuous-time model (Hecht & Voelkle, Citation2021), or multivariate Ornstein-Uhlenbeck process (e.g., Oravecz et al., Citation2018). The core of these models is a stochastic differential equation which can be conceptually written as:

(1) derivative=deterministic part+stochasticpart.(1)

The derivative is the result of performing a differentiation process upon a function which describes the state of the (human) system depending on time. Time is usually denoted with symbol t, whereas the notation of the vector which contains a value per variable (“state vector”) differs, for example, x (Oud & Delsing, Citation2010; Voelkle et al., Citation2012), y (Ryan et al., Citation2018), η (Driver et al., Citation2017), Θ (Oravecz et al., Citation2018), and θ (Hecht & Voelkle, Citation2021). The derivative is often given in Leibniz’s notation: dx(t)dt (e.g., Oud & Delsing, Citation2010; Ryan et al., Citation2018; Voelkle et al., Citation2012). Some authors (e.g., Driver et al., Citation2017; Hecht & Voelkle, Citation2021; Oravecz et al., Citation2018) place the differential dt on the right-hand side of the equation: dx(t)=[]dt, presumably because the Wiener process is not differentiable (Singer, Citation2012).

The deterministic part contains the product of a matrix, which is usually referred to as the drift matrix, and the state vector. Many authors denote the drift matrix with the symbol A, one exception being Oravecz et al.’s model where drift matrix B=A. Further, a term relating to the mean levels of the processes is usually included. Often, either process intercepts, b (e.g., Driver et al., Citation2017; Oud & Delsing, Citation2010; Voelkle et al., Citation2012), or process means, μ (e.g., Oravecz et al., Citation2018; Ryan et al., Citation2018), are used. As the relation between process means and continuous-time intercepts is μ=A1b, these expressions are interchangeable, for example, A(xμ)=Ax+b.

The stochastic part comprises scaled derivatives or differentials of Wiener processes W(t) (i.e., random walks in continuous time) representing stochastic noise. The derivatives dW(t)dt or differentials dW(t) are scaled by the Cholesky factor of the covariance matrix of the noise. Oud et al. (Citation2018) call this covariance matrix diffusion covariance matrix and its Cholesky factor diffusion matrix, whereas other authors (e.g., Oud & Delsing, Citation2010; Voelkle et al., Citation2012) call the covariance matrix “diffusion matrix” and impose no specific name on its Cholesky factor. Often, the covariance matrix is denoted with symbol Q and the Cholesky factor with G, one exception being Oravecz et al.’s notation of Σ instead of G.

Furthermore, continuous-time models frequently include a measurement part, for example, for continuous response data (e.g., Arminger, Citation1986; Chow et al., Citation2016; Deboeck & Boulton, Citation2016; Driver et al., Citation2017; Oud & Delsing, Citation2010; Oud & Jansen, Citation2000; Singer, Citation2012) or for dichotomous response data (Hecht et al., Citation2019). Also, various model extensions were proposed, for example, with individually varying continuous-time intercepts (e.g., Hecht & Voelkle, Citation2021; Oud & Delsing, Citation2010) and with time-independent and time-dependent predictors (e.g., Driver et al., Citation2017).

In summary, multivariate continuous-time models have been introduced to the behavioral and social sciences as exemplarily illustrated by the—not exhaustive—list of cited works above. The mathematical symbols, notations, parameterizations, names, and extensions might differ, but all the multivariate continuous-time models have the same stochastic differential equation core. Also, the popular continuous-time modeling software ctsem (Driver et al., Citation2017, Citation2020) is based on the multivariate continuous-time model and allows users to estimate this model.

Empirical applications

Multivariate continuous-time models (or their variants/extensions) have been applied in many areas of behavioral and social sciences research and other disciplines. Often, the number of processes under investigation was two, that is, a bivariate continuous-time model was employed. For example, Voelkle et al. (Citation2012) illustrated the dynamic interplay of authoritarianism and anomia; Ryan et al. (Citation2018) investigated the temporal relations between feeling down and being tired; Oravecz et al. (Citation2018) showed how two core affect ratings, valence and arousal, are related across time; Tómasson (Citation2018) illustrated the associations of temperature and CO2 over 800,000 years; Crewther et al. (Citation2020) studied the temporal relation of testosterone and motivation; Crewther et al. (Citation2021) investigated the within-person coupling of testosterone and cortisol; Hecht and Voelkle (Citation2021) illustrated the temporal associations between physical activity and pain; Mueller et al. (Citation2018) estimated multiple bivariate continuous-time models to explore the bivariate dynamic relations between extraversion, neuroticism, physical functioning, and vision; Volz et al. (Citation2019) investigated the dynamic interplay of general self-efficacy and post-stroke depression; Dormann et al. (Citation2018) studied psychosocial safety climate and depression with the bivariate continuous-time model; and, to complete our exemplary non-exhaustive list of empirical applications, Ehm et al. (Citation2019) examined the relation between academic self-concept and achievement.

Overall, continuous-time modeling has been applied in research practice. As the multivariate continuous-time model is a more general variant of the popular (random intercept) cross-lagged panel model [(RI)-CLPM; e.g., Hamaker et al., Citation2015; Kearney, Citation2017; Selig & Little, Citation2012] we are optimistic that the usage of multivariate continuous-time models will accelerate.

Model formulation in the current study

The formulation of the multivariate continuous-time model in the current study is largely based on the work of Oud and Delsing (Citation2010) with slight notational adaptations. The core of our model is the standard stochastic differential equation as described above. In our notation it reads:

(2) dyjtdt=Ayjtμj+GdWjtdt,(2)

where yjtRF is the state vector of person j’s F-dimensional dynamical system at continuous time tR (with F being the total number of variables or “processes”). dyjtdt is the derivative, which may be interpreted as the instantaneous rate of change (or velocity) of the system at time t. The F×F drift matrix A contains the auto-effects (on the main diagonal) and cross-effects (on the off-diagonals) and thus characterizes the temporal dynamics of the variables. In the present work, we limit our delineations to continuous-time models in which the real-parts of the eigenvalues of A are negative. The vector μjRF contains the individual means. Noise is represented by a multivariate Wiener process WjtRF and dWjtdt is its formal derivative. The F×F diffusion covariance matrix Q=GG is the covariance matrix of this noise, and G is the Cholesky factor of Q representing the effect of the noise.

The solution of the stochastic differential equation (EquationEquation 2) is given as:

(3) forp2,yjp=AΔp1yj(p1)+IFAΔp1μj+ωj(p1),(3)
(4) μjNFμ,Σ,(4)
(5) ωj(p1)NF0,QΔp1,(5)
(6) AΔp1=exp(AΔp1),(6)
(7) QΔp1=irowexp(A#Δp1)IF2row Q,(7)
(8) withA#=AIF+IFAandQ=irowA#1row Q(8)
(9) and forp=1,yj1NFμj+μdev,Σfw,(9)

where IF and IF2 are identity matrices of size F and F2, respectively. denotes the Kronecker product. The row operator puts the elements of a matrix row-wise in a column vector. The irow operator puts the elements from the column vector back into a matrix.

The interval dependent discrete-time part of the model is given by EquationEquation 3. Unequal-interval longitudinal designs usually involve responses of j=1,,N persons at several points in time, tp, with p=1,,T being a running index denoting the discrete time point and T being the number of time points. Time interval lengths Δp1Footnote1 between time points are given by Δp1=tptp1 for all p2. The values, yjpf, of person j at time point p on variable f=1,,F are stacked into the column vector yjp. This vector of each person at each time point is predicted by the matrix product of the autoregressive matrixFootnote2 AΔp1 and the vector yj(p1) of the person at the previous time point (which was Δp1 time units ago) plus an intercept term (which is calculated from the individual mean) plus an error term ωj(p1). The individual means are assumed to be normally distributed with mean vector μ and covariance matrix Σ (EquationEquation 4). The symbol as subscript denotes “long-range” (or “asymptotic”, Driver et al., Citation2017) parameters that describe the system for Δ , that is, a system with no temporal relations (AΔ=0). The error term has a normal distribution with a zero vector as location and covariance matrix QΔp1 (EquationEquation 5). The interval-dependent discrete-time parameter matrices, AΔp1 and QΔp1, are connected to the continuous-time drift matrix A via EquationEquations 6Equation8. The interval-dependent error covariance matrix QΔp1 additionally depends on the within-person covariance matrix Q (EquationEquation 7), which is assumed equal across persons. For the first time point, the vector yj1 is normally distributed with the sum of the individual mean, μj, and a deviation, μdev, as location and a within-person covariance matrix Σfw (subscript “fw” stands for “first time point, within-person”). The to be estimated model parameters of this continuous-time model are the drift matrix A, the covariance matrices Σ, Q, and Σfw, and the mean vectors μ and μdev.

The time interval dependent autoregressive matrix AΔp1 (EquationEquation 6) contains the autoregressive effects (on the main diagonal) and the cross-lagged effects (on the off-diagonals):

AΔp1=aΔp1,Y1aΔp1,YFY1aΔp1,Y1YFaΔp1,YF.

The autoregressive effects characterize the stability of the variables, that is, how predictive a value at one time point is for the value at the next time point. A high autoregressive effect indicates high stability/predictiveness, whereas a low autoregressive effect indicates low stability. A cross-lagged effect describes the prediction of one variable at a time point on another variable at the next time point. Cross-lagged effects may be interpreted like partial regression coefficients: A one unit change in the predictor variable is associated with an expected change of the dependent variable in the amount of the regression coefficient, holding the other predictors constant. For instance, a one unit change in Y1 would be associated with an expected change in YF in the amount of aΔp1,Y1YF Δ time units later when all other predictors are held constant.

For better comparability, the cross-lagged effects may be standardized with respect to the within-person variances, for example:

(10) aΔp1,std,Y1YF=aΔp1,Y1YFσwY12σwYF2,(10)

where σwY12 and σwYF2 are main diagonal elements from the within-person covariance matrix Q. The interpretation of standardized cross-lagged effects is similar to (non-standardized) cross-lagged effects, except that the units of both variables are now standard deviations. Thus, a standardized cross-lagged effect indicates the expected change of a variable YF at time t+Δ in SDYF units associated with a one SDY1 change of Y1 at time t holding the other predictor variables constant.

illustrates a bivariate continuous-time model for three time points. For more explanations, examples, and illustrations of this (and other) continuous-time models see, for instance, Voelkle et al. (Citation2012), Driver et al. (Citation2017), Hecht and Voelkle (Citation2021), Hecht et al. (Citation2019), Hecht and Zitzmann (Citation2020), and Hecht and Zitzmann (Citation2021).

Figure 2. The bivariate continuous-time model with three time points (schematic). Model parameters that are estimated are set in light text color on dark background. Dotted lines indicate parameter relations according to EquationEquations 6 and Equation7. The dagger symbol indicates path coefficients derived from IFAΔp1 in EquationEquation 3

Figure 2. The bivariate continuous-time model with three time points (schematic). Model parameters that are estimated are set in light text color on dark background. Dotted lines indicate parameter relations according to EquationEquations 6(6) AΔp−1∗=exp(AΔp−1),(6) and Equation7(7) QΔp−1∗=irow−exp(A#Δp−1)−IF2row Q∞,(7) . The dagger symbol † indicates path coefficients derived from IF−AΔp−1∗ in EquationEquation 3(3) forp≥2,yjp=AΔp−1∗yj(p−1)+IF−AΔp−1∗μj+ωj(p−1),(3)

Simulation study

Data generation

The data-generating model was the continuous-time model described in EquationEquations 2Equation9 with F=2 processes (i.e., a bivariate continuous-time model). The non-varied true model parameters were:

Σ=Q=Σfw=1001and μ=μdev=00.

The drift matrix A contains the auto-effects a11 and a22, which were varied in the simulation design, on the main diagonal and the cross-effects a21 and a12 as off-diagonal elements, which were determined from the varied peak standardized cross-lagged effects, apeak,std,Y1Y2 and apeak,std,Y2Y1, and the other model parameters. Inserting the true parameter values into the model equations (EquationEquations 3Equation9) gives the data-generating model:

μjN200,1001;
for p=1,yj1N2μj+00,1001;
forp2,Δp1U{0.20,0.40,0.60,0.80},
AΔp1=expa11a12a21a22Δp1,
QΔp1=irowexp2a11a12a120a21a22+a110a12a210a22+a11a120a21a212a22Δp1I41001,
ωj(p1)N200,QΔp1,
yjp=AΔp1yj(p1)+I2AΔp1μj+ωj(p1),

where N2 denotes a bivariate normal and U a uniform distribution, and I2 and I4 are identity matrices of size 2 and 4, respectively. Please note that Δp1 is constant across persons which is a characteristic of an “unequal-interval nonindividualized design” (Hecht et al., Citation2019) and that the intra-class correlation is 0.50.

Simulation design

We varied the following factors in our simulation study: number of persons (N = 5, 10, 20, 30, 40, 50, or 60), number of time points (T = 5, 10, 20, 30, 40, 50, or 60), size of the peak standardized cross-lagged effect from process 1 to process 2 (apeak,std,Y1Y2 = 0.15, 0.10, 0.05, 0.05, 0.10, 0.15), size of the peak standardized cross-lagged effect from process 2 to process 1 (apeak,std,Y2Y1 = 0.15, 0.10, 0.05, 0.05, 0.10, 0.15), auto-effect of process 1 (a11 = 1.05, 0.70, 0.45), and auto-effect of process 2 (a22 = 1.05, 0.70, 0.45). Throughout this work, cross-lagged parameters were standardized using the within-person variances (see EquationEquation 10). To keep the simulation design within manageable and sensible boundaries, we did not fully cross all factors. The auto-effects of process 1 and 2 were paired: 1.05/1.05, 0.70/0.70, and 0.45/0.45. The peak effect sizes were completely crossed and these pairs were completely crossed with the auto-effect pairs. To avoid uninformative power estimates of (or very close to) zero or one, combinations of smaller effects (all combinations that include absolute effect sizes of 0.05) were combined with the fully crossed subsets of N = 30, …, 60 and T = 30, …, 60, whereas combinations of higher effects (all combinations that include absolute effect sizes of 0.10 and 0.15) were combined with the fully crossed subsets of N = 5, …, 30 and T = 5, …, 30. All other effect size combinations (absolute effect combinations of 0.05/0.10 and 0.05/0.15) were combined with the fully crossed N and T. This procedure resulted in a total of 3,312 design cells. A detailed overview of the simulation design is provided in the supplementary material.

Analysis

We randomly drew cells from the design, generated data sets, and ran models until the available CPU time of 1,650 days was exploited. The mean number of replications per design cell was MNrepl=574 (min = 378, max = 802), the mean convergence rate was Mconvrate=0.81 (min = 0.30, max = 1.00), and the mean number of converged models was MNconverged=462 (min = 153, max = 652). All models were estimated using the maximum likelihood estimator of the R package ctsem (R Core Team, Citation2020; Driver et al., Citation2020) which interfaces to OpenMx (Neale et al., Citation2016) and each model ran on one Intel Xeon Gold 5120 (2.20 GHz) CPU of a 64-bit Linux Debian 10 “Buster” computer. A model was considered as converged if the OpenMx exit code was 0 and the standard errors of all parameters were unflawed (i.e., if they were non-missing and below 1,000). The analysis model was identical with the data-generating model. For each replication, the peak standardized cross-lagged effect was deemed insignificant (coded as 0) when its 95% confidence interval (CI) covered the value 0 and significant (coded as 1) when 0 was not covered by the confidence interval. The 95% CI for an undirected alternative hypothesis (population value 0) was calculated as: parameter ± 1.96 SE (Voelkle et al., Citation2012), with SE being the standard error of the parameter.

We used machine learning techniques to derive a prediction model for the power of the peak standardized cross-lagged effects. Specifically, we combined k-fold cross validation (e.g., Kuhn & Johnson, Citation2013) with stepwise logistic regression. The number of folds was k=10 and thus, there were ten training sets (each comprised of nine folds) and ten corresponding test sets (one fold). The folds were created with function createMultiFolds() from R package caret (Kuhn et al., Citation2020). Within each replicate of the cross-validation, a stepwise logistic regression was conducted with function stepAIC() from the R package MASS (Ripley et al., Citation2021). The dependent variable was the dichotomous significance of the peak standardized cross-lagged effect. The stepwise regression algorithm started with the intercept-only model; the potentially maximal model included the intercept, the number of persons N, the number of time points T, the estimated peak standardized cross-lagged effect, the estimated absolute peak standardized cross-lagged effect, two-way interactions of these terms, and quadratic and cubic polynomials of these terms. The direction of the selection algorithm was both forward and backward and the criterion was AIC. As results of this machine learning approach, we report the differences concerning the selected predictors and statistics of AIC between training sets and prediction accuracy statistics (calculated with function confusionMatrix() from R package caret) when using the trained prediction model for the test set data. The final model is then estimated from the entire data with the identified relevant predictors.

Results

All ten stepwise logistic regression models resulted in the same selected parameters. AICs were very comparable with practically no variation between training sets (MAIC=14,581,609, minAIC=14,580,228, maxAIC=14,582,725, SDAIC=808). The prediction accuracies based on the test sets were very comparable and high with Macc=0.8854, minacc=0.8853, maxacc=0.8855, SDacc<0.0001. This may be interpreted as an indication that the data might not be overfitted. The estimates of the final model are given in .

Table 1. Parameter estimates from the logistic regression model for the prediction of statistical power of peak cross-lagged effects

Recommendations and Shiny App

Our general recommendation to determine power for a specific study design and analysis approach is, of course, to either calculate the power with analytically derived equations or – if no analytical solutions exist – to conduct tailored simulation studies. Such simulations, however, can become complex and cumbersome. Then instead, our prediction model might be used to obtain a rough power estimate (although we acknowledge that generalizing our findings beyond the studied conditions is arguable and limitations apply, see Discussion).

Based on our results (final model estimates in ), we programed a Shiny App (available at https://psychtools.shinyapps.io/ContinuousTimePowerCalculation) for power calculations of peak standardized cross-lagged effects. Inputs are the number of persons N, the number of time points T, and the targeted value of the peak standardized cross-lagged effect. The output is the power of the peak standardized effect for α=0.05 and an undirected alternative hypothesis that the effect is unequal zero.

Discussion

One of the strengths of continuous-time (vs. discrete-time) modeling is the facilitation of exploring the unfolding and dissipation of dynamic effects by deriving discrete-time cross-lagged effects for a variety of time interval lengths from the estimated continuous-time model parameters and then determining the peak effect. Plotting the values of the cross-lagged effects against their time interval lengths can provide visual support for exploring the shape of the cross-lagged effect distribution. This feature of continuous-time models might be advantageous for study planning as well. The interval length between measurement occasions can be flexibilized (to some degree), because the cross-lagged relations can be explored independently from the spacing of measurement occasions in the study design.

Researchers might be interested in the peak cross-lagged effects as these characterize the maximal impact from one variable on another. Thus, for study planning, the statistical power to detect those peak cross-lagged effects might be of interest. Analytically derived formulas or tailored simulation studies could be used for power calculations. However, to our best knowledge, such analytical solutions do not exist or are not publicly available. This leaves researchers with the option to conduct simulation studies, which are, however, challenging. To give some rough guidance, we conducted a simulation study and derived a prediction formula using machine learning techniques.

Generalizability of the results is limited to the conditions that were studied. Due to limited computational resources, it was not possible to investigate all potentially relevant factors and the extent of variation of the included factors, the degree of factor crossings, and the number of replications was limited. We rather heavily varied and crossed number of persons, number of time points, and effect size, as these factors were expected to be very predictive. However, the auto-effect entered the simulation study with only three different values, and both auto-effects had the same value in each design cell. We varied only the auto-effect, but not the other model parameters, and the intra-class correlation was constant as well. We used a bivariate continuous-time model (two variables) with individual means for which a normal distribution was assumed. Significance was determined with confidence intervals based on the assumption of normal parameter distributions. The assessment design was unequally spaced (different interval lengths between measurement occasions) but the same for each person. Only one software/estimation method was employed. Thus, our results might not generalize to other conditions, particularly not to settings where auto-effects are much larger, where the auto-effects of the processes differ in size, where the other model parameters exhibit other values, to continuous-time models with more than two variables and other assumptions about individual parameters, to other intra-class correlations, to different procedures for determining significance, to different assessment designs (e.g., individualized designs), to other software and estimation method. The employed model was a bivariate continuous-time model with random intercepts (which is a more general variant of the bivariate random intercepts cross-lagged panel model, RI-CLPM). Generalizability to other models is limited.

One limitation of the bivariate continuous-time model is that both discrete-time cross-lagged effects have their maxima for the same interval length. Researchers should be aware of this limitation and avoid the use of bivariate continuous-time models when this restriction is not theoretically supported.

The dependent variable in the present study was statistical power. For study planning, other criteria might be of interest as well. For instance, Hecht and Zitzmann (Citation2021) illustrated the dependence of model performance (an aggregated measure consisting of convergence, bias, and coverage) on number of persons N and time points T. So even if a sufficient statistical power is achieved with some N and T, model performance might still be suboptimal.

Further limitations are that our results are based on a rather small amount of data (due to restricted computational resources), that we conducted our power analyses only for the very common significance level α=.05 and an undirected alternative hypothesis, that the regression model might not contain all relevant factors and does not perfectly predict power, and that our analysis model was identical to the data-generating model. Negative effects of model misspecifications on estimation performance in autoregressive modeling contexts have been shown, for example, by Tanaka and Maekawa (Citation1984) and Kunitomo and Yamamoto (Citation1985); thus, negative effects of model misspecifications on power are imaginable as well. Further, the peak cross-lagged effects might be located in regions with no or sparse data and the quality of such interpolations might also depend on design characteristics.

In sum, this leaves enough material for future research on statistical power in continuous-time modeling which we consider quite worthwhile because we believe that continuous-time modeling will become even more prominent fueled by the rise of intensive longitudinal methods like the experience sampling method (ESM), ecological momentary assessment (EMA), and ambulatory assessment (AA). In future research, it would be interesting to derive analytical solutions to avoid the discussed limitations concerning generalizability. Until then, our prediction model may be used as a preliminary bridging approach. In this regard, we see our article as one step forward toward a full understanding of statistical power in continuous-time modeling.

In light of the limitations, we caution to use our presented results unreflectingly. Still, we think that we made a modest contribution and hope that researchers who are interested in exploring the unfolding of dynamic effects will find the results and recommendations useful.

Additional information

Funding

We acknowledge support by the Open Access Publication Fund of Humboldt-Universität zu Berlin.

Notes

1 As Δp1 is constant across persons, the design may be called “unequal-interval nonindividualized” (Hecht et al., Citation2019).

2 In line with Oud and Delsing (Citation2010) we use the asterisk symbol * to denote discrete-time parameters that can be calculated from continuous-time parameters.

References