6,274
Views
12
CrossRef citations to date
0
Altmetric
Research Articles

A Comparison of Different Approaches for Estimating Cross-Lagged Effects from a Causal Inference Perspective

&
Pages 888-907 | Received 12 Aug 2021, Accepted 08 Apr 2022, Published online: 10 Jun 2022

Abstract

This article compares different approaches for estimating cross-lagged effects with a cross-lagged panel design under a causal inference perspective. We distinguish between models that rely on no unmeasured confounding (i.e., observed covariates are sufficient to remove confounding) and latent variable-type models (e.g., random intercept cross-lagged panel model) that use parametric assumptions to adjust for unmeasured time-invariant confounding by including additional latent variables. Simulation studies confirm that the cross-lagged panel model provides biased estimates of the cross-lagged effect in the presence of unmeasured confounding. However, the simulations also show that the latent variable-type approaches strongly depend on the specific parametric assumptions, and produce biased estimates under different data-generating scenarios. Finally, we discuss the role of the longitudinal design and the limitations of assessing model fit for estimating cross-lagged effects.

In many areas of psychological research, longitudinal cross-lagged panel designs are used to investigate whether changes in one construct X are related to changes in another construct Y (Little, Citation2013; Marsh et al., Citation2005; Orth et al., Citation2021). In the basic setting of two measurement waves, the key idea of the cross-lagged panel model (CLPM) is that the effect of a predictor at T1 on an outcome at T2 (i.e., cross-lagged effect) is estimated, controlling for the outcome at T1. Thus, the CLPM is based on a conditioning approach in which the posttest (Y2) is conditioned on the pretest (Y1) by regressing the posttest on the pretest and a potential exposure variable (X1; Maxwell & Delaney, Citation2004; Newsom, Citation2015; Plewis, Citation1985).

However, in a very influential paper (1402 citations listed on Google Scholar as of March 28, 2022), Hamaker et al. (Citation2015) criticized that the CLPM does not appropriately account for the trait-like, time-invariant stability of many psychological constructs and, therefore, results in distorted estimates of cross-lagged effects. Hamaker et al. (Citation2015) proposed the random intercept cross-lagged panel model (RI-CLPM) as an extension of the traditional CLPM that allows controlling for stable trait factors when at least three measurement waves are available (Usami, Murayama, et al., Citation2019). The RI-CLPM has been interpreted as a residual-level approach (Andersen, Citation2021; Asparouhov & Muthén, Citation2021) in which the longitudinal associations between two constructs are decomposed into stable between-person associations (i.e., the correlation between time-invariant between-person parts) and temporal within-person dynamics (i.e., within-person effects for deviations from between-person parts). This decomposition allows estimating within-person cross-lagged effects that are adjusted for the effects of stable trait factors. The RI-CLPM has received considerable attention in the methodological literature. Many scholars argue that the RI-CLPM should be preferred over the CLPM for estimating cross-lagged effects, particularly in the presence of stable trait factors (e.g., Berry & Willoughby, Citation2017; Curran & Hancock, Citation2021; Grimm et al., Citation2021; Mulder & Hamaker, Citation2021; Mund & Nestler, Citation2019; Usami, Citation2021; Zyphur et al., Citation2020). Furthermore, empirical comparisons of the CLPM and the RI-CLPM have shown that the decision between the CLPM and RI-CLPM is crucial because the two approaches can yield results that substantially differ concerning the magnitude, sign, and statistical significance of the estimated cross-lagged effect (e.g., Bailey et al., Citation2020; Ehm et al., Citation2019; Littlefield et al., Citation2021; Núñez-Regueiro et al., Citation2021; Oh et al., Citation2020; Orth et al., Citation2021; Ruzek & Schenke, Citation2019; Zhou et al., Citation2020).

One frequently made argument in favor of the RI-CLPM is that it controls for unobserved confounding variables that are stable across time (Usami, Murayama, et al., Citation2019; see also Bailey et al., Citation2020). Given that one of the main challenges in estimating causal effects with non-experimental data is to control for all relevant covariates (Reichardt, Citation2019), this seems to be a significant advantage of the RI-CLPM. In the present article, we compare the CLPM and the RI-CLPM from a causal perspective and define the causal estimand (i.e., the cross-lagged effect) using potential outcome notation (Imbens & Rubin, Citation2015). We also consider two alternative latent variable-type approaches that have been proposed for estimating a cross-lagged effect under unmeasured confounding: observation-level models that inlude the stable trait factors at the level of the observed scores (Dishop & Deshon, Citation2021; Zyphur et al., Citation2020; see also Bollen & Brand, Citation2010), and a fixed effects dynamic panel model (Allison et al., Citation2017) that only models the process of the outcome Y but is agnostic about the process of the exposure X.

In four simulation scenarios, we confirm that the CLPM provides biased estimates of the cross-lagged effect in the presence of unmeasured confounding variables. However, the simulations also show that the potential of the different latent variable-type approaches to control for unmeasured confounding strongly depends on the specific parametric assumptions that are used to identify the effects of the latent variables. Finally, we argue that it is often advisable to include lag-2 effects (i.e., effects of variables across two units of time) in addition to lag-1 effects in the CLPM in order to control for delayed effects of X and Y when estimating cross-lagged effects (VanderWeele et al., Citation2020). Overall, the goal of this paper is to provide a more balanced discussion of different approaches for analyzing cross-lagged panel designs, and we would like to emphasize that—despite recent methodological recommendations—there are still good reasons to use the CLPM and rely on the assumption of no unmeasured confounding.

Before we start, we would like to point out that at a more descriptive level, the CLPM has been criticized by methodologists and developmental psychologists with the argument that it provides an uninterpretable blend of within-person effects and between-person effects (e.g., Berry & Willoughby, Citation2017; Hamaker et al., Citation2015). In the present article, we focus on the question under which conditions cross-lagged effects in the RI-CLPM or the other latent-variable type models can be given a causal interpretation. The question of whether the decomposition into within-person and between-person effects (in contrast to the undecomposed effects in the CLPM) provides a more appropriate description of longitudinal processes will be not further discussed. In our view, the goal of modeling developmental processes should be kept separate from the goal of causal inference.

1. Causal Perspective on Estimating Cross-Lagged Effects

In the following, we consider a multivariate process in which two variables Xt and Yt, and a vector of time-varying covariates Lt are related across time, that is, (Xt,Yt,Lt), tR. The covariates Lt are confounders for the association between Xt and Yt. Furthermore, we consider covariates that do not vary across time (e.g., gender, social status). In the context of our study, it is instructive to decompose the time-invariant covariates into an observed part C and a potentially unobserved part U. Later, we discuss methods that try to control for the effects of U, even though the covariates in U were not measured. In our discussion, we focus on the cross-lagged effect of Xs on Yt, where s<t. We start with a definition of the causal cross-lagged effect. The important role of the longitudinal design (e.g., selection of the time points s and t) for defining the causal cross-lagged effect will be discussed in the section 5 “The Role of Time for Estimating the Causal Cross-Lagged Effect”. It should be emphasized that the main goal is to estimate the (causal) cross-lagged effect and not to model the process (Xt,Yt,Lt).

1.1. Definition of a Causal Cross-Lagged Effect

The causal inference literature heavily draws on the potential outcome framework (Hernán & Robins, Citation2020; Imbens & Rubin, Citation2015; VanderWeele, Citation2015) to define a causal effect (i.e., the effect of Xs on Yt). We assume a continuous exposure variable Xs taking values from a set XXs (Hirano & Imbens, Citation2004; Vegetabile et al., Citation2021), and assume that for each individual, there exists a potential outcome Yt(x) for all xXXs. The potential outcome Yt(x) can be interpreted as the outcome that would have resulted for an individual if the exposure Xs had been set to x (e.g., by an intervention). We further assume that, for each individual, the observed outcome equals the potential outcome under the observed exposure level, that is, Yt= Yt(x) if Xs=x. This assumption is also known as the consistency assumption and connects the potential outcomes to the observed data (Hernán & Robins, Citation2020; VanderWeele, Citation2015). Note that all other potential outcomes for an individual, i.e., Yt(x) for all other xXXs, are unobserved.

The crucial assumption for defining the causal effect of Xs on Yt is the ignorability assumption (1) Yt(x) Xs | YTY,XTX,LTL,C,U    for all xXXs,(1)

This assumption states that the potential outcomes are conditionally independent of the exposure Xs given the previous history of outcome and treatment values as well as time-varying and time-invariant covariates. YTY, XTX, and LTL denote vectors of measures of the outcome, the exposure, and the time-varying covariate, where TY={t1(Y),,tk(Y)}, TX={t1(X),,tk(X)}, and TL={t1(L),,tk(L)} are sets of ordered time indices. Note that the choice of covariates and the respective time points in the ignorability condition in EquationEquation (1) is essential for defining the causal effect of interest. Many scholars point out that this choice needs to reflect subject-matter knowledge and cannot be resolved with statistical modeling techniques (Hernán & Robins, Citation2020).

This ignorability assumption is also labeled the no unmeasured confounding, conditional independence, or selection on observables assumption in the literature (Hernán & Robins, Citation2020; Imbens, Citation2004; Morgan & Winship, Citation2015). It should be emphasized that the time-varying covariates LTL, and the previous measures of the outcome YTY are treated in the same way as the time-invariant covariates C and U in EquationEquation (1). The crucial issue is that they are not affected by the exposure Xs, and act as a mediator on the pathway from Xs to Yt. In research practice, this is often achieved by measuring the time-varying covariates and the prior values of the outcome not after the exposure, that is tk(L)s and tk(Y)s. The relationship between the covariates, the exposure, and the outcome is depicted in , where τTX,Y,L,ACE denotes the average causal effect (ACE) of Xs on Yt.

Figure 1. Causal diagram that shows the relationship between the covariates, the exposure Xs, and the outcome Yt. τTX,Y,L,ACE denotes the average causal effect (ACE) of Xs on Yt.

Figure 1. Causal diagram that shows the relationship between the covariates, the exposure Xs, and the outcome Yt. τTX,Y,L,ACE denotes the average causal effect (ACE) of Xs on Yt.

We now consider the causal effect function τTX,Y,L(x)=E(Yt(x)), the expectation of the potential outcomes Yt(x) if Xs would be fixed to x. The goal is to estimate the causal effect function τTX,Y,L(x) from the data (Yt, Xs, YTY,XTX,LTL,C,U). To simplify notation, we set W=(YTY,XTX,LTL,C,U). If the ignorability assumption holds, the conditional expectation function can be identified as follows: (2) τTX,Y,L(x)=E[Yt(x)]=E(Yt|Xs=x,W=w)fW(w)dw,(2) where fW denotes the joint density of W. In other words, the expected values of the potential outcomes can be determined by averaging the conditional expectation of the outcome given the covariates and the exposure level across the covariate distribution.

In general, the causal effect function τTX,Y,L can be nonlinear. In order to quantify an average causal effect by a weighted average slope, we assume that the conditional expectation function τTX,Y,L is modeled as a linear function of the exposure values x at time s, where the approximation is weighted according to the density of Xs: (3) τTX,Y,L(x)α+τTX,Y,L,ACEx,(3) where the parameter τTX,Y,L,ACE denotes the average causal effect (ACE). The ACE can be interpreted as a weighted average slope by increasing the exposure level by one unit from x to x+1. The linear best approximation in EquationEquation (3) is provided by the least-squares estimate τTX,Y,L,ACE (4) τTX,Y,L,ACE=(τTX,Y,L(x)τt)(xμXs)fXs(x)dxVar(Xs),(4) where τt=τTX,Y,L(x)fXs(x)dx is the expected value of the potential outcomes, and fXs denotes the density function of Xs. The parameter τTX,Y,L,ACE can be interpreted as a weighted average slope across all slopes and is the linear best approximation to the true regression function (Angrist & Pischke, Citation2009; Berk et al., Citation2014). EquationEquation (3) is also known as a marginal structural model (MSM), which specifies a model for the marginal mean of the potential outcomes as a function of the exposure, in which the effects of confounding variables have been removed (Daniel et al., Citation2013; Hernán & Robins, Citation2020).

We now show that a conventional linear regression model can be used to obtain an unbiased estimate of the causal effect τTX,Y,L,ACE when all relevant covariates of the ignorability assumption are included (Keogh et al., Citation2018). To this end, we assume that all covariates are linearly related to the outcome. The outcome Yt is then given as follows (5) Yt=γYt+ γYtXsXs+γW+εYt.(5)

It can now be shown that the coefficient γYtXs provides the average causal effect τTX,Y,L,ACE. Using the relationship in EquationEquation (5), we derive with EquationEquation (2) (6) τTX,Y,L(x)=[γYt+γYtXsx+γw]fW(w)dw=α+γYtXsx,(6) where α=γYt+γE(W). The conditional expectation function strictly follows a linear form, and we, therefore, obtain the identity τTX,Y,L,ACE=γYtXs by comparing (6) with the specification (3). Thus, the regression coefficient γYtXs of a standard regression analysis provides an unbiased estimate of the causal effect τTX,Y,L,ACE when all relevant covariates (i.e., W=(YTY,XTX,LTL,C,U)) are included.

It should be emphasized that ignorability is a strong assumption that should not be taken lightly.Footnote1 In practical applications, the ignorability assumption implies that all relevant covariates (needed to fulfill the ignorability assumption in EquationEquation (1)) are observed. It is vital that this aspect of the ignorability assumption cannot be empirically tested and needs to be justified by substantive knowledge (Aronow & Miller, Citation2019). However, in the presence of unmeasured confounders (e.g., U is not included in EquationEquation (5)), the coefficient γYtXs from EquationEquation (5) provides a biased estimate of τTX,Y,L,ACE.

1.2. Specification Issues

Even if all relevant covariates are measured and included in the regression, it is crucial that the relationship between the covariates W and the outcome Yt is correctly specified in a given application. More specifically, this requires that the functional form of the relationship is correctly specified in the analysis models (e.g., squared terms of predictors are included if quadratic effects exist). Powerful machine learning methods allow for very flexible estimation of functional relationships (Athey & Imbens, Citation2019; van der Laan & Rose, Citation2018). It should also be emphasized that no assumptions about the distribution of covariates W are included in the definition of the causal effect. In particular, there are no assumptions about the process (Xt,Lt,Yt), and the fact that the cross-lagged effect can be obtained from a conventional regression model makes evident that there is no need for modeling the process in Xt, Lt, or Yt. However, modeling the process might be advantageous for identifying the effects of time-invariant unobserved confounders U when estimating the cross-lagged effect. To sum up, it is vital to distinguish the definition of a causal effect from modeling a process (Xt,Lt,Yt). The structural parameter in a well-fitting model for the process (Xt,Lt,Yt) can be wholly unrelated to the causal effect of interest.

2. Estimating a Cross-Lagged Effect: Controlling for Measured Confounding

In the following, we discuss different models that have been proposed for estimating cross-lagged effects. We start with models that assume no unmeasured confounding and assume a longitudinal panel design in which two variables Yt and Xt and a p × 1 vector of time-varying covariates Lt are repeatedly measured at equally spaced intervals across time t (t=1,,T). The time-varying variables are combined in the (p + 2)×1 vector Zt=(Xt, Yt, Lt). In addition, we consider a k × 1 vector C of covariates that are constant across the investigated time period. For simplicity, we further assume that the variables are mean-centered at each wave. The main focus is on estimating the causal effect of Xt1 on Yt.

2.1. Cross-Lagged Panel Model with Lag-1 Effects (CL1)

The CLPM (Finkel, Citation1995; Kessler & Greenberg, Citation1981; Little, Citation2013) with lag-1 effects and covariates is given by (7) Zt=Bt,t1Zt1+ΓtC+et    for t2,(7) where Bt,t1 is a (p+2)×(p+2) matrix of regression coefficients, including the autoregressive and cross-lagged parameters. The (p+2)×k matrix Γt represents the (potentially) time-varying effects of the time-invariant covariates. The (p+2)×1 vector of residuals et denotes correlated random components that are unrelated to previous time points and normally distributed with zero means.

The CLPM can be estimated with only two waves of data. With more than two waves, the autoregressive and cross-lagged parameters can be constrained to be invariant across waves, that is BBt,t1 for t2. This can be reasonable if the time intervals between the waves have a similar length (Little, Citation2013). Moreover, a constrained estimate can be interpreted as an average of the two causal effects from t2 to t1 and from t1 to t.

If the main interest is in estimating the effect of Xt1 on Yt, the cross-lagged panel model with freely estimated parameters is equivalent to estimating a single regression with Yt as the outcome variable. In the bivariate case (i.e., Zt=(Xt,Yt)) without covariates, the regression is given by (8) Yt=βYtYt1Yt1+βYtXt1Xt1+eYt(8)

Thus, the CLPM controls for preexisting differences in the outcome when assessing the effect of the exposure and is a special case of a more general class of conditioning methods in which an estimate of the causal effect is obtained by conditioning on the pretest and other observed covariates (see VanderWeele et al., Citation2016). It should be emphasized that the regression model in EquationEquation (8) includes one of the most basic and frequently used analysis strategies in psychology (Orth et al., Citation2021).

As the traditional CLPM only considers the effects of the previous measurement wave (i.e., lag-1 effects), we will refer to the model as CL1. The CL1 for T = 3 is depicted as a path diagram in the left panel of . It needs to be emphasized that recent discussion of the CLPM (e.g., Bailey et al., Citation2020; Dietvorst et al., Citation2018; Littlefield et al., Citation2021; Lucas, Citation2022; Usami, Murayama, et al., Citation2019) mainly focused on the CL1 and did not consider the role of higher-order lags (e.g., lag-2 effects).

Figure 2. Path diagrams of the cross-lagged panel models with lag-1 (CL1) and lag-2 (CL2) effects for three measurement waves.

Figure 2. Path diagrams of the cross-lagged panel models with lag-1 (CL1) and lag-2 (CL2) effects for three measurement waves.

2.2. Cross-Lagged Panel Model with Lag-2 Effects (CL2)

The CL1 in EquationEquation (7) can be extended to a CL2 that includes the effects of variables from two previous time points (i.e., lag-2 effects): (9) Zt=Bt,t1Zt1+Bt,t2Zt2+ΓtC+et   for t2,(9) where Bt,t2 denotes a (p+2)×(p+2) matrix of regression coefficients that includes the stability effects (e.g., the extent to which Xt depends on Xt2 over and above the effect of Xt1; second-order autoregression), and the lag-2 cross-lagged effects. The CL1 is a CL2 with only lag-1 effects and is thus nested within the CL2.

There are different perspectives on the importance of including these lag-2 effects in the CL2 (Little, Citation2013). One view is that they consider delayed effects that are not captured by the lag-1 effects (e.g., Asendorpf, Citation2021; Marsh et al., Citation2018). However, it has been argued that it is often difficult to justify which psychological mechanism could be responsible for these delayed effects and that direct effects (i.e., lag-1 effects) are often more plausible (e.g., Ehm et al., Citation2019). From a causal inference perspective, the main motivation for including lag-2 effects is a more comprehensive control for confounding. VanderWeele (Citation2021, p. 607) argues that if prior exposure Xt2 affects subsequent exposure Xt1, and also independently affects the outcome Yt not through Xt1 then prior exposure itself confounds the cross-lagged of Xt1 on Yt. Thus, prior values of the exposure and outcome measures (Xt2 and Yt2) can be considered covariates that allow for stronger control of confounding. In some applications, it may even be necessary to control for lag-3 effects of lagged exposures (Xt3) and outcomes (Yt3). VanderWeele et al. (Citation2020) provide a detailed discussion of the benefits of controlling for the history of previous exposure and outcome variables.

In the bivariate case, estimating a cross-lagged effect with a CL2 is equivalent to a single regression equation in which the outcome is regressed on the exposure Xt1 and the other variables (10) Yt=βYtYt1Yt1+βYtYt2Yt2+ βYtXt1Xt1+ βYtXt2Xt2+ eYt(10) where βYtXt1 denotes the cross-lagged effect of interest. However, like the CL1, the CL2 will produce biased estimates of the cross-lagged effect if unmeasured confounders U exist that are part of the ignorability assumption (see EquationEquation (1)) but not included in EquationEquation (10).

3. Estimating a Cross-Lagged Effect: Controlling for Unmeasured Confounding

One advantage of longitudinal data is that they offer the potential to control for the effects of unmeasured confounder variables when estimating causal effects. In the following, we discuss three different strategies that have been suggested for estimating cross-lagged effects in the presence of unmeasured time-invariant confounders U. The basic idea of these approaches is to use parametric modeling assumptions to identify additional latent variables that adjust for the effects of the unmeasured confounders.

3.1. Observation-Level Approaches

In the observation-level approach, the effects of unmeasured confounders are taken into account by including additional latent variables in the CL1 model (see EquationEquation (7)): (11) Zt=Bt,t1Zt1+ΓtC+ΛtU+et   for t2,(11) where Zt, and C  are defined as above, and U is a q×1 vector of latent variables with a (p+2)×q loading matrix Λt (Bollen & Brand, Citation2010). The latent variables U are referred to as unit effects, unobserved effects, or individual effects in the literature (Andersen, Citation2021; Hsiao, Citation2014; Wooldridge, Citation2010; Zyphur et al., Citation2020) and are supposed to capture the effects of unmeasured time-invariant variables. The latent variables can be identified by setting the variances of the latent variables to one. The loadings Λt represent the—potentially time-varying—effects of U on the observed variables. In many applications, Λt is assumed to be diagonal (i.e., Zt and U have the same dimension), and the loadings are fixed to represent individual differences in level and growth (e.g., linear trends) in the observed variables Zt (Bollen & Curran, Citation2004; Usami, Murayama, et al., Citation2019). However, more exploratory approaches have been proposed, particularly in the econometrics literature, in which Λt is not diagonal and the dimension of U is not assumed to be known (e.g., Bai & Li, Citation2014; De Vos & Everaert, Citation2021). have Furthermore, the time-invariant covariates C are assumed to be uncorrelated with the latent factors U, that is Cov(C,U)=0. This restriction is necessary for identifying the model (e.g., Allison et al., Citation2017; Bollen & Brand, Citation2010). The latent variables in U have also been characterized as accumulating factors (Usami, Citation2021; Usami, Murayama et al., Citation2019).

At the first wave (t = 1), the vector of variables Z1 are treated as exogeneous (i.e., predetermined; Bollen & Brand, Citation2010) and are correlated with the time-invariant covariates C, and the latent variables U (12) Cov(Z1,C)=ΨZ1C and Cov(Z1,U)=ΨZ1U (12) where ΨZ1C and ΨZ1U denote (p+2)×k and (p+2)×q covariance matrices in which all unique elements are freely estimated. An alternative representation of the covariances at t=1 was recently suggested by Zyphur et al. (Citation2020) (13) Z1=Γ1C+Λ1U+e1(13)

In this specification, the loadings in Λ1 are freely estimated to represent the covariance of Z1 and U.

One crucial aspect of the model in EquationEquations (11) and Equation(12) (or (11) and (13)) is the identification of all model parameters, which has to be established on a model-by-model basis (e.g., Bollen & Curran, Citation2004). It should be mentioned that with a large number of waves, the model can also include higher-order lags (e.g., lag-2 effects) for the time-varying variables. However, most applications are restricted to AR(1) processes (i.e., lag-1 effects). In the following, we discuss four different observation-level models that have been proposed for estimating cross-lagged effects in the presence of unmeasured confounders. We restrict our discussion to the bivariate case without additional covariates.

3.1.1. Unidimensional Latent Factor Model (OL1)

Finkel (Citation1995) proposed an observation-level model with a single latent factor. This model is identified if at least three measurement waves are available, and the regression coefficients are assumed to be invariant across time. This model is given as follows: (14) (XtYt)Zt=(βXtXt1βYtXt1βXtYt1βYtYt1)Bt,t1(Xt1Yt1)Zt1 +(λXt,U1λYt,U1)ΛtU1U+(eXteYt)et    for t2,(14)

where the regression coefficients are set equal across time, that is BBt,t1 for t2. At t=1, X1 and Y1 are allowed to covary with U1. The path diagram of the model is depicted for T = 3 in . The latent variable U1 can be interpreted as an unmeasured confounder that affects the variables Xt and Yt, and distorts the estimation of the cross-lagged effect in a CL1. Note that U1 is assumed to be time-invariant, but the effects in Λt are allowed to vary across time. We refer to the model in EquationEquation (14) as OL1. It should be emphasized that U1 is not actually measured but that the effects of the confounder U1 are identified by parametric modelling assumptions. Finkel (Citation1995, p. 83) refers to U1 as a “phantom variable” that can be used to test for spurious associations.

Figure 3. Path diagram of an observation-level model with a single latent variable (OL1; Finkel, Citation1995) for three measurement waves.

Figure 3. Path diagram of an observation-level model with a single latent variable (OL1; Finkel, Citation1995) for three measurement waves.

3.1.2. Twodimensional Latent Factor Model (OL2)

Another variant of the observational-level approach has been proposed by Dishop and DeShon (Citation2021; see also Bollen & Brand, Citation2010). In this specification, the number of additional latent variables in U corresponds to the number of time-varying variables in Zt

(15) (XtYt)=(βXtXt1βXtYt1βYtXt1βYtYt1)(Xt1Yt1)+(λXt,U100λYt,U2)(U1U2)+(eXteYt)    for t2,(15) where the diagonal loading matrices ΛtΛ, and regression coefficients BBt,t1 are set equal across time for t2. The latent variables U1 and U2 are assumed to be correlated. At t=1, the two observed variables X1 and Y1 are allowed to be freely correlated with the latent variables (see EquationEquation (12)) Again, this guarantees that X1 and Y1 are treated as exogeneous (i.e., predetermined). The path diagram of this model is shown for T=3 in . Note that the restriction of equal effects of the latent variables in U can be removed by freely estimating the loadings in Λt (see Dishop & DeShon, Citation2021, p. 17). Furthermore, it would be possible to specify Λt to be nondiagonal (i.e., cross loadings), and allow for effects of U1 on Yt, and U2 on Xt. In the following, we refer to the model in EquationEquation (15) as OL2.

Figure 4. Path diagram of an observation-level model with two latent variables (OL2; Dishop & DeShon, Citation2021; see also Bollen & Brand, Citation2010) for three measurement waves.

Figure 4. Path diagram of an observation-level model with two latent variables (OL2; Dishop & DeShon, Citation2021; see also Bollen & Brand, Citation2010) for three measurement waves.

3.1.3. Twodimensional Latent Factor Model with Loadings at Time 1 (OL3 and OL4)

A slightly different version of an observation-level model with two latent variables has been discussed by Zyphur et al. (Citation2020; see also Shamsollahi et al., Citation2021). They used the specification in EquationEquation (13) to represent the associations between Z1 and U (16) (X1Y1)=(λX1,U100λY1,U2)(U1U2)+(eX1eY1),(16) where Λ1 is assumed to be diagonal, and only λX1,U1 and λY1,U2 are estimated (the cross-loadings are set to zero, i.e., λX1,U2=λY1,U1=0). For t2, the model discussed in Zyphur et al. (Citation2020) is given by EquationEquation (15). In the following, we refer to this model as OL3.

Zyphur et al. (Citation2020) also discuss the possibility to freely estimate the loadings in the diagonal matrices Λt (for t2). As mentioned before, this would allow the latent variables U1 and U2 to have time-varying effects (see also Bollen & Brand, Citation2010, Allison et al., Citation2017). We refer to this model as OL4. The four observation-level models, which are also included in the simulation studies, are summarized in .

Table 1. Overview of different latent variable-type models that adjust for effects of unmeasured variables U.

3.2. Residual-Level Approaches

In the residual-level approach, the longitudinal association among the time-varying Zt are decomposed into a between-person part ZtB and occasion-specific within-person deviations ZtW: (17)   Zt=ZtB+ZtW    for t=1,,T(17) (18)  ZtB=ΓtC+ΛtU   for t=1,,T(18) (19)  Z1W=e1    for t=1(19) (20)  ZtW=Bt,t1Zt1W+et    for t2       (20)

The occasion-specific deviations ZtW can be interpreted as residuals that are orthogonal to the between-person part, that is ZtB=ΓtC+ΛtU (see Asparouhov & Muthén, Citation2021). Thus, the autoregressive and cross-lagged effects (i.e., Bt,t1) in EquationEquation (20) are estimated on a residual structure that is purified from the effects of the time-invariant covariates C and the latent variables U. At t=1, the occasion-specific deviations are equated with the residuals (see EquationEquation (19)). In general, the latent variables U are allowed to have time-varying effects Λt. The identification of the latent factor model involving the part ΛtU needs to be established for each specific application. A particular strength of the residual-level approach is its ability to disaggregate between-person and within-person effects in the analysis of longitudinal data (Curran & Hancock, Citation2021; Hamaker et al., Citation2015; Usami et al., Citation2019).

The main difference between the residual-level and the observation-level approaches is that in the residual-level approach the coefficients are estimated for the decomposed scores ZtW instead of the observed scores Zt. To better compare the two approaches, it is instructive to write EquationEquations (17) to Equation(20) for the observed scores (21) Zt=Bt,t1(Zt1Γt1CΛt1U)+ΓtC+ΛtU+et =Bt,t1Zt1+(ΓtBt,t1Γt1)C+(ΛtBt,t1Λt1)U+et   for t2(21)

The conditions under which the residual-level approach is a re-expression of the observation-level approach (and vice versa) will be further investigated in the section 3.4 “Relationship between the Residual-Level and Observation-Level Approaches”.

3.2.1. Twodimensional Latent Factor Model with Time-Invariant Loadings and Regression Coefficients (RL1)

The most popular residual-level approach is the RI-CLPM (Hamaker et al., Citation2015) which was introduced to extend the CLPM with lag-1 effects (CL1). In the bivariate case, the RI-CLPM is given by (22) (XtYt)=(λXt,U100λYt,U2)(U1U2)+(XtWYtW) for t=1,,T(22) (23) (XtWYtW)=(βXtXt1βXtYt1βYtXt1βYtYt1)(Xt1WYt1W)+(eXteYt) for t2(23)

The loading matrices Λt are assumed to be diagonal (e.g., no effects of U1 on Yt), and the loadings λXt,U1 and λYt,U2 are assumed to be invariant across time (Hamaker et al., Citation2015). The latent variables U1 and U2 are interpreted as stable trait factors that represent the parts of Xt and Yt that are completely stable across time. The coefficients βXtXt1 and βYtYt1 represent the autoregressive effects and the coefficients βXtYt1 and βYtXt1 represent the cross-lagged effects. The cross-lagged effects are interpreted as within-person effects. For example, βYtXt1 indicates whether a temporal deviation (i.e., Xt1W) from the stable trait level (i.e., U1) of one construct affects subsequent within-person deviations (i.e., YtW) from the stable trait level (i.e., U2) of the other construct. The RI-CLPM is depicted as a path diagram in .

Figure 5. Path diagram of the random intercept cross-lagged panel model (RI-CLPM; Hamaker et al., Citation2015) for three measurement waves.

Figure 5. Path diagram of the random intercept cross-lagged panel model (RI-CLPM; Hamaker et al., Citation2015) for three measurement waves.

We also write the RI-CLPM for the observed scores Zt (see EquationEquation (21)) (24) (XtYt)=(βXtXt1βXtYt1βYtXt1βYtYt1)(Xt1 Yt1 )+(λXt,U1βXtXt1λXt1,U100λYt,U2βYtYt1λYt1,U2)(U1U2)+(eXteYt)(24)

We refer to the model in EquationEquation (24) with invariant regression coefficients (i.e., BBt,t1) and loadings (i.e., ΛΛt) across time as RL1.

3.2.2. Twodimensional Latent Factor Model with Time-Varying Loadings or Regression Coefficients (RL2, RL3)

It is also possible to freely estimate the loadings in the diagonal matrices Λt. This would allow the time-invariant U1 and U2 in EquationEquation (24) to have time-varying effects. In the following, we refer to this model with freely estimated loadings as RL2. We also consider a model with freely estimated regression coefficients and time-invariant loadings. This model is labeled RL3 and is the original formulation of the RI-CLPM (Hamaker et al., Citation2015). The three different residual-level models are summarized in .

3.3. Fixed Effects Dynamic Panel Model

A further variant of the observation-level approach was proposed by Allison, Williams, and Moral-Benito (Citation2017; see also Moral-Benito, Citation2013; Williams et al., Citation2018). In their dynamic panel model, only the dependent variable Yt is explicitly modeled: (25) Yt=βYtYt1Yt1+βYtXt1Xt1+βt,t1Lt1+γtC+λtU+et   for t2,(25) where βYtYt1 is the effect of the lagged depended variable; βYtXt1 represents the cross-lagged effect; βt,t1 is a 1 × p row vector of regression coefficients; the 1 × k row vector γt contains the time-varying effects of the time-invariant covariates; λt is a 1 × q row vector that contains the time-varying effects of the unit effects U. As in traditional fixed effects models (Wooldridge, Citation2010), the unit effects U are allowed to covary with the time-varying variables Xt and Lt, but need to be uncorrelated with the time-invariant covariates C. Furthermore, the residuals et are allowed to covary with future and concurrent values of Xt, and Lt, that is Cov(eth,Xt)=ϑth,t, and Cov(eth,Lt)= φth,t for h0. Thus, no further assumptions are made for modeling how Xt and the time-varying covariates are related to prior values of the outcome. At t=1, the outcome Y1 is assumed to covary with the unit effects U. Again, this ensures that Y1 is treated as predetermined. In most applications, the λt are assumed to be invariant for t2, and the dimension of U is 1.

3.3.1. Fixed Effects Models with a Unidimensional Latent Factor (FED)

In the bivariate case, the dynamic panel model is given by (26) Yt=βYtYt1Yt1+βYtXt1Xt1+λYt,U1U1+et   for t2,(26) where the cross-lagged effect βYtXt1 is adjusted for the lagged dependent variable Yt1  and the unit effect U1. At the first measurement wave (i.e., t=1), Y1, X1, and U1 are allowed to freely covary with each other. We refer to the model in EquationEquation (26) as FED. The path diagram of the FED for T = 3 is depicted in . In the following, we assume that the regression coefficients βYtYt1 and βYtXt1, as well as the loadings λYt,U1 of the unit effect, are invariant across time (see also ).

Figure 6. Path diagram of the fixed effects dynamic panel model (FED; Allison et al., Citation2017) for three measurement waves.

Figure 6. Path diagram of the fixed effects dynamic panel model (FED; Allison et al., Citation2017) for three measurement waves.

It should be emphasized that the FED does not provide estimates for the effects of Y on X. This has the advantage that no assumptions are made regarding the dependence structure of X on Y.Footnote2 However, if researchers are interested in reciprocal effects, they need to specify a separate model for X that is analogous to EquationEquation (25).

3.4. Relationship between the Residual-Level and Observation-Level Approaches

In this section, we further investigate the relationship between the observation-level and residual-level approaches (see also Andersen, Citation2021; Bollen & Curran, Citation2004; Hamaker, Citation2005; Hsiao, Citation2014; Usami, Citation2021; Usami, Murayama, et al., Citation2019). More specifically, we clarify the conditions under which the two approaches are equivalent. We consider a general residual-level model with occasion-specific, freely estimated loading matrices: (27) Z1=Λ1U+e1(27) (28) Zt=Bt,t1Zt1W+ΛtU+et    for t2(28)

Note that we use Z1W=e1, and that the dimension of U can be different from the dimension of Zt. The general observation-level model is given by (29) Cov(Z1,U˜)=Ψ˜Z1U˜(29) (30) Zt=B˜t,t1Zt1+Λ˜tU˜+e˜t    for t2(30) where B˜t,t1, Λ˜t, Λ˜1, U˜, and e˜t denote the parameters in the observation-level model. An important special case is the observation-level model in which the covariance of Z1 and U˜ is represented as a loading matrix Λ˜1 (see Zyphur et al., Citation2020) (31) Z1=Λ˜1U˜+e˜1(31)

In order to show that in the general case the two models are equivalent, we show that the parameters in EquationEquations (27) and Equation(28) can be represented as parameters in EquationEquations (30) and Equation(31), and vice versa.

3.4.1. Equivalence of General Residual-Level and Observation-Level Models

First, we start with the residual-level model and write EquationEquation (28) as follows (32) Zt=Bt,t1Zt1+(ΛtBt,t1Λt1)U+et   for t2.(32)

By setting B˜t,t1Bt,t1, Λ˜tΛtBt,t1Λt1, U˜U, and e˜tet, we obtain EquationEquation (30) of the observation-level approach. At t=1, we can obviously set Λ˜1Λ1, and e˜1e1 (see EquationEquation (31)). Thus, the model parameters of the general residual-level model can be represented as parameters of the observation-level model.

Next, we start with an observation-level model (see EquationEquations (30) and Equation(31)). We now define Bt,t1B˜t,t1, Λ1Λ˜1, UU˜, and ete˜t. We also set (33) Λt  Λ˜t+B˜t,t1Λt1   for t2(33)

For t=2, we have Λt1=Λ˜1. For t>2, Λt1 is a function of model parameters defined in EquationEquations (30) and Equation(31), that is Λ˜t and B˜t,t1. This shows that the paramaters of the general observation-level model can be re-expressed as parameters of a residual-level model, and the two approaches are equivalent. Note that this would also hold true for the special case that the regression coefficients are invariant across time, that is Bt,t1=B˜t,t1B in EquationEquations (28) and Equation(30).

3.4.2. Representation of a Simple Structure Residual-Level Model as an Observation-Level Model

Next, we consider a residual-level model with time-invariant coefficients B, and a diagonal, time-invariant loading matrix. This is a special version of the RI-CLPM (RL1) and can be written as (34) Z1=ΛU + e1(34) (35) Zt=BZt1+(IB)ΛU+et    for t2.(35)

We show that this model can be represented as an observation-level model with a simple structure loading matrix for t2. To this end, we define a diagonal matrix Λ˜ and latent variables U˜ by (36) Λ˜U˜(IB)ΛU(36)

Note that the only purpose of the diagonal matrix Λ˜ is to standardize the components in U˜. Moreover, we can define for t=1 (37) Λ˜1Λ˜1(IB)1Λ.(37)

As B is typically nondiagonal, the loading matrix Λ˜1 at t=1 in the observation-level model is nondiagonal. Thus, a RI-CLPM with time-invariant regression coefficients (RL1) can be re-expressed as an observation-level model with a nondiagonal loading matrix Λ˜1 (OL2 in EquationEquation (15); see Andersen, Citation2021). However, a RI-CLPM cannot, in general, be written as an observation-level model with a diagonal Λ˜1 (OL3 in EquationEquation (16)).

3.4.3. Representation of a Simple Structure Observation-Level Model as a Residual-Level Model

We now start with an observation-level model (38) Z1=Λ˜1U˜+e˜1(38) (39) Zt=B˜Zt1+Λ˜U˜+e˜t   for t2,(39) where the matrix Λ˜ is assumed to be diagonal (i.e., simple structure model). As pointed out before, the matrix Λ˜1 can be diagonal (OL3 in EquationEquation (16)) or nondiagonal (OL2 in EquationEquation (15)). In order to represent EquationEquations (38) and Equation(39) as a residual-level model, we get defining equations (40) Λ1U=Λ˜1U˜(40) (41) (ΛtBt,t1Λt1)U=Λ˜U˜   for t2(41)

We now set Bt,t1B˜, and obtain by iterating (42) ΛtU=Λ˜U˜+B˜Λt1U=(I+B˜+B˜2++B˜t2)Λ˜U˜+B˜t1Λ1U(42)

We use the matrix Taylor series (IB˜)1I+B˜+B˜2++B˜k if k is large enough. This finding is based on the assumption that B˜k0 for k large enough which holds for stationary processes. If we assume that B˜k0 for all k2, we can use the approximation ΛtU(IB˜)1Λ˜U˜ for t3. Thus, we can assume a simple structure Λt=Λ for t3 with (43) ΛU=(IB˜)1Λ˜U˜(43)

We now set Λ(IB˜)1Λ˜ for t3. For the first two time points, we get (44) Λ1U=Λ˜1U˜=(Λ˜1Λ˜1(IB˜)Λ)U(44) (45) Λ2U=Λ˜U˜+B˜Λ1U=((IB˜)Λ+B˜Λ1)U(45)

We define Λ1(Λ˜1Λ˜1(IB˜)Λ) and Λ2((IB˜)Λ+B˜Λ1). This shows that the observation-level model in EquationEquations (38) and Equation(39) can be approximately represented as a residual-level model with a simple structure loading matrix. However, the loading matrices at the first two time points (Λ1 and Λ2) in this approximation are, in general, not diagonal. Thus, an observation-level model with a simple structure loading matrix cannot be re-expressed as a simple structure residual-level model.

3.4.4. Representation of a Residual-Level Model as a Fixed Effects Dynamic Panel Model

We show that a residual-level model can be represented as a fixed effects dynamic panel model (FED; Allison et al., Citation2017). We start with the bivariate residual-level model in EquationEquation (24), and assume time-invariant regression coefficients (βXX, βXY, βYY, and βYX), and time-invariant loadings (λX,U1 and λY,U2).

The corresponding FED is given by (for t2) (46) Yt=λ˜Y,U˜1U˜1+β˜YYYt1+β˜YXXt1+e˜Yt(46)

We now define λ˜Y,U˜1U˜1(1βYY)λY,U2U2βYXλX,U1U1, where U˜1 is a standardized variable. Furthermore, we set β˜YYβYY, β˜YXβYX, and e˜YteYt for t2. Thus, the RI-CLPM can be represented as a FED. However, as the FED does not specify restrictions regarding the dependence of X on Y, the FED cannot, in general, be represented as an observation-level model (Allison et al., Citation2017), which implies that it can also not, in general, be represented as a residual-level model.

3.4.5. Summary

Overall, the main results of this section can be summarized as follows. First, the general observation-level and residual-level approaches with freely estimated loadings are equivalent if the same number of latent variables is utilized (i.e., dimension of U is the same). Second, residual-level models with a simple structure time-invariant loading matrix and time-invariant regression coefficients can be represented as a simple structure observation-level model with time-invariant regression coefficients. However, this is only the case if the loading matrix at t=1 in the observation-level model is nondiagonal (see OL2), and not if it is diagonal (see OL3). Third, the simple structure observation-level model with time-invariant regression coefficients cannot be re-expressed as a simple structure residual-level model (see EquationEquations (44) and Equation(45)), indicating that in the case of simple structure loading matrices, the observation-level approach is the more general approach (see Andersen, Citation2021). Fourth, both observation-level and residual-level models can be re-expressed as a fixed effects dynamic panel. However, in general, this is not possible the other way round (see Allison et al., Citation2017).

4. Estimation of Cross-Lagged Effects under Different Data-Generating Models

We now use simulated data to illustrate the conditions under which the different modeling approaches produce unbiased estimates of the cross-lagged effect. We distinguish four different scenarios for a cross-lagged panel design with three measurement waves and two variables (i.e., X and Y; see ). In Scenario A, we assumed that the true data-generating model was a CL2. In Scenario B, we assumed that the true model was an observation-level model with one latent variable (OL1). In Scenario C, the data were generated by residual-level models (RL1, RL2, and RL3). Finally, in Scenario D, we assumed that the true model was a fixed effects dynamic panel model (FED). For each scenario, we present the results for different data conditions generated under different parameters of the data-generating models. As we were only interested in the (large-sample) bias of the parameter estimates, we simulated only one large data set (N = 10000) for each data condition. In all scenarios, the variables (i.e., X1, X2, X3, Y1, Y2, Y3) were standardized with zero means and variances of one. Our discussion focuses on the estimation of the (lag-1) cross-lagged effect of X2 on Y3 (i.e., βY3X2). The R and lavaan (Rosseel, Citation2012) code for the data-generating models and the different analysis models is provided at https://bit.ly/3IN1FCi.

Figure 7. Four different simulation scenarios for a cross-lagged panel design with three measurement waves. Scenario A: true model is a cross-lagged panel model with lag-2 effects (CL2). Scenario B: true model is an observation-level model with a single latent variable (OL1). Scenario C: true model is a residual-level model. Scenario D: true model is a fixed effects dynamic panel model (FED).

Figure 7. Four different simulation scenarios for a cross-lagged panel design with three measurement waves. Scenario A: true model is a cross-lagged panel model with lag-2 effects (CL2). Scenario B: true model is an observation-level model with a single latent variable (OL1). Scenario C: true model is a residual-level model. Scenario D: true model is a fixed effects dynamic panel model (FED).

4.1. Scenario A: CL2 as the Data-Generating Model

In Scenario A, we assumed that the data were generated by a CL2 (see EquationEquation (9)). More specifically, we assumed a stationary process that fulfills a cross-lagged panel model with lag-2 effects (i.e., the lag-1 covariances were assumed to be constant for each data condition). We manipulated the lag-2 covariances by specifying different values for the lag-2 cross-lagged effects (i.e., βY3X1), and the lag-2 autoregressive effects (i.e., βY3Y1). This resulted in six different data conditions in which the synchronous and lag-1 correlations were constant, but the lag-2 correlations differed. We analyzed the six data sets with the cross-lagged panel model (CL1 and CL2), the four observation-level models (OL1, OL2, OL3, and OL4), the three residual-level models (RL1, RL2, and RL3), and the fixed effects dynamic panel model FED.

As expected, the CL2 that includes lag-2 effects provided unbiased estimates of the cross-lagged effect under all data conditions in this scenario (see ). The CL1 produced positively biased estimates and overestimated the size of the true cross-lagged effect. For example, in condition A2 with small lag-2 cross-lagged effects (i.e., βY3X1=.01), and a substantial lag-2 autoregressive effect (i.e., βY3Y1=.32), the true cross-lagged effect is substantially overestimated in the CL1 (.20 vs. .10). This illustrates that the estimates of the CL1 can be strongly distorted by the presence of delayed effects that are not adequately captured by the lag-1 effects (VanderWeele et al., Citation2020; see also Marsh et al., Citation2018). Note that the estimated cross-lagged effects in the CL1 did not change across the data conditions because the lag-1 correlations were fixed, and only the lag-2 correlations were manipulated.

Table 2. Results for Scenario A: True model is the cross-lagged panel model with lag-2 effects (CL2). Estimates of the cross-lagged effect for the different approaches.

Most importantly, the models that include additional latent variables to adjust for unmeasured confounding produced biased estimates that were sometimes too large and sometimes too small. For example, the estimates of the residual-level models strongly depended on the size of the lag-2 effects, and the RL3 tended to underestimate the magnitude of the true cross-lagged effect. In condition A2 (i.e., βY3X1=.01, and βY3Y1=.32), the estimate provided by the RL3 even had a different sign than the true cross-lagged effect (–.04 vs. .10). The bias of the residual-level models (RL1, RL2, and RL3) completely vanished when no lag-2 cross-lagged effect was present in the condition A6 (i.e., βY3X1=0).

Overall, scenario A confirms findings from the methodological literature that the CL1 can positively bias estimates of cross-lagged effects because it provides insufficient control for confounding due to the lag-2 effects. It also shows that the additional latent variables in the observation-level and residual-level models capture these lag-2 effects. However, in general, they do not appropriately control for the confounding due to previous measures of the exposure and the outcome, resulting in biased estimates of cross-lagged effects.

4.2. Scenario B: OL1 as the Data-Generating Model

In Scenario B, we assumed that an observation-level model with a single latent variable is the data-generating model (OL1; see EquationEquation (14)). We fixed the true value of the cross-lagged effect (βY3X2=.20) and manipulated the effect of the latent variable U1 on the observed variables by assuming that the correlation between U1 and Y decreased by a factor of .50 (ρY1U1=.50, ρY2U1=.25, and ρY3U1=.13), was constant across time (ρY1U1=ρY2U1=ρY3U1=.50), or increased by a factor of 2 (ρY1U1=.05, ρY2U1=.10, and ρY3U1=.20) or 3 (ρY1U1=.05, ρY2U1=.15, and ρY3U1=.45). In addition, we assumed that the correlation between U1 and X decreased by a factor of .5 (ρX1U1=.30, ρX2U1=.15, and ρX3U1=.08) or was constant across time (ρX1U1= ρX2U1= ρX3U1=.30). Overall, this resulted in six data conditions.

shows the results for the six data conditions. As expected, the observation-level model with a single latent variable (OL1) produced unbiased estimates of the cross-lagged effect. By contrast, the models that assume no unmeasured confounding (CL1 and CL2) and the other latent variable-type models provided positively or negatively biased estimates of the cross-lagged effect, depending on the effect of the latent variable U1 in the true data-generating model. For example, in condition B6 in which U1 had an increasing effect across time on Y, the estimates of CL1 and CL2 were positively biased (CL1: .29, CL2: .27), while many of the estimates provided by the latent variable-type models were negatively biased (OL2: .16, OL3: .17, RL2: .05, FED: −.01). The estimate of the RL1 was unbiased in this condition but was too small in condition B4, in which U1 had a decreasing effect across time on Y. Interestingly, the estimates produced by the observation-level model OL4 were almost unbiased. This indicates that, at least in the investigated conditions, the freely estimated loadings in OL4 can capture the time-varying (linear) effect of a single unmeasured variable.

Table 3. Results for Scenario B: True model is the observation-level model with single latent variable (OL1). Estimates of the cross-lagged effect for the different approaches.

Table 4. Results for Scenario C: True Model is the residual-level (RL) Model. Estimates for the cross-lagged effect for the different approaches.

4.3. Scenario C: Residual-Level Model as the Data-Generating Model

In Scenario C, we assumed that a residual-level model is a data-generating model with a true cross-lagged effect of βY3X2=.20. The correlation of the latent variables was set to .6 (i.e., ρU1U2=.60), We manipulated the variance of the between-person part of X (i.e., ϕU1=.30, .50, and .70), the cross-lagged effect of X1 on Y2 (βY2X1=.20 vs. βY2X1=.05), and the loadings of Y on U2 (λY1,U2=λY2,U2=λY3,U2=1.00 vs. λY1,U2=1.00, λY2,U2=.80, and λY3,U2=.64). This resulted in nine different data conditions in which the regression coefficients and loadings were invariant across time (C1, C2, and C3), the regression coefficients were different, but the loadings were the same across time (C4, C5, and C6), and the regression coefficients were invariant, but the loadings differ across time (C7, C8, and C9).

As expected, the residual-level models could produce unbiased estimates of the cross-lagged effect in all nine conditions (). However, the estimates of RL1 and RL2 were biased when the true regression coefficients were not invariant across time (conditions C4, C5, C6), and only RL3 with freely estimated loadings provided unbiased estimates when the true loadings were allowed to vary across time in the data-generating model (conditions C7, C8, C9). As shown in the section “Relationship between the Residual-Level and Observation-Level Approaches”, the OL2 and FED are re-expressions of the residual-level models and produced identical estimates. However, this is no longer the case if the OL2 and FED are misspecified in conditions C4 to C9. Furthermore, the observation-level models with a simple structure loading matrix at time 1 (OL3 and OL4) produced negatively biased estimates of the cross-lagged effect across all conditions.

The estimates of the CL1 were consistently too small and underestimated the true magnitude of the cross-lagged effect, particularly when the variance of the between-person parts was large (i.e., ϕU1=.7). These findings replicated the results from Hamaker et al. (Citation2015) and other simulation studies (Usami, Todo, et al., Citation2019) that the CL1 can produce distorted estimates of cross-lagged effects when the true data-generating model is an RL3. In addition, the CL2 that includes the lag-2 effects also produced estimates that underestimated the true magnitude of the cross-lagged effect. The bias for the CL2 was smaller but followed a similar pattern as the bias of the CL1.

4.4. Scenario D: FED as the Data-Generating Model

In Scenario D, we assumed that the fixed effects dynamic panel model FED is the true model. The data were generated from an observation-level model with two latent variables that were assumed to be correlated (ρU1U2=.60). We manipulated the variance of U1 (i.e., ϕU1=.50 and .70), the loading of X3 on U1 (i.e., λX3,U1=0.15 and .50), and the lag-2 effect of Y1 on X3 (βX3Y1=.0 and .07). This resulted in eight different conditions in which we varied the complexity in the X part of the data by allowing for different loadings of X on U1 across time (conditions D3 and D4), additional lag-2 effects of Y on X (conditions D5 and D6), or both (conditions D7 and D8).

shows that the FED produced unbiased estimates of the cross-lagged effect in all conditions. However, the estimates of the observation-level models (OL2, OL3, and OL4) were only unbiased if the loadings were invariant and no lag-2 effects were present in the data-generating model (conditions D1 and D2). This clearly illustrates that the FED can be interpreted as a more robust version of the observation-level models, and that the FED is to be prefered if researchers are only interested in estimating the cross-lagged effect but not in modeling the joint process of X and Y. The CL1 and CL2 overestimated the true magnitude of the cross-lagged effect and produced positively biased estimates across all conditions. Furthermore, the estimates of the residual-level models were strongly biased, even in the conditions with invariant loadings and no lag-2 effects (D1 and D2).

Table 5. Results for Scenario D: True model is the fixed effects dynamic panel model (FED). Estimates for the cross-lagged effect for the different approaches.

4.5. Summary

The main findings of the simulations can be summarized as follows. First, when the CL2 was the true model (Scenario A), the estimates of the different latent variable-type models were biased in many conditions, indicating that the additional latent variables did not appropriately control for the lag-2 effects in the data-generating model. This shows that the potential to adjust for the effects of unmeasured variables comes with the price that the estimates of the latent variable methods are very sensitive to the parametric modeling assumptions needed for identifying the effects of the latent variables. Second, CL1 and CL2 produced biased estimates in the three scenarios in which unmeasured latent variables were included in the data-generating models (Scenarios B, C, and D). This was expected for CL1 and CL2 because these models rely on the assumption of no unmeasured confounding (i.e., all relevant covariates are measured). However, the results in these scenarios also show that the latent variable methods (OL1, OL2, OL3, OL4, RL1, RL2, RL3, and FED) strongly depended on the specific modeling assumptions made for estimating the effects of U. If these assumptions did not correspond with the data-generating model, they produced, in general, biased estimates of the cross-lagged effect. In practical applications with real data, the decision between the different approaches cannot be guided by model fit because different model specifications (with different estimates of the cross-lagged effect) will often provide an adequate description of the data (see Section “Limitations of Model Fit for Estimating Cross-Lagged Effects”). Third, the FED that is agnostic about the process of the exposure variable X was less sensitive to specific modeling assumptions (see Allison et al., Citation2017) when estimating cross-lagged effects in the presence of unmeasured confounding. The estimates of the FED were biased in scenarios (B5, B6, and C4 to C9), in which specific parameter restrictions were needed for identifying the model (i.e., equal regression coefficients and loadings across time). With more measurement waves, these restrictions can be relaxed, and the FED would provide unbiased estimates in Scenarios B, C, and D. However, this would not resolve the central issue of scenario A because the FED still needs to make assumptions about the process of Y for identifying the unit effects of U1.

5. The Role of Time for Estimating the Causal Cross-Lagged Effect

In this section, we discuss the crucial role of selecting time points in defining the cross-lagged effect of interest (e.g., Gollob & Reichardt, Citation1987). We assume that the target of inference is the causal cross-lagged effect of X2 on Y3, and we adjust for X1, Y1, and Y2 in defining τTX,Y,ACE, that is TX={1} and TY={1,2}. In the following, we assume that there are no unmeasured confounders. The bivariate process Zt=(Xt,Yt) (for tR) is allowed to have an arbitrary dependency structure. We assume that Zt follows a Gaussian process. Hence, the vector Z=(Zt1,,ZtH) for H discrete time points t1,,tH is multivariate normally distributed. The only requirement is that the values of Zt only depend on the past; that is, they depend on Zs with s<t. It is important to emphasize that a continuous-time process Zt should be considered as the underlying data-generating model. However, a longitudinal design of discrete time points is chosen to define model parameters (i.e., including the causal effect) of interest. Notably, a finite-dimensional parameter of a discrete-time process in a longitudinal design is a summary of the continuous-time process Zt that can even depend on an infinite-dimensional parameter. In the following, we discuss the implications of choosing a particular longitudinal design by demonstrating that the model parameters in this design are functions of model parameters in a model with a refined grid of time points. By doing so, we clarify that the chosen time lags for defining the causal effect determine which effects (i.e., pathways) are controlled and which are part of the definition of the causal effect.

It is essential to understand that τTX,Y,ACE includes all effects of X2 on Y3 that are transmitted via intermediated pathways between X2 and Y3. This is illustrated in (upper panel), in which we consider the process at a refined grid of time points 1, 2, 2.5, and 3. As can be seen, the total effect of X2 on Y3 is transmitted via pathways X2Y2.5Y3, X2X2.5Y3 and X2Y3 (Keogh et al., Citation2018). More formally, the causal cross-lagged effect contains a direct effect γY3X2 and two indirect effects: (47) τTX,Y,ACE=γY3X2+γY3X2.5γX2.5X2+γY3Y2.5γY2.5X2.(47)

Figure 8. Process of Xt and Yt considered at time points 1, 2, 2.5, and 3 (upper panel), and time points 1, 2, 2.33, 2.67, and 3 (lower panel). Thick paths are included in the causal cross-lagged effect of X2 on Y3 (with adjustment for X1, Y1, and Y2).

Figure 8. Process of Xt and Yt considered at time points 1, 2, 2.5, and 3 (upper panel), and time points 1, 2, 2.33, 2.67, and 3 (lower panel). Thick paths are included in the causal cross-lagged effect of X2 on Y3 (with adjustment for X1, Y1, and Y2).

In most applications, the total effect τTX,Y,ACE will differ from the direct effect γY3X2 as well as the coefficient γY3X2.5, which is based on a shorter time lag. This highlights the fact that the causal cross-lagged effect is independently defined from a particular process model, and that the causal effect of interest will not necessarily correspond to a regression coefficient in a well-fitting process model. To put it more clearly, it is not a question of model fit whether a researcher should report the total effect τTX,Y,ACE from EquationEquation (47) (referring to X2Y3 with a time lag of 1) or γY3X2.5 (referring to X2.5Y3 with a time lag of 0.5).

The cross-lagged effect τTX,Y,ACE can also be transmitted through pathways that include effects of Y on X. We also consider the process at time points 1, 2, 2.33, 2.67, and 3 (see lower panel in ). As can be seen, the indirect effect via X2Y2.33X2.67Y3 would be part of the total effect τTX,Y,ACE. Thus, the cross-lagged effect at a single point in time targets the total effect that is transmitted through all (indirect and direct) paths between X2 and Y3 (Keogh et al., Citation2018).

The longitudinal design also plays a crucial role when adjusting for previous exposure and outcome measures. We now consider the process at time points 1, 1.5, 2, and 3 (see ). As can be seen, with adjustment for X1, Y1, and Y2, the cross-lagged effect τTX,Y,ACE includes effects of X1.5 (i.e., X2X1.5Y3) and Y1.5 (i.e., X2Y1.5Y3) if they exist. The causal effect can be calculated as (48) τTX,Y,ACE=γY3X2+γY3X1.5γX2X1.5+γY3Y1.5γX2Y1.5(48)

Figure 9. Process of Xt and Yt considered at time points 1, 1.5, 2, and 3. Thick paths are included in the causal cross-lagged effect of X2 on Y3 (with adjustment for X1, Y1, and Y2).

Figure 9. Process of Xt and Yt considered at time points 1, 1.5, 2, and 3. Thick paths are included in the causal cross-lagged effect of X2 on Y3 (with adjustment for X1, Y1, and Y2).

Given this reasoning, it might be tempting to adjust for additional covariates X1.5 and Y1.5; that is, one would utilize TX={1,1.5} and TY={1,1.5,2} for defining an alternative causal effect. In this case, the prior levels of the exposure (i.e., X1.5) would be treated as a confounder, and the causal effect would be γY3X2, which typically differs from τTX,Y,ACE computed in EquationEquation (48). As a consequence, the decision of a researcher to choosing TX={1} instead of TX={1,1.5} implies that in the former case, intermediate paths (i.e., the path X2X1.5Y3) are also part of the causal effect of interest. Only controlling for X1 does not rule out the possibility that intermediate exposures Xu with 1<u<2 also contribute to the causal effect, where the contribution strongly depends on the correlation of Xu with X2. This tradeoff in deciding whether to adjust for a previous exposure variable cannot be resolved by statistical modeling techniques (see VanderWeele et al., Citation2020, for a discussion of reasons to control for prior measures of X). For example, even in the case that a previous measure of the exposure received a statistically significant coefficient in a particular (process) model, it is still a conceptual question whether this covariate should serve as a control variable in the definition of the causal effect τTX,Y,ACE.

One interesting feature of and is that the time grid can be arbitrarily refined. One may argue that this is a perfect showcase for continuous-time structural equation modeling (CTSEM; van Montfort et al., Citation2018), and that the challenge of selecting appropriate time points can be circumvented by using CTSEM. However, we are less convinced about the benefits of CTSEM because most of the applications discussed in the methodological literature only consider CTSEM relying on the Markov property (i.e., CTSEM without memory), which boils down to a specification of a lag-1 model with a tiny time lag. Thus, we believe that CTSEM with the Markov property cannot adequately model the dependence structure of time points and will typically be misspecified. Given these limitations, whether CTSEM offers significant advantages over discrete-time models in assessing causal cross-lagged effects can be questioned.

6. Limitations of Model Fit for Estimating Cross-Lagged Effects

In this section, we discuss whether the assessment of model fit could help to choose between the different modeling approaches for estimating cross-lagged effects. Our main argument is that the assessment of model fit is only of limited usefulness because the different modeling approaches provide almost equivalent representations of the observed covariance structure. As the residual-level and observation-level models are very similar (see section 3.4 “Relationship Between the Residual-Level and Observation-Level Approaches”), we restrict our discussion to the residual-level approach.

Under a multivariate normal distribution, only the fit of an observed covariance matrix S to a model-implied covariance matrix Σ is investigated. In the residual-level approach, the covariance structure of the observations at all time points, that is Σ = Var(Z) for Z=(Z1,,ZT) is decomposed as follows (49) Σ=ΣB+ΣW,(49) where ΣB=Var(ZB) is the covariance matrix of the between-person part, that is ZB = (Z1B,,ZTB) in EquationEquation (18), and ΣW=Var(ZW) of the within-person part, that is ZW = (Z1W,,ZTW) in EquationEquations (19) and Equation(20). The causal cross-lagged effect is determined from the within-person part and, hence, is a parameter that affects the model-implied constrained covariance matrix ΣW. For evaluating the assessment of global model fit, it is crucial that the model specification of the between-person part of the model is typically not independent of the specification of the within-person part. More specifically, a similar overall model fit for the total covariance Σ is obtained if the model for the between-person part is made more complex (e.g., by including time-varying loadings or random slopes), while the model for the within-person part is less complex (e.g., by specifying invariant lag-1 effects). If distinct freely estimated model parameters refer to ΣB and ΣW, then a total number of degrees of freedom df0 associated with the population covariance Σ can be used for modeling the between-person part (i.e., using pB parameters) and the within-person part (i.e., using pW parameters). The resulting degrees of freedom df for the fitted residual-level model is given by (50) df=df0pBpW.(50)

Thus, if the number of parameters pB for the between-person part is increased, the number of within-person part parameters pW needs to be reduced to obtain the same number of degrees of freedom df.Footnote3 This illustrates that the modeled complexity of one part of the model typically affects the required complexity of the other part of the model. However, the regression parameters for the within-person part will have different interpretations, depending on the specification of the model of the between-person and within-person parts, even though the overall model fit is the same for the different specifications (see Usami, Murayama, et al., Citation2019).

The limited usefulness of model fit was also evident in Scenario A of the simulation in which the CL2 was the true data-generating model. In this scenario, the residual-level model RL3 included stable traits with time-invariant loadings for the between-person part and assumed an AR(1) process for the within-person part. The CL2 did not model the between-person structure but assumed an AR(2) process. Both models showed a very similar fit, but the parameter estimates were very different. In such a constellation, the discussion of bias depends on how the true data-generating process is defined, and researchers cannot rely on model fit to choose between the CL2 or RL3. Consequently, the bias of a causal effect estimate from a particular model is entirely unrelated to its model fit.

It should be noted that the residual-level models contain some degrees of freedom and can be rejected based on model fit. However, in an application, researchers would modify their particular specification of the residual-level model to obtain a better-fitting model (e.g., freely estimated loadings). Successively using this model modification strategy, one will get a saturated model with a perfect fit or a non-saturated model with a model fit close to the CL2 model. Then, the modified residual-level model has (approximately) the same model fit as the CL2, but the cross-lagged effects in both models substantially differ. Hence, statistical techniques (and model fit in particular) cannot resolve how to define the causal effect of interest.

Furthermore, this reasoning does not change with the availability of (many) more measurement points. For example, the RL1 with an AR(2) process for the within-person part could be equivalent to a cross-lagged panel model with an AR(3) process. Alternatively, random slopes (e.g., growth factors) could be included in the between-person part for the RL1, while an AR(1) process is specified in the within-person part. Overall, this illustrates how different parametric assumptions in latent variable models for unmeasured confounding can result in equivalent representations of the observed covariances while producing cross-lagged effects with different interpretations.

Finally, it should be added that the limited value of model fit also applies to models that rely on the assumption of no unmeasured confounding. For example, a researcher can be interested in assessing the causal effect with a CL1 model, even in the case that the CL2 model provides a better model fit than the CL1 for modeling the process (Xt,Yt) for t2. If the researcher believes—based on subject-matter knowledge—that adjusting for Xt2 would result in overadjustment, the better model fit of the CL2 can be ignored, and the cross-lagged effect from the CL1 should be utilized for assessing the causal effect of interest. Similarly, the existence of stable trait factors in a process model for X and Y does not imply that they need to be treated as confounders in defining the causal cross-lagged effect. We believe these decisions need to rely on subject-matter knowledge and should not be grounded on model fit.

7. Concluding Remarks

In the present article, we discussed different approaches for estimating cross-lagged effects with a cross-lagged panel design. We applied a causal inference perspective and distinguished between models that assume all relevant covariates are measured (CL1 and CL2) and latent variable-type models that used parametric assumptions to adjust for the effects of unmeasured time-invariant confounding variables. This final section highlights issues that need consideration when choosing between the different approaches in a specific application.

We confirmed with simulated data the well-known fact (e.g., Finkel, Citation1995; Little, Citation2013) that the CLPM provides biased estimates of cross-lagged effects if the assumption of no unmeasured confounding is violated (i.e., included covariates are not sufficient to remove confounding). However, it is less emphasized in the methodological literature that it is often beneficial to include lag-2 effects (i.e., CL2) when estimating cross-lagged effects. The main advantage of including lag-2 effects is that allowing for these additional effects of prior measures of X and Y can provide a stronger control for the presence of confounding. However, if researchers believe that the prior levels of the exposure (i.e., X1) would not act as a confounder but can be considered an essential part of X2, controlling for prior levels of the exposure X would result in estimates of cross-lagged effects that may be overadjusted by the prior levels of the exposure and difficult to interpret (see VanderWeele et al., Citation2020, for a discussion of reasons to control for prior measures of X).

The simulations also showed that the different latent variable-type models have the potential to adjust for the effects of time-invariant unmeasured confounding by including additional latent variables in the model. However, the possibility to adjust for unmeasured confounding comes with the price of additional restrictive parametric assumptions for modeling the process of Zt (i.e., linearity and restrictions on the dependence structure). The simulation results revealed that the performance of the different latent variable-type methods strongly depended on the parametric assumptions that were used for identifying the effects of the additional latent variables. By contrast, no modeling assumptions about the process of Zt need to be made in the approach that relies on the assumption of no unmeasured confounding because only the treatment-effect function (i.e., the conditional expectation of the outcome given the covariates; see EquationEquation (2)) needs to be modeled. However, as pointed out before, no unmeasured confounding is a strong assumption that is often hard to justify. On the other hand, claiming that a particular latent variable-type model controls for unmeasured confounding is frequently an equally strong assumption. Finally, the approaches that we discussed can also be applied when a causal interpretation of the estimated effect is not warranted. In this case, the cross-lagged effects can be interpreted as “adjusted associations” (Brumback, Citation2021), which are still valuable for many applications. We consider it a major advantage of the causal inference perspective that it forces researchers to think about the main target of inference and the role of potential confounding variables.

We need to mention that our discussion of the cross-lagged panel design has several limitations. First, as pointed out before, we focused on the causal effect of a variable at a single point in time (i.e., the effect of X2 on Y3). In the typical cross-lagged panel design, the variable X varies across time, and it would also be possible to estimate the joint or cumulative effect of X1 and X2 on Y3. These cumulative effects are rarely investigated in psychological research. One of the few exceptions is VanderWeele et al. (Citation2011), who investigated the cumulative effects of loneliness on depression using a cross-lagged panel design with five measurement waves (see also Silvey et al., Citation2021). However, it should be added that the assumptions for identifying the cumulative effect of a treatment trajectory over a series of time points are more demanding than for the effect at a single point in time (Hernán & Robins, Citation2020; Robins et al., Citation2000; see also Daniel et al., Citation2013). The main challenge is to adequately control for time-varying confounders (e.g., prior measures of the outcome Y2 that may also be affected by prior measures of the exposure X1). This is an active area of methodological research in longitudinal causal inference (e.g., Keogh et al., Citation2018; Newsome et al., Citation2018; Wodtke, Citation2020).

Second, we assumed that the variables were measured without error. In most practical applications, both X and Y will be affected by measurement error, resulting in biased estimates of regression coefficients. Researchers often use multiple indicators to control for measurement error (i.e., internal consistency error) if psychological constructs are measured by multiple items. Multiple indicators can be easily included in the models discussed in the present paper (e.g., Little, Citation2013; Mulder & Hamaker, Citation2021). However, internal consistency measures of reliability only focus on measurement error caused by the finite number of items on the test. It would be possible to extend the models to account for other sources of error (e.g., short-term fluctuations of the measure or transient error; McCrae, Citation2015) when estimating cross-lagged effects with error-prone variables (Heise, Citation1969; Kenny & Zautra, Citation1995). However, the main arguments for choosing between the different modeling approaches are independent of the issue of handling measurement error.

Finally, in practical applications, the selection of covariates can be challenging (VanderWeele, Citation2019). Even experts with subject-matter knowledge will often disagree on whether a specific variable should be treated as a confounder or would result in overadjustment (if included in the adjustment set). One reasonable strategy would be to juxtapose the different approaches and use their estimates (or the lower and upper bounds of the respective confidence intervals) as bounds for the true cross-lagged effect (Leamer, Citation1985). This would reveal how sensitive the conclusions are to different assumptions about confounding variables. Estimating cross-lagged effects under model uncertainty is an interesting topic for future research (see also Athey & Imbens, Citation2015; Young & Holsteen, Citation2017).

Notes

1 Usually, a second assumption is added (positivity assumption) which states that in the population, the density of observing any exposure level given the covariates is positive. This assumption implies that there exists sufficient overlap in the covariate distributions between the different exposure levels under consideration.

2 Allison et al. (Citation2017, p. 3) write: “And not having to specifiy the functional form of the dependence of x on y both simplifies the estimation problem and reduces the danger of misspecification. If you are interested in the dependence of x on y, you can always specify a second dynamic panel model for y and estimate that separately.”

3 Note that this is different from multilevel structural equation modelling where the level-1 units are treated as exchangeable, and separate saturated covariance structures exist at level 1 and level 2. In this case, the number of estimated parameters at one level (e.g., level 2) does not affect the number of estimable parameters at the other level (e.g., level 1), and level-specific evaluations for ΣB and ΣW are possible and recommended (Ryu & West, Citation2009).

References

  • Allison, P. D., Williams, R., & Moral-Benito, E. (2017). Maximum likelihood for cross-lagged panel models with fixed effects. Socius, 3, 237802311771057. https://doi.org/10.1177/2378023117710578
  • Andersen, H. K. (2021). Equivalent approaches to dealing with unobserved heterogeneity in cross-lagged panel models? Investigating the benefits and drawbacks of the latent curve model with structured residuals and the random intercept cross-lagged panel model. Psychological Methods, Advance online publication. https://doi.org/10.1037/met0000285
  • Angrist, J. D., & Pischke, J.-S. (2009). Mostly harmless econometrics: An empiricist’s companion. Princeton University Press. https://doi.org/10.1515/9781400829828
  • Aronow, P. M., & Miller, B. T. (2019). Foundation of agnostic statistics. Cambridge University Press. http://dx.doi.org/10.1017/9781316831762
  • Asendorpf, J. B. (2021). Modeling developmental processes. In J. R. Rauthmann (Ed.), Handbook of personality dynamics and processes (pp. 815–835). Elsevier. https://doi.org/10.1016/B978-0-12-813995-0.00031-5
  • Asparouhov, T., & Muthén, B. (2021). Residual structural equation models. Retrieved from https://www.statmodel.com/download/Asparouhov_Muthen_2021a.pdf
  • Athey, S., & Imbens, G. W. (2015). A measure of robustness to misspecification. American Economic Review, 105, 476–480. https://doi.org/10.1257/aer.p20151020
  • Athey, S., & Imbens, G. W. (2019). Machine learning methods that economists should know about. Annual Review of Economics, 11, 685–725. https://doi.org/10.1146/annurev-economics-080217-053433
  • Bai, J., & Li, K. (2014). Theory and methods of panel data models with interactive effects. The Annals of Statistics, 42, 142–170. https://doi.org/10.1214/13-AOS1183
  • Bailey, D. H., Oh, Y., Farkas, G., Morgan, P., & Hillemeier, M. (2020). Reciprocal effects of reading and mathematics? Beyond the cross-lagged panel model. Developmental Psychology, 56, 912–921. https://doi.org/10.1037/dev0000902
  • Berk, R. A., Brown, L., Buja, A., George, E., Pitkin, E., Zhang, K., & Zhao, L. (2014). Misspecified mean function regression: Making good use of regression models that are wrong. Sociological Methods and Research, 43, 433–451. http://dx.doi.org/10.1177/0049124114526375
  • Berry, D., & Willoughby, M. T. (2017). On the practical interpretability of cross-lagged panel models: Rethinking a developmental workshose. Child Development, 88, 1186–1206. https://doi.org/10.1111/cdev.12660
  • Bollen, K. A., & Brand, J. E. (2010). A general panel model with random and fixed effects: A structural equations approach. Social Forces, 89, 1–34. https://doi.org/10.1353/sof.2010.0072
  • Bollen, K. A., & Curran, P. J. (2004). Autoregressive latent trajectory (ALT) models a synthesis of two traditions. Sociological Methods & Research, 32, 336–383. https://doi.org/10.1177/0049124103260222
  • Brumback, B. A. (2021). Fundamentals of causal inference with R. Chapman and Hall. http://dx.doi.org/10.1201/9781003146674
  • Curran, P. J., & Hancock, G. R. (2021). The challenge of modeling co-developmental processes over time. Child Development Perspectives, 15, 67–75. http://dx.doi.org/10.1111/cdep.12401
  • Daniel, R. M., Cousens, S. N., De Stavola, B. L., Kenward, M. G., & Sterne, J. A. C. (2013). Methods for dealing with time-dependent confounding. Statistics in Medicine, 32, 1584–1618. https://doi.org/10.1002/sim.5686
  • De Vos, I., & Everaert, G. (2021). Bias-corrected common correlated effects pooled estimation in dynamic panels. Journal of Business & Economic Statistics, 39, 294–306. https://doi.org/10.1080/07350015.2019.1654879
  • Dietvorst, E., Hiemstra, M., Hillegers, M. H. J., & Keijsers, L. (2018). Adolescent perceptions of parental privacy invasion and adolescent secrecy: An illustration of Simpson’s paradox. Child Development, 89, 2081–2090. https://doi.org/10.1111/cdev.13002
  • Dishop, C. R., & DeShon, R. P. (2021). A tutorial on Bollen and Brand’s approach to modeling dynamics while attending to dynamic panel bias. Psychological Methods, Advance online publication. https://doi.org/10.1037/met0000333
  • Ehm, J.-H., Hasselhorn, M., & Schmiedek, F. (2019). Analyzing the developmental relation of academic self-concept and achievement in elementary school children: Alternative models point to different results. Developmental Psychology, 55, 2336–2351. https://doi.org/10.1037/dev0000796
  • Finkel, S. (1995). Causal analysis with panel data. Sage. http://dx.doi.org/10.4135/9781412983594
  • Gollob, H. F., & Reichardt, C. S. (1987). Taking account of time lags in causal models. Child Development, 58, 80–92. http://dx.doi.org/10.2307/1130293
  • Grimm, K. J., Helm, J., Rodgers, D., & O′Rourke, H. (2021). Analyzing cross-lag effects: A comparison of different cross-lag modeling approaches. New Directions for Child and Adolescent Development, 2021, 11–33. http://dx.doi.org/10.1002/cad.20401
  • Hamaker, E. L. (2005). Conditions for the equivalence of the autoregressive latent trajectory model and a latent growth curve model with autoregressive disturbances. Sociological Methods & Research, 33, 404–416. http://dx.doi.org/10.1177/0049124104270220
  • Hamaker, E. L., Kuiper, R. M., & Grasman, R. P. P. P. (2015). A critique of the cross-lagged panel model. Psychological Methods, 20, 102–116. http://dx.doi.org/10.1037/a0038889
  • Heise, D. R. (1969). Separating reliability and stability in test-retest correlation. American Sociological Review, 34, 93–101. http://dx.doi.org/10.2307/2092790
  • Hernán, M. A., & Robins, J. M. (2020). Causal inference: What if. Chapman & Hall/CRC.
  • Hirano, K., & Imbens, G. W. (2004). The propensity score with continuous treatments. In A. Gelman & X.-L. Meng (Eds.), Applied Bayesian modeling and causal inference from incomplete-data perspectives (pp. 73–84). Wiley. http://dx.doi.org/10.1002/0470090456.ch7
  • Hsiao, C. (2014). Analysis of panel data (3rd ed.). Cambridge University Press. http://dx.doi.org/10.1017/CBO9781139839327
  • Imbens, G. W. (2004). Nonparametric estimation of average treatment effects under exogeneity: A review. Review of Economics and Statistics, 86, 4–29. http://dx.doi.org/10.1162/003465304323023651
  • Imbens, G. W., & Rubin, D. B. (2015). Causal inference for statistics, social, and biomedical sciences. Cambridge University Press. https://doi.org/10.1017/CBO9781139025751
  • Kenny, D. A., & Zautra, A. (1995). The trait-state-error model for multiwave data. Journal of Consulting and Clinical Psychology, 63, 52–59. http://dx.doi.org/10.1037/0022-006X.63.1.52
  • Keogh, R. H., Daniel, R. M., VanderWeele, T. J., & Vansteelandt, S. (2018). Analysis of longitudinal studies with repeated outcome measures: Adjusting for time-dependent confounding using conventional methods. American Journal of Epidemiology, 187, 1085–1092. http://dx.doi.org/10.1093/aje/kwx311
  • Kessler, R. C., & Greenberg, D. F. (1981). Linear panel analysis: Models of quantitative change. Academic Press.
  • Leamer, E. E. (1985). Sensitivity analysis would help. The American Economic Review, 75, 308–313.
  • Little, T. D. (2013). Longitudinal structural equation modeling. Guilford Press.
  • Littlefield, A. K., King, K. M., Acuff, S. F., Foster, K. T., Murphy, J. G., & Witkiewitz, K. (2021). Limitations of cross-lagged panel models in addiction research and alternative models: An empirical example using project MATCH. Psychology of Addictive Behaviors, Advance online publication. https://doi.org/10.1037/adb0000750
  • Lucas, R. E. (2022). It’s time to abandon the cross-lagged panel model. PsyArXiv Preprints, https://doi.org/10.31234/osf.io/pkec7
  • Marsh, H. W., Pekrun, R., Murayama, K., Arens, A. K., Parker, P. D., Guo, J., & Dicke, T. (2018). An integrated model of academic self-concept development: Academic self-concept, grades, test scores, and tracking over 6 years. Developmental Psychology, 54, 263–280. http://dx.doi.org/10.1037/dev0000393
  • Marsh, H. W., Trautwein, U., Lüdtke, O., Köller, O., & Baumert, J. (2005). Academic self-concept, interest, grades, and standardized test scores: Reciprocal effects models of causal ordering. Child Development, 76, 397–416. http://dx.doi.org/10.1111/j.1467-8624.2005.00853.x
  • Maxwell, S. E., & Delaney, H. D. (2004). Designing experiments and analyzing data: A model comparison perspective. Erlbaum. http://dx.doi.org/10.4324/9781410609243
  • McCrae, R. R. (2015). A more nuanced review of reliability: Specificity in the trait hierarchy. Personality and Social Psychology Review, 19, 97–112. http://dx.doi.org/10.1177/1088868314541857
  • Moral-Benito, E. (2013). Likelihood-based estimation of dynamic panels with predetermined regressors. Journal of Business & Economic Statistics, 31, 451–472. http://dx.doi.org/10.1080/07350015.2013.818003
  • Morgan, S. L., & Winship, C. (2015). Counterfactuals and causal inference: Methods and principles for social research (2nd ed.). University Press. https://doi.org/10.1017/CBO9781107587991
  • Mulder, J. D., & Hamaker, E. L. (2021). Three extensions of the random intercept cross-lagged panel model. Structural Equation Modeling, 28, 638–648. http://dx.doi.org/10.1080/10705511.2020.1784738
  • Mund, M., & Nestler, S. (2019). Beyond the cross-lagged panel model: Next-generation tools for analyzing interdependencies across the life course. Advances in Life Course Research, 41, 100249. http://dx.doi.org/10.1016/j.alcr.2018.10.002
  • Newsom, J. T. (2015). Longitudinal structural equation modeling: A comprehensive introduction. Routledge.
  • Newsome, S. J., Keogh, R. H., & Daniel, R. M. (2018). Estimating long-term treatment effects in observational data: A comparison of the performance of different methods under real-world uncertainty. Statistics in Medicine, 37, 2367–2390. http://dx.doi.org/10.1002/sim.7664
  • Núñez-Regueiro, F., Juhel, J., Bressoux, P., & Nurra, C. (2021). Identifying reciprocities in school motivation research: A review of issues and solutions associated with cross-lagged effects models. Journal of Educational Psychology, Advance online publication. http://dx.doi.org/10.1037/edu0000700
  • Oh, Y., Greenberg, M. T., & Willoughby, M. T. (2020). Examining longitudinal associations between externalizing and internalizing behavior problems at within- and between-child levels. Journal of Abnormal Child Psychology, 48, 467–480. http://dx.doi.org/10.1007/s10802-019-00614-6.
  • Orth, U., Clark, D. A., Donnellan, M. B., & Robins, R. W. (2021). Testing prospective effects in longitudinal research: Comparing seven competing cross-lagged models. Journal of Personality and Social Psychology, 120, 1013–1034. http://dx.doi.org/10.1037/pspp0000358
  • Plewis, I. (1985). Analysing change: Methods for the measurement and explanation of change in the social sciences. Wiley.
  • Reichardt, C. S. (2019). Quasi-Experimentation: A guide to design and analysis. Guilford Press.
  • Robins, J. M., Hernán, M. A., & Brumback, B. (2000). Marginal structural models and causal inference in epidemiology. Epidemiology, 11, 550–560. http://dx.doi.org/10.1097/00001648-200009000-00011.
  • Rosseel, Y. (2012). Lavaan: An R package for structural equation modeling. Journal of Statistical Software, 48, 1–36. Retrieved from http://www.jstatsoft.org/v48/i02/ https://doi.org/10.18637/jss.v048.i02
  • Ruzek, E. A., & Schenke, K. (2019). The tenuous link between classroom perceptions and motivation: A within-person longitudinal study. Journal of Educational Psychology, 111, 903–917. http://dx.doi.org/10.1037/edu0000323
  • Ryu, E., & West, S. G. (2009). Level-specific evaluation of model fit in multilevel structural equation modeling. Structural Equation Modeling, 16, 583–601. http://dx.doi.org/10.1080/10705510903203466
  • Shamsollahi, A., Zyphur, M. J., & Ozkok, O. (2021). Long-run effects in dynamic systems: New tools for cross-lagged panel models. Organizational Research Methods, 2021, 109442812199322. https://doi.org/10.1177/1094428121993228
  • Silvey, C., Demir-Lira, Ö. E., Goldin-Meadow, S., & Raudenbush, S. W. (2021). Effects of time-varying parent input on children’s language outcomes differ for vocabulary and syntax. Psychological Science, 32, 536–548. http://dx.doi.org/10.1177/0956797620970559
  • Usami, S. (2021). On the differences between general cross-lagged panel model and random-intercept cross-lagged panel model: Interpretation of cross-lagged parameters and model choice. Structural Equation Modeling, 28, 331–344. http://dx.doi.org/10.1080/10705511.2020.1821690
  • Usami, S., Murayama, K., & Hamaker, E. L. (2019). A unified framework of longitudinal models to examine reciprocal relations. Psychological Methods, 24, 637–657. http://dx.doi.org/10.1037/met0000210
  • Usami, S., Todo, N., & Murayama, K. (2019). Modeling reciprocal effects in medical research: Critical discussion on the current practices and potential alternative models. PLoS One, 14, e0209133. http://dx.doi.org/10.1371/journal.pone.0209133
  • Van der Laan, M. J., & Rose, S. (2018). Targeted learning in data science: Causal inference for complex longitudinal studies. Springer. https://doi.org/10.1007/978-3-319-65304-4
  • van Montfort, K., Oud, J. H. L., & Voelkle, M. C. (Eds.). (2018). Continuous time modeling in the behavioral and related sciences. Springer Nature. http://dx.doi.org/10.1007/978-3-319-77219-6
  • VanderWeele, T. J. (2015). Explanation in causal inference: Methods for mediation and interaction. Oxford University Press.
  • VanderWeele, T. J. (2019). Principles of confounder selection. European Journal of Epidemiology, 34, 211–219. http://dx.doi.org/10.1007/s10654-019-00494-6.
  • VanderWeele, T. J. (2021). Causal inference with time-varying exposures. In T. L. Lash, T. J. VanderWeele, S. Haneuse, & K. J. Rothman (Eds.), Modern epidemiology (pp. 605–618). Wolters Kluwer.
  • VanderWeele, T. J., Hawkley, L. C., Thisted, R. A., & Cacioppo, J. T. (2011). A marginal structural model analysis for loneliness: Implications for intervention trials and clinical practice. Journal of Consulting and Clinical Psychology, 79, 225–235. http://dx.doi.org/10.1037/a0022610
  • VanderWeele, T. J., Jackson, J. W., & Li, S. (2016). Causal inference and longitudinal data: A case study of religion and mental health. Social Psychiatry and Psychiatric Epidemiology, 51, 1457–1466. http://dx.doi.org/10.1007/s00127-016-1281-9
  • VanderWeele, T. J., Mathur, M. B., & Chen, Y. (2020). Outcome-wide longitudinal designs for causal inference: A new template for empirical studies. Statistical Science, 35, 437–466. http://dx.doi.org/10.1214/19-STS728
  • Vegetabile, B. G., Griffin, B. A., Coffman, D. L., Cefalu, M., Robbins, M. W., & McCaffrey, D. F. (2021). Nonparametric estimation of population average dose-response curves using entropy balancing weights for continuous exposures. Health Services & Outcomes Research Methodology, 21, 69–110. http://dx.doi.org/10.1007/s10742-020-00236-2.
  • Williams, R., Allison, P. D., & Moral-Benito, E. (2018). Linear dynamic panel-data estimation using maximum likelihood and structural equation modeling. The Stata Journal, 18, 293–326. http://dx.doi.org/10.1177/1536867X1801800201
  • Wodtke, G. T. (2020). Regression-based adjustment for time-varying confounders. Sociological Methods & Research, 49, 906–946. http://dx.doi.org/10.1177/0049124118769087
  • Wooldridge, J. M. (2010). Econometric analysis of cross section and panel data. MIT Press.
  • Young, C., & Holsteen, K. (2017). Model uncertainty and robustness: A computational framework for multimodel analysis. Sociological Methods & Research, 46, 3–40. http://dx.doi.org/10.1177/0049124115610347
  • Zhou, N., Cao, H., Liu, F., Wu, L., Liang, Y., Xu, J., Meng, H., Zang, N., Hao, R., An, Y., Ma, S., Fang, X., & Zhang, J. (2020). A four-wave, cross-lagged model of problematic internet use and mental health among Chinese college students: Disaggregation of within-person and between-person effects. Developmental Psychology, 56, 1009–1021. http://dx.doi.org/10.1037/dev0000907.
  • Zyphur, M. J., Allison, P. D., Tay, L., Voelkle, M. C., Preacher, K. J., Zhang, Z., Hamaker, E. L., Shamsollahi, A., Pierides, D. C., Koval, P., & Diener, E. (2020). From data to causes I: Building a general cross-lagged panel model (GCLM). Organizational Research Methods, 23, 651–687. http://dx.doi.org/10.1177/1094428119847278