Full article: A Comparison of Different Approaches for Estimating Cross-Lagged Effects from a Causal Inference Perspective

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

This article compares different approaches for estimating cross-lagged effects with a cross-lagged panel design under a causal inference perspective. We distinguish between models that rely on no unmeasured confounding (i.e., observed covariates are sufficient to remove confounding) and latent variable-type models (e.g., random intercept cross-lagged panel model) that use parametric assumptions to adjust for unmeasured time-invariant confounding by including additional latent variables. Simulation studies confirm that the cross-lagged panel model provides biased estimates of the cross-lagged effect in the presence of unmeasured confounding. However, the simulations also show that the latent variable-type approaches strongly depend on the specific parametric assumptions, and produce biased estimates under different data-generating scenarios. Finally, we discuss the role of the longitudinal design and the limitations of assessing model fit for estimating cross-lagged effects.

Keywords:

In many areas of psychological research, longitudinal cross-lagged panel designs are used to investigate whether changes in one construct $X$ are related to changes in another construct $Y$ (Little, Citation2013; Marsh et al., Citation2005; Orth et al., Citation2021). In the basic setting of two measurement waves, the key idea of the cross-lagged panel model (CLPM) is that the effect of a predictor at T1 on an outcome at T2 (i.e., cross-lagged effect) is estimated, controlling for the outcome at T1. Thus, the CLPM is based on a conditioning approach in which the posttest ( $Y_{2}$ ) is conditioned on the pretest ( $Y_{1}$ ) by regressing the posttest on the pretest and a potential exposure variable ( $X_{1};$ Maxwell & Delaney, Citation2004; Newsom, Citation2015; Plewis, Citation1985).

However, in a very influential paper (1402 citations listed on Google Scholar as of March 28, 2022), Hamaker et al. (Citation2015) criticized that the CLPM does not appropriately account for the trait-like, time-invariant stability of many psychological constructs and, therefore, results in distorted estimates of cross-lagged effects. Hamaker et al. (Citation2015) proposed the random intercept cross-lagged panel model (RI-CLPM) as an extension of the traditional CLPM that allows controlling for stable trait factors when at least three measurement waves are available (Usami, Murayama, et al., Citation2019). The RI-CLPM has been interpreted as a residual-level approach (Andersen, Citation2021; Asparouhov & Muthén, Citation2021) in which the longitudinal associations between two constructs are decomposed into stable between-person associations (i.e., the correlation between time-invariant between-person parts) and temporal within-person dynamics (i.e., within-person effects for deviations from between-person parts). This decomposition allows estimating within-person cross-lagged effects that are adjusted for the effects of stable trait factors. The RI-CLPM has received considerable attention in the methodological literature. Many scholars argue that the RI-CLPM should be preferred over the CLPM for estimating cross-lagged effects, particularly in the presence of stable trait factors (e.g., Berry & Willoughby, Citation2017; Curran & Hancock, Citation2021; Grimm et al., Citation2021; Mulder & Hamaker, Citation2021; Mund & Nestler, Citation2019; Usami, Citation2021; Zyphur et al., Citation2020). Furthermore, empirical comparisons of the CLPM and the RI-CLPM have shown that the decision between the CLPM and RI-CLPM is crucial because the two approaches can yield results that substantially differ concerning the magnitude, sign, and statistical significance of the estimated cross-lagged effect (e.g., Bailey et al., Citation2020; Ehm et al., Citation2019; Littlefield et al., Citation2021; Núñez-Regueiro et al., Citation2021; Oh et al., Citation2020; Orth et al., Citation2021; Ruzek & Schenke, Citation2019; Zhou et al., Citation2020).

One frequently made argument in favor of the RI-CLPM is that it controls for unobserved confounding variables that are stable across time (Usami, Murayama, et al., Citation2019; see also Bailey et al., Citation2020). Given that one of the main challenges in estimating causal effects with non-experimental data is to control for all relevant covariates (Reichardt, Citation2019), this seems to be a significant advantage of the RI-CLPM. In the present article, we compare the CLPM and the RI-CLPM from a causal perspective and define the causal estimand (i.e., the cross-lagged effect) using potential outcome notation (Imbens & Rubin, Citation2015). We also consider two alternative latent variable-type approaches that have been proposed for estimating a cross-lagged effect under unmeasured confounding: observation-level models that inlude the stable trait factors at the level of the observed scores (Dishop & Deshon, Citation2021; Zyphur et al., Citation2020; see also Bollen & Brand, Citation2010), and a fixed effects dynamic panel model (Allison et al., Citation2017) that only models the process of the outcome $Y$ but is agnostic about the process of the exposure $X .$

In four simulation scenarios, we confirm that the CLPM provides biased estimates of the cross-lagged effect in the presence of unmeasured confounding variables. However, the simulations also show that the potential of the different latent variable-type approaches to control for unmeasured confounding strongly depends on the specific parametric assumptions that are used to identify the effects of the latent variables. Finally, we argue that it is often advisable to include lag-2 effects (i.e., effects of variables across two units of time) in addition to lag-1 effects in the CLPM in order to control for delayed effects of $X$ and $Y$ when estimating cross-lagged effects (VanderWeele et al., Citation2020). Overall, the goal of this paper is to provide a more balanced discussion of different approaches for analyzing cross-lagged panel designs, and we would like to emphasize that—despite recent methodological recommendations—there are still good reasons to use the CLPM and rely on the assumption of no unmeasured confounding.

Before we start, we would like to point out that at a more descriptive level, the CLPM has been criticized by methodologists and developmental psychologists with the argument that it provides an uninterpretable blend of within-person effects and between-person effects (e.g., Berry & Willoughby, Citation2017; Hamaker et al., Citation2015). In the present article, we focus on the question under which conditions cross-lagged effects in the RI-CLPM or the other latent-variable type models can be given a causal interpretation. The question of whether the decomposition into within-person and between-person effects (in contrast to the undecomposed effects in the CLPM) provides a more appropriate description of longitudinal processes will be not further discussed. In our view, the goal of modeling developmental processes should be kept separate from the goal of causal inference.

1. Causal Perspective on Estimating Cross-Lagged Effects

In the following, we consider a multivariate process in which two variables $X_{t}$ and $Y_{t},$ and a vector of time-varying covariates $L_{t}$ are related across time, that is, $(X_{t}, Y_{t}, L_{t}),$ $t \in R .$ The covariates $L_{t}$ are confounders for the association between $X_{t}$ and $Y_{t} .$ Furthermore, we consider covariates that do not vary across time (e.g., gender, social status). In the context of our study, it is instructive to decompose the time-invariant covariates into an observed part $C$ and a potentially unobserved part $U .$ Later, we discuss methods that try to control for the effects of $U,$ even though the covariates in $U$ were not measured. In our discussion, we focus on the cross-lagged effect of $X_{s}$ on $Y_{t},$ where $s < t .$ We start with a definition of the causal cross-lagged effect. The important role of the longitudinal design (e.g., selection of the time points $s$ and $t$ ) for defining the causal cross-lagged effect will be discussed in the section 5 “The Role of Time for Estimating the Causal Cross-Lagged Effect”. It should be emphasized that the main goal is to estimate the (causal) cross-lagged effect and not to model the process $(X_{t}, Y_{t}, L_{t}) .$

1.1. Definition of a Causal Cross-Lagged Effect

The causal inference literature heavily draws on the potential outcome framework (Hernán & Robins, Citation2020; Imbens & Rubin, Citation2015; VanderWeele, Citation2015) to define a causal effect (i.e., the effect of $X_{s}$ on $Y_{t}$ ). We assume a continuous exposure variable $X_{s}$ taking values from a set $X_{X_{s}}$ (Hirano & Imbens, Citation2004; Vegetabile et al., Citation2021), and assume that for each individual, there exists a potential outcome $Y_{t} (x)$ for all $x \in X_{X_{s}} .$ The potential outcome $Y_{t} (x)$ can be interpreted as the outcome that would have resulted for an individual if the exposure $X_{s}$ had been set to $x$ (e.g., by an intervention). We further assume that, for each individual, the observed outcome equals the potential outcome under the observed exposure level, that is, ${Y_{t} = Y}_{t} (x)$ if $X_{s} = x .$ This assumption is also known as the consistency assumption and connects the potential outcomes to the observed data (Hernán & Robins, Citation2020; VanderWeele, Citation2015). Note that all other potential outcomes for an individual, i.e., $Y_{t} (x')$ for all other $x' \in X_{X_{s}},$ are unobserved.

The crucial assumption for defining the causal effect of $X_{s}$ on $Y_{t}$ is the ignorability assumption (1) $Y_{t} (x) ⊥ X_{s} | Y_{T_{Y}}, X_{T_{X}}, L_{T_{L}}, C, U for all x \in X_{X_{s}},$ (1)

This assumption states that the potential outcomes are conditionally independent of the exposure $X_{s}$ given the previous history of outcome and treatment values as well as time-varying and time-invariant covariates. $Y_{T_{Y}},$ $X_{T_{X}},$ and $L_{T_{L}}$ denote vectors of measures of the outcome, the exposure, and the time-varying covariate, where $T_{Y} = {t_{1}^{(Y)}, \dots, t_{k}^{(Y)}},$ $T_{X} = {t_{1}^{(X)}, \dots, t_{k}^{(X)}},$ and $T_{L} = {t_{1}^{(L)}, \dots, t_{k}^{(L)}}$ are sets of ordered time indices. Note that the choice of covariates and the respective time points in the ignorability condition in EquationEquation (1)(1) $Y_{t} (x) ⊥ X_{s} | Y_{T_{Y}}, X_{T_{X}}, L_{T_{L}}, C, U for all x \in X_{X_{s}},$ (1) is essential for defining the causal effect of interest. Many scholars point out that this choice needs to reflect subject-matter knowledge and cannot be resolved with statistical modeling techniques (Hernán & Robins, Citation2020).

This ignorability assumption is also labeled the no unmeasured confounding, conditional independence, or selection on observables assumption in the literature (Hernán & Robins, Citation2020; Imbens, Citation2004; Morgan & Winship, Citation2015). It should be emphasized that the time-varying covariates $L_{T_{L}},$ and the previous measures of the outcome $Y_{T_{Y}}$ are treated in the same way as the time-invariant covariates $C$ and $U$ in EquationEquation (1)(1) $Y_{t} (x) ⊥ X_{s} | Y_{T_{Y}}, X_{T_{X}}, L_{T_{L}}, C, U for all x \in X_{X_{s}},$ (1) . The crucial issue is that they are not affected by the exposure $X_{s},$ and act as a mediator on the pathway from $X_{s}$ to $Y_{t} .$ In research practice, this is often achieved by measuring the time-varying covariates and the prior values of the outcome not after the exposure, that is $t_{k}^{(L)} \leq s$ and $t_{k}^{(Y)} \leq s .$ The relationship between the covariates, the exposure, and the outcome is depicted in , where $τ_{T_{X, Y, L}, ACE}$ denotes the average causal effect (ACE) of $X_{s}$ on $Y_{t} .$

Figure 1. Causal diagram that shows the relationship between the covariates, the exposure $X_{s},$ and the outcome $Y_{t} .$ $τ_{T_{X, Y, L}, ACE}$ denotes the average causal effect (ACE) of $X_{s}$ on $Y_{t} .$

We now consider the causal effect function $τ_{T_{X, Y, L}} (x) = E (Y_{t} (x)),$ the expectation of the potential outcomes $Y_{t} (x)$ if $X_{s}$ would be fixed to $x .$ The goal is to estimate the causal effect function $τ_{T_{X, Y, L}} (x)$ from the data ( $Y_{t},$ $X_{s},$ $Y_{T_{Y}}, X_{T_{X}}, L_{T_{L}}, C, U$ ). To simplify notation, we set $W = (Y_{T_{Y}}, X_{T_{X}}, L_{T_{L}}, C, U$ ). If the ignorability assumption holds, the conditional expectation function can be identified as follows: (2) $τ_{T_{X, Y, L}} (x) = E [Y_{t} (x)] = \int E (Y_{t} | X_{s} = x, W = w) f_{W} (w) d w,$ (2) where $f_{W}$ denotes the joint density of $W .$ In other words, the expected values of the potential outcomes can be determined by averaging the conditional expectation of the outcome given the covariates and the exposure level across the covariate distribution.

In general, the causal effect function $τ_{T_{X, Y, L}}$ can be nonlinear. In order to quantify an average causal effect by a weighted average slope, we assume that the conditional expectation function $τ_{T_{X, Y, L}}$ is modeled as a linear function of the exposure values $x$ at time $s,$ where the approximation is weighted according to the density of $X_{s}$ : (3) $τ_{T_{X, Y, L}} (x) ≃ α + τ_{T_{X, Y, L}, ACE} x,$ (3) where the parameter $τ_{T_{X, Y, L}, ACE}$ denotes the average causal effect (ACE). The ACE can be interpreted as a weighted average slope by increasing the exposure level by one unit from $x$ to $x + 1 .$ The linear best approximation in EquationEquation (3)(3) $τ_{T_{X, Y, L}} (x) ≃ α + τ_{T_{X, Y, L}, ACE} x,$ (3) is provided by the least-squares estimate $τ_{T_{X, Y, L}, ACE}$ (4) $τ_{T_{X, Y, L}, ACE} = \frac{\int (τ_{T_{X, Y, L}} (x) - τ_{t}) (x - μ_{X_{s}}) f_{X_{s}} (x) d x}{Var (X_{s})},$ (4) where $τ_{t} = \int τ_{T_{X, Y, L}} (x) f_{X_{s}} (x) d x$ is the expected value of the potential outcomes, and $f_{X_{s}}$ denotes the density function of $X_{s} .$ The parameter $τ_{T_{X, Y, L}, ACE}$ can be interpreted as a weighted average slope across all slopes and is the linear best approximation to the true regression function (Angrist & Pischke, Citation2009; Berk et al., Citation2014). EquationEquation (3)(3) $τ_{T_{X, Y, L}} (x) ≃ α + τ_{T_{X, Y, L}, ACE} x,$ (3) is also known as a marginal structural model (MSM), which specifies a model for the marginal mean of the potential outcomes as a function of the exposure, in which the effects of confounding variables have been removed (Daniel et al., Citation2013; Hernán & Robins, Citation2020).

We now show that a conventional linear regression model can be used to obtain an unbiased estimate of the causal effect $τ_{T_{X, Y, L}, ACE}$ when all relevant covariates of the ignorability assumption are included (Keogh et al., Citation2018). To this end, we assume that all covariates are linearly related to the outcome. The outcome $Y_{t}$ is then given as follows (5) $Y_{t} = γ_{Y_{t}} {+ γ_{Y_{t} X_{s}} X_{s} + γ^{'} W + ε}_{Y_{t}} .$ (5)

It can now be shown that the coefficient $γ_{Y_{t} X_{s}}$ provides the average causal effect $τ_{T_{X, Y, L}, ACE} .$ Using the relationship in EquationEquation (5)(5) $Y_{t} = γ_{Y_{t}} {+ γ_{Y_{t} X_{s}} X_{s} + γ^{'} W + ε}_{Y_{t}} .$ (5) , we derive with EquationEquation (2)(2) $τ_{T_{X, Y, L}} (x) = E [Y_{t} (x)] = \int E (Y_{t} | X_{s} = x, W = w) f_{W} (w) d w,$ (2) (6) $τ_{T_{X, Y, L}} (x) = \int [γ_{Y_{t}} + γ_{Y_{t} X_{s}} x + γ^{'} w] f_{W} (w) d w = α + γ_{Y_{t} X_{s}} x,$ (6) where $α = γ_{Y_{t}} + γ^{'} E (W$ ). The conditional expectation function strictly follows a linear form, and we, therefore, obtain the identity $τ_{T_{X, Y, L}, ACE} = γ_{Y_{t} X_{s}}$ by comparing (6) with the specification (3). Thus, the regression coefficient $γ_{Y_{t} X_{s}}$ of a standard regression analysis provides an unbiased estimate of the causal effect $τ_{T_{X, Y, L}, ACE}$ when all relevant covariates (i.e., $W = (Y_{T_{Y}}, X_{T_{X}}, L_{T_{L}}, C, U$ )) are included.

It should be emphasized that ignorability is a strong assumption that should not be taken lightly.Footnote¹ In practical applications, the ignorability assumption implies that all relevant covariates (needed to fulfill the ignorability assumption in EquationEquation (1)(1) $Y_{t} (x) ⊥ X_{s} | Y_{T_{Y}}, X_{T_{X}}, L_{T_{L}}, C, U for all x \in X_{X_{s}},$ (1) ) are observed. It is vital that this aspect of the ignorability assumption cannot be empirically tested and needs to be justified by substantive knowledge (Aronow & Miller, Citation2019). However, in the presence of unmeasured confounders (e.g., $U$ is not included in EquationEquation (5)(5) $Y_{t} = γ_{Y_{t}} {+ γ_{Y_{t} X_{s}} X_{s} + γ^{'} W + ε}_{Y_{t}} .$ (5) ), the coefficient $γ_{Y_{t} X_{s}}$ from EquationEquation (5)(5) $Y_{t} = γ_{Y_{t}} {+ γ_{Y_{t} X_{s}} X_{s} + γ^{'} W + ε}_{Y_{t}} .$ (5) provides a biased estimate of $τ_{T_{X, Y, L}, ACE} .$

1.2. Specification Issues

Even if all relevant covariates are measured and included in the regression, it is crucial that the relationship between the covariates $W$ and the outcome $Y_{t}$ is correctly specified in a given application. More specifically, this requires that the functional form of the relationship is correctly specified in the analysis models (e.g., squared terms of predictors are included if quadratic effects exist). Powerful machine learning methods allow for very flexible estimation of functional relationships (Athey & Imbens, Citation2019; van der Laan & Rose, Citation2018). It should also be emphasized that no assumptions about the distribution of covariates $W$ are included in the definition of the causal effect. In particular, there are no assumptions about the process $(X_{t}, L_{t}, Y_{t}),$ and the fact that the cross-lagged effect can be obtained from a conventional regression model makes evident that there is no need for modeling the process in $X_{t},$ $L_{t},$ or $Y_{t} .$ However, modeling the process might be advantageous for identifying the effects of time-invariant unobserved confounders $U$ when estimating the cross-lagged effect. To sum up, it is vital to distinguish the definition of a causal effect from modeling a process $(X_{t}, L_{t}, Y_{t}) .$ The structural parameter in a well-fitting model for the process $(X_{t}, L_{t}, Y_{t})$ can be wholly unrelated to the causal effect of interest.

2. Estimating a Cross-Lagged Effect: Controlling for Measured Confounding

In the following, we discuss different models that have been proposed for estimating cross-lagged effects. We start with models that assume no unmeasured confounding and assume a longitudinal panel design in which two variables $Y_{t}$ and $X_{t}$ and a p × 1 vector of time-varying covariates $L_{t}$ are repeatedly measured at equally spaced intervals across time $t$ ( $t = 1, \dots, T$ ). The time-varying variables are combined in the (p + 2)×1 vector $Z_{t} = (X_{t}, Y_{t}, L_{t}) .$ In addition, we consider a k × 1 vector $C$ of covariates that are constant across the investigated time period. For simplicity, we further assume that the variables are mean-centered at each wave. The main focus is on estimating the causal effect of $X_{t - 1}$ on $Y_{t} .$

2.1. Cross-Lagged Panel Model with Lag-1 Effects (CL1)

The CLPM (Finkel, Citation1995; Kessler & Greenberg, Citation1981; Little, Citation2013) with lag-1 effects and covariates is given by (7) $Z_{t} = {B_{t, t - 1} Z}_{t - 1} + Γ_{t} C + e_{t} for t \geq 2,$ (7) where $B_{t, t - 1}$ is a ( $p + 2) \times (p + 2)$ matrix of regression coefficients, including the autoregressive and cross-lagged parameters. The ( $p + 2) \times k$ matrix $Γ_{t}$ represents the (potentially) time-varying effects of the time-invariant covariates. The ( $p + 2) \times 1$ vector of residuals $e_{t}$ denotes correlated random components that are unrelated to previous time points and normally distributed with zero means.

The CLPM can be estimated with only two waves of data. With more than two waves, the autoregressive and cross-lagged parameters can be constrained to be invariant across waves, that is ${B \equiv B}_{t, t - 1}$ for $t \geq 2 .$ This can be reasonable if the time intervals between the waves have a similar length (Little, Citation2013). Moreover, a constrained estimate can be interpreted as an average of the two causal effects from $t - 2$ to $t - 1$ and from $t - 1$ to $t .$

If the main interest is in estimating the effect of $X_{t - 1}$ on $Y_{t},$ the cross-lagged panel model with freely estimated parameters is equivalent to estimating a single regression with $Y_{t}$ as the outcome variable. In the bivariate case (i.e., $Z_{t} = (X_{t}, Y_{t})$ ) without covariates, the regression is given by (8) $Y_{t} = β_{Y_{t} Y_{t - 1}} Y_{t - 1} + β_{Y_{t} X_{t - 1}} X_{t - 1} + e_{Y_{t}}$ (8)

Thus, the CLPM controls for preexisting differences in the outcome when assessing the effect of the exposure and is a special case of a more general class of conditioning methods in which an estimate of the causal effect is obtained by conditioning on the pretest and other observed covariates (see VanderWeele et al., Citation2016). It should be emphasized that the regression model in EquationEquation (8)(8) $Y_{t} = β_{Y_{t} Y_{t - 1}} Y_{t - 1} + β_{Y_{t} X_{t - 1}} X_{t - 1} + e_{Y_{t}}$ (8) includes one of the most basic and frequently used analysis strategies in psychology (Orth et al., Citation2021).

As the traditional CLPM only considers the effects of the previous measurement wave (i.e., lag-1 effects), we will refer to the model as CL1. The CL1 for T = 3 is depicted as a path diagram in the left panel of . It needs to be emphasized that recent discussion of the CLPM (e.g., Bailey et al., Citation2020; Dietvorst et al., Citation2018; Littlefield et al., Citation2021; Lucas, Citation2022; Usami, Murayama, et al., Citation2019) mainly focused on the CL1 and did not consider the role of higher-order lags (e.g., lag-2 effects).

Figure 2. Path diagrams of the cross-lagged panel models with lag-1 (CL1) and lag-2 (CL2) effects for three measurement waves.

2.2. Cross-Lagged Panel Model with Lag-2 Effects (CL2)

The CL1 in EquationEquation (7)(7) $Z_{t} = {B_{t, t - 1} Z}_{t - 1} + Γ_{t} C + e_{t} for t \geq 2,$ (7) can be extended to a CL2 that includes the effects of variables from two previous time points (i.e., lag-2 effects): (9) $Z_{t} = {B_{t, t - 1} Z}_{t - 1} + {B_{t, t - 2} Z}_{t - 2} + Γ_{t} C + e_{t} for t \geq 2,$ (9) where $B_{t, t - 2}$ denotes a ( $p + 2) \times (p + 2)$ matrix of regression coefficients that includes the stability effects (e.g., the extent to which $X_{t}$ depends on $X_{t - 2}$ over and above the effect of $X_{t - 1};$ second-order autoregression), and the lag-2 cross-lagged effects. The CL1 is a CL2 with only lag-1 effects and is thus nested within the CL2.

There are different perspectives on the importance of including these lag-2 effects in the CL2 (Little, Citation2013). One view is that they consider delayed effects that are not captured by the lag-1 effects (e.g., Asendorpf, Citation2021; Marsh et al., Citation2018). However, it has been argued that it is often difficult to justify which psychological mechanism could be responsible for these delayed effects and that direct effects (i.e., lag-1 effects) are often more plausible (e.g., Ehm et al., Citation2019). From a causal inference perspective, the main motivation for including lag-2 effects is a more comprehensive control for confounding. VanderWeele (Citation2021, p. 607) argues that if prior exposure $X_{t - 2}$ affects subsequent exposure $X_{t - 1},$ and also independently affects the outcome $Y_{t}$ not through $X_{t - 1}$ then prior exposure itself confounds the cross-lagged of $X_{t - 1}$ on $Y_{t} .$ Thus, prior values of the exposure and outcome measures ( $X_{t - 2}$ and $Y_{t - 2}$ ) can be considered covariates that allow for stronger control of confounding. In some applications, it may even be necessary to control for lag-3 effects of lagged exposures ( $X_{t - 3})$ and outcomes ( $Y_{t - 3}) .$ VanderWeele et al. (Citation2020) provide a detailed discussion of the benefits of controlling for the history of previous exposure and outcome variables.

In the bivariate case, estimating a cross-lagged effect with a CL2 is equivalent to a single regression equation in which the outcome is regressed on the exposure $X_{t - 1}$ and the other variables (10) $Y_{t} = β_{Y_{t} Y_{t - 1}} Y_{t - 1} + β_{Y_{t} Y_{t - 2}} Y_{t - 2} {+ β}_{Y_{t} X_{t - 1}} X_{t - 1} {+ β}_{Y_{t} X_{t - 2}} X_{t - 2} {+ e}_{Y_{t}}$ (10) where $β_{Y_{t} X_{t - 1}}$ denotes the cross-lagged effect of interest. However, like the CL1, the CL2 will produce biased estimates of the cross-lagged effect if unmeasured confounders $U$ exist that are part of the ignorability assumption (see EquationEquation (1)(1) $Y_{t} (x) ⊥ X_{s} | Y_{T_{Y}}, X_{T_{X}}, L_{T_{L}}, C, U for all x \in X_{X_{s}},$ (1) ) but not included in EquationEquation (10)(10) $Y_{t} = β_{Y_{t} Y_{t - 1}} Y_{t - 1} + β_{Y_{t} Y_{t - 2}} Y_{t - 2} {+ β}_{Y_{t} X_{t - 1}} X_{t - 1} {+ β}_{Y_{t} X_{t - 2}} X_{t - 2} {+ e}_{Y_{t}}$ (10) .

3. Estimating a Cross-Lagged Effect: Controlling for Unmeasured Confounding

One advantage of longitudinal data is that they offer the potential to control for the effects of unmeasured confounder variables when estimating causal effects. In the following, we discuss three different strategies that have been suggested for estimating cross-lagged effects in the presence of unmeasured time-invariant confounders $U .$ The basic idea of these approaches is to use parametric modeling assumptions to identify additional latent variables that adjust for the effects of the unmeasured confounders.

3.1. Observation-Level Approaches

In the observation-level approach, the effects of unmeasured confounders are taken into account by including additional latent variables in the CL1 model (see EquationEquation (7)(7) $Z_{t} = {B_{t, t - 1} Z}_{t - 1} + Γ_{t} C + e_{t} for t \geq 2,$ (7) ): (11) $Z_{t} = {B_{t, t - 1} Z}_{t - 1} + Γ_{t} C + Λ_{t} U + e_{t} for t \geq 2,$ (11) where $Z_{t},$ and $C$ are defined as above, and $U$ is a $q \times 1$ vector of latent variables with a ( $p + 2) \times q$ loading matrix $Λ_{t}$ (Bollen & Brand, Citation2010). The latent variables $U$ are referred to as unit effects, unobserved effects, or individual effects in the literature (Andersen, Citation2021; Hsiao, Citation2014; Wooldridge, Citation2010; Zyphur et al., Citation2020) and are supposed to capture the effects of unmeasured time-invariant variables. The latent variables can be identified by setting the variances of the latent variables to one. The loadings $Λ_{t}$ represent the—potentially time-varying—effects of $U$ on the observed variables. In many applications, $Λ_{t}$ is assumed to be diagonal (i.e., $Z_{t}$ and $U$ have the same dimension), and the loadings are fixed to represent individual differences in level and growth (e.g., linear trends) in the observed variables $Z_{t}$ (Bollen & Curran, Citation2004; Usami, Murayama, et al., Citation2019). However, more exploratory approaches have been proposed, particularly in the econometrics literature, in which $Λ_{t}$ is not diagonal and the dimension of $U$ is not assumed to be known (e.g., Bai & Li, Citation2014; De Vos & Everaert, Citation2021). have Furthermore, the time-invariant covariates $C$ are assumed to be uncorrelated with the latent factors $U,$ that is $Cov (C, U) = 0 .$ This restriction is necessary for identifying the model (e.g., Allison et al., Citation2017; Bollen & Brand, Citation2010). The latent variables in $U$ have also been characterized as accumulating factors (Usami, Citation2021; Usami, Murayama et al., Citation2019).

At the first wave (t = 1), the vector of variables $Z_{1}$ are treated as exogeneous (i.e., predetermined; Bollen & Brand, Citation2010) and are correlated with the time-invariant covariates $C,$ and the latent variables $U$ (12) $Cov (Z_{1}, C) = Ψ_{Z_{1} C} and Cov (Z_{1}, U) = Ψ_{Z_{1} U}$ (12) where $Ψ_{Z_{1} C}$ and $Ψ_{Z_{1} U}$ denote ( $p + 2) \times k$ and ( $p + 2) \times q$ covariance matrices in which all unique elements are freely estimated. An alternative representation of the covariances at $t = 1$ was recently suggested by Zyphur et al. (Citation2020) (13) $Z_{1} = Γ_{1} C + Λ_{1} U + e_{1}$ (13)

In this specification, the loadings in $Λ_{1}$ are freely estimated to represent the covariance of $Z_{1}$ and $U .$

One crucial aspect of the model in EquationEquations (11)(11) $Z_{t} = {B_{t, t - 1} Z}_{t - 1} + Γ_{t} C + Λ_{t} U + e_{t} for t \geq 2,$ (11) and Equation(12)(12) $Cov (Z_{1}, C) = Ψ_{Z_{1} C} and Cov (Z_{1}, U) = Ψ_{Z_{1} U}$ (12) (or (11) and (13)) is the identification of all model parameters, which has to be established on a model-by-model basis (e.g., Bollen & Curran, Citation2004). It should be mentioned that with a large number of waves, the model can also include higher-order lags (e.g., lag-2 effects) for the time-varying variables. However, most applications are restricted to AR(1) processes (i.e., lag-1 effects). In the following, we discuss four different observation-level models that have been proposed for estimating cross-lagged effects in the presence of unmeasured confounders. We restrict our discussion to the bivariate case without additional covariates.

3.1.1. Unidimensional Latent Factor Model (OL1)

Finkel (Citation1995) proposed an observation-level model with a single latent factor. This model is identified if at least three measurement waves are available, and the regression coefficients are assumed to be invariant across time. This model is given as follows: (14) $\underset{Z_{t}}{\underset{︸}{(\binom{X_{t}}{Y_{t}})}} = \underset{B_{t, t - 1}}{\underset{︸}{(\binom{β_{X_{t} X_{t - 1}}}{β_{Y_{t} X_{t - 1}}} \binom{β_{X_{t} Y_{t - 1}}}{β_{Y_{t} Y_{t - 1}}})}} \underset{Z_{t - 1}}{\underset{︸}{(\binom{X_{t - 1}}{Y_{t - 1}})}} + \underset{Λ_{t}}{\underset{︸}{(\binom{λ_{X_{t}, U_{1}}}{λ_{Y_{t}, U_{1}}})}} \underset{U}{\underset{︸}{U_{1}}} + \underset{e_{t}}{\underset{︸}{(\binom{e_{X_{t}}}{e_{Y_{t}}})}} f or t \geq 2,$ (14)

where the regression coefficients are set equal across time, that is ${B \equiv B}_{t, t - 1}$ for $t \geq 2 .$ At $t = 1,$ $X_{1}$ and $Y_{1}$ are allowed to covary with $U_{1} .$ The path diagram of the model is depicted for T = 3 in . The latent variable $U_{1}$ can be interpreted as an unmeasured confounder that affects the variables $X_{t}$ and $Y_{t},$ and distorts the estimation of the cross-lagged effect in a CL1. Note that $U_{1}$ is assumed to be time-invariant, but the effects in $Λ_{t}$ are allowed to vary across time. We refer to the model in EquationEquation (14)(14) $\underset{Z_{t}}{\underset{︸}{(\binom{X_{t}}{Y_{t}})}} = \underset{B_{t, t - 1}}{\underset{︸}{(\binom{β_{X_{t} X_{t - 1}}}{β_{Y_{t} X_{t - 1}}} \binom{β_{X_{t} Y_{t - 1}}}{β_{Y_{t} Y_{t - 1}}})}} \underset{Z_{t - 1}}{\underset{︸}{(\binom{X_{t - 1}}{Y_{t - 1}})}} + \underset{Λ_{t}}{\underset{︸}{(\binom{λ_{X_{t}, U_{1}}}{λ_{Y_{t}, U_{1}}})}} \underset{U}{\underset{︸}{U_{1}}} + \underset{e_{t}}{\underset{︸}{(\binom{e_{X_{t}}}{e_{Y_{t}}})}} f or t \geq 2,$ (14) as OL1. It should be emphasized that $U_{1}$ is not actually measured but that the effects of the confounder $U_{1}$ are identified by parametric modelling assumptions. Finkel (Citation1995, p. 83) refers to $U_{1}$ as a “phantom variable” that can be used to test for spurious associations.

Figure 3. Path diagram of an observation-level model with a single latent variable (OL1; Finkel, Citation1995) for three measurement waves.

3.1.2. Twodimensional Latent Factor Model (OL2)

Another variant of the observational-level approach has been proposed by Dishop and DeShon (Citation2021; see also Bollen & Brand, Citation2010). In this specification, the number of additional latent variables in $U$ corresponds to the number of time-varying variables in $Z_{t}$

(15) $(\begin{matrix} X_{t} \\ Y_{t} \end{matrix}) = (\begin{matrix} β_{X_{t} X_{t - 1}} & β_{X_{t} Y_{t - 1}} \\ β_{Y_{t} X_{t - 1}} & β_{Y_{t} Y_{t - 1}} \end{matrix}) (\begin{matrix} X_{t - 1} \\ Y_{t - 1} \end{matrix}) + (\begin{matrix} λ_{X_{t}, U_{1}} & 0 \\ 0 & λ_{Y_{t}, U_{2}} \end{matrix}) (\begin{matrix} U_{1} \\ U_{2} \end{matrix}) + (\begin{matrix} e_{X_{t}} \\ e_{Y_{t}} \end{matrix}) f or t \geq 2,$ (15) where the diagonal loading matrices $Λ_{t} \equiv Λ,$ and regression coefficients ${B \equiv B}_{t, t - 1}$ are set equal across time for $t \geq 2 .$ The latent variables $U_{1}$ and $U_{2}$ are assumed to be correlated. At $t = 1,$ the two observed variables $X_{1}$ and $Y_{1}$ are allowed to be freely correlated with the latent variables (see EquationEquation (12)(12) $Cov (Z_{1}, C) = Ψ_{Z_{1} C} and Cov (Z_{1}, U) = Ψ_{Z_{1} U}$ (12) ) Again, this guarantees that $X_{1}$ and $Y_{1}$ are treated as exogeneous (i.e., predetermined). The path diagram of this model is shown for $T = 3$ in . Note that the restriction of equal effects of the latent variables in $U$ can be removed by freely estimating the loadings in $Λ_{t}$ (see Dishop & DeShon, Citation2021, p. 17). Furthermore, it would be possible to specify $Λ_{t}$ to be nondiagonal (i.e., cross loadings), and allow for effects of $U_{1}$ on $Y_{t},$ and $U_{2}$ on $X_{t} .$ In the following, we refer to the model in EquationEquation (15)(15) $(\begin{matrix} X_{t} \\ Y_{t} \end{matrix}) = (\begin{matrix} β_{X_{t} X_{t - 1}} & β_{X_{t} Y_{t - 1}} \\ β_{Y_{t} X_{t - 1}} & β_{Y_{t} Y_{t - 1}} \end{matrix}) (\begin{matrix} X_{t - 1} \\ Y_{t - 1} \end{matrix}) + (\begin{matrix} λ_{X_{t}, U_{1}} & 0 \\ 0 & λ_{Y_{t}, U_{2}} \end{matrix}) (\begin{matrix} U_{1} \\ U_{2} \end{matrix}) + (\begin{matrix} e_{X_{t}} \\ e_{Y_{t}} \end{matrix}) f or t \geq 2,$ (15) as OL2.

Figure 4. Path diagram of an observation-level model with two latent variables (OL2; Dishop & DeShon, Citation2021; see also Bollen & Brand, Citation2010) for three measurement waves.

3.1.3. Twodimensional Latent Factor Model with Loadings at Time 1 (OL3 and OL4)

A slightly different version of an observation-level model with two latent variables has been discussed by Zyphur et al. (Citation2020; see also Shamsollahi et al., Citation2021). They used the specification in EquationEquation (13)(13) $Z_{1} = Γ_{1} C + Λ_{1} U + e_{1}$ (13) to represent the associations between $Z_{1}$ and $U$ (16) $(\binom{X_{1}}{Y_{1}}) = (\begin{matrix} λ_{X_{1}, U_{1}} & 0 \\ 0 & λ_{Y_{1}, U_{2}} \end{matrix}) (\binom{U_{1}}{U_{2}}) + (\binom{e_{X_{1}}}{e_{Y_{1}}}),$ (16) where $Λ_{1}$ is assumed to be diagonal, and only $λ_{X_{1}, U_{1}}$ and $λ_{Y_{1}, U_{2}}$ are estimated (the cross-loadings are set to zero, i.e., $λ_{X_{1}, U_{2}} = λ_{Y_{1}, U_{1}} = 0$ ). For $t \geq 2,$ the model discussed in Zyphur et al. (Citation2020) is given by EquationEquation (15)(15) $(\begin{matrix} X_{t} \\ Y_{t} \end{matrix}) = (\begin{matrix} β_{X_{t} X_{t - 1}} & β_{X_{t} Y_{t - 1}} \\ β_{Y_{t} X_{t - 1}} & β_{Y_{t} Y_{t - 1}} \end{matrix}) (\begin{matrix} X_{t - 1} \\ Y_{t - 1} \end{matrix}) + (\begin{matrix} λ_{X_{t}, U_{1}} & 0 \\ 0 & λ_{Y_{t}, U_{2}} \end{matrix}) (\begin{matrix} U_{1} \\ U_{2} \end{matrix}) + (\begin{matrix} e_{X_{t}} \\ e_{Y_{t}} \end{matrix}) f or t \geq 2,$ (15) . In the following, we refer to this model as OL3.

Zyphur et al. (Citation2020) also discuss the possibility to freely estimate the loadings in the diagonal matrices $Λ_{t}$ (for $t \geq 2$ ). As mentioned before, this would allow the latent variables $U_{1}$ and $U_{2}$ to have time-varying effects (see also Bollen & Brand, Citation2010, Allison et al., Citation2017). We refer to this model as OL4. The four observation-level models, which are also included in the simulation studies, are summarized in .

Table 1. Overview of different latent variable-type models that adjust for effects of unmeasured variables $U .$

Display Table

3.2. Residual-Level Approaches

In the residual-level approach, the longitudinal association among the time-varying $Z_{t}$ are decomposed into a between-person part $Z_{t}^{B}$ and occasion-specific within-person deviations $Z_{t}^{W}$ : (17) $Z_{t} = Z_{t}^{B} + Z_{t}^{W} for t = 1, \dots, T$ (17) (18) $Z_{t}^{B} = {Γ_{t} C + Λ}_{t} U for t = 1, \dots, T$ (18) (19) $Z_{1}^{W} = e_{1} for t = 1$ (19) (20) $Z_{t}^{W} = {B_{t, t - 1} Z}_{t - 1}^{W} + e_{t} for t \geq 2$ (20)

The occasion-specific deviations $Z_{t}^{W}$ can be interpreted as residuals that are orthogonal to the between-person part, that is $Z_{t}^{B} = {Γ_{t} C + Λ}_{t} U$ (see Asparouhov & Muthén, Citation2021). Thus, the autoregressive and cross-lagged effects (i.e., $B_{t, t - 1}$ ) in EquationEquation (20)(20) $Z_{t}^{W} = {B_{t, t - 1} Z}_{t - 1}^{W} + e_{t} for t \geq 2$ (20) are estimated on a residual structure that is purified from the effects of the time-invariant covariates $C$ and the latent variables $U .$ At $t = 1,$ the occasion-specific deviations are equated with the residuals (see EquationEquation (19)(19) $Z_{1}^{W} = e_{1} for t = 1$ (19) ). In general, the latent variables $U$ are allowed to have time-varying effects $Λ_{t} .$ The identification of the latent factor model involving the part $Λ_{t} U$ needs to be established for each specific application. A particular strength of the residual-level approach is its ability to disaggregate between-person and within-person effects in the analysis of longitudinal data (Curran & Hancock, Citation2021; Hamaker et al., Citation2015; Usami et al., Citation2019).

The main difference between the residual-level and the observation-level approaches is that in the residual-level approach the coefficients are estimated for the decomposed scores $Z_{t}^{W}$ instead of the observed scores $Z_{t} .$ To better compare the two approaches, it is instructive to write EquationEquations (17)(17) $Z_{t} = Z_{t}^{B} + Z_{t}^{W} for t = 1, \dots, T$ (17) to Equation(20)(20) $Z_{t}^{W} = {B_{t, t - 1} Z}_{t - 1}^{W} + e_{t} for t \geq 2$ (20) for the observed scores (21) $Z_{t} = B_{t, t - 1} (Z_{t - 1} - Γ_{t - 1} C - Λ_{t - 1} U) + Γ_{t} C + Λ_{t} U + e_{t} = B_{t, t - 1} Z_{t - 1} + (Γ_{t} - B_{t, t - 1} Γ_{t - 1}) C + (Λ_{t} - B_{t, t - 1} Λ_{t - 1}) U + e_{t} for t \geq 2$ (21)

The conditions under which the residual-level approach is a re-expression of the observation-level approach (and vice versa) will be further investigated in the section 3.4 “Relationship between the Residual-Level and Observation-Level Approaches”.

3.2.1. Twodimensional Latent Factor Model with Time-Invariant Loadings and Regression Coefficients (RL1)

The most popular residual-level approach is the RI-CLPM (Hamaker et al., Citation2015) which was introduced to extend the CLPM with lag-1 effects (CL1). In the bivariate case, the RI-CLPM is given by (22) $(\binom{X_{t}}{Y_{t}}) = (\begin{matrix} λ_{X_{t}, U_{1}} & 0 \\ 0 & λ_{Y_{t}, U_{2}} \end{matrix}) (\binom{U_{1}}{U_{2}}) + (\begin{matrix} X_{t}^{W} \\ Y_{t}^{W} \end{matrix}) for t = 1, \dots, T$ (22) (23) $(\begin{matrix} X_{t}^{W} \\ Y_{t}^{W} \end{matrix}) = (\begin{matrix} β_{X_{t} X_{t - 1}} & β_{X_{t} Y_{t - 1}} \\ β_{Y_{t} X_{t - 1}} & β_{Y_{t} Y_{t - 1}} \end{matrix}) (\begin{matrix} X_{t - 1}^{W} \\ Y_{t - 1}^{W} \end{matrix}) + (\binom{e_{X_{t}}}{e_{Y_{t}}}) for t \geq 2$ (23)

The loading matrices $Λ_{t}$ are assumed to be diagonal (e.g., no effects of $U_{1}$ on $Y_{t}$ ), and the loadings $λ_{X_{t}, U_{1}}$ and $λ_{Y_{t}, U_{2}}$ are assumed to be invariant across time (Hamaker et al., Citation2015). The latent variables $U_{1}$ and $U_{2}$ are interpreted as stable trait factors that represent the parts of $X_{t}$ and $Y_{t}$ that are completely stable across time. The coefficients $β_{X_{t} X_{t - 1}}$ and $β_{Y_{t} Y_{t - 1}}$ represent the autoregressive effects and the coefficients $β_{X_{t} Y_{t - 1}}$ and $β_{Y_{t} X_{t - 1}}$ represent the cross-lagged effects. The cross-lagged effects are interpreted as within-person effects. For example, $β_{Y_{t} X_{t - 1}}$ indicates whether a temporal deviation (i.e., $X_{t - 1}^{W}$ ) from the stable trait level (i.e., $U_{1}$ ) of one construct affects subsequent within-person deviations (i.e., $Y_{t}^{W}$ ) from the stable trait level (i.e., $U_{2}$ ) of the other construct. The RI-CLPM is depicted as a path diagram in .

Figure 5. Path diagram of the random intercept cross-lagged panel model (RI-CLPM; Hamaker et al., Citation2015) for three measurement waves.

We also write the RI-CLPM for the observed scores $Z_{t}$ (see EquationEquation (21)(21) $Z_{t} = B_{t, t - 1} (Z_{t - 1} - Γ_{t - 1} C - Λ_{t - 1} U) + Γ_{t} C + Λ_{t} U + e_{t} = B_{t, t - 1} Z_{t - 1} + (Γ_{t} - B_{t, t - 1} Γ_{t - 1}) C + (Λ_{t} - B_{t, t - 1} Λ_{t - 1}) U + e_{t} for t \geq 2$ (21) ) (24) $(\binom{X_{t}}{Y_{t}}) = (\begin{matrix} β_{X_{t} X_{t - 1}} & β_{X_{t} Y_{t - 1}} \\ β_{Y_{t} X_{t - 1}} & β_{Y_{t} Y_{t - 1}} \end{matrix}) (\begin{matrix} X_{t - 1} \\ Y_{t - 1} \end{matrix}) + (\begin{matrix} λ_{X_{t}, U_{1}} - β_{X_{t} X_{t - 1}} λ_{X_{t - 1}, U_{1}} & 0 \\ 0 & λ_{Y_{t}, U_{2}} - β_{Y_{t} Y_{t - 1}} λ_{Y_{t - 1}, U_{2}} \end{matrix}) (\binom{U_{1}}{U_{2}}) + (\binom{e_{X_{t}}}{e_{Y_{t}}})$ (24)

We refer to the model in EquationEquation (24)(23) $(\begin{matrix} X_{t}^{W} \\ Y_{t}^{W} \end{matrix}) = (\begin{matrix} β_{X_{t} X_{t - 1}} & β_{X_{t} Y_{t - 1}} \\ β_{Y_{t} X_{t - 1}} & β_{Y_{t} Y_{t - 1}} \end{matrix}) (\begin{matrix} X_{t - 1}^{W} \\ Y_{t - 1}^{W} \end{matrix}) + (\binom{e_{X_{t}}}{e_{Y_{t}}}) for t \geq 2$ (23) with invariant regression coefficients (i.e., $B \equiv B_{t, t - 1}$ ) and loadings (i.e., ${Λ \equiv Λ}_{t}$ ) across time as RL1.

3.2.2. Twodimensional Latent Factor Model with Time-Varying Loadings or Regression Coefficients (RL2, RL3)

It is also possible to freely estimate the loadings in the diagonal matrices $Λ_{t} .$ This would allow the time-invariant $U_{1}$ and $U_{2}$ in EquationEquation (24)(23) $(\begin{matrix} X_{t}^{W} \\ Y_{t}^{W} \end{matrix}) = (\begin{matrix} β_{X_{t} X_{t - 1}} & β_{X_{t} Y_{t - 1}} \\ β_{Y_{t} X_{t - 1}} & β_{Y_{t} Y_{t - 1}} \end{matrix}) (\begin{matrix} X_{t - 1}^{W} \\ Y_{t - 1}^{W} \end{matrix}) + (\binom{e_{X_{t}}}{e_{Y_{t}}}) for t \geq 2$ (23) to have time-varying effects. In the following, we refer to this model with freely estimated loadings as RL2. We also consider a model with freely estimated regression coefficients and time-invariant loadings. This model is labeled RL3 and is the original formulation of the RI-CLPM (Hamaker et al., Citation2015). The three different residual-level models are summarized in .

3.3. Fixed Effects Dynamic Panel Model

A further variant of the observation-level approach was proposed by Allison, Williams, and Moral-Benito (Citation2017; see also Moral-Benito, Citation2013; Williams et al., Citation2018). In their dynamic panel model, only the dependent variable $Y_{t}$ is explicitly modeled: (25) $Y_{t} = {β_{Y_{t} Y_{t - 1}} Y}_{t - 1} {+ {β_{Y_{t} X_{t - 1}} X}_{t - 1} + β}_{t, t - 1} L_{t - 1} + γ_{t} C + λ_{t} U + e_{t} for t \geq 2,$ (25) where $β_{Y_{t} Y_{t - 1}}$ is the effect of the lagged depended variable; $β_{Y_{t} X_{t - 1}}$ represents the cross-lagged effect; $β_{t, t - 1}$ is a 1 × p row vector of regression coefficients; the 1 × k row vector $γ_{t}$ contains the time-varying effects of the time-invariant covariates; $λ_{t}$ is a 1 × q row vector that contains the time-varying effects of the unit effects $U .$ As in traditional fixed effects models (Wooldridge, Citation2010), the unit effects $U$ are allowed to covary with the time-varying variables $X_{t}$ and $L_{t},$ but need to be uncorrelated with the time-invariant covariates $C .$ Furthermore, the residuals $e_{t}$ are allowed to covary with future and concurrent values of $X_{t},$ and $L_{t},$ that is ${Cov (e}_{t - h}, X_{t}) = ϑ_{t - h, t},$ and ${Cov (e}_{t - h}, L_{t}) =$ φ $_{t - h, t}$ for $h \geq 0 .$ Thus, no further assumptions are made for modeling how $X_{t}$ and the time-varying covariates are related to prior values of the outcome. At $t = 1,$ the outcome $Y_{1}$ is assumed to covary with the unit effects $U .$ Again, this ensures that $Y_{1}$ is treated as predetermined. In most applications, the $λ_{t}$ are assumed to be invariant for $t \geq 2,$ and the dimension of $U$ is 1.

3.3.1. Fixed Effects Models with a Unidimensional Latent Factor (FED)

In the bivariate case, the dynamic panel model is given by (26) $Y_{t} = β_{Y_{t} Y_{t - 1}} Y_{t - 1} + β_{Y_{t} X_{t - 1}} X_{t - 1} + λ_{Y_{t}, U_{1}} U_{1} + e_{t} for t \geq 2,$ (26) where the cross-lagged effect $β_{Y_{t} X_{t - 1}}$ is adjusted for the lagged dependent variable $Y_{t - 1}$ and the unit effect $U_{1} .$ At the first measurement wave (i.e., $t = 1$ ), $Y_{1},$ $X_{1},$ and $U_{1}$ are allowed to freely covary with each other. We refer to the model in EquationEquation (26)(25) $Y_{t} = {β_{Y_{t} Y_{t - 1}} Y}_{t - 1} {+ {β_{Y_{t} X_{t - 1}} X}_{t - 1} + β}_{t, t - 1} L_{t - 1} + γ_{t} C + λ_{t} U + e_{t} for t \geq 2,$ (25) as FED. The path diagram of the FED for T = 3 is depicted in . In the following, we assume that the regression coefficients $β_{Y_{t} Y_{t - 1}}$ and $β_{Y_{t} X_{t - 1}},$ as well as the loadings $λ_{Y_{t}, U_{1}}$ of the unit effect, are invariant across time (see also ).

Figure 6. Path diagram of the fixed effects dynamic panel model (FED; Allison et al., Citation2017) for three measurement waves.

It should be emphasized that the FED does not provide estimates for the effects of $Y$ on $X .$ This has the advantage that no assumptions are made regarding the dependence structure of $X$ on $Y .$ Footnote² However, if researchers are interested in reciprocal effects, they need to specify a separate model for $X$ that is analogous to EquationEquation (25)(24) $(\binom{X_{t}}{Y_{t}}) = (\begin{matrix} β_{X_{t} X_{t - 1}} & β_{X_{t} Y_{t - 1}} \\ β_{Y_{t} X_{t - 1}} & β_{Y_{t} Y_{t - 1}} \end{matrix}) (\begin{matrix} X_{t - 1} \\ Y_{t - 1} \end{matrix}) + (\begin{matrix} λ_{X_{t}, U_{1}} - β_{X_{t} X_{t - 1}} λ_{X_{t - 1}, U_{1}} & 0 \\ 0 & λ_{Y_{t}, U_{2}} - β_{Y_{t} Y_{t - 1}} λ_{Y_{t - 1}, U_{2}} \end{matrix}) (\binom{U_{1}}{U_{2}}) + (\binom{e_{X_{t}}}{e_{Y_{t}}})$ (24) .

3.4. Relationship between the Residual-Level and Observation-Level Approaches

In this section, we further investigate the relationship between the observation-level and residual-level approaches (see also Andersen, Citation2021; Bollen & Curran, Citation2004; Hamaker, Citation2005; Hsiao, Citation2014; Usami, Citation2021; Usami, Murayama, et al., Citation2019). More specifically, we clarify the conditions under which the two approaches are equivalent. We consider a general residual-level model with occasion-specific, freely estimated loading matrices: (27) $Z_{1} = Λ_{1} U + e_{1}$ (27) (28) $Z_{t} = {B_{t, t - 1} Z}_{t - 1}^{W} + Λ_{t} U + e_{t} for t \geq 2$ (28)

Note that we use $Z_{1}^{W} = e_{1},$ and that the dimension of $U$ can be different from the dimension of $Z_{t} .$ The general observation-level model is given by (29) $Cov (Z_{1}, \tilde{U}) = {\tilde{Ψ}}_{Z_{1} \tilde{U}}$ (29) (30) $Z_{t} = {\tilde{B}}_{t, t - 1} Z_{t - 1} + {\tilde{Λ}}_{t} \tilde{U} + {\tilde{e}}_{t} for t \geq 2$ (30) where ${\tilde{B}}_{t, t - 1},$ ${\tilde{Λ}}_{t},$ ${\tilde{Λ}}_{1},$ $\tilde{U},$ and ${\tilde{e}}_{t}$ denote the parameters in the observation-level model. An important special case is the observation-level model in which the covariance of $Z_{1}$ and $\tilde{U}$ is represented as a loading matrix ${\tilde{Λ}}_{1}$ (see Zyphur et al., Citation2020) (31) $Z_{1} = {\tilde{Λ}}_{1} \tilde{U} + {\tilde{e}}_{1}$ (31)

In order to show that in the general case the two models are equivalent, we show that the parameters in EquationEquations (27)(26) $Y_{t} = β_{Y_{t} Y_{t - 1}} Y_{t - 1} + β_{Y_{t} X_{t - 1}} X_{t - 1} + λ_{Y_{t}, U_{1}} U_{1} + e_{t} for t \geq 2,$ (26) and Equation(28)(27) $Z_{1} = Λ_{1} U + e_{1}$ (27) can be represented as parameters in EquationEquations (30)(29) $Cov (Z_{1}, \tilde{U}) = {\tilde{Ψ}}_{Z_{1} \tilde{U}}$ (29) and Equation(31)(30) $Z_{t} = {\tilde{B}}_{t, t - 1} Z_{t - 1} + {\tilde{Λ}}_{t} \tilde{U} + {\tilde{e}}_{t} for t \geq 2$ (30) , and vice versa.

3.4.1. Equivalence of General Residual-Level and Observation-Level Models

First, we start with the residual-level model and write EquationEquation (28)(27) $Z_{1} = Λ_{1} U + e_{1}$ (27) as follows (32) $Z_{t} = B_{t, t - 1} Z_{t - 1} + {(Λ}_{t} - B_{t, t - 1} Λ_{t - 1}) U + e_{t} f or t \geq 2 .$ (32)

By setting ${\tilde{B}}_{t, t - 1} \equiv B_{t, t - 1},$ ${\tilde{Λ}}_{t} \equiv Λ_{t} - B_{t, t - 1} Λ_{t - 1},$ $\tilde{U} \equiv U,$ and ${\tilde{e}}_{t} \equiv e_{t},$ we obtain EquationEquation (30)(29) $Cov (Z_{1}, \tilde{U}) = {\tilde{Ψ}}_{Z_{1} \tilde{U}}$ (29) of the observation-level approach. At $t = 1,$ we can obviously set ${\tilde{Λ}}_{1} \equiv Λ_{1},$ and ${\tilde{e}}_{1} \equiv e_{1}$ (see EquationEquation (31)(30) $Z_{t} = {\tilde{B}}_{t, t - 1} Z_{t - 1} + {\tilde{Λ}}_{t} \tilde{U} + {\tilde{e}}_{t} for t \geq 2$ (30) ). Thus, the model parameters of the general residual-level model can be represented as parameters of the observation-level model.

Next, we start with an observation-level model (see EquationEquations (30)(29) $Cov (Z_{1}, \tilde{U}) = {\tilde{Ψ}}_{Z_{1} \tilde{U}}$ (29) and Equation(31)(30) $Z_{t} = {\tilde{B}}_{t, t - 1} Z_{t - 1} + {\tilde{Λ}}_{t} \tilde{U} + {\tilde{e}}_{t} for t \geq 2$ (30) ). We now define $B_{t, t - 1} \equiv {\tilde{B}}_{t, t - 1},$ $Λ_{1} \equiv {\tilde{Λ}}_{1},$ $U \equiv \tilde{U},$ and $e_{t} \equiv {\tilde{e}}_{t} .$ We also set (33) $Λ_{t} \equiv {\tilde{Λ}}_{t} + {\tilde{B}}_{t, t - 1} Λ_{t - 1} for t \geq 2$ (33)

For $t = 2,$ we have $Λ_{t - 1} = {\tilde{Λ}}_{1} .$ For $t > 2,$ $Λ_{t - 1}$ is a function of model parameters defined in EquationEquations (30)(29) $Cov (Z_{1}, \tilde{U}) = {\tilde{Ψ}}_{Z_{1} \tilde{U}}$ (29) and Equation(31)(30) $Z_{t} = {\tilde{B}}_{t, t - 1} Z_{t - 1} + {\tilde{Λ}}_{t} \tilde{U} + {\tilde{e}}_{t} for t \geq 2$ (30) , that is ${\tilde{Λ}}_{t}$ and ${\tilde{B}}_{t, t - 1} .$ This shows that the paramaters of the general observation-level model can be re-expressed as parameters of a residual-level model, and the two approaches are equivalent. Note that this would also hold true for the special case that the regression coefficients are invariant across time, that is $B_{t, t - 1} = {\tilde{B}}_{t, t - 1} \equiv B$ in EquationEquations (28)(27) $Z_{1} = Λ_{1} U + e_{1}$ (27) and Equation(30)(29) $Cov (Z_{1}, \tilde{U}) = {\tilde{Ψ}}_{Z_{1} \tilde{U}}$ (29) .

3.4.2. Representation of a Simple Structure Residual-Level Model as an Observation-Level Model

Next, we consider a residual-level model with time-invariant coefficients $B,$ and a diagonal, time-invariant loading matrix. This is a special version of the RI-CLPM (RL1) and can be written as (34) $Z_{1} = Λ U + e_{1}$ (34) (35) $Z_{t} = {BZ}_{t - 1} + (I - B) Λ U + e_{t} f or t \geq 2 .$ (35)

We show that this model can be represented as an observation-level model with a simple structure loading matrix for $t \geq 2 .$ To this end, we define a diagonal matrix $\tilde{Λ}$ and latent variables $\tilde{U}$ by (36) $\tilde{Λ} \tilde{U} \equiv (I - B) Λ U$ (36)

Note that the only purpose of the diagonal matrix $\tilde{Λ}$ is to standardize the components in $\tilde{U} .$ Moreover, we can define for $t = 1$ (37) ${\tilde{Λ}}_{1} \equiv {\tilde{Λ}}^{- 1} (I - B)^{- 1} Λ .$ (37)

As $B$ is typically nondiagonal, the loading matrix ${\tilde{Λ}}_{1}$ at $t = 1$ in the observation-level model is nondiagonal. Thus, a RI-CLPM with time-invariant regression coefficients (RL1) can be re-expressed as an observation-level model with a nondiagonal loading matrix ${\tilde{Λ}}_{1}$ (OL2 in EquationEquation (15)(15) $(\begin{matrix} X_{t} \\ Y_{t} \end{matrix}) = (\begin{matrix} β_{X_{t} X_{t - 1}} & β_{X_{t} Y_{t - 1}} \\ β_{Y_{t} X_{t - 1}} & β_{Y_{t} Y_{t - 1}} \end{matrix}) (\begin{matrix} X_{t - 1} \\ Y_{t - 1} \end{matrix}) + (\begin{matrix} λ_{X_{t}, U_{1}} & 0 \\ 0 & λ_{Y_{t}, U_{2}} \end{matrix}) (\begin{matrix} U_{1} \\ U_{2} \end{matrix}) + (\begin{matrix} e_{X_{t}} \\ e_{Y_{t}} \end{matrix}) f or t \geq 2,$ (15) ; see Andersen, Citation2021). However, a RI-CLPM cannot, in general, be written as an observation-level model with a diagonal ${\tilde{Λ}}_{1}$ (OL3 in EquationEquation (16)(16) $(\binom{X_{1}}{Y_{1}}) = (\begin{matrix} λ_{X_{1}, U_{1}} & 0 \\ 0 & λ_{Y_{1}, U_{2}} \end{matrix}) (\binom{U_{1}}{U_{2}}) + (\binom{e_{X_{1}}}{e_{Y_{1}}}),$ (16) ).

3.4.3. Representation of a Simple Structure Observation-Level Model as a Residual-Level Model

We now start with an observation-level model (38) $Z_{1} = {\tilde{Λ}}_{1} \tilde{U} + {\tilde{e}}_{1}$ (38) (39) $Z_{t} = \tilde{B} Z_{t - 1} + \tilde{Λ} \tilde{U} + {\tilde{e}}_{t} for t \geq 2,$ (39) where the matrix $\tilde{Λ}$ is assumed to be diagonal (i.e., simple structure model). As pointed out before, the matrix ${\tilde{Λ}}_{1}$ can be diagonal (OL3 in EquationEquation (16)(16) $(\binom{X_{1}}{Y_{1}}) = (\begin{matrix} λ_{X_{1}, U_{1}} & 0 \\ 0 & λ_{Y_{1}, U_{2}} \end{matrix}) (\binom{U_{1}}{U_{2}}) + (\binom{e_{X_{1}}}{e_{Y_{1}}}),$ (16) ) or nondiagonal (OL2 in EquationEquation (15)(15) $(\begin{matrix} X_{t} \\ Y_{t} \end{matrix}) = (\begin{matrix} β_{X_{t} X_{t - 1}} & β_{X_{t} Y_{t - 1}} \\ β_{Y_{t} X_{t - 1}} & β_{Y_{t} Y_{t - 1}} \end{matrix}) (\begin{matrix} X_{t - 1} \\ Y_{t - 1} \end{matrix}) + (\begin{matrix} λ_{X_{t}, U_{1}} & 0 \\ 0 & λ_{Y_{t}, U_{2}} \end{matrix}) (\begin{matrix} U_{1} \\ U_{2} \end{matrix}) + (\begin{matrix} e_{X_{t}} \\ e_{Y_{t}} \end{matrix}) f or t \geq 2,$ (15) ). In order to represent EquationEquations (38)(37) ${\tilde{Λ}}_{1} \equiv {\tilde{Λ}}^{- 1} (I - B)^{- 1} Λ .$ (37) and Equation(39)(38) $Z_{1} = {\tilde{Λ}}_{1} \tilde{U} + {\tilde{e}}_{1}$ (38) as a residual-level model, we get defining equations (40) $Λ_{1} U = {\tilde{Λ}}_{1} \tilde{U}$ (40) (41) $(Λ_{t} - B_{t, t - 1} Λ_{t - 1}) U = \tilde{Λ} \tilde{U} for t \geq 2$ (41)

We now set $B_{t, t - 1} \equiv \tilde{B},$ and obtain by iterating (42) $Λ_{t} U = \tilde{Λ} \tilde{U} + \tilde{B} Λ_{t - 1} U = (I + \tilde{B} + {\tilde{B}}^{2} + \dots + {\tilde{B}}^{t - 2}) \tilde{Λ} \tilde{U} + {\tilde{B}}^{t - 1} Λ_{1} U$ (42)

We use the matrix Taylor series ${(I - \tilde{B})}^{- 1} ≃ I + \tilde{B} + {\tilde{B}}^{2} + \dots + {\tilde{B}}^{k}$ if $k$ is large enough. This finding is based on the assumption that ${\tilde{B}}^{k} \to 0$ for $k$ large enough which holds for stationary processes. If we assume that ${\tilde{B}}^{k} \approx 0$ for all $k \geq 2,$ we can use the approximation $Λ_{t} U \approx {(I - \tilde{B})}^{- 1} \tilde{Λ} \tilde{U}$ for $t \geq 3 .$ Thus, we can assume a simple structure $Λ_{t} = Λ$ for $t \geq 3$ with (43) $Λ U = {(I - \tilde{B})}^{- 1} \tilde{Λ} \tilde{U}$ (43)

We now set $Λ \equiv {(I - \tilde{B})}^{- 1} \tilde{Λ}$ for $t \geq 3 .$ For the first two time points, we get (44) $Λ_{1} U = {\tilde{Λ}}_{1} \tilde{U} = ({\tilde{Λ}}_{1} {\tilde{Λ}}^{- 1} (I - \tilde{B}) Λ) U$ (44) (45) $Λ_{2} U = \tilde{Λ} \tilde{U} + {\tilde{B} Λ}_{1} U = ((I - \tilde{B}) Λ + {\tilde{B} Λ}_{1}) U$ (45)

We define $Λ_{1} \equiv ({\tilde{Λ}}_{1} {\tilde{Λ}}^{- 1} (I - \tilde{B}) Λ)$ and $Λ_{2} \equiv ((I - \tilde{B}) Λ + {\tilde{B} Λ}_{1}) .$ This shows that the observation-level model in EquationEquations (38)(37) ${\tilde{Λ}}_{1} \equiv {\tilde{Λ}}^{- 1} (I - B)^{- 1} Λ .$ (37) and Equation(39)(38) $Z_{1} = {\tilde{Λ}}_{1} \tilde{U} + {\tilde{e}}_{1}$ (38) can be approximately represented as a residual-level model with a simple structure loading matrix. However, the loading matrices at the first two time points ( $Λ_{1}$ and $Λ_{2}$ ) in this approximation are, in general, not diagonal. Thus, an observation-level model with a simple structure loading matrix cannot be re-expressed as a simple structure residual-level model.

3.4.4. Representation of a Residual-Level Model as a Fixed Effects Dynamic Panel Model

We show that a residual-level model can be represented as a fixed effects dynamic panel model (FED; Allison et al., Citation2017). We start with the bivariate residual-level model in EquationEquation (24)(23) $(\begin{matrix} X_{t}^{W} \\ Y_{t}^{W} \end{matrix}) = (\begin{matrix} β_{X_{t} X_{t - 1}} & β_{X_{t} Y_{t - 1}} \\ β_{Y_{t} X_{t - 1}} & β_{Y_{t} Y_{t - 1}} \end{matrix}) (\begin{matrix} X_{t - 1}^{W} \\ Y_{t - 1}^{W} \end{matrix}) + (\binom{e_{X_{t}}}{e_{Y_{t}}}) for t \geq 2$ (23) , and assume time-invariant regression coefficients ( $β_{X X},$ $β_{X Y},$ $β_{Y Y},$ and $β_{Y X}$ ), and time-invariant loadings ( $λ_{X, U_{1}}$ and $λ_{Y, U_{2}}$ ).

The corresponding FED is given by (for $t \geq 2$ ) (46) $Y_{t} = {\tilde{λ}}_{Y, {\tilde{U}}_{1}} {\tilde{U}}_{1} + {\tilde{β}}_{Y Y} Y_{t - 1} + {\tilde{β}}_{Y X} X_{t - 1} + {\tilde{e}}_{Y_{t}}$ (46)

We now define ${\tilde{λ}}_{Y, {\tilde{U}}_{1}} {\tilde{U}}_{1} \equiv (1 - β_{Y Y}) λ_{Y, U_{2}} U_{2} - β_{Y X} λ_{X, U_{1}} U_{1},$ where ${\tilde{U}}_{1}$ is a standardized variable. Furthermore, we set ${\tilde{β}}_{Y Y} \equiv β_{Y Y},$ ${\tilde{β}}_{Y X} \equiv β_{Y X}$ , and ${\tilde{e}}_{Y_{t}} \equiv e_{Y_{t}}$ for $t \geq 2 .$ Thus, the RI-CLPM can be represented as a FED. However, as the FED does not specify restrictions regarding the dependence of $X$ on $Y,$ the FED cannot, in general, be represented as an observation-level model (Allison et al., Citation2017), which implies that it can also not, in general, be represented as a residual-level model.

3.4.5. Summary

Overall, the main results of this section can be summarized as follows. First, the general observation-level and residual-level approaches with freely estimated loadings are equivalent if the same number of latent variables is utilized (i.e., dimension of $U$ is the same). Second, residual-level models with a simple structure time-invariant loading matrix and time-invariant regression coefficients can be represented as a simple structure observation-level model with time-invariant regression coefficients. However, this is only the case if the loading matrix at $t = 1$ in the observation-level model is nondiagonal (see OL2), and not if it is diagonal (see OL3). Third, the simple structure observation-level model with time-invariant regression coefficients cannot be re-expressed as a simple structure residual-level model (see EquationEquations (44)(43) $Λ U = {(I - \tilde{B})}^{- 1} \tilde{Λ} \tilde{U}$ (43) and Equation(45)(44) $Λ_{1} U = {\tilde{Λ}}_{1} \tilde{U} = ({\tilde{Λ}}_{1} {\tilde{Λ}}^{- 1} (I - \tilde{B}) Λ) U$ (44) ), indicating that in the case of simple structure loading matrices, the observation-level approach is the more general approach (see Andersen, Citation2021). Fourth, both observation-level and residual-level models can be re-expressed as a fixed effects dynamic panel. However, in general, this is not possible the other way round (see Allison et al., Citation2017).

4. Estimation of Cross-Lagged Effects under Different Data-Generating Models

We now use simulated data to illustrate the conditions under which the different modeling approaches produce unbiased estimates of the cross-lagged effect. We distinguish four different scenarios for a cross-lagged panel design with three measurement waves and two variables (i.e., $X$ and $Y;$ see ). In Scenario A, we assumed that the true data-generating model was a CL2. In Scenario B, we assumed that the true model was an observation-level model with one latent variable (OL1). In Scenario C, the data were generated by residual-level models (RL1, RL2, and RL3). Finally, in Scenario D, we assumed that the true model was a fixed effects dynamic panel model (FED). For each scenario, we present the results for different data conditions generated under different parameters of the data-generating models. As we were only interested in the (large-sample) bias of the parameter estimates, we simulated only one large data set (N = 10000) for each data condition. In all scenarios, the variables (i.e., $X_{1},$ $X_{2},$ $X_{3},$ $Y_{1},$ $Y_{2},$ $Y_{3}$ ) were standardized with zero means and variances of one. Our discussion focuses on the estimation of the (lag-1) cross-lagged effect of $X_{2}$ on $Y_{3}$ (i.e., $β_{Y_{3} X_{2}}$ ). The R and lavaan (Rosseel, Citation2012) code for the data-generating models and the different analysis models is provided at https://bit.ly/3IN1FCi.

Figure 7. Four different simulation scenarios for a cross-lagged panel design with three measurement waves. Scenario A: true model is a cross-lagged panel model with lag-2 effects (CL2). Scenario B: true model is an observation-level model with a single latent variable (OL1). Scenario C: true model is a residual-level model. Scenario D: true model is a fixed effects dynamic panel model (FED).

4.1. Scenario A: CL2 as the Data-Generating Model

In Scenario A, we assumed that the data were generated by a CL2 (see EquationEquation (9)(9) $Z_{t} = {B_{t, t - 1} Z}_{t - 1} + {B_{t, t - 2} Z}_{t - 2} + Γ_{t} C + e_{t} for t \geq 2,$ (9) ). More specifically, we assumed a stationary process that fulfills a cross-lagged panel model with lag-2 effects (i.e., the lag-1 covariances were assumed to be constant for each data condition). We manipulated the lag-2 covariances by specifying different values for the lag-2 cross-lagged effects (i.e., $β_{Y_{3} X_{1}}$ ), and the lag-2 autoregressive effects (i.e., $β_{Y_{3} Y_{1}}$ ). This resulted in six different data conditions in which the synchronous and lag-1 correlations were constant, but the lag-2 correlations differed. We analyzed the six data sets with the cross-lagged panel model (CL1 and CL2), the four observation-level models (OL1, OL2, OL3, and OL4), the three residual-level models (RL1, RL2, and RL3), and the fixed effects dynamic panel model FED.

As expected, the CL2 that includes lag-2 effects provided unbiased estimates of the cross-lagged effect under all data conditions in this scenario (see ). The CL1 produced positively biased estimates and overestimated the size of the true cross-lagged effect. For example, in condition A2 with small lag-2 cross-lagged effects (i.e., $β_{Y_{3} X_{1}}$ =.01), and a substantial lag-2 autoregressive effect (i.e., $β_{Y_{3} Y_{1}}$ =.32), the true cross-lagged effect is substantially overestimated in the CL1 (.20 vs. .10). This illustrates that the estimates of the CL1 can be strongly distorted by the presence of delayed effects that are not adequately captured by the lag-1 effects (VanderWeele et al., Citation2020; see also Marsh et al., Citation2018). Note that the estimated cross-lagged effects in the CL1 did not change across the data conditions because the lag-1 correlations were fixed, and only the lag-2 correlations were manipulated.

Table 2. Results for Scenario A: True model is the cross-lagged panel model with lag-2 effects (CL2). Estimates of the cross-lagged effect for the different approaches.

Display Table

Most importantly, the models that include additional latent variables to adjust for unmeasured confounding produced biased estimates that were sometimes too large and sometimes too small. For example, the estimates of the residual-level models strongly depended on the size of the lag-2 effects, and the RL3 tended to underestimate the magnitude of the true cross-lagged effect. In condition A2 (i.e., $β_{Y_{3} X_{1}}$ =.01, and $β_{Y_{3} Y_{1}}$ =.32), the estimate provided by the RL3 even had a different sign than the true cross-lagged effect (–.04 vs. .10). The bias of the residual-level models (RL1, RL2, and RL3) completely vanished when no lag-2 cross-lagged effect was present in the condition A6 (i.e., $β_{Y_{3} X_{1}}$ =0).

Overall, scenario A confirms findings from the methodological literature that the CL1 can positively bias estimates of cross-lagged effects because it provides insufficient control for confounding due to the lag-2 effects. It also shows that the additional latent variables in the observation-level and residual-level models capture these lag-2 effects. However, in general, they do not appropriately control for the confounding due to previous measures of the exposure and the outcome, resulting in biased estimates of cross-lagged effects.

4.2. Scenario B: OL1 as the Data-Generating Model

In Scenario B, we assumed that an observation-level model with a single latent variable is the data-generating model (OL1; see EquationEquation (14)(14) $\underset{Z_{t}}{\underset{︸}{(\binom{X_{t}}{Y_{t}})}} = \underset{B_{t, t - 1}}{\underset{︸}{(\binom{β_{X_{t} X_{t - 1}}}{β_{Y_{t} X_{t - 1}}} \binom{β_{X_{t} Y_{t - 1}}}{β_{Y_{t} Y_{t - 1}}})}} \underset{Z_{t - 1}}{\underset{︸}{(\binom{X_{t - 1}}{Y_{t - 1}})}} + \underset{Λ_{t}}{\underset{︸}{(\binom{λ_{X_{t}, U_{1}}}{λ_{Y_{t}, U_{1}}})}} \underset{U}{\underset{︸}{U_{1}}} + \underset{e_{t}}{\underset{︸}{(\binom{e_{X_{t}}}{e_{Y_{t}}})}} f or t \geq 2,$ (14) ). We fixed the true value of the cross-lagged effect ( $β_{Y_{3} X_{2}}$ =.20) and manipulated the effect of the latent variable $U_{1}$ on the observed variables by assuming that the correlation between $U_{1}$ and $Y$ decreased by a factor of .50 ( $ρ_{Y_{1} U_{1}}$ =.50, $ρ_{Y_{2} U_{1}}$ =.25, and $ρ_{Y_{3} U_{1}}$ =.13), was constant across time ( $ρ_{Y_{1} U_{1}}$ = $ρ_{Y_{2} U_{1}}$ = $ρ_{Y_{3} U_{1}}$ =.50), or increased by a factor of 2 ( $ρ_{Y_{1} U_{1}}$ =.05, $ρ_{Y_{2} U_{1}}$ =.10, and $ρ_{Y_{3} U_{1}}$ =.20) or 3 ( $ρ_{Y_{1} U_{1}}$ =.05, $ρ_{Y_{2} U_{1}}$ =.15, and $ρ_{Y_{3} U_{1}}$ =.45). In addition, we assumed that the correlation between $U_{1}$ and $X$ decreased by a factor of .5 ( $ρ_{X_{1} U_{1}}$ =.30, $ρ_{X_{2} U_{1}}$ =.15, and $ρ_{X_{3} U_{1}}$ =.08) or was constant across time ( $ρ_{X_{1} U_{1}}$ = $ρ_{X_{2} U_{1}}$ = $ρ_{X_{3} U_{1}}$ =.30). Overall, this resulted in six data conditions.

shows the results for the six data conditions. As expected, the observation-level model with a single latent variable (OL1) produced unbiased estimates of the cross-lagged effect. By contrast, the models that assume no unmeasured confounding (CL1 and CL2) and the other latent variable-type models provided positively or negatively biased estimates of the cross-lagged effect, depending on the effect of the latent variable $U_{1}$ in the true data-generating model. For example, in condition B6 in which $U_{1}$ had an increasing effect across time on $Y,$ the estimates of CL1 and CL2 were positively biased (CL1: .29, CL2: .27), while many of the estimates provided by the latent variable-type models were negatively biased (OL2: .16, OL3: .17, RL2: .05, FED: −.01). The estimate of the RL1 was unbiased in this condition but was too small in condition B4, in which $U_{1}$ had a decreasing effect across time on $Y .$ Interestingly, the estimates produced by the observation-level model OL4 were almost unbiased. This indicates that, at least in the investigated conditions, the freely estimated loadings in OL4 can capture the time-varying (linear) effect of a single unmeasured variable.

Table 3. Results for Scenario B: True model is the observation-level model with single latent variable (OL1). Estimates of the cross-lagged effect for the different approaches.

Display Table

Table 4. Results for Scenario C: True Model is the residual-level (RL) Model. Estimates for the cross-lagged effect for the different approaches.

Display Table

4.3. Scenario C: Residual-Level Model as the Data-Generating Model

In Scenario C, we assumed that a residual-level model is a data-generating model with a true cross-lagged effect of $β_{Y_{3} X_{2}}$ =.20. The correlation of the latent variables was set to .6 (i.e., $ρ_{U_{1} U_{2}}$ =.60), We manipulated the variance of the between-person part of $X$ (i.e., $ϕ_{U_{1}}$ =.30, .50, and .70), the cross-lagged effect of $X_{1}$ on $Y_{2}$ ( $β_{Y_{2} X_{1}}$ =.20 vs. $β_{Y_{2} X_{1}}$ =.05), and the loadings of $Y$ on $U_{2}$ ( $λ_{Y_{1}, U_{2}}$ = $λ_{Y_{2}, U_{2}}$ = $λ_{Y_{3}, U_{2}}$ =1.00 vs. $λ_{Y_{1}, U_{2}}$ =1.00, $λ_{Y_{2}, U_{2}}$ =.80, and $λ_{Y_{3}, U_{2}}$ =.64). This resulted in nine different data conditions in which the regression coefficients and loadings were invariant across time (C1, C2, and C3), the regression coefficients were different, but the loadings were the same across time (C4, C5, and C6), and the regression coefficients were invariant, but the loadings differ across time (C7, C8, and C9).

As expected, the residual-level models could produce unbiased estimates of the cross-lagged effect in all nine conditions (). However, the estimates of RL1 and RL2 were biased when the true regression coefficients were not invariant across time (conditions C4, C5, C6), and only RL3 with freely estimated loadings provided unbiased estimates when the true loadings were allowed to vary across time in the data-generating model (conditions C7, C8, C9). As shown in the section “Relationship between the Residual-Level and Observation-Level Approaches”, the OL2 and FED are re-expressions of the residual-level models and produced identical estimates. However, this is no longer the case if the OL2 and FED are misspecified in conditions C4 to C9. Furthermore, the observation-level models with a simple structure loading matrix at time 1 (OL3 and OL4) produced negatively biased estimates of the cross-lagged effect across all conditions.

The estimates of the CL1 were consistently too small and underestimated the true magnitude of the cross-lagged effect, particularly when the variance of the between-person parts was large (i.e., $ϕ_{U_{1}}$ =.7). These findings replicated the results from Hamaker et al. (Citation2015) and other simulation studies (Usami, Todo, et al., Citation2019) that the CL1 can produce distorted estimates of cross-lagged effects when the true data-generating model is an RL3. In addition, the CL2 that includes the lag-2 effects also produced estimates that underestimated the true magnitude of the cross-lagged effect. The bias for the CL2 was smaller but followed a similar pattern as the bias of the CL1.

4.4. Scenario D: FED as the Data-Generating Model

In Scenario D, we assumed that the fixed effects dynamic panel model FED is the true model. The data were generated from an observation-level model with two latent variables that were assumed to be correlated ( $ρ_{U_{1} U_{2}}$ =.60). We manipulated the variance of $U_{1}$ (i.e., $ϕ_{U_{1}}$ =.50 and .70), the loading of $X_{3}$ on $U_{1}$ (i.e., $λ_{X_{3}, U_{1}}$ =0.15 and .50), and the lag-2 effect of $Y_{1}$ on $X_{3}$ ( $β_{X_{3} Y_{1}}$ =.0 and .07). This resulted in eight different conditions in which we varied the complexity in the $X$ part of the data by allowing for different loadings of $X$ on $U_{1}$ across time (conditions D3 and D4), additional lag-2 effects of $Y$ on $X$ (conditions D5 and D6), or both (conditions D7 and D8).

shows that the FED produced unbiased estimates of the cross-lagged effect in all conditions. However, the estimates of the observation-level models (OL2, OL3, and OL4) were only unbiased if the loadings were invariant and no lag-2 effects were present in the data-generating model (conditions D1 and D2). This clearly illustrates that the FED can be interpreted as a more robust version of the observation-level models, and that the FED is to be prefered if researchers are only interested in estimating the cross-lagged effect but not in modeling the joint process of $X$ and $Y .$ The CL1 and CL2 overestimated the true magnitude of the cross-lagged effect and produced positively biased estimates across all conditions. Furthermore, the estimates of the residual-level models were strongly biased, even in the conditions with invariant loadings and no lag-2 effects (D1 and D2).

Table 5. Results for Scenario D: True model is the fixed effects dynamic panel model (FED). Estimates for the cross-lagged effect for the different approaches.

Display Table

4.5. Summary

The main findings of the simulations can be summarized as follows. First, when the CL2 was the true model (Scenario A), the estimates of the different latent variable-type models were biased in many conditions, indicating that the additional latent variables did not appropriately control for the lag-2 effects in the data-generating model. This shows that the potential to adjust for the effects of unmeasured variables comes with the price that the estimates of the latent variable methods are very sensitive to the parametric modeling assumptions needed for identifying the effects of the latent variables. Second, CL1 and CL2 produced biased estimates in the three scenarios in which unmeasured latent variables were included in the data-generating models (Scenarios B, C, and D). This was expected for CL1 and CL2 because these models rely on the assumption of no unmeasured confounding (i.e., all relevant covariates are measured). However, the results in these scenarios also show that the latent variable methods (OL1, OL2, OL3, OL4, RL1, RL2, RL3, and FED) strongly depended on the specific modeling assumptions made for estimating the effects of $U .$ If these assumptions did not correspond with the data-generating model, they produced, in general, biased estimates of the cross-lagged effect. In practical applications with real data, the decision between the different approaches cannot be guided by model fit because different model specifications (with different estimates of the cross-lagged effect) will often provide an adequate description of the data (see Section “Limitations of Model Fit for Estimating Cross-Lagged Effects”). Third, the FED that is agnostic about the process of the exposure variable $X$ was less sensitive to specific modeling assumptions (see Allison et al., Citation2017) when estimating cross-lagged effects in the presence of unmeasured confounding. The estimates of the FED were biased in scenarios (B5, B6, and C4 to C9), in which specific parameter restrictions were needed for identifying the model (i.e., equal regression coefficients and loadings across time). With more measurement waves, these restrictions can be relaxed, and the FED would provide unbiased estimates in Scenarios B, C, and D. However, this would not resolve the central issue of scenario A because the FED still needs to make assumptions about the process of $Y$ for identifying the unit effects of $U_{1} .$

5. The Role of Time for Estimating the Causal Cross-Lagged Effect

In this section, we discuss the crucial role of selecting time points in defining the cross-lagged effect of interest (e.g., Gollob & Reichardt, Citation1987). We assume that the target of inference is the causal cross-lagged effect of $X_{2}$ on $Y_{3},$ and we adjust for $X_{1},$ $Y_{1},$ and $Y_{2}$ in defining $τ_{T_{X, Y}, ACE},$ that is $T_{X} = {1}$ and $T_{Y} = {1, 2} .$ In the following, we assume that there are no unmeasured confounders. The bivariate process $Z_{t} = (X_{t}, Y_{t})$ (for $t \in R$ ) is allowed to have an arbitrary dependency structure. We assume that $Z_{t}$ follows a Gaussian process. Hence, the vector $Z = (Z_{t_{1}}, \dots, Z_{t_{H}})$ for H discrete time points $t_{1}, \dots, t_{H}$ is multivariate normally distributed. The only requirement is that the values of $Z_{t}$ only depend on the past; that is, they depend on $Z_{s}$ with $s < t .$ It is important to emphasize that a continuous-time process $Z_{t}$ should be considered as the underlying data-generating model. However, a longitudinal design of discrete time points is chosen to define model parameters (i.e., including the causal effect) of interest. Notably, a finite-dimensional parameter of a discrete-time process in a longitudinal design is a summary of the continuous-time process $Z_{t}$ that can even depend on an infinite-dimensional parameter. In the following, we discuss the implications of choosing a particular longitudinal design by demonstrating that the model parameters in this design are functions of model parameters in a model with a refined grid of time points. By doing so, we clarify that the chosen time lags for defining the causal effect determine which effects (i.e., pathways) are controlled and which are part of the definition of the causal effect.

It is essential to understand that $τ_{T_{X, Y}, ACE}$ includes all effects of $X_{2}$ on $Y_{3}$ that are transmitted via intermediated pathways between $X_{2}$ and $Y_{3} .$ This is illustrated in (upper panel), in which we consider the process at a refined grid of time points 1, 2, 2.5, and 3. As can be seen, the total effect of $X_{2}$ on $Y_{3}$ is transmitted via pathways ${X_{2} \to Y}_{2.5} \to Y_{3}$ , ${X_{2} \to X}_{2.5} \to Y_{3}$ and $X_{2} \to Y_{3}$ (Keogh et al., Citation2018). More formally, the causal cross-lagged effect contains a direct effect $γ_{Y_{3} X_{2}}$ and two indirect effects: (47) $τ_{T_{X, Y}, ACE} = γ_{Y_{3} X_{2}} + γ_{Y_{3} X_{2.5}} γ_{X_{2.5} X_{2}} + γ_{Y_{3} Y_{2.5}} γ_{Y_{2.5} X_{2}} .$ (47)

Figure 8. Process of X_t and Y_t considered at time points 1, 2, 2.5, and 3 (upper panel), and time points 1, 2, 2.33, 2.67, and 3 (lower panel). Thick paths are included in the causal cross-lagged effect of X₂ on Y₃ (with adjustment for X₁, Y₁, and Y₂).

In most applications, the total effect $τ_{T_{X, Y}, ACE}$ will differ from the direct effect $γ_{Y_{3} X_{2}}$ as well as the coefficient $γ_{Y_{3} X_{2.5}},$ which is based on a shorter time lag. This highlights the fact that the causal cross-lagged effect is independently defined from a particular process model, and that the causal effect of interest will not necessarily correspond to a regression coefficient in a well-fitting process model. To put it more clearly, it is not a question of model fit whether a researcher should report the total effect $τ_{T_{X, Y}, ACE}$ from EquationEquation (47)(46) $Y_{t} = {\tilde{λ}}_{Y, {\tilde{U}}_{1}} {\tilde{U}}_{1} + {\tilde{β}}_{Y Y} Y_{t - 1} + {\tilde{β}}_{Y X} X_{t - 1} + {\tilde{e}}_{Y_{t}}$ (46) (referring to $X_{2} \to Y_{3}$ with a time lag of 1) or $γ_{Y_{3} X_{2.5}}$ (referring to $X_{2.5} \to Y_{3}$ with a time lag of 0.5).

The cross-lagged effect $τ_{T_{X, Y}, ACE}$ can also be transmitted through pathways that include effects of $Y$ on $X .$ We also consider the process at time points 1, 2, 2.33, 2.67, and 3 (see lower panel in ). As can be seen, the indirect effect via ${X_{2} \to Y}_{2.33} \to {X_{2.67} \to Y}_{3}$ would be part of the total effect $τ_{T_{X, Y}, ACE} .$ Thus, the cross-lagged effect at a single point in time targets the total effect that is transmitted through all (indirect and direct) paths between $X_{2}$ and $Y_{3}$ (Keogh et al., Citation2018).

The longitudinal design also plays a crucial role when adjusting for previous exposure and outcome measures. We now consider the process at time points 1, 1.5, 2, and 3 (see ). As can be seen, with adjustment for $X_{1},$ $Y_{1},$ and $Y_{2},$ the cross-lagged effect $τ_{T_{X, Y}, ACE}$ includes effects of $X_{1.5}$ (i.e., $X_{2} \leftarrow {X_{1.5} \to Y}_{3}$ ) and $Y_{1.5}$ (i.e., $X_{2} \leftarrow {Y_{1.5} \to Y}_{3}$ ) if they exist. The causal effect can be calculated as (48) $τ_{T_{X, Y}, ACE} = γ_{Y_{3} X_{2}} + γ_{Y_{3} X_{1.5}} γ_{X_{2} X_{1.5}} + γ_{Y_{3} Y_{1.5}} γ_{X_{2} Y_{1.5}}$ (48)

Figure 9. Process of X_t and Y_t considered at time points 1, 1.5, 2, and 3. Thick paths are included in the causal cross-lagged effect of X₂ on Y₃ (with adjustment for X₁, Y₁, and Y₂).

Figure 9. Process of Xt and Yt considered at time points 1, 1.5, 2, and 3. Thick paths are included in the causal cross-lagged effect of X2 on Y3 (with adjustment for X1, Y1, and Y2).

Given this reasoning, it might be tempting to adjust for additional covariates $X_{1.5}$ and $Y_{1.5};$ that is, one would utilize $T_{X} = {1, 1.5}$ and $T_{Y} = {1, 1.5, 2}$ for defining an alternative causal effect. In this case, the prior levels of the exposure (i.e., $X_{1.5}$ ) would be treated as a confounder, and the causal effect would be $γ_{Y_{3} X_{2}},$ which typically differs from $τ_{T_{X, Y}, ACE}$ computed in EquationEquation (48)(47) $τ_{T_{X, Y}, ACE} = γ_{Y_{3} X_{2}} + γ_{Y_{3} X_{2.5}} γ_{X_{2.5} X_{2}} + γ_{Y_{3} Y_{2.5}} γ_{Y_{2.5} X_{2}} .$ (47) . As a consequence, the decision of a researcher to choosing $T_{X} = {1}$ instead of $T_{X} = {1, 1.5}$ implies that in the former case, intermediate paths (i.e., the path $X_{2} \leftarrow {X_{1.5} \to Y}_{3}$ ) are also part of the causal effect of interest. Only controlling for $X_{1}$ does not rule out the possibility that intermediate exposures $X_{u}$ with 1< $u$ <2 also contribute to the causal effect, where the contribution strongly depends on the correlation of $X_{u}$ with $X_{2} .$ This tradeoff in deciding whether to adjust for a previous exposure variable cannot be resolved by statistical modeling techniques (see VanderWeele et al., Citation2020, for a discussion of reasons to control for prior measures of $X$ ). For example, even in the case that a previous measure of the exposure received a statistically significant coefficient in a particular (process) model, it is still a conceptual question whether this covariate should serve as a control variable in the definition of the causal effect $τ_{T_{X, Y}, ACE} .$

One interesting feature of and is that the time grid can be arbitrarily refined. One may argue that this is a perfect showcase for continuous-time structural equation modeling (CTSEM; van Montfort et al., Citation2018), and that the challenge of selecting appropriate time points can be circumvented by using CTSEM. However, we are less convinced about the benefits of CTSEM because most of the applications discussed in the methodological literature only consider CTSEM relying on the Markov property (i.e., CTSEM without memory), which boils down to a specification of a lag-1 model with a tiny time lag. Thus, we believe that CTSEM with the Markov property cannot adequately model the dependence structure of time points and will typically be misspecified. Given these limitations, whether CTSEM offers significant advantages over discrete-time models in assessing causal cross-lagged effects can be questioned.

6. Limitations of Model Fit for Estimating Cross-Lagged Effects

In this section, we discuss whether the assessment of model fit could help to choose between the different modeling approaches for estimating cross-lagged effects. Our main argument is that the assessment of model fit is only of limited usefulness because the different modeling approaches provide almost equivalent representations of the observed covariance structure. As the residual-level and observation-level models are very similar (see section 3.4 “Relationship Between the Residual-Level and Observation-Level Approaches”), we restrict our discussion to the residual-level approach.

Under a multivariate normal distribution, only the fit of an observed covariance matrix $S$ to a model-implied covariance matrix $Σ$ is investigated. In the residual-level approach, the covariance structure of the observations at all time points, that is $Σ = Var (Z)$ for $Z = (Z_{1}, \dots, Z_{T})$ is decomposed as follows (49) $Σ = Σ_{B} + Σ_{W},$ (49) where $Σ_{B} = Var (Z^{B})$ is the covariance matrix of the between-person part, that is $Z^{B} = (Z_{1}^{B}, \dots, Z_{T}^{B})$ in EquationEquation (18)(18) $Z_{t}^{B} = {Γ_{t} C + Λ}_{t} U for t = 1, \dots, T$ (18) , and $Σ_{W} = Var (Z^{W})$ of the within-person part, that is $Z^{W} = (Z_{1}^{W}, \dots, Z_{T}^{W})$ in EquationEquations (19)(19) $Z_{1}^{W} = e_{1} for t = 1$ (19) and Equation(20)(20) $Z_{t}^{W} = {B_{t, t - 1} Z}_{t - 1}^{W} + e_{t} for t \geq 2$ (20) . The causal cross-lagged effect is determined from the within-person part and, hence, is a parameter that affects the model-implied constrained covariance matrix $Σ_{W} .$ For evaluating the assessment of global model fit, it is crucial that the model specification of the between-person part of the model is typically not independent of the specification of the within-person part. More specifically, a similar overall model fit for the total covariance $Σ$ is obtained if the model for the between-person part is made more complex (e.g., by including time-varying loadings or random slopes), while the model for the within-person part is less complex (e.g., by specifying invariant lag-1 effects). If distinct freely estimated model parameters refer to $Σ_{B}$ and $Σ_{W},$ then a total number of degrees of freedom $d f_{0}$ associated with the population covariance $Σ$ can be used for modeling the between-person part (i.e., using $p_{B}^{}$ parameters) and the within-person part (i.e., using $p_{W}^{}$ parameters). The resulting degrees of freedom $d f$ for the fitted residual-level model is given by (50) $d f = d f_{0} - p_{B} - p_{W} .$ (50)

Thus, if the number of parameters $p_{B}$ for the between-person part is increased, the number of within-person part parameters $p_{W}^{}$ needs to be reduced to obtain the same number of degrees of freedom $d f .$ Footnote³ This illustrates that the modeled complexity of one part of the model typically affects the required complexity of the other part of the model. However, the regression parameters for the within-person part will have different interpretations, depending on the specification of the model of the between-person and within-person parts, even though the overall model fit is the same for the different specifications (see Usami, Murayama, et al., Citation2019).

The limited usefulness of model fit was also evident in Scenario A of the simulation in which the CL2 was the true data-generating model. In this scenario, the residual-level model RL3 included stable traits with time-invariant loadings for the between-person part and assumed an AR(1) process for the within-person part. The CL2 did not model the between-person structure but assumed an AR(2) process. Both models showed a very similar fit, but the parameter estimates were very different. In such a constellation, the discussion of bias depends on how the true data-generating process is defined, and researchers cannot rely on model fit to choose between the CL2 or RL3. Consequently, the bias of a causal effect estimate from a particular model is entirely unrelated to its model fit.

It should be noted that the residual-level models contain some degrees of freedom and can be rejected based on model fit. However, in an application, researchers would modify their particular specification of the residual-level model to obtain a better-fitting model (e.g., freely estimated loadings). Successively using this model modification strategy, one will get a saturated model with a perfect fit or a non-saturated model with a model fit close to the CL2 model. Then, the modified residual-level model has (approximately) the same model fit as the CL2, but the cross-lagged effects in both models substantially differ. Hence, statistical techniques (and model fit in particular) cannot resolve how to define the causal effect of interest.

Furthermore, this reasoning does not change with the availability of (many) more measurement points. For example, the RL1 with an AR(2) process for the within-person part could be equivalent to a cross-lagged panel model with an AR(3) process. Alternatively, random slopes (e.g., growth factors) could be included in the between-person part for the RL1, while an AR(1) process is specified in the within-person part. Overall, this illustrates how different parametric assumptions in latent variable models for unmeasured confounding can result in equivalent representations of the observed covariances while producing cross-lagged effects with different interpretations.

Finally, it should be added that the limited value of model fit also applies to models that rely on the assumption of no unmeasured confounding. For example, a researcher can be interested in assessing the causal effect with a CL1 model, even in the case that the CL2 model provides a better model fit than the CL1 for modeling the process ${(X}_{t}, Y_{t})$ for $t \geq 2 .$ If the researcher believes—based on subject-matter knowledge—that adjusting for $X_{t - 2}$ would result in overadjustment, the better model fit of the CL2 can be ignored, and the cross-lagged effect from the CL1 should be utilized for assessing the causal effect of interest. Similarly, the existence of stable trait factors in a process model for $X$ and $Y$ does not imply that they need to be treated as confounders in defining the causal cross-lagged effect. We believe these decisions need to rely on subject-matter knowledge and should not be grounded on model fit.

7. Concluding Remarks

In the present article, we discussed different approaches for estimating cross-lagged effects with a cross-lagged panel design. We applied a causal inference perspective and distinguished between models that assume all relevant covariates are measured (CL1 and CL2) and latent variable-type models that used parametric assumptions to adjust for the effects of unmeasured time-invariant confounding variables. This final section highlights issues that need consideration when choosing between the different approaches in a specific application.

We confirmed with simulated data the well-known fact (e.g., Finkel, Citation1995; Little, Citation2013) that the CLPM provides biased estimates of cross-lagged effects if the assumption of no unmeasured confounding is violated (i.e., included covariates are not sufficient to remove confounding). However, it is less emphasized in the methodological literature that it is often beneficial to include lag-2 effects (i.e., CL2) when estimating cross-lagged effects. The main advantage of including lag-2 effects is that allowing for these additional effects of prior measures of $X$ and $Y$ can provide a stronger control for the presence of confounding. However, if researchers believe that the prior levels of the exposure (i.e., $X_{1}$ ) would not act as a confounder but can be considered an essential part of $X_{2},$ controlling for prior levels of the exposure $X$ would result in estimates of cross-lagged effects that may be overadjusted by the prior levels of the exposure and difficult to interpret (see VanderWeele et al., Citation2020, for a discussion of reasons to control for prior measures of $X$ ).

The simulations also showed that the different latent variable-type models have the potential to adjust for the effects of time-invariant unmeasured confounding by including additional latent variables in the model. However, the possibility to adjust for unmeasured confounding comes with the price of additional restrictive parametric assumptions for modeling the process of $Z_{t}$ (i.e., linearity and restrictions on the dependence structure). The simulation results revealed that the performance of the different latent variable-type methods strongly depended on the parametric assumptions that were used for identifying the effects of the additional latent variables. By contrast, no modeling assumptions about the process of $Z_{t}$ need to be made in the approach that relies on the assumption of no unmeasured confounding because only the treatment-effect function (i.e., the conditional expectation of the outcome given the covariates; see EquationEquation (2)(2) $τ_{T_{X, Y, L}} (x) = E [Y_{t} (x)] = \int E (Y_{t} | X_{s} = x, W = w) f_{W} (w) d w,$ (2) ) needs to be modeled. However, as pointed out before, no unmeasured confounding is a strong assumption that is often hard to justify. On the other hand, claiming that a particular latent variable-type model controls for unmeasured confounding is frequently an equally strong assumption. Finally, the approaches that we discussed can also be applied when a causal interpretation of the estimated effect is not warranted. In this case, the cross-lagged effects can be interpreted as “adjusted associations” (Brumback, Citation2021), which are still valuable for many applications. We consider it a major advantage of the causal inference perspective that it forces researchers to think about the main target of inference and the role of potential confounding variables.

We need to mention that our discussion of the cross-lagged panel design has several limitations. First, as pointed out before, we focused on the causal effect of a variable at a single point in time (i.e., the effect of $X_{2}$ on $Y_{3}$ ). In the typical cross-lagged panel design, the variable $X$ varies across time, and it would also be possible to estimate the joint or cumulative effect of $X_{1}$ and $X_{2}$ on $Y_{3} .$ These cumulative effects are rarely investigated in psychological research. One of the few exceptions is VanderWeele et al. (Citation2011), who investigated the cumulative effects of loneliness on depression using a cross-lagged panel design with five measurement waves (see also Silvey et al., Citation2021). However, it should be added that the assumptions for identifying the cumulative effect of a treatment trajectory over a series of time points are more demanding than for the effect at a single point in time (Hernán & Robins, Citation2020; Robins et al., Citation2000; see also Daniel et al., Citation2013). The main challenge is to adequately control for time-varying confounders (e.g., prior measures of the outcome $Y_{2}$ that may also be affected by prior measures of the exposure $X_{1}$ ). This is an active area of methodological research in longitudinal causal inference (e.g., Keogh et al., Citation2018; Newsome et al., Citation2018; Wodtke, Citation2020).

Second, we assumed that the variables were measured without error. In most practical applications, both $X$ and $Y$ will be affected by measurement error, resulting in biased estimates of regression coefficients. Researchers often use multiple indicators to control for measurement error (i.e., internal consistency error) if psychological constructs are measured by multiple items. Multiple indicators can be easily included in the models discussed in the present paper (e.g., Little, Citation2013; Mulder & Hamaker, Citation2021). However, internal consistency measures of reliability only focus on measurement error caused by the finite number of items on the test. It would be possible to extend the models to account for other sources of error (e.g., short-term fluctuations of the measure or transient error; McCrae, Citation2015) when estimating cross-lagged effects with error-prone variables (Heise, Citation1969; Kenny & Zautra, Citation1995). However, the main arguments for choosing between the different modeling approaches are independent of the issue of handling measurement error.

Finally, in practical applications, the selection of covariates can be challenging (VanderWeele, Citation2019). Even experts with subject-matter knowledge will often disagree on whether a specific variable should be treated as a confounder or would result in overadjustment (if included in the adjustment set). One reasonable strategy would be to juxtapose the different approaches and use their estimates (or the lower and upper bounds of the respective confidence intervals) as bounds for the true cross-lagged effect (Leamer, Citation1985). This would reveal how sensitive the conclusions are to different assumptions about confounding variables. Estimating cross-lagged effects under model uncertainty is an interesting topic for future research (see also Athey & Imbens, Citation2015; Young & Holsteen, Citation2017).

Notes

1 Usually, a second assumption is added (positivity assumption) which states that in the population, the density of observing any exposure level given the covariates is positive. This assumption implies that there exists sufficient overlap in the covariate distributions between the different exposure levels under consideration.

2 Allison et al. (Citation2017, p. 3) write: “And not having to specifiy the functional form of the dependence of x on y both simplifies the estimation problem and reduces the danger of misspecification. If you are interested in the dependence of x on y, you can always specify a second dynamic panel model for y and estimate that separately.”

3 Note that this is different from multilevel structural equation modelling where the level-1 units are treated as exchangeable, and separate saturated covariance structures exist at level 1 and level 2. In this case, the number of estimated parameters at one level (e.g., level 2) does not affect the number of estimable parameters at the other level (e.g., level 1), and level-specific evaluations for

Σ_{B}

and

Σ_{W}

are possible and recommended (Ryu & West, Citation2009).

References

Allison, P. D., Williams, R., & Moral-Benito, E. (2017). Maximum likelihood for cross-lagged panel models with fixed effects. Socius, 3, 237802311771057. https://doi.org/10.1177/2378023117710578
Google Scholar
Andersen, H. K. (2021). Equivalent approaches to dealing with unobserved heterogeneity in cross-lagged panel models? Investigating the benefits and drawbacks of the latent curve model with structured residuals and the random intercept cross-lagged panel model. Psychological Methods, Advance online publication. https://doi.org/10.1037/met0000285
Google Scholar
Angrist, J. D., & Pischke, J.-S. (2009). Mostly harmless econometrics: An empiricist’s companion. Princeton University Press. https://doi.org/10.1515/9781400829828
Google Scholar
Aronow, P. M., & Miller, B. T. (2019). Foundation of agnostic statistics. Cambridge University Press. http://dx.doi.org/10.1017/9781316831762
Google Scholar
Asendorpf, J. B. (2021). Modeling developmental processes. In J. R. Rauthmann (Ed.), Handbook of personality dynamics and processes (pp. 815–835). Elsevier. https://doi.org/10.1016/B978-0-12-813995-0.00031-5
Google Scholar
Asparouhov, T., & Muthén, B. (2021). Residual structural equation models. Retrieved from https://www.statmodel.com/download/Asparouhov_Muthen_2021a.pdf
Google Scholar
Athey, S., & Imbens, G. W. (2015). A measure of robustness to misspecification. American Economic Review, 105, 476–480. https://doi.org/10.1257/aer.p20151020
Web of Science ®Google Scholar
Athey, S., & Imbens, G. W. (2019). Machine learning methods that economists should know about. Annual Review of Economics, 11, 685–725. https://doi.org/10.1146/annurev-economics-080217-053433
Web of Science ®Google Scholar
Bai, J., & Li, K. (2014). Theory and methods of panel data models with interactive effects. The Annals of Statistics, 42, 142–170. https://doi.org/10.1214/13-AOS1183
Web of Science ®Google Scholar
Bailey, D. H., Oh, Y., Farkas, G., Morgan, P., & Hillemeier, M. (2020). Reciprocal effects of reading and mathematics? Beyond the cross-lagged panel model. Developmental Psychology, 56, 912–921. https://doi.org/10.1037/dev0000902
PubMed Web of Science ®Google Scholar
Berk, R. A., Brown, L., Buja, A., George, E., Pitkin, E., Zhang, K., & Zhao, L. (2014). Misspecified mean function regression: Making good use of regression models that are wrong. Sociological Methods and Research, 43, 433–451. http://dx.doi.org/10.1177/0049124114526375
Web of Science ®Google Scholar
Berry, D., & Willoughby, M. T. (2017). On the practical interpretability of cross-lagged panel models: Rethinking a developmental workshose. Child Development, 88, 1186–1206. https://doi.org/10.1111/cdev.12660
PubMed Web of Science ®Google Scholar
Bollen, K. A., & Brand, J. E. (2010). A general panel model with random and fixed effects: A structural equations approach. Social Forces, 89, 1–34. https://doi.org/10.1353/sof.2010.0072
PubMed Web of Science ®Google Scholar
Bollen, K. A., & Curran, P. J. (2004). Autoregressive latent trajectory (ALT) models a synthesis of two traditions. Sociological Methods & Research, 32, 336–383. https://doi.org/10.1177/0049124103260222
Web of Science ®Google Scholar
Brumback, B. A. (2021). Fundamentals of causal inference with R. Chapman and Hall. http://dx.doi.org/10.1201/9781003146674
Google Scholar
Curran, P. J., & Hancock, G. R. (2021). The challenge of modeling co-developmental processes over time. Child Development Perspectives, 15, 67–75. http://dx.doi.org/10.1111/cdep.12401
Web of Science ®Google Scholar
Daniel, R. M., Cousens, S. N., De Stavola, B. L., Kenward, M. G., & Sterne, J. A. C. (2013). Methods for dealing with time-dependent confounding. Statistics in Medicine, 32, 1584–1618. https://doi.org/10.1002/sim.5686
PubMed Web of Science ®Google Scholar
De Vos, I., & Everaert, G. (2021). Bias-corrected common correlated effects pooled estimation in dynamic panels. Journal of Business & Economic Statistics, 39, 294–306. https://doi.org/10.1080/07350015.2019.1654879
Web of Science ®Google Scholar
Dietvorst, E., Hiemstra, M., Hillegers, M. H. J., & Keijsers, L. (2018). Adolescent perceptions of parental privacy invasion and adolescent secrecy: An illustration of Simpson’s paradox. Child Development, 89, 2081–2090. https://doi.org/10.1111/cdev.13002
PubMed Web of Science ®Google Scholar
Dishop, C. R., & DeShon, R. P. (2021). A tutorial on Bollen and Brand’s approach to modeling dynamics while attending to dynamic panel bias. Psychological Methods, Advance online publication. https://doi.org/10.1037/met0000333
Google Scholar
Ehm, J.-H., Hasselhorn, M., & Schmiedek, F. (2019). Analyzing the developmental relation of academic self-concept and achievement in elementary school children: Alternative models point to different results. Developmental Psychology, 55, 2336–2351. https://doi.org/10.1037/dev0000796
PubMed Web of Science ®Google Scholar
Finkel, S. (1995). Causal analysis with panel data. Sage. http://dx.doi.org/10.4135/9781412983594
Google Scholar
Gollob, H. F., & Reichardt, C. S. (1987). Taking account of time lags in causal models. Child Development, 58, 80–92. http://dx.doi.org/10.2307/1130293
PubMed Web of Science ®Google Scholar
Grimm, K. J., Helm, J., Rodgers, D., & O′Rourke, H. (2021). Analyzing cross-lag effects: A comparison of different cross-lag modeling approaches. New Directions for Child and Adolescent Development, 2021, 11–33. http://dx.doi.org/10.1002/cad.20401
PubMedGoogle Scholar
Hamaker, E. L. (2005). Conditions for the equivalence of the autoregressive latent trajectory model and a latent growth curve model with autoregressive disturbances. Sociological Methods & Research, 33, 404–416. http://dx.doi.org/10.1177/0049124104270220
Web of Science ®Google Scholar
Hamaker, E. L., Kuiper, R. M., & Grasman, R. P. P. P. (2015). A critique of the cross-lagged panel model. Psychological Methods, 20, 102–116. http://dx.doi.org/10.1037/a0038889
PubMed Web of Science ®Google Scholar
Heise, D. R. (1969). Separating reliability and stability in test-retest correlation. American Sociological Review, 34, 93–101. http://dx.doi.org/10.2307/2092790
Web of Science ®Google Scholar
Hernán, M. A., & Robins, J. M. (2020). Causal inference: What if. Chapman & Hall/CRC.
Google Scholar
Hirano, K., & Imbens, G. W. (2004). The propensity score with continuous treatments. In A. Gelman & X.-L. Meng (Eds.), Applied Bayesian modeling and causal inference from incomplete-data perspectives (pp. 73–84). Wiley. http://dx.doi.org/10.1002/0470090456.ch7
Google Scholar
Hsiao, C. (2014). Analysis of panel data (3rd ed.). Cambridge University Press. http://dx.doi.org/10.1017/CBO9781139839327
Google Scholar
Imbens, G. W. (2004). Nonparametric estimation of average treatment effects under exogeneity: A review. Review of Economics and Statistics, 86, 4–29. http://dx.doi.org/10.1162/003465304323023651
Web of Science ®Google Scholar
Imbens, G. W., & Rubin, D. B. (2015). Causal inference for statistics, social, and biomedical sciences. Cambridge University Press. https://doi.org/10.1017/CBO9781139025751
Google Scholar
Kenny, D. A., & Zautra, A. (1995). The trait-state-error model for multiwave data. Journal of Consulting and Clinical Psychology, 63, 52–59. http://dx.doi.org/10.1037/0022-006X.63.1.52
PubMed Web of Science ®Google Scholar
Keogh, R. H., Daniel, R. M., VanderWeele, T. J., & Vansteelandt, S. (2018). Analysis of longitudinal studies with repeated outcome measures: Adjusting for time-dependent confounding using conventional methods. American Journal of Epidemiology, 187, 1085–1092. http://dx.doi.org/10.1093/aje/kwx311
PubMed Web of Science ®Google Scholar
Kessler, R. C., & Greenberg, D. F. (1981). Linear panel analysis: Models of quantitative change. Academic Press.
Google Scholar
Leamer, E. E. (1985). Sensitivity analysis would help. The American Economic Review, 75, 308–313.
Web of Science ®Google Scholar
Little, T. D. (2013). Longitudinal structural equation modeling. Guilford Press.
Google Scholar
Littlefield, A. K., King, K. M., Acuff, S. F., Foster, K. T., Murphy, J. G., & Witkiewitz, K. (2021). Limitations of cross-lagged panel models in addiction research and alternative models: An empirical example using project MATCH. Psychology of Addictive Behaviors, Advance online publication. https://doi.org/10.1037/adb0000750
Google Scholar
Lucas, R. E. (2022). It’s time to abandon the cross-lagged panel model. PsyArXiv Preprints, https://doi.org/10.31234/osf.io/pkec7
Google Scholar
Marsh, H. W., Pekrun, R., Murayama, K., Arens, A. K., Parker, P. D., Guo, J., & Dicke, T. (2018). An integrated model of academic self-concept development: Academic self-concept, grades, test scores, and tracking over 6 years. Developmental Psychology, 54, 263–280. http://dx.doi.org/10.1037/dev0000393
PubMed Web of Science ®Google Scholar
Marsh, H. W., Trautwein, U., Lüdtke, O., Köller, O., & Baumert, J. (2005). Academic self-concept, interest, grades, and standardized test scores: Reciprocal effects models of causal ordering. Child Development, 76, 397–416. http://dx.doi.org/10.1111/j.1467-8624.2005.00853.x
PubMed Web of Science ®Google Scholar
Maxwell, S. E., & Delaney, H. D. (2004). Designing experiments and analyzing data: A model comparison perspective. Erlbaum. http://dx.doi.org/10.4324/9781410609243
Google Scholar
McCrae, R. R. (2015). A more nuanced review of reliability: Specificity in the trait hierarchy. Personality and Social Psychology Review, 19, 97–112. http://dx.doi.org/10.1177/1088868314541857
PubMed Web of Science ®Google Scholar
Moral-Benito, E. (2013). Likelihood-based estimation of dynamic panels with predetermined regressors. Journal of Business & Economic Statistics, 31, 451–472. http://dx.doi.org/10.1080/07350015.2013.818003
Web of Science ®Google Scholar
Morgan, S. L., & Winship, C. (2015). Counterfactuals and causal inference: Methods and principles for social research (2nd ed.). University Press. https://doi.org/10.1017/CBO9781107587991
Google Scholar
Mulder, J. D., & Hamaker, E. L. (2021). Three extensions of the random intercept cross-lagged panel model. Structural Equation Modeling, 28, 638–648. http://dx.doi.org/10.1080/10705511.2020.1784738
Web of Science ®Google Scholar
Mund, M., & Nestler, S. (2019). Beyond the cross-lagged panel model: Next-generation tools for analyzing interdependencies across the life course. Advances in Life Course Research, 41, 100249. http://dx.doi.org/10.1016/j.alcr.2018.10.002
PubMed Web of Science ®Google Scholar
Newsom, J. T. (2015). Longitudinal structural equation modeling: A comprehensive introduction. Routledge.
Google Scholar
Newsome, S. J., Keogh, R. H., & Daniel, R. M. (2018). Estimating long-term treatment effects in observational data: A comparison of the performance of different methods under real-world uncertainty. Statistics in Medicine, 37, 2367–2390. http://dx.doi.org/10.1002/sim.7664
PubMed Web of Science ®Google Scholar
Núñez-Regueiro, F., Juhel, J., Bressoux, P., & Nurra, C. (2021). Identifying reciprocities in school motivation research: A review of issues and solutions associated with cross-lagged effects models. Journal of Educational Psychology, Advance online publication. http://dx.doi.org/10.1037/edu0000700
Google Scholar
Oh, Y., Greenberg, M. T., & Willoughby, M. T. (2020). Examining longitudinal associations between externalizing and internalizing behavior problems at within- and between-child levels. Journal of Abnormal Child Psychology, 48, 467–480. http://dx.doi.org/10.1007/s10802-019-00614-6.
PubMed Web of Science ®Google Scholar
Orth, U., Clark, D. A., Donnellan, M. B., & Robins, R. W. (2021). Testing prospective effects in longitudinal research: Comparing seven competing cross-lagged models. Journal of Personality and Social Psychology, 120, 1013–1034. http://dx.doi.org/10.1037/pspp0000358
PubMed Web of Science ®Google Scholar
Plewis, I. (1985). Analysing change: Methods for the measurement and explanation of change in the social sciences. Wiley.
Google Scholar
Reichardt, C. S. (2019). Quasi-Experimentation: A guide to design and analysis. Guilford Press.
Google Scholar
Robins, J. M., Hernán, M. A., & Brumback, B. (2000). Marginal structural models and causal inference in epidemiology. Epidemiology, 11, 550–560. http://dx.doi.org/10.1097/00001648-200009000-00011.
PubMed Web of Science ®Google Scholar
Rosseel, Y. (2012). Lavaan: An R package for structural equation modeling. Journal of Statistical Software, 48, 1–36. Retrieved from http://www.jstatsoft.org/v48/i02/ https://doi.org/10.18637/jss.v048.i02
Web of Science ®Google Scholar
Ruzek, E. A., & Schenke, K. (2019). The tenuous link between classroom perceptions and motivation: A within-person longitudinal study. Journal of Educational Psychology, 111, 903–917. http://dx.doi.org/10.1037/edu0000323
Web of Science ®Google Scholar
Ryu, E., & West, S. G. (2009). Level-specific evaluation of model fit in multilevel structural equation modeling. Structural Equation Modeling, 16, 583–601. http://dx.doi.org/10.1080/10705510903203466
Web of Science ®Google Scholar
Shamsollahi, A., Zyphur, M. J., & Ozkok, O. (2021). Long-run effects in dynamic systems: New tools for cross-lagged panel models. Organizational Research Methods, 2021, 109442812199322. https://doi.org/10.1177/1094428121993228
Google Scholar
Silvey, C., Demir-Lira, Ö. E., Goldin-Meadow, S., & Raudenbush, S. W. (2021). Effects of time-varying parent input on children’s language outcomes differ for vocabulary and syntax. Psychological Science, 32, 536–548. http://dx.doi.org/10.1177/0956797620970559
PubMed Web of Science ®Google Scholar
Usami, S. (2021). On the differences between general cross-lagged panel model and random-intercept cross-lagged panel model: Interpretation of cross-lagged parameters and model choice. Structural Equation Modeling, 28, 331–344. http://dx.doi.org/10.1080/10705511.2020.1821690
Web of Science ®Google Scholar
Usami, S., Murayama, K., & Hamaker, E. L. (2019). A unified framework of longitudinal models to examine reciprocal relations. Psychological Methods, 24, 637–657. http://dx.doi.org/10.1037/met0000210
PubMed Web of Science ®Google Scholar
Usami, S., Todo, N., & Murayama, K. (2019). Modeling reciprocal effects in medical research: Critical discussion on the current practices and potential alternative models. PLoS One, 14, e0209133. http://dx.doi.org/10.1371/journal.pone.0209133
PubMed Web of Science ®Google Scholar
Van der Laan, M. J., & Rose, S. (2018). Targeted learning in data science: Causal inference for complex longitudinal studies. Springer. https://doi.org/10.1007/978-3-319-65304-4
Google Scholar
van Montfort, K., Oud, J. H. L., & Voelkle, M. C. (Eds.). (2018). Continuous time modeling in the behavioral and related sciences. Springer Nature. http://dx.doi.org/10.1007/978-3-319-77219-6
Google Scholar
VanderWeele, T. J. (2015). Explanation in causal inference: Methods for mediation and interaction. Oxford University Press.
Google Scholar
VanderWeele, T. J. (2019). Principles of confounder selection. European Journal of Epidemiology, 34, 211–219. http://dx.doi.org/10.1007/s10654-019-00494-6.
PubMed Web of Science ®Google Scholar
VanderWeele, T. J. (2021). Causal inference with time-varying exposures. In T. L. Lash, T. J. VanderWeele, S. Haneuse, & K. J. Rothman (Eds.), Modern epidemiology (pp. 605–618). Wolters Kluwer.
Google Scholar
VanderWeele, T. J., Hawkley, L. C., Thisted, R. A., & Cacioppo, J. T. (2011). A marginal structural model analysis for loneliness: Implications for intervention trials and clinical practice. Journal of Consulting and Clinical Psychology, 79, 225–235. http://dx.doi.org/10.1037/a0022610
PubMed Web of Science ®Google Scholar
VanderWeele, T. J., Jackson, J. W., & Li, S. (2016). Causal inference and longitudinal data: A case study of religion and mental health. Social Psychiatry and Psychiatric Epidemiology, 51, 1457–1466. http://dx.doi.org/10.1007/s00127-016-1281-9
PubMed Web of Science ®Google Scholar
VanderWeele, T. J., Mathur, M. B., & Chen, Y. (2020). Outcome-wide longitudinal designs for causal inference: A new template for empirical studies. Statistical Science, 35, 437–466. http://dx.doi.org/10.1214/19-STS728
Web of Science ®Google Scholar
Vegetabile, B. G., Griffin, B. A., Coffman, D. L., Cefalu, M., Robbins, M. W., & McCaffrey, D. F. (2021). Nonparametric estimation of population average dose-response curves using entropy balancing weights for continuous exposures. Health Services & Outcomes Research Methodology, 21, 69–110. http://dx.doi.org/10.1007/s10742-020-00236-2.
PubMed Web of Science ®Google Scholar
Williams, R., Allison, P. D., & Moral-Benito, E. (2018). Linear dynamic panel-data estimation using maximum likelihood and structural equation modeling. The Stata Journal, 18, 293–326. http://dx.doi.org/10.1177/1536867X1801800201
Web of Science ®Google Scholar
Wodtke, G. T. (2020). Regression-based adjustment for time-varying confounders. Sociological Methods & Research, 49, 906–946. http://dx.doi.org/10.1177/0049124118769087
Web of Science ®Google Scholar
Wooldridge, J. M. (2010). Econometric analysis of cross section and panel data. MIT Press.
Google Scholar
Young, C., & Holsteen, K. (2017). Model uncertainty and robustness: A computational framework for multimodel analysis. Sociological Methods & Research, 46, 3–40. http://dx.doi.org/10.1177/0049124115610347
Web of Science ®Google Scholar
Zhou, N., Cao, H., Liu, F., Wu, L., Liang, Y., Xu, J., Meng, H., Zang, N., Hao, R., An, Y., Ma, S., Fang, X., & Zhang, J. (2020). A four-wave, cross-lagged model of problematic internet use and mental health among Chinese college students: Disaggregation of within-person and between-person effects. Developmental Psychology, 56, 1009–1021. http://dx.doi.org/10.1037/dev0000907.
PubMed Web of Science ®Google Scholar
Zyphur, M. J., Allison, P. D., Tay, L., Voelkle, M. C., Preacher, K. J., Zhang, Z., Hamaker, E. L., Shamsollahi, A., Pierides, D. C., Koval, P., & Diener, E. (2020). From data to causes I: Building a general cross-lagged panel model (GCLM). Organizational Research Methods, 23, 651–687. http://dx.doi.org/10.1177/1094428119847278
Web of Science ®Google Scholar

A Comparison of Different Approaches for Estimating Cross-Lagged Effects from a Causal Inference Perspective

Abstract

1. Causal Perspective on Estimating Cross-Lagged Effects

1.1. Definition of a Causal Cross-Lagged Effect

1.2. Specification Issues

2. Estimating a Cross-Lagged Effect: Controlling for Measured Confounding

2.1. Cross-Lagged Panel Model with Lag-1 Effects (CL1)

2.2. Cross-Lagged Panel Model with Lag-2 Effects (CL2)

3. Estimating a Cross-Lagged Effect: Controlling for Unmeasured Confounding

3.1. Observation-Level Approaches

3.1.1. Unidimensional Latent Factor Model (OL1)

3.1.2. Twodimensional Latent Factor Model (OL2)

3.1.3. Twodimensional Latent Factor Model with Loadings at Time 1 (OL3 and OL4)

Table 1. Overview of different latent variable-type models that adjust for effects of unmeasured variables U.

3.2. Residual-Level Approaches

3.2.1. Twodimensional Latent Factor Model with Time-Invariant Loadings and Regression Coefficients (RL1)

3.2.2. Twodimensional Latent Factor Model with Time-Varying Loadings or Regression Coefficients (RL2, RL3)

3.3. Fixed Effects Dynamic Panel Model

3.3.1. Fixed Effects Models with a Unidimensional Latent Factor (FED)

3.4. Relationship between the Residual-Level and Observation-Level Approaches

3.4.1. Equivalence of General Residual-Level and Observation-Level Models

3.4.2. Representation of a Simple Structure Residual-Level Model as an Observation-Level Model

3.4.3. Representation of a Simple Structure Observation-Level Model as a Residual-Level Model

3.4.4. Representation of a Residual-Level Model as a Fixed Effects Dynamic Panel Model

3.4.5. Summary

4. Estimation of Cross-Lagged Effects under Different Data-Generating Models

4.1. Scenario A: CL2 as the Data-Generating Model

Table 2. Results for Scenario A: True model is the cross-lagged panel model with lag-2 effects (CL2). Estimates of the cross-lagged effect for the different approaches.

4.2. Scenario B: OL1 as the Data-Generating Model

Table 3. Results for Scenario B: True model is the observation-level model with single latent variable (OL1). Estimates of the cross-lagged effect for the different approaches.

Table 4. Results for Scenario C: True Model is the residual-level (RL) Model. Estimates for the cross-lagged effect for the different approaches.

4.3. Scenario C: Residual-Level Model as the Data-Generating Model

4.4. Scenario D: FED as the Data-Generating Model

Table 5. Results for Scenario D: True model is the fixed effects dynamic panel model (FED). Estimates for the cross-lagged effect for the different approaches.

4.5. Summary

5. The Role of Time for Estimating the Causal Cross-Lagged Effect

6. Limitations of Model Fit for Estimating Cross-Lagged Effects

7. Concluding Remarks

Notes

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

Table 1. Overview of different latent variable-type models that adjust for effects of unmeasured variables $U .$