Views

CrossRef citations to date

Altmetric

Teacher’s Corner

A Bayesian Vector Autoregressive Model with Nonignorable Missingness in Dependent Variables and Covariates: Development, Evaluation, and Application to Family Processes

Linying JiThe Pennsylvania State UniversityCorrespondence[email protected]
View further author information

Meng ChenThe Pennsylvania State UniversityView further author information

Zita OraveczThe Pennsylvania State UniversityView further author information

E. Mark CummingsUniversity of Notre DameView further author information

Zhao-Hua LuSt. Jude Children’s Research HospitalView further author information

Sy-Miin ChowThe Pennsylvania State UniversityView further author information

Abstract

Intensive longitudinal designs involving repeated assessments of constructs often face the problems of nonignorable attrition and selected omission of responses on particular occasions. However, time series models, such as vector autoregressive (VAR) models, are often fit to these data without consideration of nonignorable missingness. We introduce a Bayesian model that simultaneously represents the over-time dependencies in multivariate, multiple-subject time series data via a VAR model, and possible ignorable and nonignorable missingness in the data. We provide software code for implementing this model with application to an empirical data set. Moreover, simulation results comparing the joint approach with two-step multiple imputation procedures are included to shed light on the relative strengths and weaknesses of these approaches in practical data analytic scenarios.

Keywords:

Notes

¹ To start the procedure, all missing observations are filled in using random draws with replacement from all observed values. Then, for the first iteration, $ϕ_{1}^{1}$ is drawn from the distribution $P (ϕ_{1}^{1} | y_{1}^{o b s}, Y_{- 1}^{0}, X^{0}, R)$ . Missing values in $y_{1}$ , $y_{1}^{m i s s}$ , are then filled in by drawing values from $P (y_{1} | y_{1}^{o b s}, Y_{- 1}^{0}, X^{0}, R, ϕ_{1}^{1})$ , the posterior predictive distribution of $y_{1}$ conditioned on $y_{1}^{o b s}, Y_{- 1}^{0}, X^{0}, R$ , and $ϕ_{1}^{1}$ . Here, we use superscript $k$ to denote data sets and parameter estimates from the $k^{t h}$ iteration, with $k = 0$ denoting the original data sets or initial starting values of the parameters. Similar procedure is subsequently performed to generate predicted values for $y_{2}^{m i s s}$ , only that imputed values for $y_{1}$ from the previous step are used in the prediction process. The first iteration ends when missing observations are filled in for all variables. The procedure is repeated for $K$ iterations to result in one set of data with imputed values. The whole procedure is repeated multiple times to generate multiple imputed data sets and correspondingly, multiple sets of parameter and standard error estimates for subsequent pooling (van Buuren, Citation2012).

² The mixture models factor the full-data model as:

P (Y, R | X, ω) = P (Y | R, X, ω) P (R | X, ω),

With this model, the relations between

Y

and

X

are conditioned on different missing data patterns. In contrast, the shared parameter approach assumes a multilevel structure and models random effects

b

jointly with

Y

and

R

with following general model:

P (Y, R | X, ω) = \int P (Y, R, b | X, ω) d b .

³ The version of the R package dynr we used in this study contained a small error in handling the uncertainty of the missing data, which resulted in slight overestimation of the process noise variances, and higher corresponding biases for these particular parameters under the Partial MI approach. Without this error, biases for all process noise-related parameters would likely be even lower under the Partial MI, but conclusions involving other parameters should remain the same.

⁴ The standard deviation of all point estimates on a parameter across MC runs.

⁵ Since neither the LD method nor the two-step partial MI method explicitly specify and estimate a missingness model, no missing data parameter estimates were available from these methods.

Additional information

Funding

This work was supported by the National Center for Advancing Translational Sciences [UL TR000127]; The Intensive Longitudinal Health Behavior Cooperative Agreement Program funded by the National Institutes of Health under Award Number U24AA027684; National Institutes of Health [R01GM105004]; National Science Foundation [IGE-1806874]; and Penn State Quantitative Social Sciences Initiative.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

A Bayesian Vector Autoregressive Model with Nonignorable Missingness in Dependent Variables and Covariates: Development, Evaluation, and Application to Family Processes

Information for

Open access

Opportunities

Help and information

A Bayesian Vector Autoregressive Model with Nonignorable Missingness in Dependent Variables and Covariates: Development, Evaluation, and Application to Family Processes

Abstract

Notes

Additional information

Funding

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature