2,682
Views
39
CrossRef citations to date
0
Altmetric
Research Article

Performance of Estimators for Confirmatory Factor Analysis of Ordinal Variables with Missing Data

ORCID Icon & ORCID Icon
Pages 584-601 | Published online: 06 Nov 2019
 

Abstract

Missing data and ordinal indicators are common in applied research involving latent constructs. Unfortunately, ordinal indicators violate the linearity assumption for conventional CFA that is routinely used to provide structural validity evidence for measurement instruments. Although robust maximum likelihood estimator (MLR) can deal with both missing data and nonnormality, it is generally inappropriate for ordinal indicators. Categorical estimation methods such as weighted least square mean and variance adjusted (WLSMV) method, or MLR or maximum likelihood (ML) that justly treats ordinal indicators as categorical (MLR-CAT or ML-CAT, respectively) have been recommended for ordinal dependent variables. However, performances of these categorical estimators in the presence of missing data have not been empirically examined. The current study systematically investigates the relative performances of WLSMV, MLR, MLR-CAT, and ML-CAT under different conditions of missing data amount and mechanism, sample size, level of indicator distribution, and number of indicator categories. Results generally favor MLR-CAT so long as the sample size is not too small (>200) to result in convergence problems.

Notes

1 We focus on CFA models in this article for convenience though many of the issues discussed here are relevant to other SEMs.

2 WLSMV is the default estimator in Mplus when categorical variables are declared; lavaan (Rosseel, Citation2012) in R (R Core Team, Citation2018) also automatically switches to WLSMV when it detects categorical variables.

3 Extremely large standard error (SE) estimates that fall outside of reasonable range suggest model convergence with numerical estimation problems. In practice, these improper solutions cannot be trusted and were therefore removed from subsequent analyses. We chose the proper range for SE to be 0–1 because SE cannot be negative, the true SEs (≤.48) are all well within this range, and the value of 1 would make at least one limit of all approximate 68% probable intervals for our parameters (standardized loadings and factor correlations) out of bound.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 53.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 412.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.