2,682
Views
39
CrossRef citations to date
0
Altmetric
Research Article

Performance of Estimators for Confirmatory Factor Analysis of Ordinal Variables with Missing Data

ORCID Icon & ORCID Icon
 

Abstract

Missing data and ordinal indicators are common in applied research involving latent constructs. Unfortunately, ordinal indicators violate the linearity assumption for conventional CFA that is routinely used to provide structural validity evidence for measurement instruments. Although robust maximum likelihood estimator (MLR) can deal with both missing data and nonnormality, it is generally inappropriate for ordinal indicators. Categorical estimation methods such as weighted least square mean and variance adjusted (WLSMV) method, or MLR or maximum likelihood (ML) that justly treats ordinal indicators as categorical (MLR-CAT or ML-CAT, respectively) have been recommended for ordinal dependent variables. However, performances of these categorical estimators in the presence of missing data have not been empirically examined. The current study systematically investigates the relative performances of WLSMV, MLR, MLR-CAT, and ML-CAT under different conditions of missing data amount and mechanism, sample size, level of indicator distribution, and number of indicator categories. Results generally favor MLR-CAT so long as the sample size is not too small (>200) to result in convergence problems.

Notes

1 We focus on CFA models in this article for convenience though many of the issues discussed here are relevant to other SEMs.

2 WLSMV is the default estimator in Mplus when categorical variables are declared; lavaan (Rosseel, Citation2012) in R (R Core Team, Citation2018) also automatically switches to WLSMV when it detects categorical variables.

3 Extremely large standard error (SE) estimates that fall outside of reasonable range suggest model convergence with numerical estimation problems. In practice, these improper solutions cannot be trusted and were therefore removed from subsequent analyses. We chose the proper range for SE to be 0–1 because SE cannot be negative, the true SEs (≤.48) are all well within this range, and the value of 1 would make at least one limit of all approximate 68% probable intervals for our parameters (standardized loadings and factor correlations) out of bound.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.