Random Permutation Testing Applied to Measurement Invariance Testing with Ordered-Categorical Indicators: Structural Equation Modeling: A Multidisciplinary Journal: Vol 25 , No 4

Sample our Education journals, sign in here to start your access, latest two full volumes FREE to you for 14 days

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
Read this article /doi/full/10.1080/10705511.2017.1421467?needAccess=true

Abstract

We describe and evaluate a random permutation test of measurement invariance with ordered-categorical data. To calculate a p-value for the observed (∆)χ², an empirical reference distribution is built by repeatedly shuffling the grouping variable, then saving the χ² from a configural model, or the ∆χ² between configural and scalar-invariance models, fitted to each permuted dataset. The current gold standard in this context is a robust mean- and variance-adjusted ∆χ² test proposed by Satorra (2000), which yields inflated Type I errors, particularly when thresholds are asymmetric, unless samples sizes are quite large (Bandalos, 2014; Sass et al., 2014). In a Monte Carlo simulation, we compare permutation to three implementations of Satorra’s robust χ² across a variety of conditions evaluating configural and scalar invariance. Results suggest permutation can better control Type I error rates while providing comparable power under conditions that the standard robust test yields inflated errors.

Keywords:

measurment invariance
permutation testing
categorical indicators

Acknowledgment

We would like to thank Yves Rosseel for his helpful technical discussions while investigating different implementations of the mean- and variance-adjusted test statistic, and Paul Johnson for his computational assistance while comparing software packages. We thank the Center for Research Methods and Data Analysis and the College of Liberal Sciences at the University of Kansas for access to their high performance compute cluster on which our Monte Carlo simulations were conducted.

Notes

¹ Throughout the manuscript, we will restrain our discussion to the case of polychoric correlations for models fit only to ordered-categorical items, but this WLS estimator can also be applied to a mixture of discrete and continuous indicators. When continuous indicators are included, their observed (co)variances are included in the estimated polychoric correlation matrix, and polyserial correlations are estimated between the discrete and continuous indicators.

² Mean- and variance-adjusted statistics can also be calculated for other estimators, such as maximum likelihood.

³ Note that it is not appropriate to calculate the difference between two statistics because they will not be approximately distributed. Instead, the difference between unadjusted statistics must be calculated, then adjusted.

⁴ Details about how to use the DIFFTEST command can be found with Web Note 4 at http://www.statmodel.com/ .

⁵ Jorgensen et al. (Citation2017a) showed that the test of overall model fit tests an overly restrictive null hypothesis because model configurations could be equivalent across populations even if the hypothesized model is not a perfectly accurate representation of it. This issue is discussed elsewhere in greater detail (Jorgensen, Citation2017; Jorgensen et al., Citation2017), but it is beyond the focus of the current study, which focuses on situations in which the test fails even in the ideal circumstance that the hypothesized model is a perfect representation of the population(s).

⁶ Jorgensen et al. (Citation2017a) showed that permuting alternative fit indices also provides valid tests of hypotheses about measurement invariance.

⁷ Wu and Estabrook (Citation2016) recently showed that it is not possible to test equality of thresholds independently of any other type of measurement parameter. It is only possible to test equality of thresholds on the condition of at least one other type of measurement parameter (for items with four or more categories), at least two other types (for items with three categories), or at least three other types (for binary items). This finding has implications for how measurement invariance should be tested with ordered-categorical indicators, but such a paradigm shift is beyond the scope of the current article.

⁸ Appendix A also discusses the issue of sparse data, when not all levels of a variable are observed in each group.

⁹ The application of the permutation method to incomplete data is a topic for future research that is beyond the scope of the current investigation.

¹⁰ Jorgensen (Citation2017) discussed modifying configurally invariant models with inadequate fit.

¹¹ If we had fixed the factor means and variances in both groups even in the scalar model, as Sass et al. (Citation2014) did, these differences would have been = 16 and 40, respectively, as Sass et al. (Citation2014) reported. We discuss the implications of this difference in the Discussion section.

¹² If software is flexible enough (e.g., general Bayesian modeling software, or more flexible SEM software like OpenMx), it is possible to fit a model to each group that estimates only the thresholds between categories that were observed within each group. Equality constraints could still be imposed on loadings and the thresholds the researcher knows correspond to categories on the same response scale used in each group.

Jorgensen, T. D., Kite, B., Chen, P.-Y., & Short, S. D. (2017a). Permutation randomization methods for testing measurement equivalence and detecting differential item functioning in multiple-group confirmatory factor analysis. Psychological Methods. Advanced online publication. doi: 10.1037/met0000152

PubMed Web of Science ®Google Scholar

Jorgensen, T. D. (2017). Applying permutation tests and multivariate modification indices to configurally invariant models that need respecification. Frontiers in Psychology, 8, 1455. doi:10.3389/fpsyg.2017.01455

PubMedGoogle Scholar

Jorgensen, T. D., Kite, B., Chen, P.-Y., & Short, S. D. (2017b). Finally! a valid test of configural invariance using permutation in multigroup cfa. In L. A. Van Der Ark, M. Wiberg, S. A. Culpepper, J. A. Douglas, & W.-C. Wang (Eds.), Quantitative psychology: The 81st annual meeting of the psychometric society, Asheville, North Carolina, 2016 (pp. 93–103). New York, NY: Springer.

Google Scholar

PubMed Web of Science ®Google Scholar

Wu, H., & Estabrook, R. (2016). Identification of confirmatory factor analysis models of different levels of invariance for ordered categorical outcomes. Psychometrika, 81(4), 1014–1045. doi:10.1007/s11336-016-9506-0

PubMed Web of Science ®Google Scholar

PubMedGoogle Scholar

Sass, D. A., Schmitt, T. A., & Marsh, H. W. (2014). Evaluating model fit with ordered categorical data within a measurement invariance framework: A comparison of estimators. Structural Equation Modeling, 21(2), 167–180. doi:10.1080/10705511.2014.882658

Web of Science ®Google Scholar

Log in via your institution

Access through your institution

Log in to Taylor & Francis Online

Shibboleth

Log in to Taylor & Francis Online

Username Password

Forgot password?

Keep me logged in (not suitable for shared devices).

You will otherwise be logged out automatically, after a limited period, and will need to log in again.

Restore content access

Restore content access for purchases made as guest

Purchase options * Save for later Item saved, go to cart

PDF download + Online access

48 hours access to article PDF & online version
Article PDF can be downloaded
Article PDF can be printed

USD 53.00 Add to cart

PDF download + Online access - Online Checkout

Issue Purchase

30 days online access to complete issue
Article PDFs can be downloaded
Article PDFs can be printed

USD 412.00 Add to cart

Issue Purchase - Online Checkout

* Local tax will be added as applicable

Share icon
Back to Top

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

Random Permutation Testing Applied to Measurement Invariance Testing with Ordered-Categorical Indicators

Log in via your institution

Log in to Taylor & Francis Online

Restore content access

Related Research

Information for

Open access

Opportunities

Help and information

Random Permutation Testing Applied to Measurement Invariance Testing with Ordered-Categorical Indicators

Abstract

Acknowledgment

Notes

Log in via your institution

Log in to Taylor & Francis Online

Log in to Taylor & Francis Online

Restore content access

Related Research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature