197
Views
0
CrossRef citations to date
0
Altmetric
Research Articles

Revisiting Savalei’s (2011) Research on Remediating Zero-Frequency Cells in Estimating Polychoric Correlations: A Data Distribution Perspective

Pages 81-96 | Received 17 Oct 2022, Accepted 29 May 2023, Published online: 14 Jul 2023
 

Abstract

In Savalei’s (Citation2011) simulation that evaluated the performance of polychoric correlation estimates in small samples, two methods for treating zero-frequency cells, adding 0.5 (ADD) and doing nothing (NONE), were compared. Savalei tentatively suggested using ADD for binary data and NONE for data with three or more categories. Yet, Savalei’s suggestion could be explained by the skewness of the data distribution being severe for binary data and slight for three-category data. To rule out this alternative explanation, we extended Savalei’s design by incorporating the degree of skewness into our simulation. With slightly skewed data, NONE is recommended due to its high-quality estimates. With severely skewed data, only ADD is recommended for binary data when the skewness of two variables is the same-signed and the underlying correlation is expected to be strong. Methods for improving the polychoric correlation estimates with severely skewed data merit further study.

Notes

1 Our review revealed that Savalei’s (Citation2011) research has been widely referenced in one of the two contexts: (1) to support the use of ADD (e.g., Disabato et al., Citation2019; Jenkins et al., Citation2018; Lacko et al., Citation2022; Lubbe, Citation2019; Recio-Román et al., Citation2021; Schuberth et al., Citation2018) or (2) to substantiate the argument that tables with zero-frequency cells can result in biased polychoric correlation estimates (e.g., Bainter & Forster, Citation2019; DiStefano et al., Citation2018; DiStefano et al., Citation2021; Flora et al., Citation2012; Olvera Astivia, Citation2013; Pendergast et al., Citation2017; Yang & Xia, Citation2019).

2 We sincerely thank Dr. Savalei for her constructive comments on expanding the scope of our work to include additional methods and options beyond NONE and ADD that performed poorly under severely skewed data in our simulations.

3 We also examined the performance of the three additional estimators for slightly skewed data and found that neither of them outperformed NONE. Therefore, in slight skewness, NONE is still recommended to be used.

Additional information

Funding

The research was supported in part by grant MOST 110-2410-H-002-123-MY3 from the Ministry of Science and Technology in Taiwan.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 53.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 412.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.