406
Views
2
CrossRef citations to date
0
Altmetric
Research Article

A Comparison of Limited-Information Test Statistics for a Response Style MIRT Model

, ORCID Icon, ORCID Icon &
Pages 687-702 | Published online: 26 Oct 2020
 

Abstract

An increased use of models for measuring response styles is apparent in recent years with the multidimensional nominal response model (MNRM) as one prominent example. Inclusion of latent constructs representing extreme (ERS) or midpoint response style (MRS) often improves model fit according to information criteria. However, a test of absolute model fit is often not reported even though it could comprise an important piece of validity evidence. Limited information test statistics are candidates for this task, including the full (M2), ordinal (M2*), and mixed (C2) statistics, which differ in whether additional collapsing of univariate or bivariate contingency tables is conducted. Such collapsing makes sense when item categories are ordinal, which may not hold under the MNRM. More generally, limited information test statistics have gone unevaluated under nominal data and non-ordinal latent trait models. We present a simulation study evaluating the performance of M2, M2*, and C2 with the MNRM. Manipulated conditions included sample size, presence and type of response style, and strength of item slopes on substantive and style dimensions. We found that M2 sometimes had inflated Type I error rates, M2* always had little power, and C2 lacked power under some conditions. M2 and C2 may provide complementary and valuable information regarding model fit.

Article information

Conflict of interest disclosures: Each author signed a form for disclosure of potential conflicts of interest. No authors reported any financial or other conflicts of interest in relation to the work described.

Ethical principles: The authors affirm having followed professional ethical guidelines in preparing this work. These guidelines include obtaining informed consent from human participants, maintaining ethical treatment and respect for the rights of human or animal participants, and ensuring the privacy of participants and their data, such as ensuring that individual participants cannot be identified in reported results or from publicly available original or archival data.

Funding: This work was supported by Grant RGPIN-2018-05357 and DGECR-2018-00083 from the National Science and Engineering Research Council of Canada (NSERC).

Role of the funders/sponsors: None of the funders or sponsors of this research had any role in the design and conduct of the study; collection, management, analysis, and interpretation of data; preparation, review, or approval of the manuscript; or decision to submit the manuscript for publication.

Acknowledgements: The authors would like thank Li Cai for their comments on prior versions of this manuscript. The ideas and opinions expressed herein are those of the authors alone, and endorsement by the authors’ institutions is not intended and should not be inferred.

Correction Statement

This article has been republished with minor changes. These changes do not impact the academic content of the article.

Notes

1 In addition to sparsity, all such statistics may require numerical integration, which is feasible for models with few latent traits, but still problematic for models with many latent traits.

2 We have retained all participants regardless of their standing on quality control checks. Our use of the subsequent models is mainly for illustrative purposes, and we do not make any firm substantive conclusions from the results of these analyses.

3 If items differ greatly in overall difficulty, these high frequencies may not cluster along the diagonal, but will still have a similar pattern where category pairs further toward at least one of the off-diagonals will have the lowest frequencies.

4 To establish the asymptotic distribution for the family, it is necessary that: 1) Lη has full row rank, s, and 1C is not in its row span; and 2) Δη has full column rank, v (Joe & Maydeu-Olivares, Citation2010, p. 396).

5 Despite what appear to be slightly small standardized loadings for these items (e.g., Table 5; computed using flexMIRT®), the strong and weak conditions for the unidimensional model and a standard normal latent trait correspond to marginal reliabilities of .91 and .84, respectively. The information provided for style factors is expected to be less since the effective number of categories per item is fewer (e.g., see also Falk & Ju, Citation2020).

6 In support of this claim, we computed model-implied probabilities for all 510 response patterns for a 10-item (5 categories per item), two-factor (substantive + ERS) model with a .3 correlation among latent dimensions. As an example, fitting alternative misspecified models (a single substantive dimension) to these model-implied proportions typically yielded near zero test statistics for M2*, even if a wildly inflated sample size was specified for the fitted model (e.g., close to 1 million). This strategy was deemed not appropriate for the 12-item measures in our empirical example and simulation study, as doing computations on 512 patterns was too RAM and processor intensive.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 53.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 352.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.