References
- Bock, R. D., & Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46, 443–459. doi:https://doi.org/10.1007/BF02293801
- Cao, Y., Lu, R., & Tao, W. (2014). Effect of item response theory (IRT) model selection on testlet-based test equating. ETS Research Report Series, 2014, 1–13. doi:https://doi.org/10.1002/ets2.12017
- Cohen, J. (1973). Eta-squared and partial eta-squared in fixed factor ANOVA designs. Educational and Psychological Measurement, 33(1), 107–112. doi:https://doi.org/10.1177/001316447303300111
- Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd ed.). Hillsdale, NJ: Lawrence Earlbaum Associates.
- Cole, K. M., & Paek, I. (2017). PROC IRT: A SAS procedure for item response theory. Applied Psychological Measurement, 41, 311–320. doi:https://doi.org/10.1177/0146621616685062
- de Ayala, R. J. (2009). Methodology in the social sciences. The theory and practice of item response theory. New York, NY: Guilford Press.
- Drasgow, F. (1989). An evaluation of marginal maximum likelihood estimation for the two-parameter logistic model. Applied Psychological Measurement, 13, 77–90. doi:https://doi.org/10.1177/014662168901300108
- Feinberg, R. R., & Rubright, J. D. (2016). Conducting simulation studies in psychometrics. Educational Measurement: Issues and Practice, 35(2), 36–49. doi:https://doi.org/10.1111/emip.12111
- Gagne, P., & Furlow, C. F. (2009). Automating multiple software packages in simulation research for structural equation modeling and hierarchical linear modeling. Structural Equation Modeling, 16, 179–185. doi:https://doi.org/10.1080/10705510802561543
- Harwell, M., Stone, C. A., Hsu, T., & Kirisci, L. (1996). Monte Carlo studies in item response theory. Applied Psychological Measurement, 20(2), 101–125. doi:https://doi.org/10.1177/014662169602000201
- Kang, T., & Chen, T. T. (2008). Performance of the generalized S-X2 item fit index for polytomous IRT models. Journal of Educational Measurement, 45(4), 391–406. doi:https://doi.org/10.1111/jedm.2008.45.issue-4
- Liu, T., Sun, Y., Li, Z., & Xin, T. (2019). The impact of aberrant response on reliability and validity. Measurement: Interdisciplinary Research and Perspectives, 17(3), 133–142.
- Luecht, R., & Ackerman, T. A. (2018). A technical note on IRT simulation studies: Dealing with truth, estimates, observed data, and residuals. Educational Measurement: Issues and Practice, 37(3), 65–76. doi:https://doi.org/10.1111/emip.2018.37.issue-3
- McKinley, R. L., & Mills, C. N. (1985). A comparison of several goodness of fit statistics. Applied Psychological Measurement, 9, 49–57. doi:https://doi.org/10.1177/014662168500900105
- Orlando, M., & Thissen, D. (2000). Likelihood-based item-fit indices for dichotomous item response theory models. Applied Psychological Measurement, 24(1), 50–64. doi:https://doi.org/10.1177/01466216000241003
- Orlando, M., & Thissen, D. (2003). Further investigation of the performance of S-X2: An item fit index for use with dichotomous item response theory models. Applied Psychological Measurement, 27, 289–298. doi:https://doi.org/10.1177/0146621603027004004
- SAS Institute Inc. (2015). SAS/STAT® 14.1 user’s guide. Cary, NC: Author.
- Woods, C. M. (2006). Ramsay-curve item response theory (RC-IRT) to detect and correct for nonnormal latent variables. Psychological Methods, 11(3), 253. doi:https://doi.org/10.1037/1082-989X.11.3.253
- Woods, C. M., & Thissen, D. (2006). Item response theory with estimation of the latent population distribution using spLine-based densities. Psychometrika, 71(2), 281. doi:https://doi.org/10.1007/s11336-004-1175-8
- Yen, W. M. (1981). Using simulation results to choose a latent trait model. Applied Psychological Measurement, 5, 245–262. doi:https://doi.org/10.1177/014662168100500212