References
- Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19(6), 716–723. https://doi.org/10.1109/TAC.1974.1100705
- Alexeev, N., Templin, J., & Cohen, A. S. (2011). Spurious latent classes in the mixture Rasch model. Journal of Educational Measurement, 48(3), 313–332. https://doi.org/10.1111/jedm.2011.48.issue-3
- Anscombe, F. J., & Glynn, W. J. (1983). Distribution of the kurtosis statistic b2 for normal samples. Biometrika, 70(1), 227–234. https://www.jstor.org/stable/2335960
- Arellano-Valle, R. B., & Genton, M. G. (2005). On fundamental skew distributions. Journal of Multivariate Analysis, 96(1), 93–116. https://doi.org/10.1016/j.jmva.2004.10.002
- Asparouhov, T., & Muthén, B. (2015). Structural equation models and mixture models with continuous nonnormal skewed distributions. Structural Equation Modeling: A Multidisciplinary Journal, 23(1), 1–19. https://doi.org/10.1080/10705511.2014.947375
- Azzalini, A., & Capitanio, A. (2003). Distributions generated by perturbation of symmetry with emphasis on a multivariate skew t‐distribution. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 65(2), 367–389. https://doi.org/10.1111/rssb.2003.65.issue-2
- Azzalini, A., & Dalla-Valle, A. (1996). The multivariate skew-normal distribution. Biometrika, 83(4), 715–726. https://doi.org/10.1093/biomet/83.4.715
- Bauer, D. J., & Curran, P. J. (2003). Distributional assumptions of growth mixture models: Implications for over-extraction of latent trajectory classes. Psychological Methods, 8(3), 338–363. https://doi.org/10.1037/1082-989X.8.3.338
- Bauer, D. J., & Curran, P. J. (2004). The integration of continuous and discrete latent variable models: Potential problems and promising opportunities. Psychological Methods, 9(1), 3–29. https://doi.org/10.1037/1082-989X.9.1.3
- Bolt, D. M., Cohen, A. S., & Wollack, J. A. (2002). Item parameter estimation under conditions of test speededness: Application of a mixture Rasch model with ordinal constraints. Journal of Educational Measurement, 39(4), 331–348. https://doi.org/10.1111/j.1745-3984.2002.tb01146.x
- Branco, M. D., & Dey, D. K. (2001). A general class of multivariate skew-elliptical distributions. Journal of Multivariate Analysis, 79(1), 99–113. https://doi.org/10.1006/jmva.2000.1960
- Brooks, S. P., & Gelman, A. (1998). General methods for monitoring convergence of iterative simulations. Journal of Computational and Graphical Statistics, 7(4), 434–455. https://doi.org/10.1080/10618600.1998.10474787
- Cabral, C. R. B., Bolfarine, H., & Pereira, J. R. G. (2008). Bayesian density estimation using skew student-t-normal mixtures. Computational Statistics & Data Analysis, 52(12), 5075–5090. https://doi.org/10.1016/j.csda.2008.05.003
- Chalmers, P. (2012). mirt: A multidimensional item response theory package for the R environment. Journal of Statistical Software, 48(6), 1–29. https://doi.org/10.18637/jss.v048.i06
- Cho, S.-J., Cohen, A. S., & Kim, S.-H. (2013). Markov chain Monte Carlo estimation of a mixture item response theory model. Journal of Statistical Computation and Simulation, 83(2), 278–306. https://doi.org/10.1080/00949655.2011.603090
- Cho, S.-J., Suh, Y., & Lee, W.-Y. (2016). An NCME instructional module on latent DIF analysis using mixture item response models. Educational Measurement: Issues and Practice, 35(1), 48–61. https://doi.org/10.1111/emip.12093
- Choi, Y.-J., Alexeev, N., & Cohen, A. S. (2015). Differential item functioning analysis using a mixture 3-parameter logistic model with a covariate on the TIMSS 2007 mathematics test. International Journal of Testing, 15(3), 239–253. https://doi.org/10.1080/15305058.2015.1007241
- Cohen, A. S., & Bolt, D. M. (2005). A mixture model analysis of differential item functioning. Journal of Educational Measurement, 42(2), 133–148. https://doi.org/10.1111/j.1745-3984.2005.00007
- Congdon, P. (2003). Applied Bayesian modelling. John Wiley.
- Cook, D. L. (1959). A replication of Lord’s study on skewness and kurtosis of observed test-score distributions. Educational and Psychological Measurement, 19(1), 81–87. https://doi.org/10.1177/001316445901900109
- Cowles, M. K., & Carlin, B. P. (1996). Markov chain Monte Carlo convergence diagnostics: A comparative review. Journal of the American Statistical Association, 91(434), 883–904. https://doi.org/10.1080/01621459.1996.10476956
- D’Agostino, R. B. (1970). Transformation to normality of the null distribution of g1. Biometrika, 57(3), 679–681. https://doi.org/10.1093/biomet/57.3.679
- Dagne, G. A. (2013). Bayesian inference for skew-normal mixture models with left-censoring. Journal of Biopharmaceutical Statistics, 23(5), 1023–1041. https://doi.org/10.1080/10543406.2013.813517
- de Ayala, R. J. (2009). The theory and practice of item response theory. The Guilford Press.
- de Ayala, R. J., & Sava-Bolesta, M. (1999). Item parameter recovery for the nominal response model. Applied Psychological Measurement, 23(1), 3–19. https://doi.org/10.1177/01466219922031130
- Drasgow, F., & Parsons, C. K. (1983). Application of unidimensional item response theory model to multidimensional data. Applied Psychological Measurement, 7(2), 189–199. https://doi.org/10.1177/014662168300700207
- Du Toit, M. (2003). IRT from SSI: BILOG-MG, MULTILOG, PARSCALE, TEST-FACT. Scientific Software International.
- Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists. Psychology Press.
- Fang, K. T., Kotz, S., & Ng, K. W. (1990). Symmetric multivariate and related distributions. Chapman & Hall.
- Fleishman, A. I. (1978). A method for simulating non-normal distributions. Psychometrika, 43(4), 521–532. https://doi.org/10.1007/BF02293811
- Frühwirth-Schnatter, S., & Pyne, S. (2010). Bayesian inference for finite mixtures of univariate and multivariate skew-normal and skew-t distributions. Biostatistics, 11(2), 317–336. https://doi.org/10.1093/biostatistics/kxp062
- Habing, B. (2006). IRFs and simulating IRT data. Retrieved, 2016, from http://people.stat.sc.edu/habing/courses/778rS06.html#simplot
- Hambleton, R. K., & Swaminathan, H. (1985). Item response theory: Principles and applications. Kluwer-Nijhoff.
- Heidelberger, P., & Welch, P. (1983). Simulation run length control in the presence of an initial transient. Operations Research, 31(6), 1109–1144. https://doi.org/10.1287/opre.31.6.1109
- Ho, A. D., & Yu, C. C. (2015). Descriptive statistics for modern test score distributions skewness, kurtosis, discreteness, and ceiling effects. Educational and Psychological Measurement, 75(3), 365–388. https://doi.org/10.1177/0013164414548576
- Huang, Y., Hu, X. J., & Dagne, G. A. (2014). Jointly modeling time-to-event and longitudinal data: A Bayesian approach. Statistical Methods & Applications, 23(1), 95–121. https://doi.org/10.1007/s10260-013-0242-7
- Jackman, S. (2000). Estimation and inference via Bayesian simulation: An introduction to Markov chain Monte Carlo. American Journal of Political Science, 44(2), 375–404. https://doi.org/10.2307/2669318
- Jasra, A., Stephens, D. A., Gallagher, K., & Holmes, C. C. (2006). Bayesian mixture modelling in geochronology via Markov chain Monte Carlo. Mathematical Geology, 38(3), 269–300. https://doi.org/10.1007/s11004-005-9019-3
- Kirisci, L., Hsu, T., & Yu, L. (2001). Robustness of item parameter estimation programs to assumptions of unidimensionality and normality. Applied Psychological Measurement, 25(2), 146–162. https://doi.org/10.1177/01466210122031975
- Komsta, L., & Novomestky, F. (2015). Moments: Moments, cumulants, skewness, kurtosis and related tests. R package version 0.14. Retrieved from http://CRAN.R-project.org/package=moments
- Lee, S., & McLachlan, G. J. (2014). Finite mixtures of multivariate skew t-distributions: Some recent and new results. Journal of Statistics and Computing, 24(2), l81–202. https://doi.org/10.1007/s11222-012-9362-4
- Lesaffre, E., & Lawson, A. B. (2012). Bayesian biostatistics. John Wiley & Sons.
- Li, F., Cohen, A. S., Kim, S.-H., & Cho, S.-J. (2009). Model selection methods for mixture dichotomous IRT models. Applied Psychological Measurement, 33(5), 353–373. https://doi.org/10.1177/0146621608326422
- Linacre, J. M. (1994). Sample size and item calibration stability. Rasch Measurement Transactions, 7(4), 328.
- Lord, F. M. (1955). A survey of observed test-score distributions with respect to skewness and kurtosis. Educational and Psychological Measurement, 15(4), 383–389. https://doi.org/10.1177/001316445501500406
- Lunn, D., Spiegelhalter, D., Thomas, A., & Best, N. (2009). The BUGS project: Evolution, critique and future directions. Statistics in Medicine, 28(25), 3049–3082. https://doi.org/10.1002/sim.3680
- Marco, G. L. (1977). Item characteristic curve solutions to three intractable testing problems. Journal of Educational Measurement, 14(2), 139–160. https://doi.org/10.1111/j.1745-3984.1977.tb00033.x
- McLachlan, G., & Peel, D. (2000). Finite mixture models. Wiley.
- Micerri, T. (1989). The unicorn, the normal curve, and other improbable creatures. Psychological Bulletin, 105(1), 156–166. https://doi.org/10.1037/0033-2909.105.1.156
- Mislevy, R. J. (1984). Estimating latent distributions. Psychometrika, 49(3), 359–381. https://doi.org/10.1007/BF02306026
- Mislevy, R. J., & Bock, R. D. (1990). BILOG 3: Item analysis and test scoring with binary logistic models [Computer program]. Scientific Software.
- Mislevy, R. J., & Verhelst, N. (1990). Modeling item responses when different subjects employ different solution strategies. Psychometrika, 55(2), 195–215. https://doi.org/10.1007/BF02295283
- Muraki, E., & Bock, R. D. (2003). PARSCALE (Version 4.1) [Computer program]. Scientific Software.
- OECD. (2014). PISA 2012 Technical Report. Retrieved April 20, 2016, from https://www.oecd.org/pisa/pisaproducts/PISA-2012-technical-report-final.pdf
- Pearson, K. (1895). Contributions to the mathematical theory of evolution. II. Skew variation in homogeneous material. Philosophical Transactions of the Royal Society of London A, 186, 343–414. https://doi.org/10.1098/rsta.1895.0010
- Raftery, A. L., & Lewis, S. (1992). How many iterations in the Gibbs sampler? In J. O. Berger, J. M. Bernardo, A. P. Dawid, & A. F. M. Smit (Eds.), Bayesian statistics (Vol. 4, pp. 763–773). Oxford University Press.
- Reckase, M. D. (1979). Unifactor latent trait models applied to multifactor tests: Results and implications. Journal of Educational Statistics, 4(3), 207–230. https://doi.org/10.3102/10769986004003207
- Reise, S. P., & Yu, J. (1990). Parameter recovery in the graded response model using MULTILOG. Journal of Educational Measurement, 27(2), 133–144. https://doi.org/10.1111/j.1745-3984.1990.tb00738.x
- Roberts, J. S., Donoghue, J. R., & Laughlin, J. E. (2002). Characteristics of MML/EAP parameter estimates in the generalized graded unfolding model. Applied Psychological Measurement, 26(2), 192–207. https://doi.org/10.1177/01421602026002006
- Robitzsch, A. (2014). sirt: Supplementary item response theory models R package version 0. 43-70 [Computer Software]. Retrieved from https://cran.r-project.org/package=sirt
- Robitzsch, A. (2020). sirt: Supplementary Item response theory Models. R package version 3.9-4 [Computer Software]. Retrieved from https://CRAN.R-project.org/package=sirt
- Rost, J. (1990). Rasch models in latent classes: An integration of two approaches to item analysis. Applied Psychological Measurement, 14(3), 271–282. https://doi.org/10.1177/014662169001400305
- Rost, J. (1991). A logistic mixture distribution model for polytomous item response. British Journal of Mathematical and Statistical Psychology, 44(1), 75–92. https://doi.org/10.1111/bmsp.1991.44.issue-1
- Sahu, S., Dey, D. K., & Branco, M. D. (2003). A new class of multivariate skew distribution with application to Bayesian regression models. The Canadian Journal of Statistics., 31(2), 129–150. https://doi.org/10.2307/3316064
- Sass, D. A., Schmitt, T. A., & Walker, C. M. (2008). Estimating non-normal latent trait distributions within item response theory using true and estimated item parameters. Applied Measurement in Education, 21(1), 65–88. https://doi.org/10.1080/08957340701796415
- Schwarz, G. (1978). Estimating the dimension of a model. Annals of Statistics, 6(2), 461–464. https://doi.org/10.1214/aos/1176344136
- Sen, S., Cohen, A. S., & Kim, S.-H. (2016). The impact of non-normality on extraction of spurious latent classes in mixture IRT models. Applied Psychological Measurement, 40(2), 98–113. https://doi.org/10.1177/0146621615605080
- Seong, T. (1990). Sensitivity of marginal maximum likelihood estimation of item and ability parameters to the characteristics of the prior ability distributions. Applied Psychological Measurement, 14(3), 299–311. https://doi.org/10.1177/014662169001400307
- Smith, B. J. (2007). boa: An R package for MCMC output convergence assessment and posterior inference. Journal of Statistical Software, 21(11), 1–37. https://doi.org/10.18637/jss.v021.i11
- Stone, C. A. (1992). Recovery of marginal maximum likelihood estimates in the two-parameter logistic response model: An evaluation of MULTILOG. Applied Psychological Measurement, 16(1), 1–16. https://doi.org/10.1177/014662169201600101
- Tate, R. L. (1995). Robustness of the School‐Level IRT Model. Journal of Educational Measurement, 32(2), 145–162. https://doi.org/10.1111/j.1745-3984.1995.tb00460.x
- Tsutakawa, R. K., & Johnson, J. C. (1990). The effect of uncertainty of item parameter estimation on ability estimates. Psychometrika, 55(2), 371–390. https://doi.org/10.1007/BF02295293
- von Davier, M. (2001). WINMIRA 2001 [Computer software]. Assessment Systems Corporation.
- Wright, B. D. (1977). Misunderstanding the Rasch model. Journal of Educational Measurement, 14(3), 219–225. https://doi.org/10.1111/j.1745-3984.1977.tb00039.x
- Xu, X., & Jia, Y. (2011). The sensitivity of parameter estimates to the latent ability distribution (Research Report No. RR-11-41). Educational Testing Service.
- Xu, X., & von Davier, M. (2008). Fitting the structured general diagnostic model to NAEP data (Research Report No. RR-08-27). Educational Testing Service.
- Yen, W. M. (1987). A comparison of the efficiency and accuracy of BILOG and LOGIST. Psychometrika, 52(2), 275–291. https://doi.org/10.1007/BF02294241
- Zwinderman, A. H., & van den Wollenberg, A. L. (1990). Robustness of marginal maximum likelihood estimation in the Rasch model. Applied Psychological Measurement, 14(1), 73–81. https://doi.org/10.1177/014662169001400107