References
- Acworth, P. A., Broadie, M., & Glasserman, P. (1998). A comparison of some Monte Carlo and quasi Monte Carlo techniques for option pricing. In H. Niederreiter, P. Hellekalek, G. Larcher, & P. Zinterhof (Eds.), Monte Carlo and Quasi-Monte Carlo methods 1996 (pp. 1–18). Springer.
- Akour, M., & AL-Omari, H. (2013). Empirical investigation of the stability of IRT item-parameters estimation. International Online Journal of Educational Sciences, 5(2), 291–301. https://eis.hu.edu.jo/deanshipfiles/pub106314725.pdf
- Bartolucci, F. (2007). A class of multidimensional IRT models for testing unidimensionality and clustering items. Psychometrika, 72(2), 141–157. https://doi.org/https://doi.org/10.1007/s11336-005-1376-9
- Barton, M. A., & Lord, F. M. (1981). An upper asymptote for the three-parameter logistic item-response model. (Research Report 18-21). Educational Testing Service. https://doi.org/https://doi.org/10.1002/j.2333-8504.1981.tb01255.x
- Bashkov, B. M., & DeMars, C. E. (2017). Examining the performance of the Metropolis–Hastings Robbins–Monro algorithm in the estimation of multilevel multidimensional IRT models. Applied Psychological Measurement, 41(5), 323–337. https://doi.org/https://doi.org/10.1177/0146621616688923
- Birnbaum, A. (1968). Some latent trait models and their use in inferring an examinee’s ability. In F. M. Lord & M. R. Novick (Eds.), Statistical theories of mental test scores (pp. 397–479). Addison-Wesley.
- Bock, R. D., & Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46(4), 443–459. https://doi.org/https://doi.org/10.1007/BF02293801
- Bock, R. D., & Lieberman, M. (1970). Fitting a response model for n dichotomously scored items. Psychometrika, 35(2), 179–197. https://doi.org/https://doi.org/10.1007/BF02291262
- Booth, J. G., & Hobert, J. P. (1999). Maximizing generalized linear mixed model likelihoods with an automated Monte Carlo EM algorithm. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 61(1), 265–285. https://doi.org/https://doi.org/10.1111/1467-9868.00176
- Bulut, O., & Sünbül, Ö. (2017). Monte Carlo simulation studies in item response theory with the R programming language. Journal of Measurement and Evaluation in Education and Psychology, 8(3), 266–287. https://doi.org/https://doi.org/10.21031/epod.305821
- Caflisch, R. E. (1998). Monte Carlo and quasi-Monte Carlo methods. Acta Numerica, 7, 1–49. https://doi.org/https://doi.org/10.1017/S0962492900002804
- Cai, L. (2008). A Metropolis–Hastings Robbins–Monro algorithm for maximum likelihood nonlinear latent structure analysis with a comprehensive measurement model [Unpublished doctoral dissertation]. Department of Psychology, University of North Carolina at Chapel Hill.
- Cai, L. (2010a). High-dimensional exploratory item factor analysis by a Metropolis–Hastings Robbins–Monro algorithm. Psychometrika, 75(1), 33–57. https://doi.org/https://doi.org/10.1007/s11336-009-9136-x
- Cai, L. (2010b). Metropolis-Hastings Robbins-Monro algorithm for confirmatory item factor analysis. Journal of Educational and Behavioral Statistics, 35(3), 307–335. https://doi.org/https://doi.org/10.3102/1076998609353115
- Chalmers, R. P. (2012). mirt: A multidimensional item response theory package for the R environment. Journal of Statistical Software, 48(6), 1–29. https://doi.org/https://doi.org/10.18637/jss.v048.i06
- Chalmers, R. P. (2015). Extended mixed‐effects item response models with the MH‐RM algorithm. Journal of Educational Measurement, 52(2), 200–222. https://doi.org/https://doi.org/10.1111/jedm.12072
- Chalmers, R. P., & Flora, D. B. (2014). Maximum-likelihood estimation of noncompensatory IRT models with the MH-RM algorithm. Applied Psychological Measurement, 38(5), 339–358. https://doi.org/https://doi.org/10.1177/0146621614520958
- Cheng, Y., & Liu, C. (2015). The effect of upper and lower asymptotes of IRT models on computerized adaptive testing. Applied Psychological Measurement, 39(7), 551–565. https://doi.org/https://doi.org/10.1177/0146621615585850
- Chuah, S. C., Drasgow, F., & Luecht, R. (2006). How big is big enough? Sample size requirements for CAST item parameter estimation. Applied Measurement in Education, 19(3), 241–255. https://doi.org/https://doi.org/10.1207/s15324818ame1903_5
- Core Team, R. (2017). R: A language and environment for statistical computing [Computer Software]. R Foundation for Statistical Computing.
- Culpepper, S. A. (2016). Revisiting the 4-parameter item response model: Bayesian estimation and application. Psychometrika, 81(4), 1142–1163. https://doi.org/https://doi.org/10.1007/s11336-015-9477-6
- De Ayala, R. J. (1994). The influence of multidimensionality on the graded response model. Applied Psychological Measurement, 18(2), 155–170. https://doi.org/https://doi.org/10.1177/014662169401800205
- De Ayala, R. J. (2009). The theory and practice of item response theory. The Guilford Press.
- De La Torre, J., & Hong, Y. (2010). Parameter estimation with small sample size. A higher-order IRT model approach. Applied Psychological Measurement, 34(4), 267–285. https://doi.org/https://doi.org/10.1177/0146621608329501
- DeMars, C. (2010). Item response theory. Oxford University Press. https://doi.org/https://doi.org/10.1093/acprof:oso/9780195377033.001.0001
- Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological), 39(1), 1–22. https://doi.org/https://doi.org/10.1111/j.2517-6161.1977.tb01600.x
- Embretson, S. E. (2010). Measuring psychological constructs: Advances in model-based approaches. American Psychological Association. https://doi.org/https://doi.org/10.1037/12074-000
- Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists. Lawrence Erlbaum Associates, Inc., Publishers.
- Finch, H. (2010). Item parameter estimation for the MIRT model: Bias and precision of confirmatory factor analysis—based models. Applied Psychological Measurement, 34(1), 10–26. https://doi.org/https://doi.org/10.1177/0146621609336112
- Finch, H., Habing, B. T., & Huynh, H. (2003, April). Comparison of NOHARM and conditional covariance methods of dimensionality assessment [Paper presentation]. Annual Meeting of the American Educational Research Association, Chicago, IL.
- Fisher, R. (1925). Theory of statistical estimation. Mathematical Proceedings of the Cambridge Philosophical Society, 22(5), 700–725. https://doi.org/https://doi.org/10.1017/S0305004100009580
- Foley, B. P. (2010). Improving IRT parameter estimates with small sample sizes: Evaluating the efficacy of a new data augmentation technique [Doctoral dissertation]. University of Nebraska - Lincoln. https://digitalcommons.unl.edu/cehsdiss/75
- Gibbons, R. D., & Hedeker, D. R. (1992). Full-information item bi-factor analysis. Psychometrika, 57(3), 423–436. https://doi.org/https://doi.org/10.1007/BF02295430
- Glas, C. A. W. (1999). Modification indices for the 2-PL and the nominal response model. Psychometrika, 64(3), 273–294. https://doi.org/https://doi.org/10.1007/BF02294296
- Glas, C. A. W. (2001). Differential item functioning depending on general covariates. In A. Boomsma, M. A. J. Van Duijn, & T. A. B. Snijders (Eds.), Essays on item response theory (pp. 131–148). Springer. https://doi.org/https://doi.org/10.1007/978-1-4613-0169-1_7
- Glas, C. A. W. (2002). Item calibration and parameter drift. In W. J. Van Der Linden & C. A. W. Glas (Eds.), Computerized adaptive testing: Theory and practice (pp. 183–199). Kluwer Academic Publishers. https://doi.org/https://doi.org/10.1007/0-306-47531-6_10
- Graham, I. G., Kuo, F. Y., Nuyens, D., Scheichl, R., & Sloan, I. H. (2011). Quasi-Monte Carlo methods for elliptic PDEs with random coefficients and applications. Journal of Computational Physics, 230(10), 3668–3694. https://doi.org/https://doi.org/10.1016/j.jcp.2011.01.023
- Haertel, E. H. (1990). Continuous and discrete latent structure models for item response data. Psychometrika, 55(3), 477–494. https://doi.org/https://doi.org/10.1007/BF02294762
- Halton, J. H. (1960). On the efficiency of certain quasi-random sequences of points in evaluating multi-dimensional integrals. Numerische Mathematik, 2(1), 84–90. https://doi.org/https://doi.org/10.1007/BF01386213
- Hambleton, R. K., & Swaminathan, H. (1985). Item response theory: Principles and applications. KluwerNijhoff Publishing.
- Hastings, W. K. (1970). Monte Carlo simulation methods using Markov chains and their applications. Biometrika, 57(1), 97–109. https://doi.org/https://doi.org/10.1093/biomet/57.1.97
- Hattie, J. (1985). Methodology review: Assessing unidimensionality of tests and items. Applied Psychological Measurement, 9(2), 139–164. https://doi.org/https://doi.org/10.1177/014662168500900204
- Hickernell, F. J., & Hong, H. S. (2002). Quasi-Monte Carlo methods and their randomizations. In R. Chan, Y. K. Kwok, D. Yao, & Q. Zhang (Eds.), Applied probability, AMS/IP studies in advanced mathematics (pp. 26, 59–77). American Mathematical Society.
- Hitchcock, D. B. (2003). A history of the Metropolis–Hastings algorithm. The American Statistician, 57(4), 254–257. https://doi.org/https://doi.org/10.1198/0003130032413
- Hulin, C. L., Lissak, R. I., & Drasgow, F. (1982). Recovery of two-and three-parameter logistic item characteristic curves: A Monte Carlo study. Applied Psychological Measurement, 6(3), 249–260. https://doi.org/https://doi.org/10.1177/014662168200600301
- Jank, W. (2005a). Quasi-Monte Carlo sampling to improve the efficiency of Monte Carlo EM. Computational Statistics & Data Analysis, 48(4), 685–701. https://doi.org/https://doi.org/10.1016/j.csda.2004.03.019
- Jank, W. (2005b, August). Stochastic variants of EM: Monte Carlo, quasi-Monte Carlo and more. Paper presented at the meeting of the American Statistical Association, Minneapolis. Abstract retrieved from http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.616.1796&rep=rep1&type=pdf
- Jiang, Z., & Templin, J. (2019). Gibbs samplers for logistic item response models via the Pólya–Gamma distribution: A computationally efficient data-augmentation strategy. Psychometrika, 84(2), 358–374. https://doi.org/https://doi.org/10.1007/s11336-018-9641-x
- Kalkan, Ö. K., & Çuhadar, İ. (2020). An evaluation of 4PL IRT and DINA models for estimating pseudo-guessing and slipping parameters. Journal of Measurement and Evaluation in Education and Psychology, 11(2), 131–146. https://doi.org/https://doi.org/10.21031/epod.660273
- Kuo, F. Y., Dunsmuir, W. T. M., Sloan, I. H., Wand, M. P., & Womersley, R. S. (2008). Quasi-Monte Carlo for highly structured generalised response models. Methodology and Computing in Applied Probability, 10(2), 239–275. https://doi.org/https://doi.org/10.1007/s11009-007-9045-3
- L’Ecuyer, P., & Lemieux, C. (2002). Recent advances in randomized quasi-Monte Carlo methods. In M. Dror, P. L’Ecuyer, & F. Szidarovski (Eds.), Modeling uncertainty: An examination of stochastic theory, methods, and applications (pp. 419–474). Kluwer Academic Publishers. https://doi.org/https://doi.org/10.1007/0-306-48102-2_20
- Li, F., Cohen, A. S., Kim, S. H., & Cho, S. J. (2009). Model selection methods for mixture dichotomous IRT models. Applied Psychological Measurement, 33(5), 353–373. https://doi.org/https://doi.org/10.1177/0146621608326422
- Liao, W. W., Ho, R. G., Yen, Y. C., & Cheng, H. C. (2012). The four-parameter logistic item response theory model as a robust method of estimating ability despite aberrant responses. Social Behavior and Personality: An International Journal, 40(10), 1679–1694. https://doi.org/https://doi.org/10.2224/sbp.2012.40.10.1679
- Litvakov, B. M. (1973). On a class of Robbins-Monro procedures. Information Sciences, 6, 33–47. https://doi.org/https://doi.org/10.1016/0020-0255(73)90026-1
- Loken, E., & Rulison, K. L. (2010). Estimation of a four-parameter item response theory model. British Journal of Mathematical and Statistical Psychology, 63(3), 509–525. https://doi.org/https://doi.org/10.1348/000711009X474502
- Magis, D. (2013). A note on the item information function of the four-parameter logistic model. Applied Psychological Measurement, 37(4), 304–315. https://doi.org/https://doi.org/10.1177/0146621613475471
- Meng, X., Xu, G., Zhang, J., & Tao, J. (2020). Marginalized maximum a posteriori estimation for the four‐parameter logistic model under a mixture modelling framework. British Journal of Mathematical and Statistical Psychology, 73(S1), 51–82. https://doi.org/https://doi.org/10.1111/bmsp.12185.
- Metropolis, N., Rosenbluth, A. W., Rosenbluth, M. N., Teller, A. H., & Teller, E. (1953). Equations of state space calculations by fast computing machines. Journal of Chemical Physics, 21(6), 1087–1092. https://doi.org/https://doi.org/10.1063/1.1699114
- Mislevy, R. J. (1986). Recent developments in the factor analysis of categorical variables. Journal of Educational Statistics, 11(1), 3–31. https://doi.org/https://doi.org/10.3102/10769986011001003
- Monroe, S., & Cai, L. (2014). Estimation of a Ramsay-curve item response theory model by the Metropolis–Hastings Robbins–Monro algorithm. Educational and Psychological Measurement, 74(2), 343–369. https://doi.org/https://doi.org/10.1177/0013164413499344
- Moskowitz, B., & Caflisch, R. E. (1996). Smoothness and dimension reduction in quasi-Monte Carlo methods. Mathematical and Computer Modelling, 23(8–9), 37–54. https://doi.org/https://doi.org/10.1016/0895-7177(96)00038-6
- Muraki, E. (1990). Fitting a polytomous item response model to Likert-type data. Applied Psychological Measurement, 14(1), 59–71. https://doi.org/https://doi.org/10.1177/014662169001400106
- Muraki, E. (1992). A generalized partial credit model: Application of an EM algorithm (ETS research report 92-06). Educational Testing Service. https://onlinelibrary.wiley.com/doi/pdf/10.1002/j.2333-8504.1992.tb01436.x
- Muraki, E., & Carlson, J. E. (1995). Full-information factor analysis for polytomous item responses. Applied Psychological Measurement, 19(1), 73–90. https://doi.org/https://doi.org/10.1177/014662169501900109
- Neath, R. C. (2013). On convergence properties of the Monte Carlo EM algorithm. In G. Jones & X. Shen (Eds.), Advances in modern statistical theory and applications: A Festschrift in honor of Morris L. Eaton (pp. 43–62). Institute of Mathematical Statistics. https://doi.org/https://doi.org/10.1214/12-IMSCOLL1003
- Niederreiter, H. (1992). Random number generation and quasi-Monte Carlo methods. SIAM.
- Osgood, D. W., McMorris, B. J., & Potenza, M. T. (2002). Analyzing multiple-item measures of crime and deviance I: Item response theory scaling. Journal of Quantitative Criminology, 18(3), 267–296. https://doi.org/https://doi.org/10.1023/A:1016008004010
- Paskov, S., & Traub, J. (1995). Faster evaluation of financial derivatives. Journal of Portfolio Management, 22(1), 113–120. https://doi.org/https://doi.org/10.3905/jpm.1995.409541
- Rijmen, F. (2010). Formal relations and an empirical comparison among the bi‐factor, the testlet, and a second‐order multidimensional IRT model. Journal of Educational Measurement, 47(3), 361–372. https://doi.org/https://doi.org/10.1111/j.1745-3984.2010.00118.x
- Robbins, H., & Monro, S. (1951). A stochastic approximation method. The Annals of Mathematical Statistics, 22(3), 400–407. https://doi.org/https://doi.org/10.1214/aoms/1177729586
- Robert, C. P. (2015). The Metropolis‐Hastings algorithm. ArXiv:1504.01896 [Stat]. http://arxiv.org/abs/1504.01896
- Roussos, L. A., Templin, J. L., & Henson, R. A. (2007). Skills diagnosis using IRT‐based latent class models. Journal of Educational Measurement, 44(4), 293–311. https://doi.org/https://doi.org/10.1111/j.1745-3984.2007.00040.x
- Rubin, D. B., & Thomas, N. (2001). Using parameter expansion to improve the performance of the EM algorithm for multidimensional IRT population-survey models. In A. Boomsma, M. A. J. Van Duijn, & T. A. B. Snijders (Eds.), Essays on item response theory (Vol. 157, pp. 193–204). Lecture Notes in Statistics, Springer. https://doi.org/https://doi.org/10.1007/978-1-4613-0169-1_11
- Rulison, K. L., & Loken, E. (2009). I've fallen and I can't get up: Can high-ability students recover from early mistakes in CAT? Applied Psychological Measurement, 33(2), 83–101. https://doi.org/https://doi.org/10.1177/0146621608324023
- Schürer, R. (2003). A comparison between (quasi-) Monte Carlo and cubature rule based methods for solving high-dimensional integration problems. Mathematics and Computers in Simulation, 62(3–6), 509–517. https://doi.org/https://doi.org/10.1016/S0378-4754(02)00250-1
- Taghipour, S., & Banjevic, D. (2013). Maximum likelihood estimation from interval censored recurrent event data. Computers & Industrial Engineering, 64(1), 143–152. https://doi.org/https://doi.org/10.1016/j.cie.2012.09.012
- Tate, R. (2003). A comparison of selected empirical methods for assessing the structure of responses to test items. Applied Psychological Measurement, 27(3), 159–203. https://doi.org/https://doi.org/10.1177/0146621603027003001
- Van Der Linden, W. J. (Ed.). (2016). Handbook of item response theory: Volume 1: Models. CRC Press. https://doi.org/https://doi.org/10.1201/9781315374512
- Van Der Linden, W. J., & Hambleton, R. K. (Eds.). (1997). Handbook of modern item response theory. Springer Science & Business Media. https://doi.org/https://doi.org/10.1007/978-1-4757-2691-6
- Von Davier, M., & Sinharay, S. (2010). Stochastic approximation methods for latent regression item response models. Journal of Educational and Behavioral Statistics, 35(2), 174–193. https://doi.org/https://doi.org/10.3102/1076998609346970
- Waller, N. G., & Reise, S. P. (2010). Measuring psychopathology with nonstandard item response theory models: Fitting the four-parameter model to the Minnesota Multiphasic Personality Inventory. In S. Embretson (Ed.), Measuring psychological constructs: Advances in model based approaches (pp. 147–173). American Psychological Association. https://doi.org/https://doi.org/10.1037/12074-007
- Waller, N. G., & Feuerstahler, L. (2017). Bayesian modal estimation of the four-parameter item response model in real, realistic, and idealized data sets. Multivariate Behavioral Research, 52(3), 350–370. https://doi.org/https://doi.org/10.1080/00273171.2017.1292893
- Wang, C. (2015). On latent trait estimation in multidimensional compensatory item response models. Psychometrika, 80(2), 428–449. https://doi.org/https://doi.org/10.1007/s11336-013-9399-0
- Wei, G. C., & Tanner, M. A. (1990). A Monte Carlo implementation of the EM algorithm and the poor man’s data augmentation algorithms. Journal of the American Statistical Association, 85(411), 699–704. https://doi.org/https://doi.org/10.1080/01621459.1990.10474930
- Woods, C. M. (2008). Ramsay-curve item response theory for the three-parameter logistic item response model. Applied Psychological Measurement, 32(6), 447–465. https://doi.org/https://doi.org/10.1177/0146621607308014
- Yang, J. S., & Cai, L. (2014). Estimation of contextual effects through nonlinear multilevel latent variable modeling with a Metropolis–Hastings Robbins–Monro algorithm. Journal of Educational and Behavioral Statistics, 39(6), 550–582. https://doi.org/https://doi.org/10.3102/1076998614559972
- Yen, Y. C., Ho, R. G., Laio, W. W., Chen, L. J., & Kuo, C. C. (2012). An empirical evaluation of the slip correction in the four parameter logistic models with computerized adaptive testing. Applied Psychological Measurement, 36(2), 75–87. https://doi.org/https://doi.org/10.1177/0146621611432862