References
- Abdel-Fattah, A. F. A. (1994). Comparing BILOG and LOGIST estimates for normal, truncated normal, and beta ability distributions. Paper presented at the annual meeting of the American Educational Research Association, New Orleans, LA.
- Albert, J. H. (1992). Bayesian estimation of normal ogive item response curves using Gibbs sampling. Journal of Educational Statistics, 17(3), 251–269. https://doi.org/https://doi.org/10.3102/10769986017003251
- Azevedo, C. L., Bolfarine, H., & Andrade, D. F. (2011). Bayesian inference for a skew-normal IRT model under the centered parameterization. Computational Statistics & Data Analysis, 55(1), 353–365. https://doi.org/https://doi.org/10.1016/j.csda.2010.05.003
- Bock, R. D., & Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46(4), 443–459. https://doi.org/https://doi.org/10.1007/BF02293801
- Bolt, D. M., Cohen, A. S., & Wollack, J. A. (2001). A mixture item response model for multiple-choice data. Journal of Educational and Behavioral Statistics, 26(4), 381–409. https://doi.org/https://doi.org/10.3102/10769986026004381
- Boulet, J. R. (1996). The effect of nonnormal ability distributions on IRT parameter estimation using full-information and limited-information methods [Dissertation online]. University of Ottawa (Canada).
- Cai, L. (2010). Metropolis-Hastings Robbins-Monro algorithm for confirmatory item factor analysis. Journal of Educational and Behavioral Statistics, 35(3), 307–335. https://doi.org/https://doi.org/10.3102/1076998609353115
- Chalmers, R. P. (2012). mirt: A multidimensional item response theory package for the R environment. Journal of Statistical Software, 48(6), 1–29. https://doi.org/https://doi.org/10.18637/jss.v048.i06
- Cohen, A. S., Kane, M. T., & Kim, S. H. (2001). The precision of simulation study results. Applied Psychological Measurement, 25(2), 136–145. https://doi.org/https://doi.org/10.1177/01466210122031966
- Davidian, M., & Gallant, A. R. (1993). The nonlinear mixed effects model with a smooth random effects density. Biometrika, 80(3), 475–488. https://doi.org/https://doi.org/10.1093/biomet/80.3.475
- De Ayala, R. J. (1994). The influence of multidimensionality on the graded response model. Applied Psychological Measurement, 18(2), 155–170. https://doi.org/https://doi.org/10.1177/014662169401800205
- de la Torre, J., & Song, H. (2009). Simultaneous estimation of overall and domain abilities: A higher-order IRT model approach. Applied Psychological Measurement, 33(8), 620–639. https://doi.org/https://doi.org/10.1177/0146621608326423
- Dorans, N. J., Pommerich, M., & Holland, P. W. (Eds.). (2007). Linking and aligning scores and scales. Springer.
- Ferrando, P. J. (2003). The accuracy of the E, N and P trait estimates: An empirical study using the EPQ-R. Personality and Individual Differences, 34(4), 665–679. https://doi.org/10.1016/S0191-8869(02)00053-3 https://doi.org/https://doi.org/10.1016/S0191-8869(02)00053-3
- Fox, J. P. (2010). Bayesian item response modeling: Theory and applications. Springer. https://doi.org/https://doi.org/10.1007/BF02294839
- Fox, J. P., & Glas, C. A. (2001). Bayesian estimation of a multilevel IRT model using Gibbs sampling. Psychometrika, 66(2), 271–288. https://doi.org/https://doi.org/10.1007/BF02294839
- Geweke, J. (1992). Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments. In J. M. Bernardo, A. F. M. Smith, A. P. Dawid, & J. O. Berger (Eds.), Bayesian Statistics (pp. 169–193). Oxford University Press.
- Junker, B. W., Patz, R. J., & VanHoudnos, N. M. (2016). Markov chain Monte Carlo for item response models. In W. J. van der Linden (Eds.), Handbook of item response theory, Volume Two: Statistical tools (pp. 271–312). Chapman & Hall/CRC Press.
- Kang, T., Cohen, A. S., & Sung, H. J. (2009). Model selection indices for polytomous items. Applied Psychological Measurement, 33(7), 499–518. https://doi.org/https://doi.org/10.1177/0146621608327800
- Kieftenbeld, V., & Natesan, P. (2012). Recovery of graded response model parameters: A comparison of marginal maximum likelihood and Markov chain Monte Carlo estimation. Applied Psychological Measurement, 36(5), 399–419. https://doi.org/https://doi.org/10.1177/0146621612446170
- Kirisci, L., & Hsu, T. C. (1995). The Robustness of BILOG to Violations of Assumptions of Unidimensionality of Test Items and Normality of Ability. In annual meeting of the NCME, San Francisco.
- Kirisci, L., Hsu, T., & Yu, L. (2001). Robustness of item parameter estimation programs to assumptions of unidimensionality and normality. Applied Psychological Measurement, 25(2), 146–162. https://doi.org/https://doi.org/10.1177/01466210122031975
- MathWorks. 2016. MATLAB - The Language of Technical Computing. Retrieved March 25, 2016, from http://www.mathworks.com/products/matlab, 2016a. .
- Meng, X. B., Tao, J., & Chang, H. H. (2015). A conditional joint modeling approach for locally dependent item responses and response times. Journal of Educational Measurement, 52(1), 1–27. https://doi.org/https://doi.org/10.1111/jedm.12060
- Molenaar, D., Dolan, C. V., & de Boeck, P. (2012). The heteroscedastic graded response model with a skewed latent trait: Testing statistical and substantive hypotheses related to skewed item category functions. Psychometrika, 77(3), 455–478. https://doi.org/https://doi.org/10.1007/s11336-012-9273-5
- Monroe, S. L., & Cai, L. (2014). Estimation of a Ramsay-curve item response theory model by the Metropolis–Hastings Robbins–Monro algorithm. Educational and Psychological Measurement, 74(2), 343–369. https://doi.org/https://doi.org/10.1177/0013164413499344
- Monroe, S. L. (2014). Multidimensional item factor analysis with semi-nonparametric latent densities. [Unpublished Doctoral Dissertation]. Department of Education, University of California.
- Papoulis, A., & Pillai, S. U. (2002). Probability, random variables, and stochastic processes. McGraw-Hill.
- Patz, R. J., & Junker, B. W. (1999a). A straightforward approach to Markov chain Monte Carlo methods for item response models. Journal of Educational and Behavioral Statistics, 24(2), 146–178. https://doi.org/https://doi.org/10.3102/10769986024002146
- Patz, R. J., & Junker, B. W. (1999b). Applications and extensions of MCMC in IRT: Multiple item types, missing data, and rated responses. Journal of Educational and Behavioral Statistics, 24(4), 342–366. https://doi.org/https://doi.org/10.3102/10769986024004342
- Plummer, M. (2010). JAGS Version 2.2. 0 user manual. http://surfnet.dl.sourceforge.net/project/mcmcjags/Manuals/2.x/jags_user_manual.pdf.
- Ramsay, J. O. (1991). Kernel smoothing approaches to nonparametric item characteristic curve estimation. Psychometrika, 56, 611–630. https://doi.org/10.1007/BF02294494
- Reckase, M. D. (2009). Unidimensional item response theory models. In Multidimensional item response theory (pp. 11–56). Springer.
- Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores. Psychometric Monograph. Psychometrika, 34(S1), 1–97. https://doi.org/https://doi.org/10.1007/BF03372160
- Samejima, F. (2016). Graded response models. In W. J. van der Linden (Eds.), Handbook of item response theory (pp. 95–107). Chapman & Hall/CRC Press.
- Santos, J. R., Azevedo, C. L., & Bolfarine, H. (2013). A multiple group item response theory model with centered skew-normal latent trait distributions under a Bayesian framework. Journal of Applied Statistics, 40(10), 2129–2149. https://doi.org/https://doi.org/10.1080/02664763.2013.807331
- Segawa, E. (2005). A growth model for multilevel ordinal data. Journal of Educational and Behavioral Statistics, 30(4), 369–396. https://doi.org/https://doi.org/10.3102/10769986030004369
- van der Linden, W. J. (2006). A lognormal model for response times on test items. Journal of Educational and Behavioral Statistics, 31(2), 181–204. https://doi.org/https://doi.org/10.3102/10769986031002181
- Wang, C., Fan, Z., Chang, H. H., & Douglas, J. A. (2013). A semiparametric model for jointly analyzing response times and accuracy in computerized testing. Journal of Educational and Behavioral Statistics, 38(4), 381–417. https://doi.org/https://doi.org/10.3102/1076998612461831
- Wang, C., Su, S., & Weiss, D. J. (2018). Robustness of parameter estimation to assumptions of normality in the multidimensional graded response Model. Multivariate Behavioral Research, 53(3), 403–418. https://doi.org/https://doi.org/10.1080/00273171.2018.1455572
- Wang, C., Xu, G., & Shang, Z. (2018). A two-stage approach to differentiating normal and aberrant behavior in computer based testing. Psychometrika, 83(1), 223–254. https://doi.org/https://doi.org/10.1007/s11336-016-9525-x
- Woods, C. M. (2015). Estimating the latent density in unidimensional IRT to permit non-normality. In S. P. Reise, & D. A. Revicki (Eds.), Handbook of item response theory modeling: Applications to typical performance assessment (pp. 60–84). Routledge.
- Woods, C. M. (2006). Ramsay-curve item response theory (RC-IRT) to detect and correct for nonnormal latent variables. Psychological Methods, 11(3), 253–270. https://doi.org/https://doi.org/10.1037/1082-989X.11.3.253
- Woods, C. M. (2007a). Ramsay curve IRT for Likert-type data. Applied Psychological Measurement, 31(3), 195–212. https://doi.org/https://doi.org/10.1177/0146621606291567
- Woods, C. M. (2007b). Empirical histograms in item response theory with ordinal data. Educational and Psychological Measurement, 67(1), 73–87. https://doi.org/https://doi.org/10.1177/0013164406288163
- Woods, C. M. (2008). Ramsay-curve item response theory for the three-parameter logistic item response model. Applied Psychological Measurement, 32(6), 447–465. https://doi.org/https://doi.org/10.1177/0146621607308014
- Woods, C. M., & Lin, N. (2009). Item response theory with estimation of the latent density using Davidian curves. Applied Psychological Measurement, 33(2), 102–117. https://doi.org/https://doi.org/10.1177/0146621608319512
- Woods, C. M., & Thissen, D. (2006). Item response theory with estimation of the latent population distribution using spline-based densities. Psychometrika, 71(2), 281. https://doi.org/https://doi.org/10.1007/s11336-004-1175-8
- Yamamoto, K., Muraki, E. (1991). Non-linear transformation of IRT scale to account for the effect of non-normal ability distribution of the item parameter estimation. Paper presented at the annual meeting of the American Educational Research Association, Chicago, IL.
- Zhang, D., & Davidian, M. (2001). Linear mixed models with flexible distributions of random effects for longitudinal data. Biometrics, 57(3), 795–802. https://doi.org/https://doi.org/10.1111/j.0006-341x.2001.00795.x
- Zhang, X., Tao, J., Wang, C., & Shi, N. Z. (2019). Bayesian Model selection methods for multilevel IRT models: A comparison of five DIC-based indices. Journal of Educational Measurement, 56(1), 3–27. https://doi.org/https://doi.org/10.1111/jedm.12197
- Zwinderman, A. H., & Van den Wollenberg, A. L. (1990). Robustness of marginal maximum likelihood estimation in the Rasch model. Applied Psychological Measurement, 14(1), 73–81. https://doi.org/https://doi.org/10.1177/014662169001400107