591
Views
7
CrossRef citations to date
0
Altmetric
Original Articles

Profile-likelihood Confidence Intervals in Item Response Theory Models

, &

References

  • Albert, J. H. (1992). Bayesian estimation of normal ogive item response curves using Gibbs sampling. American Educational Research Association, 17(3), 251–269.
  • Baker, F. B., & Kim, S. H. (2004). Item response theory: Parameter estimation techniques (2nd ed.). New York, NY: Dekker.
  • Birnbaum, A. (1968). Some latent trait models and their use in inferring an examinee's ability. In F. M. Lord & M. R. Novick (Eds.), Statistical theories of mental test scores, (pp. 395–479. Reading, MA: Addison-Wesley.
  • Bock, R. D., & Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46(4), 443–459.
  • Bock, R. D., Gibbons, R., & Muraki, E. (1988). Full-information item factor analysis. Applied Psychological Measurement, 12(3), 261–280.
  • Bock, R. D., & Lieberman, M. (1970). Fitting a response model for n dichotomously scored items. Psychometrika, 35(2), 179–197.
  • Bradley, J. V. (1978). Robustness? British Journal of Mathematical and Statistical Psychology, 31, 144–152.
  • Brown, C., Templin, J., & Cohen, A. (2015). Comparing the two- and three-parameter logistic models via likelihood ratio tests: A commonly misunderstood problem. Applied Psychological Measurement, 39(5), 335–348. doi:10.1177/0146621614563326
  • Cai, L. (2008). SEM of another flavour: Two new applications of the supplemented EM algorithm. British Journal of Mathematical and Statistical Psychology, 61, 309–329. doi:10.1348/000711007X249603
  • Cai, L. (2015). flexMIRT: A numerical engine for multilevel item factor analysis and test scoring [Computer software manual]. Vector Psychometric Group. (Version 3.0)
  • Chalmers, R. P. (2012). mirt: A multidimensional item response theory package for the R environment. Journal of Statistical Software, 48(6), 1–29. doi:10.18637/jss.v048.i06
  • Chalmers, R. P. (2015). Extended mixed-effects item response models with the MH-RM algorithm. Journal of Educational Measurement, 52(2), 200–222. doi:10.1111/jedm.12072
  • Chalmers, R. P. (2016a). Generating adaptive and non-adaptive test interfaces for multidimensional item response theory applications. Journal of Statistical Software, 71(5), 1–38. doi:10.18637/jss.v071.i05
  • Chalmers, R. P. (2016b). SimDesign: Structure for Organizing Monte Carlo Simulation Designs [Computer software manual]. Retrieved from https://CRAN.R-project.org/package=SimDesign (R package version 1.0).
  • Chalmers, R. P., Counsell, A., & Flora, D. B. (2016). It might not make a big DIF: Improved differential test functioning statistics that account for sampling variability. Educational and Psychological Measurement, 76(1), 114–140. doi:10.1177/0013164415584576
  • Chalmers, R. P., & Ng, V. (2017). Plausible-value imputation statistics for detecting item misfit. Applied Psychological Measurement, 41(5), 372–387. doi:10.1177/014662161769207
  • Cox, D. R., & Hinkley, D. V. (1974). Theoretical statistics. London, UK: Chapman & Hall.
  • Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 39(1), 1–38.
  • Doebler, A., Doebler, P., & Holling, H. (2013). Optimal and most exact confidence intervals for person parameters in item response theory models. Psychometrika, 78(1), 98–115. doi: 10.1007/S11336-012-9290-4
  • du Toit, M. (2003). IRT from SSI: BILOG-MG MULTILOG PARSCALE TESTFACT [Computer software manual]. Lincolnwood, IL.
  • Edwards, M. C. (2010). A Markov Chain Monte Carlo approach to confirmatory item factor analysis. Psychometrika, 75(3), 474–497. doi: 10.1007/S11336-010-9161-9
  • Efron, B., & Tibshirani, R. J. (1998). An introduction to the bootstrap. New York: Chapman & Hall.
  • Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists. Mahwah, NJ: Erlbaum.
  • Gorsuch, R. L. (1983). Factor analysis (2nd ed.). Hillsdale, NJ: Erlbaum.
  • Johnson, T. R., & Kuhn, K. M. (2015). Simulation-based Bayesian inference for latent traits of item response models: Introduction to the ltbayes package for R. Behavior Research Methods, 47, 1309–1327. doi:10.3758/s13428-014-0540-5
  • Liu, Y., & Hannig, J. (2016). Generalized fiducial inference for binary logistic item response models. Psychometrika, 81(2), 290–324. doi:10.1007/s11336-015-9492-7
  • Liu, Y., & Hannig, J. (in press). Generalized fiducial inference for logistic graded response models. Psychometrika. Advance online publication. doi:10.1007/s11336-017-9554-0
  • Lord, F. M., & Novick, M. R. (1968). Statistical theory of mental test scores. Reading, MA: Addison-Wesley.
  • Louis, T. A. (1982). Finding the observed information matrix when using the EM algorithm. Journal of the Royal Statistical Society – Series B, 44, 226–233.
  • Maydeu-Olivares, A., & Joe, H. (2005). Limited and full information estimation and goodness-of-fit testing in 2n contingency tables: A unified framework. Journal of the American Statistical Association, 100, 1009–1020. doi:10.1198/016214504000002069
  • McLachlan, G. J., & Krishnan, T. (2008). The EM algorithm and extensions (2nd ed.). Hoboken, New Jersey: Wiley.
  • Meeker, W. Q., & Escobar, L. A. (1995). Teaching about approximate confidence regions based on maximum likelihood estimation. The American Statistician, 49(1), 48–53.
  • Meng, X.-L., & Rubin, D. B. (1991). Using EM to obtain asymptotic variance-covariance matrices: The SEM algorithm. Journal of the American Statistical Association, 86(416), 899–909.
  • Neale, M. C., & Miller, M. B. (1997). The use of likelihood-based confidence intervals in genetic models. Behavior Genetircs, 27(2), 113–120.
  • Oakes, D. (1999). Direct calculation of the information matrix via the EM algorithm. Journal of the Royal Statistical Society. Series B (Statistical Methodology), 61(2), 479–482.
  • Ogasawara, H. (2009). Asymtotic cumulants of the parameter estimators in item response theory. Computational Statistics, 24, 313–331. doi:10.1007/s00180-008-0118-8
  • Orlando, M., & Thissen, D. (2000). Likelihood-based item fit indices for dichotomous item response theory models. Applied Psychological Measurement, 24, 50–64.
  • Paek, I., & Cai, L. (2014). A comparison of item parameter standard error estimation procedures for unidimensional and multidimensional IRT modeling. Educational and Psychological Measurement, 74, 58–76. doi:10.1177/0013164413500277
  • Patz, R. J., & Junker, B. W. (1999). A straightforward approach to Markov Chain Monte Carlo methods for item response models. Journal of Educational and Behavioral Statistics, 24(2), 146–178.
  • Pawitan, Y. (2001). In all likelihood: Statistical modelling and inference using likelihood. New York: Oxford University Press.
  • Pek, J., & Wu, H. (2015). Profile likelihood-based confidence intervals and regions for structural equation models. Psychometrika, 80(4), 1123–1145. doi:10.1007/s11336-015-9461-1
  • R Core Team (2016). R: A language and environment for statistical computing [Computer software manual]. Vienna, Austria. Retrieved from https://www.R-project.org/
  • Reckase, M. D. (2009). Multidimensional item response theory. New York: Springer-Verlag.
  • Rubin, D. B. (1987). Multiple imputation for nonresponse in surveys. New York: J. Wiley & Sons.
  • Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores. Psychometrika Monographs (17).
  • Sigal, M. J., & Chalmers, R. P. (2016). Play it again: Teaching statistics with Monte Carlo simulation. Journal of Statistics Education, 24(3), 1–21. doi: 10.1080/10691898.2016.1246953
  • Tian, W., Cai, L., Thissen, D., & Xin, T. (2012). Numerical differentiation methods for computing error covariance matrices in item response theory modeling: An evaluation and a new proposal. Educational and Psychological Measurement, 73(3), 412–439. doi:10.1177/0013164412465875
  • Wirth, R., & Edwards, M. C. (2007). Item factor analysis: Current approaches and future directions. Psychological Methods, 12, 58–79. doi:10.1037/1082-989X.12.1.58
  • Wood, R., Wilson, D., Gibbons, R., Schilling, S., Muraki, E., & Bock, R. (2003). TESTFACT 4 for Windows: Test Scoring, Item Statistics, and Full-information Item Factor Analysis [Computer software]. Scientific Software International.
  • Wu, H., & Neale, M. C. (2012). Adjusted confidence intervals for a bounded parameter. Behavior Genetics, 42, 886–898. doi:10.1007/s10519-012-9560-z
  • Ye, Y. (1987). Interior algorithms for linear, quadratic, and linearly constrained non linear programming (Unpublished doctoral dissertation). Department of EES Stanford University, Stanford CA.
  • Yuan, K.-H., Cheng, Y., & Patton, J. (2014). Information matrices and standard errors for MLEs of item parameters in IRT. Psychometrika, 79(2), 232–254. doi:10.1007/S11336-013-9334-4
  • Zimowski, M. F., Muraki, E., Mislevy, R. J., & Bock, R. D. (2003). BILOG-MG 3 for Windows: Multiple-group IRT analysis and test maintenance for binary items [Computer software]. Lincolnwood, IL: Scientific Software International, Inc.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.