References
- American Educational Research Association, American Psychological Association, and National Council on Measurement in Education. (2014). Standards for educational and psychological testing. Washington, D.C.: AERA.
- Birnbaum, A. (1968). Some latent trait models and their use in inferring an examinee’s ability. In F. Lord & M. L. Novick (Eds.), Statistical theories of mental test scores (pp. 397–479). Reading, MA: Addison-Wesley Publishing Company.
- Carroll, J. B., & Schohan, B. (1953). Construction of comprehensive achievement examinations for navy officer candidate programs (Research Report NR 154-138). Pennsylvania: American Institute for Research.
- Conijn, J. M., Sijtsma, K., & Emons, W. H. M. (2016). Identifying person-fit latent classes, and explanation of categorical and continuous person misfit. Applied Psychological Measurement, 40, 128–141. doi:10.1177/01466216115611164
- Cui, Y., & Leighton, J. P. (2009). The hierarchy consistency index: Evaluating person fit for cognitive diagnostic assessment. Journal of Educational Measurement, 46(4), 429–449. doi:10.1111/jedm.2009.46.issue-4
- Cui, Y., & Li, J. (2015). Evaluating person fit for cognitive diagnostic assessment. Applied Psychological Measurement, 39, 223–238. doi:10.1177/0146621614557272
- Cui, Y., & Roberts, M. R. (2013). Validating student score inferences with person-fit statistic and verbal reports: A person-fit study for cognitive diagnostic assessment. Educational Measurement: Issues and Practice, 32, 34–42. doi:10.1111/emip.2013.32.issue-1
- de Ayala, R. J. (2009). The theory and practice of item response theory. New York, New York: Guilford Press.
- de la Torre, J., & Deng, W. (2008). Improving person-fit assessment by correcting the ability estimate and its reference distribution. Journal of Educational Measurement, 45(2), 159–177. doi:10.1111/j.1745-3984.2008.00058.x
- Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists. Mahwah, NJ: Lawrence Erlbaum Associates.
- Emons, W. H. M., Sijtsma, K., & Meijer, R. R. (2004). Testing hypotheses about person-response functions in person-fit analysis. Multivariate Behavioral Research, 39, 1–35. doi:10.1207/s15327906mbr3901_1
- Emons, W. H. M., Sijtsma, K., & Meijer, R. R. (2005). Global, local, and graphical person-fit analysis using person-response functions. Psychological Methods, 10(1), 101–119. doi:10.1037/1082-989X.10.1.101
- Engelhard, Jr., G. (2013a). Hanning (smoothing) of person response functions. Rasch Measurement Transactions, 26(4), 1392–1393.
- Engelhard, Jr., G. (2013b). Invariant measurement: Using Rasch models in the social behavioral, and health sciences. New York, NY: Routledge.
- Ferrando, P. J. (2007). Factor-analytic procedures for assessing response pattern scalability. Multivariate Behavioral Research, 42, 481–507. doi:10.1080/00273170701382583
- Ferrando, P. J. (2014). A general approach for assessing person fit and person reliability in typical- response measurement. Applied Psychological Measurement, 38, 166–183. doi:10.1177/0146621613497532
- Hambleton, R. K., Swaminathan, H., & Rogers, H. J. (1991). Fundamentals of item response theory. Newbury Park, California: Sage Publications Inc.
- Hendrawan, I., Glas, C. A. W., & Meijer, R. R. (2005). The effect of person misfit on classification decisions. Applied Psychological Measurement, 29, 26–44. doi:10.1177/0146621604270902
- Jennings, J. K. (2015). Calibrating test item banks for an introductory statistics course ( Unpublished master’s thesis). Athens, Georgia: The University of Georgia, Athens. Retrieved from http://dbs.galib.uga.edu/cgi-bin/getd.cgi?userid=galileo&action=search&_cc=1
- Karabatsos, G. (2000). A critique of Rasch residual fit statistics. Journal of Applied Measurement, 1(2), 152–176.
- Karabatsos, G. (2003). Comparing the aberrant response detection performance of thirty-six person fit statistics. Applied Measurement in Education, 16, 227–298. doi:10.1207/S15324818AME1604_2
- Lamprianou, I. (2013). The tendency of individuals to respond to high-stakes tests in idiosyncratic ways. Journal of Applied Measurement, 14(3), 299–317.
- Levine, M. V., & Rubin, D. B. (1979). Measuring the appropriateness of multiple-choice test scores. Journal of Educational Statistics, 4, 269–290. doi:10.2307/1164595
- Linacre, J. M. (2016). Winstepsinsert Register symbol Rasch measurement computer program. Beaverton Oregon:Winsteps.com
- Ludlow, L. H. (1986). Graphical analysis of item response theory residuals. Applied Psychological Measurement, 10, 217–229. doi:10.1177/014662168601000301
- Lumsden, J. (1977). Person reliability. Applied Psychological Measurement, 1, 477–482. doi:10.1177/014662167700100403
- Lumsden, J. (1978). Tests are perfectly reliable. British Journal of Mathematical and Statistical Psychology, 31, 19–26. doi:10.1111/j.2044-8317.1978.tb00568.x
- Meijer, R. R. (1996). Person-fit research: An Introduction. Applied Measurement in Education, 9, 3–8.
- Meijer, R. R., Egberink, I. J. L., Emons, W. H. M, Sijtsma, K. (2008.) Detection and validation of unscalable item score patterns using item response theory: An illustration with Harter’s self-perception profile for children. Journal of Personality Assessment, 90(3), 227–238. doi:10.1080/00223890701884921
- Meijer, R. R. (2003). Diagnosing item score patterns on a test using item response theory-based person-fit statistics. Psychological Methods, 8, 72–87. doi:10.1037/1082-989X.8.1.72
- Meijer, R. R., & Sijtsma, K. (2001). Methodology review: Evaluating person fit. Applied Psychological Measurement, 25, 107–135. doi:10.1177/01466210122031957
- Meijer, R. R., & van Krimpen-Stoop, E. M. (2010). Detecting person misfit in adaptive testing. In W. van Der Linden & C. Glas (Eds.), Elements of Adaptive Testing (pp. 315–329). New York, NY: Springer.
- Messick, S. (1994). Validity of psychological assessment: Validation of inferences from persons’ responses and performances as scientific inquiry into score meaning (Research Report 94-45). Princeton, NJ: Educational Testing Service.
- Mosier, C. I. (1941). Psychophysics and mental test theory II: The constant process. Psychological Review, 48, 235–249. doi:10.1037/h0055909
- Petridou, A., & Williams, J. (2007). Accounting for aberrant test response patterns using multilevel models. Journal of Educational Measurement, 44, 227–247. doi:10.1111/jedm.2007.44.issue-3
- Petridou, A., & Williams, J. (2010). The extent of mismeasurement for aberrant examinees. Educational Assessment, 15, 42–68. doi:10.1080/10627191003673240
- Raju, N. S. (1988). The area between two item characteristic curves. Psychometrika, 53(4), 495–502. doi:10.1007/BF02294403
- Rasch, G. (1980). Probabilistic models for some intelligence and attainment tests. Chicago, IL: University of Chicago Press ( Original work published 1960).
- Reckase,M. D. (1979). Unifactor latent trait models applied to multifactor tests: Results and implications. Journal of Educational Statistics, 4, 207–230. doi:10.2307/1164671
- Reise, S. P. (2000). Using multilevel logistic regression to evaluate person-fit in IRT models. Multivariate Behavioral Research, 35, 543–568. doi:10.1207/S15327906MBR3504_06
- Reise, S. P., & Flannery, W. P. (1996). Assessing person-fit on measures of typical performance. Applied Measurement in Education, 9, 9–26. doi:10.1207/s15324818ame0901_3
- Rudner, L. M., Bracey, G., & Skaggs, G. (1996). The use of a person-fit statistic with one high-quality achievement test. Applied Measurement in Education, 9, 91–109. doi:10.1207/s15324818ame0901_8
- Rupp, A. A. (2013). A systematic review of the methodology for person fit research in item response theory: Lessons about generalizability of inferences from the design of simulated studies. Psychological Test and Assessment Modeling, 55(1), 3–38.
- Seo, D. G., & Weiss, D. J. (2013). lz person-fit index to identify misfit students with achievement test data. Educational and Psychological Measurement, 73, 994–1016. doi:10.1177/0013164413497015
- Sijtsma, K. (1986). A coefficient of deviance of response patterns. Kwantitatieve Methoden: Nieuwsbrief Voor Toegepaste Statistiek En Operationele Research, 7(22), 131–145.
- Sijtsma, K., & Meijer, R. R. (2001). The person response function as a tool in person-fit research. Psychometrika, 66, 191–208. doi:10.1007/BF02294835
- Sinharay, S. (2015). Assessment of person fit for mixed-format tests. Journal of Educational and Behavioral Statistics, 40, 343–365. doi:10.3102/1076998615589128
- Smith, R. M. (1985). A comparison of Rasch person analysis and robust estimators. Educational and Psychological Measurement, 45, 433–444. doi:10.1177/001316448504500301
- Smith, R. M. (1986). Person fit in the Rasch model. Educational and Psychological Measurement, 46, 359–372. doi:10.1177/001316448604600210
- Smith, R. M. (1990). Theory and practice of fit. Rasch Measurement Transactions, 3, 78.
- Smith, R. M. (2004). Fit analysis in latent trait measurement models. In E. Smith Jr. & R. Smith (Eds.), Introduction to Rasch Measurement (pp. 73–92). Maple Grove, MN: JAM Press.
- Smith, R. M., & Hedges, L. V. (1982). Comparison of likelihood ratio X2 and pearsonian X2 tests of fit in the Rasch model. Education Research and Perspectives, 9, 44–54.
- Smith, R. M., & Plackner, C. (2010). The family approach to assessing fit in Rasch measurement. In M. L. Garner, G. Engelhard Jr., W. P. Fisher Jr., & M. Wilson (Eds.), Advances in Rasch Measurement (Vol. 1, pp. 64–85). Maple Grove, MN: JAM Press.
- Tendeiro, J. N., & Meijer, R. R. (2012). A CUSUM to detect person misfit: A discussion and some alternatives for existing procedures. Applied Psychological Measurement, 36, 420–442. doi:10.1177/0146621612446305
- Tendeiro, J. N., & Meijer, R. R. (2014). Detection of invalid test scores: The usefulness of simple nonparametric statistics. Journal of Educational Measurement, 51, 239–259. doi:10.1111/jedm.12046
- Trabin, T. E., & Weiss, D. J. (1979). The person response curve: Fit of individuals to item characteristic curve models. (Research Report 79-7). Minneapolis, Minnesota: University of Minnesota, Department of Psychology, Psychometric Methods Program.
- Velleman, P. F., & Hoaglin, D. C. (1981). Smoothing data. In P. Velleman & D. Hoaglin (Eds.), Applications, basics, and computing of exploratory data analysis (pp. 159–199). Boston, MA: Duxbury Press.
- Walker, A. A., Engelhard, Jr., G., Royal, K. D., & Hedgpeth, M. W. (2016). Exploring aberrant responses using person fit and person response functions. Journal of Applied Measurement, 17(2), 194–208.
- Wolfe, E. W. (2013). A bootstrap approach to evaluating person and item fit to the Rasch model. Journal of Applied Measurement, 14(1), 1–9.
- Wright, B. D., & Masters, G. N. (1982). Rating Scale Analysis: Rasch Measurement. Chicago, IL: MESA Press.
- Wright, B. D., Mead, R. J., & Ludlow, L. H. (1980). KIDMAP:(Research Memorandum #29). Chicago, Illinois: Statistical Laboratory Department of Education University of Chicago. Retrieved from http://www.rasch.org/memo29.pdf
- Wright, B. D., & Stone, M. H. (1979). Best Test Design: Rasch Measurement. Chicago, IL: MESA Press.