References
- Borsboom, D., & Markus, K. A. (2013). Truth and evidence in validity theory. Journal of Educational Measurement, 50(1), 110–114. https://doi.org/https://doi.org/10.1111/jedm.12006
- Brown, J. D. (1996). Testing in language programs. Prentice Hall Regents.
- Chalhoub-Deville, M., & O’Sullivan, B. (2020). Validity: Theoretical development and integrated arguments. British Council Monographs.
- Chalhoub-Deville, M. (2009a). The intersection of test impact, validation, and educational reform policy. Annual Review of Applied Linguistics, 29(March), 118–131. https://doi.org/https://doi.org/10.1017/S0267190509090102
- Chalhoub-Deville, M. (2016). Validity theory: Reform policies, accountability testing, and consequences. Language Testing, 33(453–472), 453–472. https://doi.org/https://doi.org/10.1177/0265532215593312
- Chalhoub-Deville, M. (2009b). Standards-based assessment in the U.S.: Social and educational impact. In L. Taylor & C. Weir (Eds.), Language testing matters: Investigating the wider social and educational impact of assessment (pp. 281–300). Cambridge University Press.
- Chalhoub-Deville, M. (2020). Towards a model of validity in accountability testing. In M. K. Wolf (Ed.), Assessing English language proficiency in U.S. K–12 schools (pp. 245–264). Routledge.
- Cizek, G. J. (2020). Validity: An integrated approach to test score meaning and use. Routledge.
- Cronbach, L. J. (1988). Five perspectives on validity argument. In H. Wainer & H. I. Braun (Eds.), Test validity (pp. 3–18). Lawrence Erlbaum Associates.
- Field, J. (2019). Rethinking the second language listening test: From theory into practice. Equinox.
- Gebru, T., Morgenstern, J., Vecchione, B., Wortman Vaughan, J., Wallach, H., Daum, H., III, & Crawford, K. (2019). Datasheets for datasets. (Working Paper arXiv:1803.09010v7). https://arxiv.org/pdf/1803.09010v7.pdf
- Kane, M. T. (1992). An argument-based approach to validation. Psychological Bulletin, 112(3), 527–535. https://doi.org/https://doi.org/10.1037/0033-2909.112.3.527
- Kane, M. T. (2013). Validating the interpretations and uses of test scores. Journal of Educational Measurement, 50(1), 1–73. Special Issue: Validity. https://doi.org/https://doi.org/10.1111/jedm.12000
- Kane, M. T. (2006). Validation. In R. Brennan (Ed.), Educational measurement (4th ed., pp. 17–64). American Council on Education and Praeger.
- Khalifa, H., & Weir, C. J. (2009). Examining reading: Research and practice in assessing second language reading. Cambridge University Press.
- McNamara, T. (2013). Values in language assessment. In C. A. Chapelle (Ed.), The encyclopaedia of applied linguistics (Vol. 10, pp. 6027–6032). Wiley-Blackwell.
- Messick, S. (1989). Validity. In R. L. Linn (Ed.), Educational measurement (3rd ed., pp. 13–103). American Council on Education and Macmillan.
- Mislevy, R. J., & Haertel, G. D. (2007). Implications of evidence-centered design for educational testing. Educational Measurement: Issues and Practice, 25(4), 6–20. https://doi.org/https://doi.org/10.1111/j.1745-3992.2006.00075.x
- Mislevy, R., Almond, R., & Lukas, J. (2003, July). A brief introduction to evidence-centered design. ETS Research Report No. RR-03-16.
- Mislevy, R. (2007). Validity by design. Educational Researcher, 36(8), 463–469. https://doi.org/https://doi.org/10.3102/0013189X07311660
- Mitchell, M., Wu, S., Zaldivar, A., Barnes, P., Vasserman, L., Hutchinson, B., Spitzer, E., Raji, I. D., & Gebru, T. (2019, January 29–31). Model cards for model reporting. FAT* '19: Proceedings of the Conference on Fairness, Accountability, and Transparency, January 2019, January 29–31, Pages 220–229 https://doi.org/https://doi.org/10.1145/3287560.3287596.
- O’Sullivan, B., Dunlea, J., Spiby, R., Westbrook, C., & Dunn, K. (2020). Aptis general technical manual, v 2.2. London: British council. Retrieved September 2, 2021, from https://www.britishcouncil.org/sites/default/files/aptis_technical_manual_v_2.2_final.pdf
- O’Sullivan, B., Patel, M., & Mundy, G. (2020). Understanding and measuring assessment as cultural relations. British Council Perspectives on English Language Education & Policy. British Council. Retrieved August 26, 2021, from https://www.britishcouncil.org/sites/default/files/perspectives_language_testing_as_cultural_relations.pdf
- O’Sullivan, B., & Weir, C. J. (2011). Language testing and validation. In B. O’Sullivan (Ed.), Language testing theories and practices (pp. 13–32). Palgrave Macmillan.
- O’Sullivan, B. (2019). Considering validity. In C. Roever & G. Wigglesworth (Eds.), Social perspectives on language testing: Papers in honour of Tim McNamara (pp. 199–216). Peter Lang.
- O’Sullivan, B. (2016). Validity: What is it and who is it for? In Y.-N. Leung (Ed.), Epoch making in English teaching and learning: Evolution, innovation, and revolution (pp. 157–175). Crane Publishing Company Ltd.
- Shepard, L. A. (1993). Evaluating test validity. Review of Research in Education, 19(1), 405–450. https://doi.org/10.3102/0091732X019001405
- Weir, C. J. (2005). Language testing and validation: An evidence-based approach. Palgrave.