References
- American Educational Research Association (AERA), American Psychological Association (APA), & National Council on Measurement in Education (NCME). (2014). Standards for educational and psychological testing. Washington, DC: AERA.
- Bejar, I., Douglas, D., Jamieson, J., Nissan, S., & Turner, J. (2000). TOEFL 2000 listening framework: A working paper (TOEFL Monograph Series, MS-19). Retrieved from https://www.ets.org/Media/Research/pdf/RM-00-07.pdf
- Brown, A. (2006). An examination of the rating process in the revised IELTS speaking test. IELTS Research Reports, 6, 41–69. Retrieved from https://www.ielts.org/-/media/research-reports/ielts_rr_volume06_report2.ashx
- Brown, A., Iwashita, N., & McNamara, T. (2005). An examination of rater orientations and test taker performance on English-for-academic-purposes speaking tasks (TOEFL Monograph Series, MS-29). Retrieved from https://www.ets.org/Media/Research/pdf/RR-05-05.pdf
- Burgoon, J. K., Guerrero, L. K., & Floyd, K. (2016). Nonverbal communication. New York, NY: Routledge.
- Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd ed.). Hillsdale, NJ: Erlbaum.
- Conlan, C. J., Bardsley, W. N., & Martinson, S. H. (1994). A study of intra-rater reliability of assessments of live versus audio-recorded interviews in the IELTS Speaking component. A report of a study commissioned by the International Editing Committee of the IELTS.
- Creswell, J. W., & Plano Clark, V. L. (2011). Designing and conducting mixed methods research (2nd ed.). Thousand Oaks, CA: Sage Publications.
- Douglas, D., & Hegelheimer, V. (2007). Assessing language using computer technology. Annual Review of Applied Linguistics, 27, 115–132. doi:https://doi.org/10.1017/S0267190508070062
- Eckes, T. (2015). Introduction to many-facet Rasch measurement: Analyzing and evaluating rater-mediated assessments (2nd ed.). Frankfurt, Germany: Peter Lang.
- Galaczi, E., & Taylor, L. (2018). Interactional competence: Conceptualisations, operationalisations, and outstanding questions. Language Assessment Quarterly, 15(3), 219–236. doi:https://doi.org/10.1080/15434303.2018.1453816
- Galaczi, E. D. (2014). Interactional competence across proficiency levels: How do learners manage interaction in paired speaking tests? Applied Linguistics, 35(5), 553–574. doi:https://doi.org/10.1093/applin/amt017
- Galaczi, E. D., Lim, G., & Khabbazbashi, N. (2012). Descriptor salience and clarity in rating scale development and evaluation. Paper presented at Language Testing Forum, University of Bristol, UK.
- Lee, H., Patel, M., Lynch, J., & Galaczi, E. D. (in press). Development of the IELTS video call speaking test: Phase 4 operational trial. IELTS Partnership Research Papers.
- Linacre, J. M. (2013). Facets computer program for many-facet Rasch measurement, version 3.71.3. Beaverton, Oregon. Retrieved from Winsteps.com
- Linacre, J. M. (2019). A user’s guide to FACETS. Retrieved fromhttps://www.winsteps.com/a/Facets-Manual.pdf
- Linacre, J. M. (2020). Winsteps Rasch measurement computer program: User’s guide. Retrieved from https://www.winsteps.com/a/Winsteps-Manual.pdf
- Mackey, A., & Gass, S. (2016). Second language research: Methodology and design. Oxon, UK: Routledge.
- May, L. (2011). Interaction in a paired speaking test: The rater’s perspective. Frankfurt, Germany: Peter Lang.
- McGurk, H., & MacDonald, J. (1976). Hearing lips and seeing voices. Nature, 264(5588), 746–748. doi:https://doi.org/10.1038/264746a0
- McNamara, T. (1997). ‘Interaction’ in second language performance assessment: Whose performance? Applied Linguistics, 18(4), 444–446. doi:https://doi.org/10.1093/applin/18.4.446
- Nakatsuhara, F., Inoue, C., Berry, V., & Galaczi, E. (2017). Exploring the use of video-conferencing technology in the assessment of spoken language: A mixed-methods study. Language Assessment Quarterly, 14(1), 1–18. doi:https://doi.org/10.1080/15434303.2016.1263637
- Nakatsuhara, F., Inoue, C., & Taylor, L. (2017). An investigation into double-marking methods: Comparing live, audio and video rating of performance on the IELTS speaking test. IELTS Research Reports Online Series, 2017/1, 1–49. Retrieved from https://www.ielts.org/-/media/research-reports/ielts_online_rr_2017-1.ashx
- Raffler-Engel, W. (1980). Kinesics and paralinguistics: A neglected factor in second language research and teaching. Canadian Modern Language Review, 36(2), 225–237. doi:https://doi.org/10.3138/cmlr.36.2.225
- Salaberry, R., & Kunitz, S. (Eds.). (2019). Teaching and testing L2 interactional competence: Bridging theory and practice. NY: Routledge.
- Styles, P. (1993). Inter- and intra-rater reliability of assessments of “live” versus audio- and video-recorded interviews in the IELTS Speaking test. A report on a project conducted at the British Council centre in Brussels.
- Taylor, L., & Falvey, P. (Eds.). (2007). IELTS collected papers: Research in speaking and writing assessment. Studies in language testing 19. Cambridge: UCLES/Cambridge University Press.
- Taylor, L., & Galaczi, E. (2011). Scoring validity. In L. Taylor (Ed.), Examining speaking: Research and practice in assessing second language speaking. Studies in language testing 30 (pp. 171–233). Cambridge: UCLES/Cambridge University Press.
- Vo, S. T. (2019). Effects of task types on interactional competence in oral communication assessment (Doctoral dissertation). Iowa State University. ProQuest Dissertations and Theses Global.
- Wagner, E. (2008). Video listening tests: What are they measuring? Language Assessment Quarterly, 5(3), 218–243. doi:https://doi.org/10.1080/15434300802213015
- Wagner, E. (2010). The effect of the use of video texts on ESL listening test-taker performance. Language Testing, 27(4), 493–513. doi:https://doi.org/10.1177/0265532209355668
- Wind, S. A., & Peterson, M. E. (2018). A systematic review of methods for evaluating rating quality in language assessment. Language Testing, 35(2), 161–192. doi:https://doi.org/10.1177/0265532216686999
- Wolfe, E. W. (2013). A bootstrap approach to evaluating person and item fit to the Rasch model. Journal of Applied Measurement, 14(1), 1–9.
- Wright, B. D., & Linacre, J. M. (1994). Reasonable mean-square fit values. Rasch Measurement Transactions, 8(3), 370. https://www.rasch.org/rmt/rmt83b.htm
- Young, R. F. (2011). Interactional competence in language learning, teaching, and testing. In E. Hinkel (Ed.), Handbook of research in second language teaching and learning (Vol. 2, pp. 426–443). New York, NY: Routledge.