1,890
Views
15
CrossRef citations to date
0
Altmetric
Articles

Developing assessment scales for large-scale speaking tests: a multiple-method approach

, , &
Pages 217-237 | Received 03 Mar 2010, Accepted 17 Mar 2011, Published online: 02 Aug 2011

References

  • Alderson, J.C., N. Figueras, H. Kuijper, G. Nold, S. Takala, and S. Tardieu. 2004. The development of specifications for item development and classification within the Common European Framework of Reference for languages: Learning, teaching, assessment: Reading and listening. Final report of the Dutch CEF construct project. Lancaster University, Lancaster, UK. Unpublished.
  • Brindley , G. 1998 . “ Describing language development? Rating scales and SLA ” . In Interfaces between second language acquisition and language testing research , Edited by: Bachman , L.F. and Cohen , A.D. 12 – 140 . Cambridge : Cambridge University Press .
  • Brooks , L. 2009 . Interacting in pairs in a test of oral proficiency: Co-constructing a better performance . Language Testing , 26 ( 3 ) : 341 – 366 .
  • Brown , A. 2007 . “ An investigation of the rating process in the IELTS oral interview ” . In IELTS collected papers , Edited by: Taylor , L. and Falvey , P. 98 – 141 . Cambridge : Cambridge University Press .
  • Canepa , A. 2006 . Review of speaking test criteria for Main Suite and BEC , Cambridge ESOL Internal Report Cambridge : UCLES .
  • Chalhoub-Deville , M. 1995 . Deriving oral assessment scales across different tests and rater groups . Language Testing , 12 ( 1 ) : 16 – 33 .
  • Christiansen , T. 2006 . Discourse Management assessment criteria in Main Suite and BEC , Cambridge ESOL Internal Report Cambridge : UCLES .
  • Council of Europe . 2001 . Common European Framework of Reference for Languages: Learning, teaching, assessment , Cambridge : Cambridge University Press .
  • Ducasse , A.M. and Brown , A. 2009 . Assessing paired orals: Raters’ orientation to interaction . Language Testing , 26 ( 3 ) : 423 – 443 .
  • Falvey , P. and Shaw , S. 2006 . IELTS writing: Revising assessment criteria and scales (Phase 4) . Cambridge ESOL Research Notes , 23 : 7 – 12 .
  • ffrench , A. 2003 . “ The change process at the paper level. Paper 5, Speaking ” . In Continuity and innovation: Revising the Cambridge Proficiency in English examination 1913–2002 , Edited by: Weir , C. and Milanovic , M. 367 – 471 . Cambridge : Cambridge University Press .
  • Field, J. 2011. Cognitive validity. In Examining speaking: Research and practice in assessing second language speaking , ed. L. Taylor. Cambridge: Cambridge University Press.
  • Fulcher , G. 1996 . Does thick description lead to smart tests? A data-based approach to rating scale construction . Language Testing , 13 ( 1 ) : 208 – 238 .
  • Galaczi , E.D. 2008 . Peer-peer interaction in a speaking test: The case of the First Certificate in English examination . Language Assessment Quarterly , 5 ( 2 ) : 89 – 119 .
  • Green , A. 2006 . Placing speaking descriptors on the Cambridge ESOL scale , Cambridge ESOL Internal Report Cambridge : UCLES .
  • Green , A. 2011 . Language functions revisited: Theoretical and empirical bases for language construct definition across the ability range , Cambridge : Cambridge University Press .
  • Gude , K. 2006 . Review of speaking test criteria for Main Suite and BEC , Cambridge ESOL Internal Report Cambridge : UCLES .
  • Hawkins , J.A. and Filipović , L. 2011 . Criterial features in the learning of English: Specifying the reference levels of the Common European Framework , Cambridge : Cambridge University Press .
  • Horner , D. 2006 . Review of speaking test criteria for Cambridge Main Suite and BEC: Interactive communication , Cambridge ESOL Internal Report Cambridge : UCLES .
  • Hubbard , C. , Gilbert , S. and Pidcock , J. 2006 . Assessment processes in speaking tests: A pilot verbal protocol study . Cambridge ESOL Research Notes , 24 : 14 – 19 .
  • Isaacs , T. 2008 . Towards defining a valid assessment criterion of pronunciation proficiency in non-native English-speaking graduate students . Canadian Modern Language Review , 64 ( 4 ) : 555 – 580 .
  • Jacoby , S. and Ochs , E. 1995 . Co-construction: An introduction . Research in Language and Social Interaction , 28 ( 3 ) : 171 – 183 .
  • Knoch , U. 2009 . Diagnostic assessment of writing: A comparison of two rating scales . Language Testing , 26 ( 2 ) : 275 – 304 .
  • Knoch , U. 2009 . Collaborating with ESP stakeholders in rating scale validation: The case of the ICAO rating scale . Spaan Fellow Working Papers in Second or Foreign Language Assessment , 7 : 21 – 46 .
  • Lazaraton , A. 2002 . A qualitative approach to the validation of oral language tests , Cambridge : Cambridge University Press .
  • Lazaraton , A. and Davis , L. 2007 . An analysis of candidate language output on the PET Speaking Test standardisation videos , Internal Cambridge ESOL Report Cambridge : UCLES .
  • Linacre, M. 2006. Facets Rasch Measurement computer programme. Chicago, IL: Winsteps.
  • Lunz , M.E. and Wright , B.D. 1997 . “ Latent trait models for performance examinations ” . In Applications of latent trait and latent class models in the social sciences , Edited by: Rost , J. and Langehene , R. 80 – 88 . Munster, , Germany : Waxmann .
  • Lynch , B. and McNamara , T. 1998 . Using G theory and Many-facet Rasch measurement in the development of performance assessments of the ESL speaking skills of immigrants . Language Testing , 15 ( 2 ) : 158 – 180 .
  • Matthews , A. 2006 . Review of speaking test criteria for Main Suite and BEC , Cambridge ESOL Internal Report Cambridge : UCLES .
  • McNamara , T. 1996 . Measuring second language proficiency , London : Longman .
  • Milanovich , M. , Saville , N. , Pollitt , A. and Cook , A. 1996 . “ Developing rating scales for CASE: Theoretical concerns and analyses ” . In Validation in language testing , Edited by: Cumming , A. and Berwick , R. 15 – 38 . Clevedon : Multilingual Matters .
  • Myford , C.M. and Wolfe , E.W. 2003 . Detecting and measuring rater effects using Multi-Facet Rasch measurement: Part 1 . Journal of Applied Measurement , 4 ( 4 ) : 386 – 422 .
  • Myford , C.M. and Wolfe , E.W. 2004 . Detecting and measuring rater effects using Multi-Facet Rasch measurement: Part 2 . Journal of Applied Measurement , 5 ( 2 ) : 189 – 227 .
  • O’Sullivan, B. 2005. A practical introduction to using FACETS in language testing research. Unpublished manuscript. Roehampton University.
  • O’Sullivan , B. , Weir , C.J. and Saville , N. 2002 . Using observation checklists to validate speaking-test tasks . Language Testing , 19 ( 1 ) : 33 – 56 .
  • Saville , N. 2003 . “ The process of test development and revision within UCLES EFL ” . In Continuity and innovation: Revising the Cambridge Proficiency in English examination 1913–2002 , Edited by: Milanovic , M. and Weir , C. 57 – 120 . Cambridge : University of Cambridge Local Examinations Syndicate .
  • Sawaki , Y. 2007 . Construct validation of analytic rating scales in a speaking assessment: Reporting a score profile and a composite . Language Testing , 24 ( 3 ) : 355 – 390 .
  • Taylor, L. 1999. Study of quantitative differences between CPE individual and paired speaking tests. Internal Cambridge ESOL report. University of Cambridge, UK.
  • Taylor, L. 2011. Introduction. In Examining speaking: Research and practice in assessing second language speaking, ed. L. Taylor. Cambridge: Cambridge University Press.
  • Taylor , L. and Jones , N. 2006 . Cambridge ESOL exams and the Common European Framework of Reference (CEFR) . Cambridge ESOL Research Notes , 24 : 2 – 6 .
  • Taylor , L. and Wigglesworth , G. 2009 . Are two heads better than one? Pair work in L2 assessment contexts . Language Testing , 26 ( 3 ) : 325 – 339 .
  • Turner , C.E. 2000 . Listening to the voices of rating scale developers: Identifying salient features of second language performance assessment . The Canadian Modern Language Review , 56 ( 4 ) : 555 – 584 .
  • Turner , C.E. and Upshur , J.A. 2002 . Rating scales derived from student samples: Effects of the scale maker and the student sample on scale content and student scores . TESOL Quarterly , 36 ( 1 ) : 49 – 70 .
  • University of Cambridge ESOL Examinations . 2007 . First Certificate in English handbook for teachers , Cambridge : UCLES .
  • University of Cambridge ESOL Examinations . 2008 . Certificate in Advanced English handbook for teachers , Cambridge : UCLES .
  • University of Cambridge ESOL Examinations . 2008 . Certificate of Proficiency in English handbook for teachers , Cambridge : UCLES .
  • University of Cambridge ESOL Examinations . 2008 . Key English Test handbook for teachers , Cambridge : UCLES .
  • University of Cambridge ESOL Examinations . 2008 . Preliminary English Test handbook for teachers , Cambridge : UCLES .
  • van Moere , A. 2006 . Validity evidence in a university group oral test . Language Testing , 23 ( 4 ) : 411 – 440 .
  • Weir , C.J. 2003 . “ A survey of the history of the Certificate of Proficiency in English (CPE) in the twentieth century ” . In Continuity and innovation: A history of the CPE Examination 1913–2002 , Edited by: Weir , C.J. and Milanovic , M. 1 – 56 . Cambridge : University of Cambridge Local Examinations Syndicate .
  • Xi , X. 2010 . Automated scoring and feedback systems: Where are we and where are we heading? . Language Testing , 27 ( 3 ) : 291 – 300 .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.