798
Views
9
CrossRef citations to date
0
Altmetric
Articles

Comparing Yes/No Angoff and Bookmark Standard Setting Methods in the Context of English Assessment

Pages 331-350 | Published online: 15 Aug 2013

REFERENCES

  • Inc , ACT . 2007 . Developing achievement levels on the 2006 National Assessment of Educational Progress in grade 12 economics: Technical report , Iowa City , IA : Author .
  • Angoff , W. H. 1971 . “ Scales, norms, and equivalent scores ” . In Educational measurement , Edited by: Thorndike , R. L. 508 – 600 . Washington , DC : American Council on Education .
  • Bechger , T. M. , Kuijper , H. and Maris , G. 2009 . Standard setting in relation to the common European framework of reference for languages: The case of the state examination of Dutch as a second language . Language Assessment Quarterly , 6 : 126 – 150 .
  • Bejar , I. I. 1983 . Subject matter experts' assessment of item statistics . Applied Psychological Measurement , 7 : 303 – 310 .
  • Berk , R. A. 1986 . A consumer's guide to setting performance standards on criterion referenced tests . Review of Educational Research , 56 : 137 – 172 .
  • Buckendahl , C. W. , Smith , R. W. , Impara , J. C. and Plake , B. S. 2002 . A comparison of Angoff and Bookmark standard setting methods . Journal of Educational Measurement , 39 : 253 – 263 .
  • Cizek , G. J. and Bunch , M. B. 2007 . Standard setting: A guide to establishing and evaluating performance standards on tests , Thousand Oaks , CA : Sage .
  • Cohen , J. 1988 . Statistical power analysis for the behavioral sciences , 2nd , Hillsdale , NJ : Erlbaum .
  • Cohen , J. 1992 . A power primer . Psychological Bulletin , 112 : 155 – 159 .
  • Colton , D. A. and Hecht , J. T. A prelimary report on a study of three techniques for setting minimum passing scores . Symposium presentation at the annual meeting of the National Council on Measurement in Education . Los Angeles , CA .
  • Council of Europe . 2001 . Common European Framework of Reference for Languages: Learning, teaching, assessment , Cambridge , UK : Cambridge University Press .
  • Council of Europe. (2009). Relating language examinations to the common European Framework of Reference for Languages: Learning, teaching, assessment. A manual. http://www.coe.int/t/DG4/Portfolio/documents/Manual%20Revision%20-%20proofread%20-%20FINAL.pdf (http://www.coe.int/t/DG4/Portfolio/documents/Manual%20Revision%20-%20proofread%20-%20FINAL.pdf) (Accessed: 10 February 2012 ).
  • Ferdous , A. A. and Plake , B. S. 2005 . Understanding the factors that influence decisions of panelists in a standard setting study . Applied Measurement in Education , 18 : 257 – 267 .
  • Figueras , N. , North , B. , Takala , S. , Verhelst , N. and Van Avermaet , P. 2005 . Relating examinations to the Common European Framework: A manual . Language Testing , 22 : 261 – 279 .
  • Hambleton , R. K. 2001 . “ Setting performance standards on educational assessments nad criteria for evaluating the process ” . In Setting performance standardsL Concepts, methods, and perspectives , Edited by: Cizek , G. J. 89 – 116 . Mahwah , NJ : Erlbaum .
  • Harsch , C. and Rupp , A. 2011 . Designing and scaling level-specific CEFR Writing Tasks . Language Assessment Quarterly , 8 : 1 – 33 .
  • Kane , M. 1994 . Validating the performance standards associated with passing scores . Review of Educational Research , 64 : 425 – 461 .
  • Kaftandjieva , F. and Takala , S. 2002 . “ Council of Europe scales of language proficiency: A validation study ” . In Common European Framework of Reference for Languages: Learning, teaching, assessment. Case studies , Edited by: Alderson , J. C. 106 – 129 . Strasbourg , France : Council of Europe .
  • Impara , J. C. and Plake , B. S. 1998 . Teachers' ability to estimate item difficulty: A test of the assumptions in the Angoff Standard Setting Method . Journal of Educational Measurement , 35 : 69 – 81 .
  • Landis , J. R. and Koch , G. G. 1977 . Measurement of observer agreement for categorical data . Biometrics , 33 : 159 – 174 .
  • Lewis , D.M. , Mitzel , H. C. and Green , D. R. IRT based standard setting procedures using behavioral anchoring . Symposium conducted at the Council of Chief State School Officers National Conference on Large Scale Assessment . Phoenix , AZ . Standard setting: A bookmark approach. In D. R. Green (Chair) ,
  • Lord , F. M. 1980 . Applications of item response theory to practical testing problems , Hillsdale , NJ : Erlbaum .
  • Meara , K. C. , Hambleton , R. K. and Sireci , S. G. 2001 . Setting and validating standards on professional licensure and certification exams: A survey of current practice . CLEAR Exam Review , 7 ( 2 ) : 17 – 23 .
  • Mehrens , W. A. Methodological issues in standard setting for educational exams . Proceedings of the joint conference on standard setting for large scale assessments of the National Assessment Governing Board (NAGB) and the National Center for Educational Statistics (MCES), Volume II . Washington , DC . pp. 221 – 263 . U.S. Government Printing Office .
  • Mills , C. N. and Melican , G. J. 1988 . Estimating and adjusting cutoff scores: Features of selected methods . Applied Measurement in Education , 1 : 261 – 275 .
  • O'Neill , T. R. , Buckendahl , C. W. , Plake , B. S. and Taylor , L. 2007 . Recommending a nursing specific passing standard for the IELTS examination . Language Assessment Quarterly , 4 : 295 – 317 .
  • Papageorgiou , S. 2010 . Investigating the decision-making process of standard setting participants . Language Testing , 27 : 261 – 282 .
  • Perie , M. Angoff and Bookmark methods . Workshop presented at the annual meeting of the National Council on Measurement in Education . Montreal , Canada.
  • Peterson , C. H. , Schulz , E. M. and Engelhard , J. G. 2011 . Reliability and validity of bookmark-based methods for standard setting: comparisons to Angoff-based methods in the national assessment of educational progress . Educational Measurement: Issues and Practice , 30 ( 2 ) : 3 – 14 .
  • Plake , B. S. 1998 . Setting performance standards for professional licensure and certification . Applied Measurement in Education , 11 : 65 – 80 .
  • Shepard , L. A. Implication for standard setting of the National Academy of Education evaluation of the National Assessment of Educational Progress achievement levels . Proceedings of the joint conference on standard setting for large scale assessments of the National Assessment Governing Board (NAGB) and the National Center for Educational Statistics (NCES), Volume II . Washington , DC . pp. 143 – 160 . U.S. Government Printing Office .
  • Shepard , L. , Glaser , R. , Linn , R. and Bohrnstedt , G. 1993 . Setting performance standards for student achievement , Stanford , CA : National Academy of Education .
  • Sireci , S. G. and Biskin , B. J. 1992 . Measurement practices in national licensing examination programs: A survey . CLEAR Exam Review , 3 : 21 – 25 .
  • Sireci , S. G. , Hauger , J. , Wells , C. S. , Lewis , C. , Delton , J. and Zenisky , A. Evaluation of the standard setting on the 2005 Grade 12 National Assessment of Educational Progress math test . Center for Educational Assessment Research Report No. 618 . 2007 . Amherst: Center for Educational Assessment, University of Massachusetts Amherst
  • Sireci , S. G. , Hauger , J. B. , Wells , C. S. , Shea , C. and Zenisky , A. L. 2009 . Evaluation of the standard setting on the 2005 Grade 12 National Assessment of Educational Progress mathematics test . Applied Measurement in Education , 22 : 339 – 358 .
  • van der Linden , W. J. , Veldkamp , B. P. and Carlson , J. E. 2004 . Optimizing balanced incomplete block designs for large-scale educational assessments . Applied Psychological Measurement , 28 : 317 – 331 .
  • Van Moere , A. 2006 . Validity evidence in a group oral test . Language Testing , 23 : 411 – 440 .
  • Wang , N. 2003 . Use of the Rasch IRT model in standard setting: An item mapping method . Journal of Educational Measurement , 40 : 231 – 253 .
  • Werner , E. 1978 . Cutting scores for occupational licensing tests, manual of considerations and methods , Sacramento : California Department of Consumer Affairs .
  • Williams , N. and Schulz , E. M. An investigation of response probability values used in standard setting . Paper presented at the meeting of the National Council on Measurement in Education . Montreal , Canada.
  • Wood , R. , Wilson , D. T. , Gibbons , R. D. , Schilling , S. G. , Muraki , E. and Bock , R. D. 2003 . TESTFACT 4 for Windows: Test Scoring, Item Statistics, and Full-information Item Factor Analysis [Computer software] , Lincolnwood , IL : Scientific Software International .
  • Yin , P. , Schulz , M. and Sconing , J. A comparison of cut scores and cut score variability from Angoff-based and Bookmark procedure in standard setting . Paper presented at the meeting of the National Council on Measurement in Education . San Diego , CA .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.