359
Views
4
CrossRef citations to date
0
Altmetric
Articles

Using modern test theory to maintain standards in public qualifications in England

Pages 628-647 | Received 05 Jan 2012, Accepted 22 Jun 2012, Published online: 13 Jul 2012

References

  • Alberts , R.V.J. 2001 . Equating exams as a prerequisite for maintaining standards: Experience with Dutch centralised secondary examinations . Assessment in Education: Principles, Policy & Practice , 8 ( 3 ) : 353 – 367 .
  • Baird , J.-A. 2000 . Are examination standards all in the head? Experiments with examiners’ judgements of standards in A level examinations . Research in Education , 64 : 91 – 100 .
  • Baird , J.-A. , Cresswell , M. and Newton , P.E. 2000 . Would the real gold standard please step forward? . Research Papers in Education , 15 ( 2 ) : 213 – 29 .
  • Baird , J.-A. and Dhillon , D. 2005 . Qualitative expert judgements on examination standards: valid, but inexact , Manchester : Assessment and Qualifications Alliance .
  • Béguin, A.A. 2000. Robustness of equating high-stakes tests. University of Twente, Department of Educational Measurement and Data Analysis.
  • Béguin, A.A. 2012. Use of different sources of information in maintaining standards: Examples from the Netherlands. In Psychometrics in practice at RCEC, ed. Theo J.H.M. Eggen and Bernard P. Veldkamp, 27–38. Enschede: RCEC.
  • Béguin , A.A. and Glas , C.A.W. 2001 . MCMC estimation and some model-fit analysis of multidimensional IRT models . Psychometrika , 66 ( 4 ) : 541 – 562 .
  • Birnbaum, A. 1968. Some latent trait models. Statistical theories of mental test scores. Reading, MA: Addison-Wesley.
  • Black , B. and Bramley , T. 2008 . Investigating a judgemental rank-ordering method for maintaining standards in UK examinations . Research Papers in Education , 23 ( 3 ) : 357 – 73 .
  • Bramley , T. and Gill , T. 2010 . Evaluating the rank-ordering method for standard maintaining . Research Papers in Education , 25 ( 3 ) : 293 – 317 .
  • Cresswell, M. 1997. Examining judgements: Theory and practice of awarding public examination grades. London: Institute of Education, University of London.
  • Curtis, S.M. 2010. BUGS Code for Item Response Theory. Journal of Statistical Software, 36(Code Snippet 1): 1–34.
  • Fox , J.-P. 2010 . Bayesian item response modelling , New York , NY : Springer .
  • Good , F. and Cresswell , M. 1988 . Grading the GCSE , London : Secondary Examinations Council .
  • Guttman , I. 1967 . The use of the concept of a future observation in goodness-of-fit problems . Journal of the Royal Statistical Society B , 29 : 83 – 100 .
  • Hanson , B.A. and Béguin , A.A. 2002 . Obtaining a common scale for Item Response Theory item parameters using separate versus concurrent estimation in the common-item equating design . Applied Psychological Measurement , 26 : 3 – 24 .
  • Kolen , M.J. and Brennan , R.L. 2004 . Test equating, scaling, and linking. Statistics in social science and public policy , New York , NY : Springer .
  • Laming, D. 2004. Human judgment. London: Thomson Learning.
  • Linacre , J. 1995 . Bruce Choppin: Visionary . Rasch Measurement Transactions , 8 ( 4 ) : 394
  • Lord, F.M., and M.R. Novick. 1968. Statistical theories of mental test scores. Reading, MA: Addison-Wesley.
  • Masters , G.N. 1984 . Constructing an item bank using partial credit scoring . Journal of Educational Measurement , 21 ( 1 ) : 19 – 32 .
  • Muraki , E. 1992 . A generalized partial credit model: Application of an EM algorithm . Applied Psychological Measurement , 16 ( 2 ) : 159 – 176 .
  • Newton , P.E. 2005 . Examination standards and the limits of linking . Assessment in Education , 12 : 105 – 123 .
  • Newton , P.E. 2010 . Contrasting conceptions of comparability . Research Papers in Education , 25 ( 3 ) : 285 – 92 .
  • Panayides , P. , Robinson , C. and Tymms , P. 2010 . The assessment revolution that has passed England by: Rasch measurement . British Educational Research Journal , 36 ( 4 ) : 611 – 626 .
  • R Development Core Team. (2009). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. http://www.R-project.org.
  • Rasch , G. 1960 . Probabilistic models for some intelligence and achievement tests , Copenhagen : Danish Institute for Educational Research .
  • Rubin , D.B. 1984 . Bayesianly justifiable and relevant frequency calculations for the applies statistician . The Annals of Statistics , 12 ( 4 ) : 1151 – 1172 .
  • Scharaschkin , A. and Baird , J.-A. 2000 . The effects of consistency of performance on A level examiners’ judgements of standards . British Educational Research Journal , 26 ( 3 ) : 343 – 357 .
  • Sinharay , S. 2005 . Assessing fit of unidimensional Item Response Theory models using a Bayesian approach . Journal of Educational Measurement , 42 ( 4 ) : 375 – 394 .
  • Spiegelhalter, D.J., A. Thomas, N. Best, and D. Lunn. 2003. WinBUGS User Manual Version 1.4. Cambridge: MRC Biostatistics Unit, Department of Epidemiology & Public Health.
  • Stocking , M.L. and Lord , F.M. 1983 . Developing a common metric in Item Response Theory . Applied Psychological Measurement , 7 : 201 – 210 .
  • Stringer, N. (2011). Setting and maintaining GCSE and GCE grading standards: The case for contextualised cohort-referencing. Research Papers in Education.
  • Verhelst , N.D. and Glas , C.A.W. 1995 . The One Parameter Logistic Model Rasch models: Foundations, recent developments, and applications , London : Springer-Verlag .
  • Weeks , J.P. 2010 . Plink: An R package for linking mixed-format tests using IRT-based methods . Journal of Statistical Software , 35 ( 12 ) : 1 – 33 .
  • Wheadon , C. and Béguin , A.A. 2010 . Fears for tiers: Are candidates being appropriately rewarded for their performance in tiered examinations? . Assessment in Education: Principles, Policy & Practice , 17 ( 3 ) : 287 – 300 .
  • Wiliam , D. 1996 . Meanings and consequences in standard setting . Assessment in Education: Principles Policy and Practice , 3 ( 3 ) : 287 – 307 .
  • Wright , B.D. 1979 . Best test design , Chicago , IL : MESA .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.