427
Views
4
CrossRef citations to date
0
Altmetric
Articles

Ofqual’s Reliability Programme: a case study exploring the potential to improve public understanding and confidence

Pages 93-113 | Published online: 30 Jan 2013
 

Abstract

In May 2008, Ofqual established a two-year programme of research to investigate the nature and extent of (un)reliability within the qualifications, examinations and assessments that it regulated. It was particularly concerned to improve understanding of, and confidence in, this technically complex and politically sensitive phenomenon. The following article presents an account of this programme, from the perspective of one of its initiators, the author. It describes: the context prior to the programme, where little information on (un)reliability was routinely available to the public; the rationale for the programme, in terms of the tension between improving public understanding and the concomitant threat of decreasing public confidence; and ways in which aspects of the programme were constructed through media reports. It concludes with lessons learned from running the programme and with an extended discussion of the challenge of talking about reliability and error.

Acknowledgements

This paper was produced with the support of my employer, Cambridge Assessment, although the views expressed are entirely my own. I would like to thank Andrew Boyle, Isabel Nisbet, Dennis Opposs and John Gardner for very helpful comments on earlier drafts.

Notes

1. While the term reliability has a narrow technical meaning which is generally accepted, there is no generally accepted term to express the overall error that has been described here as measurement inaccuracy. In Newton (2005a), I distinguished ‘measurement inaccuracy’ (i.e. error as the difference between correct and incorrect assessment result) from ‘human error’ (i.e. error as the violation of an assessment procedure). I described both as aspects of ‘assessment error’. There is no clear relationship between measurement inaccuracy and human error since measurement inaccuracy can (and will) arise in the absence of human error, and human error may not actually result in measurement inaccuracy.

2. This will requires the translation of reliability statistics into classification accuracy statistics, with the proviso that these are likely to be underestimates.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 53.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 385.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.