References
- Brown , G. , Bull , J. and Pendlebury , M. 1997 . Assessing student learning in higher education , London : Routledge .
- Burton , R. F. 2001 . Quantifying the effects of chance in multiple choice and true/false tests: question selection and guessing of answers . Assessment & Evaluation in Higher Education , 26 (1) : 41 – 50 .
- Burton , R. F. 2004a . Multiple choice and true/false tests: reliability measures and some implications of negative marking . Assessment & Evaluation in Higher Education , 29 (5) : 585 – 595 .
- Burton , R. F. 2004b . Can item response theory help us improve our tests? . Medical Education , 38 (4) : 338 – 339 .
- Burton , R. F. 2005 . Multiple‐choice and true/false tests: myths and misapprehensions . Assessment & Evaluation in Higher Education , 30 (1) : 63 – 70 .
- Burton , R. F. and Miller , D. J. 1999 . Statistical modelling of multiple‐choice and true/false tests: ways of considering, and of reducing, the uncertainties attributable to guessing . Assessment & Evaluation in Higher Education , 24 (4) : 399 – 411 .
- Case , S. M. , Swanson , D. B. and Ripkey , D. R. 1994 . Comparison of items in five‐option and extended‐matching formats for assessment of diagnostic skills . Academic Medicine , 69 (Suppl) : S1 – S3 .
- Cronbach , L. J. 1951 . Coefficient alpha and the internal structure of tests . Psychometrika , 16 (3) : 297 – 334 .
- Ebel , R. L. 1979 . Essentials of educational measurement , (3rd edn) , Englewood Cliffs, NJ : Prentice‐Hall .
- Embretson , S. E. 1999 . “ Issues in the measurement of cognitive abilities ” . In The new rules of measurement , Edited by: Embretson , S. E. and Herschberger , S. L. 1 – 15 . Mahwah, NJ : Lawrence Erlbaum Associates .
- Feldt , L. S. 1984 . Some relationships between the binomial error model and classical test theory . Educational and Psychological Measurement , 44 : 883 – 891 .
- Feldt , L. S. and Brennan , R. L. 1989 . “ Reliability ” . In Educational measurement , (3rd edn) , Edited by: Linn , R. L. 105 – 146 . New York : Macmillan .
- Gardner‐Medwin , A. R. and Gahan , M. . Formative and summative confidence‐based assessment . Proceedings of the 7th International Computer‐Aided Assessment Conference . July 2003 , Loughborough, UK. pp. 147 – 155 . Available online at: http://s‐d.lboro.ac.uk/caanew/past Conferences/2003/procedings/gardner‐medwin.pdf (accessed 7 May 2005)
- Gulliksen , H. 1945 . The relation of item difficulty and inter‐item correlation to test variance and reliability . Psychometrika , 10 (2) : 79 – 91 .
- Hambleton , R. K. 1984 . “ Determining test length ” . In A guide to criterion‐referenced test construction , Edited by: Berk , R.A. 144 – 168 . Baltimore and London : The Johns Hopkins University Press .
- Kuder , G. F. and Richardson , M. W. 1937 . The theory of the estimation of test reliability . Psychometrika , 2 (3) : 151 – 160 .
- Lord , F. M. and Novick , M. R. 1968 . Statistical theories of mental test scores , Reading, MA : Addison‐Wesley .
- Millman , J. 1973 . Passing scores and test lengths for domain‐referenced measures . Review of Educational Research , 43 : 205 – 216 .
- Posey , C. 1932 . Luck and examination grades . Journal of Engineering Education , 60 : 292 – 296 .
- Wilcox , R. R. 1980 . Determining the length of a criterion‐referenced test . Applied Psychological Measurement , 4 (4) : 425 – 446 .
- Wood , R. 1991 . Assessment and testing , Cambridge : Cambridge University Press .
- Wright , B. D. 1999 . “ Fundamental measurement for psychology ” . In The new rules of measurement , Edited by: Embretson , S. E. and Herschberger , S. L. 65 – 104 . Mahwah, NJ : Lawrence Erlbaum Associates .