REFERENCES
- Automotive Industry Action Group . ( 2002 ). Measurement System Analysis; Reference Manual, , 3rd ed. Detroit , MI : Automotive Industry Action Group .
- Bartko , J. J. , Carpenter , W. T. ( 1976 ). On the Methods and Theory of Reliability . Journal of Nervous and Mental Disease 163 : 307 – 317 . [PUBMED] [INFOTRIEVE] [CSA] [CROSSREF]
- Bartholomew , D. J. , Knott , M. ( 1999 ). Latent Variable Models and Factor Analysis . London : Arnold .
- Boyles , R. A. ( 2001 ). Gauge Capability for Pass-Fail Inspection . Technometrics 43 ( 2 ): 223 – 229 . [CSA] [CROSSREF]
- Brennan , R. L. , Prediger , D. J. ( 1981 ). Coefficient Kappa: Some uses, Misuses, and Alternatives . Educational and Psychological Measurement 41 : 687 – 699 . [CSA] [CROSSREF]
- Cicchetti , D. V. , Feinstein , A. R. ( 1990 ). High Agreement but Low Kappa: II. Resolving the Paradoxes . Journal of Clinical Epidemiology 43 ( 6 ): 551 – 558 . [PUBMED] [INFOTRIEVE] [CSA] [CROSSREF]
- Cohen , J. ( 1960 ). A Coefficient of Agreement for Nominal Scales . Educational and Psychological Measurement 20 : 37 – 46 . [CSA] [CROSSREF]
- Conger , A. J. ( 1980 ). Integration and Generalization of Kappas for Multiple Raters . Psychological Bulletin 88 ( 2 ): 322 – 328 . [CSA] [CROSSREF]
- Dunn , G. ( 1989 ). Design and Analysis of Reliability Studies . New York : Oxford University Press .
- Elffers , H. ( 2001 ). How High should Cohen's Kappa be when Dealing with Unbalanced Categories? (In Dutch) . Kwantitatieve Methoden 66 : 33 – 41 . [CSA]
- Everitt , B. S. ( 1968 ). Moments of the Statistics Kappa and weighted Kappa . The British Journal of Mathematical and Statistical Psychology 21 ( 1 ): 97 – 103 . [CSA]
- Feinstein , A. R. , Cicchetti , D. V. ( 1990 ). High Agreement but Low Kappa: I. The Problems of two Paradoxes . Journal of Clinical Epidemiology 43 ( 6 ): 543 – 549 . [PUBMED] [INFOTRIEVE] [CSA] [CROSSREF]
- Fleiss , J. L. ( 1965 ). Estimating the Accuracy of Dichotomous Judgments . Psychometrika 30 : 469 – 479 . [PUBMED] [INFOTRIEVE] [CSA]
- Fleiss , J. L. ( 1971 ). Measuring Nominal Scale Agreement among many Raters . Psychological Bulletin 76 ( 5 ): 378 – 382 . [CSA]
- Futrell , D. ( 1995 ). When Quality Is a Matter of Taste, Use Reliability Indexes . Quality Progress May , 81 – 86 . [CSA]
- Goodman , L. A. , Kruskal , W. H. ( 1954 ). Measures of Association for Cross Classifications . Journal of American Statistical Association 49 : 732 – 764 . [CSA]
- Hubert , L. H. ( 1977 ). Kappa Revisited . Psychological Bulletin 84 ( 2 ): 289 – 297 . [CSA] [CROSSREF]
- Landis , J. R. , Koch , G. G. ( 1975a ). A Review of the Statistical Methods in the Analysis of Data Arising from Observer Reliability Studies (Part I) . Statistica Neerlandica 29 : 101 – 123 . [CSA] [CROSSREF]
- Landis , J. R. , Koch , G. G. ( 1975b ). A Review of the Statistical Methods in the Analysis of Data Arising from Observer Reliability Studies (Part II) . Statistica Neerlandica 29 : 151 – 161 . [CSA]
- Landis , J. R. , Koch , G. G. ( 1977 ). The Measurement of Observer Agreement for Categorical Data . Biometrics 33 : 159 – 174 . [PUBMED] [INFOTRIEVE] [CSA] [CROSSREF]
- McLachlan , G. J. , Krishnan , T. ( 1997 ). The EM Algorithm and Extensions . New York : Wiley .
- Shrout , P. E. , Fleiss , J. L. ( 1979 ). Intraclass Correlations: Uses in Assessing Rater Reliability . Psychological Bulletin 86 ( 2 ): 420 – 428 . [CSA] [CROSSREF]
- Tanner , M. A. , Young , M. A. ( 1985 ). Modeling Agreement among Raters . Journal of the American Statistical Association 80 : 175 – 180 . [CSA]
- Uebersax , J. S. ( 1988 ). Validity Inferences from Interobserver Agreement . Psychological Bulletin 104 ( 3 ): 405 – 416 . [CSA] [CROSSREF]
- Wheeler , D. J. , Lyday , R. W. ( 1989 ). Evaluating the Measurement Process . Knoxville , TN : SPC Press .
- †Current affiliation: Department of Statistics, Free University of Amsterdam.
- §Current affiliation: Organon NV.