Search in:

Patient Related Outcome Measures Volume 14, 2023 - Issue

Submit an article Journal homepage

Open access

321

Views

CrossRef citations to date

Altmetric

METHODOLOGY

Studies on Reliability and Measurement Error of Measurements in Medicine – From Design to Statistics Explained for Medical Researchers

Lidwine B Mokkink1 Amsterdam UMC, Vrije Universiteit Amsterdam, Epidemiology and Data Science, Amsterdam, the Netherlands;2 Amsterdam Public Health Research Institute, Amsterdam, the NetherlandsCorrespondence[email protected]

https://orcid.org/0000-0001-6489-2827 View further author information

Iris Eekhout1 Amsterdam UMC, Vrije Universiteit Amsterdam, Epidemiology and Data Science, Amsterdam, the Netherlands;2 Amsterdam Public Health Research Institute, Amsterdam, the Netherlands;3 Child Health, Netherlands Organisation for Applied Scientific Research, Leiden, the NetherlandsView further author information

Maarten Boers1 Amsterdam UMC, Vrije Universiteit Amsterdam, Epidemiology and Data Science, Amsterdam, the Netherlands;2 Amsterdam Public Health Research Institute, Amsterdam, the Netherlands

https://orcid.org/0000-0002-6969-283X View further author information

Cees PM van der Vleuten4 Department of Educational Development and Research, School of Health Professions Education, Faculty of Health, Medicine and Life Sciences, Maastricht University, Maastricht, the NetherlandsView further author information

Henrica CW de Vet1 Amsterdam UMC, Vrije Universiteit Amsterdam, Epidemiology and Data Science, Amsterdam, the Netherlands;2 Amsterdam Public Health Research Institute, Amsterdam, the NetherlandsView further author information

Pages 193-212 | Received 24 Nov 2022, Accepted 27 May 2023, Published online: 07 Jul 2023

Cite this article
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

References

Stenroth L, Sefa S, Arokoski J, Toyras J. Does magnetic resonance imaging provide superior reliability for achilles and patellar tendon cross-sectional area measurements compared with ultrasound imaging? Ultrasound Med Biol. 2019;45(12):3186–3198. doi:10.1016/j.ultrasmedbio.2019.08.001
PubMed Web of Science ®Google Scholar
Mokkink LB, Boers M, van der Vleuten CPM, et al. COSMIN risk of bias tool to assess the quality of studies on reliability or measurement error of outcome measurement instruments: a Delphi study. BMC Med Res Methodol. 2020;20(1). doi:10.1186/s12874-020-01179-5
PubMed Web of Science ®Google Scholar
Mokkink LB, Terwee CB, Patrick DL, et al. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J Clin Epidemiol. 2010;63(7):737–745. doi:10.1016/j.jclinepi.2010.02.006
PubMed Web of Science ®Google Scholar
de Vet HC, Terwee CB, Knol DL, Bouter LM. When to use agreement versus reliability measures. J Clin Epidemiol. 2006;59:1033–1039. doi:10.1016/j.jclinepi.2005.10.015
PubMed Web of Science ®Google Scholar
McGraw KO, Wong SP. Forming inferences about some intraclass correlation coefficients. Psychol Methods. 1996;1:30–46. doi:10.1037/1082-989X.1.1.30
Web of Science ®Google Scholar
Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psychol Bull. 1979;86:420–428. doi:10.1037/0033-2909.86.2.420
PubMed Web of Science ®Google Scholar
Brennan RL. Generalizability theory. Statistics for Social Science and Public Policy. Springer-Verlag; 2001.
Google Scholar
Shavelson RJ, Webb NM. Generalizability theory. A Primer. Vol 1. Measurement Methods for the Social Science. Sage Publishing; 1991.
Google Scholar
Bloch R, Norman G. Generalizability theory for the perplexed: a practical introduction and guide: AMEE Guide No. 68. Med Teach. 2012;34(11):960–992. doi:10.3109/0142159X.2012.703791
PubMed Web of Science ®Google Scholar
Eekhout I, Mokkink LB. ICC & SEM power: sample size decision assistant for studies on reliability and measurement error; 2022. Available from: https://iriseekhout.shinyapps.io/ICCpower/. Accessed June 21, 2023.
Google Scholar
Mokkink LB, HCWd V, Diemeer S, Eekhout I. Sample size recommendations for studies on reliability and measurement error: an online application based on simulation studies. Health Serv Outcomes Res Method. 2022. doi:10.1007/s10742-022-00293-9
Web of Science ®Google Scholar
Rose M, Bjorner JB, Gandek B, Bruce B, Fries JF, Ware JE. The PROMIS physical function item bank was calibrated to a standardized metric and shown to improve measurement efficiency. J Clin Epidemiol. 2014;67(5):516–526. doi:10.1016/j.jclinepi.2013.10.024
PubMed Web of Science ®Google Scholar
Fischer JS, Jak AJ, Kniker JE, Rudick RA, Cutter G. Multiple Sclerosis Functional Composite (MSFC). Administration and Scoring Manual. National Multiple Sclerosis Society; 2001.
Google Scholar
Holen JC, Saltvedt I, Fayers PM, Hjermstad MJ, Loge JH, Kaasa S. Doloplus-2, a valid tool for behavioural pain assessment? BMC Geriatr. 2007;7:29. doi:10.1186/1471-2318-7-29
PubMedGoogle Scholar
Butland RJ, Pang J, Gross ER, Woodcock AA, Geddes DM. Two-, six-, and 12-minute walking tests in respiratory disease. Br Med J. 1982;284(6329):1607–1608. doi:10.1136/bmj.284.6329.1607
PubMed Web of Science ®Google Scholar
Ware JE, Sherbourne CD. The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection. Med Care. 1992;30(6):473–483. doi:10.1097/00005650-199206000-00002
PubMed Web of Science ®Google Scholar
Aaronson NK, Muller M, Cohen PD, et al. Translation, validation, and norming of the Dutch language version of the SF-36 Health Survey in community and chronic disease populations. J Clin Epidemiol. 1998;51(11):1055–1068. doi:10.1016/s0895-4356(98)00097-3
PubMed Web of Science ®Google Scholar
Gellhorn AC, Carlson MJ. Inter-rater, intra-rater, and inter-machine reliability of quantitative ultrasound measurements of the patellar tendon. Ultrasound Med Biol. 2013;39(5):791–796. doi:10.1016/j.ultrasmedbio.2012.12.001
PubMed Web of Science ®Google Scholar
Koo TK, Li MY. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med. 2016;15(2):155–163. doi:10.1016/j.jcm.2016.02.012
PubMed Web of Science ®Google Scholar
White E, Armstrong BK, Saracci R. Principles of Exposure Measurement in Epidemiology. Collecting, Evaluating, and Improving Measures of Disease Risk Factors. Oxford University Press; 2008.
Google Scholar
Liljequist D, Elfving B, Skavberg Roaldsen K. Intraclass correlation - A discussion and demonstration of basic features. PLoS One. 2019;14(7):e0219854. doi:10.1371/journal.pone.0219854
PubMed Web of Science ®Google Scholar
Eekhout I. Agree: agreement and reliability between multiple raters. R package version 0.1.8. Available from: https://github.com/iriseekhout/Agree/. Accessed March 8, 2022.
Google Scholar
Eekhout I, Mokkink LB. Estimating ICCs and SEMs with multilevel models. Available from: https://www.iriseekhout.com/r/agree/. Accessed January 20, 2022.
Google Scholar
Streiner DL, Norman G. Health measurement scales. In: A Practical Guide to Their Development and Use. 4th ed. Oxford University Press; 2008.
Google Scholar
de Vet HC, Terwee CB, Mokkink L, Knol DL. Measurement in Medicine. Practical Guides to Biostatistics and Epidemiology. Cambridge University Press; 2011.
Google Scholar
Skeie EJ, Borge JA, Leboeuf-Yde C, Bolton J, Wedderkopp N. Reliability of diagnostic ultrasound in measuring the multifidus muscle. Chiropr Man Therap. 2015;23:15. doi:10.1186/s12998-015-0059-6
PubMedGoogle Scholar
Kottner J, Audige L, Brorson S, et al. Guidelines for Reporting Reliability and Agreement Studies (GRRAS) were proposed. J Clin Epidemiol. 2011;64(1):96–106. doi:10.1016/j.jclinepi.2010.03.002
PubMed Web of Science ®Google Scholar
Gagnier JJ, Lai J, Mokkink LB, Terwee CB. COSMIN reporting guideline for studies on measurement properties of patient-reported outcome measures. Qual Life Res. 2021;30:2197–2218. doi:10.1007/s11136-021-02822-4
PubMed Web of Science ®Google Scholar
Demetrashvili N, Wit EC, van den Heuvel ER. Confidence intervals for intraclass correlation coefficients in variance components models. Stat Methods Med Res. 2016;25(5):2359–2376. doi:10.1177/0962280214522787
PubMed Web of Science ®Google Scholar
Efron B. Better bootstrap confidence intervals. J Am Stat Assoc. 1987;82(397):171–185. doi:10.1080/01621459.1987.10478410
Web of Science ®Google Scholar
Loy A, Korobova J. Bootstrapping clustered data in R using lmeresampler. arXiv. 2021;20:54.
Google Scholar
de Vet HCW. Guide for the calculation of ICC in SPSS. Available from: http://www.clinimetrics.nl/images/upload/files/Chapter%205/chapter%205_5_Calculation%20of%20ICC%20in%20SPSS.pdf. Accessed July 14, 2021.
Google Scholar
Brennan RL. urGENOVA. University of Iowa; 2021. Available from: https://education.uiowa.edu/research-centers/center-advanced-studies-measurement-and-assessment/computer-programs. Accessed June 21, 2023.
Google Scholar
Terwee CB, Peipert JD, Chapman R, et al. Minimal important change (MIC): a conceptual clarification and systematic review of MIC estimates of PROMIS measures. Qual Life Res. 2021;30(10):2729–2754. doi:10.1007/s11136-021-02925-y
PubMed Web of Science ®Google Scholar
Zou GY. Sample size formulas for estimating intraclass correlation coefficients with precision and assurance. Stat Med. 2012;31:3972–3981. doi:10.1002/sim.5466
PubMed Web of Science ®Google Scholar

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Studies on Reliability and Measurement Error of Measurements in Medicine – From Design to Statistics Explained for Medical Researchers

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Studies on Reliability and Measurement Error of Measurements in Medicine – From Design to Statistics Explained for Medical Researchers

References

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date