References
- Adnan, K., and O. Bulut. 2014. “Crossed Random-Effect Modeling: Examining the Effects of Teacher Experience and Rubric Use in Performance Assessments.” Eurasian Journal of Educational Research 14 (57): 1–28. doi:https://doi.org/10.14689/ejer.2014.57.4.
- Andrade, H., and Y. Du. 2005. “Student Perspectives on Rubric-Referenced Assessment.” Practical Assessment, Research and Evaluation 10 (3): 1–11. doi: https://doi.org/10.7275/g367-ye94
- Bloxham, S., P. Boyd, and S. Orr. 2011. “Mark my Words: The Role of Assessment Criteria in UK Higher Education Grading Practices.” Studies in Higher Education 36 (6): 655–670. doi:https://doi.org/10.1080/03075071003777716.
- Bourke, S., and A. P. Holbrook. 2013. “Examining PhD and Research Masters Theses.” Assessment & Evaluation in Higher Education 38 (4): 407–416. doi:https://doi.org/10.1080/02602938.2011.638738.
- de Kleijn, R. A. M., M. T. Mainhard, P. C. Meijer, A. Pilot, and M. Brekelmans. 2012. “Master’s Thesis Supervision: Relations between Perceptions of the Supervisor-Student Relationship, Final Grade, Perceived Supervisor Contribution to Learning and Student Satisfaction.” Studies in Higher Education 37 (8): 925–939. doi:https://doi.org/10.1080/03075079.2011.556717.
- de Vries, A., and B. D. Ripley. 2016. gdendro: Create dendrograms and tree diagrams using “ggplot2” (0.1-20). https://github.com/andrie/ggdendro
- Goulden, N. R., and C. J. G. Griffin. 1995. “The Meaning of Grades Based on Faculty and Student Metaphors.” Communication Education 44 (2): 110–125. doi:https://doi.org/10.1080/03634529509379003.
- Grainger, P., L. Adie, and K. Weir. 2016. “Quality Assurance of Assessment and Moderation Discourses Involving Sessional Staff.” Assessment & Evaluation in Higher Education 41 (4): 548–559. doi:https://doi.org/10.1080/02602938.2015.1030333.
- Grömping, U. 2006. “Relative Importance for Linear Regression in R: The Package Relaimpo.” Journal of Statistical Software 17 (1): 1–27. doi:https://doi.org/10.18637/jss.v017.i01.
- Kapborg, I., and C. Berterö. 2002. “Critiquing Bachelor Candidates' Theses: Are the Criteria Useful?” International Nursing Review 49 (2): 122–128. doi:https://doi.org/10.1046/j.1466-7657.2002.00123.x.
- Kiley, M., and G. Mullins. 2004. “Examining the Examiners: How Inexperienced Examiners Approach the Assessment of Research Theses.” International Journal of Educational Research 41 (2): 121–135. doi:https://doi.org/10.1016/j.ijer.2005.04.009.
- Lundgren, S. M., M. Halvarsson, and B. Robertsson. 2008. “Quality Assessment and Comparison of Grading between Examiners and Supervisors of Bachelor Theses in Nursing Education.” Nurse Education Today 28 (1): 24–32. doi:https://doi.org/10.1016/j.nedt.2007.02.009.
- Panadero, E., J. Alonso-Tapia, and E. Reche. 2013. “Rubrics vs. Self-Assessment Scripts Effect on Self-Regulation, Performance and Self-Efficacy in Pre-Service Teachers.” Studies in Educational Evaluation 39 (3): 125–132. doi:https://doi.org/10.1016/j.stueduc.2013.04.001.
- Pedersen, T. L. 2019. patchwork: The composer of plots (R-package version 1.0.0). https://cran.r-project.org/package=patchwork
- Prins, F. J., R. de Kleijn, and J. van Tartwijk. 2017. “Students’ Use of a Rubric for Research Theses.” Assessment & Evaluation in Higher Education 42 (1): 128–150. doi:https://doi.org/10.1080/02602938.2015.1085954.
- Reddy, Y. M., and H. Andrade. 2010. “A Review of Rubric Use in Higher Education.” Assessment & Evaluation in Higher Education 35 (4): 435–448. doi:https://doi.org/10.1080/02602930902862859.
- Revelle, W. 2019. psych: Procedures for psychological, psychometric, and personality Research (R package version 1.9.12). https://cran.r-project.org/package=psych
- Sadler, D. R. 2009. “Indeterminacy in the Use of Preset Criteria for Assessment and Grading.” Assessment & Evaluation in Higher Education 34 (2): 159–179. doi:https://doi.org/10.1080/02602930801956059.
- Tai, J., R. Ajjawi, D. Boud, P. Dawson, and E. Panadero. 2018. “Developing Evaluative Judgement: Enabling Students to Make Decisions about the Quality of Work.” Higher Education 76 (3): 467–481. doi:https://doi.org/10.1007/s10734-017-0220-3.
- Timmerman, B. E., D. C. Strickland, R. L. Johnson, and J. R. Payne. 2011. “Development of a ‘Universal’ Rubric for Assessing Undergraduates’ Scientific Reasoning Skills Using Scientific Writing.” Assessment & Evaluation in Higher Education 36 (5): 509–547. doi:https://doi.org/10.1080/02602930903540991.
- van der Vleuten, C. P. M., L. W. T. Schuwirth, E. W. Driessen, J. Dijkstra, D. Tigelaar, L. K. J. Baartman, and J. van Tartwijk. 2012. “A Model for Programmatic Assessment Fit for Purpose.” Medical Teacher 34 (3): 205–214. doi:https://doi.org/10.3109/0142159X.2012.652239.
- Warnes, G. R., B. Bolker, L. Bonebakker, R. Gentleman, W. Huber, A. Liaw, T. Lumley, et al. 2019. gplots: Various R programming tools for plotting data (3.0.3). https://github.com/talgalili/gplots
- Wickham, H. 2016. ggplot2: Elegant Graphics for Data Analysis. New York: Springer-Verlag. https://doi.org/978-3-319-24277-4.
- Wilke, C. O. 2019. cowplot: Streamlined plot theme and plot annotations for ggplot2. https://github.com/wilkelab/cowplot
- Williams, L., and S. Kemp. 2019. “Independent Markers of Master’s Theses Show Low Levels of Agreement.” Assessment & Evaluation in Higher Education 44 (5): 764–771. doi:https://doi.org/10.1080/02602938.2018.1535052.