References
- Atkinson, R. C., & Geiser, S. (2009). Reflections on a century of college admissions tests. Educational Researcher, 38, 665–676.
- Black, P., & Wiliam, D. (1998). Assessment and classroom learning. Assessment in Education, 5, 7–74.
- Bonner, S. (2012, April). Validating interpretations and actions based on classroom assessment. Symposium presentation at the annual meeting of the National Council on Measurement in Education, Vancouver, British Columbia, Canada.
- Bowers, A. J. (2010). Analyzing the longitudinal K–12 grading histories of entire cohorts of students: Grades, data driven decision making, dropping out and hierarchical cluster analysis. Practical Assessment, Research &Evaluation, 15(7), Available from http://pareonline.net/getvn.asp?v = 15&n = 7.
- Bowers, A. J., & Sprott, R. (2012). Examining the multiple trajectories associated with dropping out of high school: A growth mixture model analysis. Journal of Educational Research, 105, 176–195.
- Bowers, A. J., Sprott, R., & Taff, S. (2013). Do we know who will drop out? A review of the predictors of dropping out of high school: Precision, sensitivity and specificity. The High School Journal, 96, 77–100.
- Bowman, N. A. (2011). Examining systematic errors in predictors of college student self-reported gains. New Directions for Institutional Research, 2011(150), 7–19.
- Brimi, H. M. (2011). Reliability of grading high school work in English. Practical Assessment, Research & Evaluation, 16(17), Retrieved from http://pareonline.net/getvn.asp?v = 16&n = 17.
- Brookhart, S. M. (1993). Teachers' grading practices: Meaning and values. Journal of Educational Measurement, 30, 123–142.
- Brookhart, S. M. (1994). Teachers' grading: Practice and theory. Applied Measurement in Education, 7, 279–301.
- Brookhart, S. M. (2003). Developing measurement theory for classroom assessment purposes and uses. Educational Measurement: Issues and Practice, 22(4), 5–12.
- Brookhart, S. M. (2013a). Grading. In J. H. McMillan (Ed.), SAGE handbook of research on classroom assessment (pp. 257–271). Thousand Oaks, CA: Sage.
- Brookhart, S. M. (2013b). The use of teacher judgement for summative assessment in the USA. Assessment in Education, 20, 69–90.
- Cizek, G. J., Fitzgerald, S. M., & Rachor, R. E. (1995–1996). Teachers' assessment practices: Preparation, isolation, and the kitchen sink. Educational Assessment, 3, 159–179.
- Cronbach, L. J. (1971). Test validation. In R. L. Thorndike (Ed.), Educational measurement (2nd ed.) (pp. 443–507). Washington, DC: American Council on Education.
- Crooks, T. J. (1988). The impact of classroom evaluation practices on students. Review of Educational Research, 58, 438–481.
- Cross, L. H., & Frary, R. B. (1999). Hodgepodge grading: Endorsed by students and teachers alike. Applied Measurement in Education, 12, 53–72.
- Dorans, N. J. (2012). The contestant perspective on taking tests: Emanations from the statue within. Educational Measurement: Issues and Practice, 31(4), 20–37.
- Dweck, C. S. (2000). Self-theories: Their role in motivation, personality, and development. New York, NY: Psychology Press.
- Entwisle, D. R., & Alexander, K. L. (1988). Factors affecting achievement test scores and marks of black and white first graders. Elementary School Journal, 88, 449–471.
- Frary, R. B., Cross, L. H., & Weber, L. J. (1993). Testing and grading practices and opinions of secondary teachers of academic subjects: Implications for instruction in measurement. Educational Measurement: Issues and Practice, 12(3), 23–30.
- Friedman, S. J., & Manley, M. (1991, April). Grading practices in the secondary school: Perceptions of the stakeholders. Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago, IL.
- Griswold, P. A., & Griswold, M. M. (1992, April). The grading contingency: Graders' beliefs and expectations and the assessment ingredients. Paper presented at the annual meeting of the American Educational Research Association, San Francisco, CA.
- Gullickson, A. R. (1985). Student evaluation techniques and their relationship to grade and curriculum. Journal of Educational Research, 79, 96–100.
- Guskey, T. R. (2015). On your mark. Bloomington, IN: Solution Tree Press.
- Haertel, E., & Herman, J. (2005, June). A historical perspective on validity arguments for accountability testing. National Center for Research on Evaluation, Standards, and Student Testing, University of California, Los Angeles. Retrieved from ERIC database. (ED488709).
- Healy, K. L. (1935). A study of the factors involved in the rating of pupils' compositions. Journal of Experimental Education, 4, 50–53.
- Hoge, R. D., & Coladarci, T. (1989). Teacher-based judgments of academic achievement: A review of literature. Review of Educational Research, 59, 297–313.
- Hulten, C. E. (1925). The personal element in teachers' marks. Journal of Educational Research, 12, 49–55.
- Kane, M. T. (2006). Validation. In R. L. Brennan (Ed.), Educational measurement (4th ed., pp. 17–64). Westport, CT: American Council on Education/Praeger.
- Kane, M. T. (2012, March). Validity, fairness, and testing. Paper presented at the symposium Educational Assessment, Accountability, and Equity: Conversations on Validity around the World, Teachers College, Columbia University, New York, NY.
- Kelly, F. J. (1914). Teachers' marks: Their variability and standardization (Contributions to Education No. 66). New York, NY: Teachers College, Columbia University.
- Klapp Lekholm, A., & Cliffordson, C. (2008). Discrepancies between school grades and test scores at individual and school level: Effects of gender and family background. Educational Research and Evaluation, 14, 181–199.
- Klapp Lekholm, A., & Cliffordson, C. (2009). Effects of student characteristics on grades in compulsory school. Educational Research and Evaluation, 15, 1–23.
- Kuncel, N. R., Credé, M., & Thomas, L. L. (2005). The validity of self-reported grade-point averages, class ranks, and test scores: A meta-analysis and review of the literature. Review of Educational Research, 75, 63–82.
- Lauterbach, C. E. (1928). Some factors affecting teachers' marks. Journal of Educational Psychology, 19, 266–271.
- Llosa, L. (2008). Building and supporting a validity argument for a standards-based classroom assessment of English proficiency based on teacher judgments. Educational Measurement: Issues and Practice, 27(3), 32–42.
- Manke, M. P., & Loyd, B. H. (1990). An investigation of non-achievement related factors influencing teachers' grading practices. Paper presented at the annual meeting of the National Council on Measurement in Education, Boston, MA.
- Manke, M. P., & Loyd, B. H. (1991). A study of teachers' understanding of their grading practices. Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago, IL.
- McMillan, J. H. (2001). Secondary teachers' classroom assessment and grading practices. Educational Measurement: Issues and Practice, 20, 20–32.
- McMillan, J. H., Myran, S., & Workman, D. (2002). Elementary teachers' classroom assessment and grading practices. Journal of Educational Research, 95, 203–213.
- Messick, S. (1989). Validity. In R. L. Linn (Ed.), Educational measurement (3rd ed., pp. 13–103). New York, NY: Macmillan.
- Natriello, G. (1987). The impact of evaluation processes on students. Educational Psychologist, 22, 155–175.
- Nava, F. J. G., & Loyd, B. H. (1992, April). An investigation of achievement and non-achievement criteria in elementary and secondary school grading. Paper presented at the annual meeting of the American Educational Research Association, San Francisco, CA.
- O'Connor, K. (2009). How to grade for learning (3rd ed.). Thousand Oaks, CA: Corwin.
- Pattison, E., Grodsky, E., & Muller, C. (2013). Is the sky falling? Grade inflation and the signaling power of grades. Educational Researcher, 42, 259–265.
- Randall, J., & Engelhard, G. (2009a). Differences between teachers' grading practices in elementary and middle schools. Journal of Educational Research, 102, 175–185.
- Randall, J., & Engelhard, G. (2009b). Examining teacher grades using Rasch measurement theory. Journal of Educational Measurement, 46, 1–18.
- Randall, J., & Engelhard, G. (2010). Examining the grading practices of teachers. Teaching and Teacher Education, 26, 1372–1380.
- Rugg, H. O. (1918). Teachers' marks and the reconstruction of the marking system. The Elementary School Journal, 18, 701–719.
- Sawyer, R. (2013). Beyond correlations: Usefulness of high school GPA and test scores in making college admissions decisions. Applied Measurement in Education, 26, 89–112.
- Silberstein, N. (1922). The variability of teachers' marks. English Journal, 11, 414–424.
- Sims, V. M. (1933). Reducing the variability of essay examination marks through eliminating variations in standards of grading. Journal of Educational Research, 26, 637–647.
- Smallwood, M. L. (1935). An historical study of examinations and grading systems in early American universities (Harvard Studies in Education #24). Cambridge, MA: Harvard University Press.
- Smith, A. Z., & Dobbin, J. E. (1960). Marks and marking systems. In C. W. Harris (Ed.), Encyclopedia of educational research (3rd ed., pp. 783–791). New York, NY: Macmillan.
- Smith, E. R., Tyler, R. W., & the Evaluation Staff (1942). Appraising and recording student progress The Adventure in American Education Series (Vol. III). New York, NY: Harper & Brothers.
- Smith, J. K. (2003). Reconsidering reliability in classroom assessment and grading. Educational Measurement: Issues and Practice, 22(4), 26–33.
- Starch, D., & Elliott, E. C. (1912). Reliability of the grading of high-school work in English. School Review, 20, 442–457.
- Starch, D., & Elliott, E. C. (1913a). Reliability of grading work in mathematics. School Review, 21, 254–259.
- Starch, D., & Elliott, E. C. (1913b). Reliability of grading work in history. School Review, 21, 676–681.
- Stiggins, R. J., Frisbie, D. A., & Griswold, P. A. (1989). Inside high school grading practices: Building a research agenda. Educational Measurement: Issues and Practice, 8(2), 5–14.
- Thorsen, C., & Cliffordson, C. (2012). Teachers' grade assignment and the predictive validity of criterion-referenced grades. Educational Research and Evaluation, 18, 153–172.
- Toulmin, S. (1958). The uses of argument. Cambridge, UK: Cambridge University Press.
- White, K. R. (1982). The relation between socioeconomic status and academic achievement. Psychological Bulletin, 91, 461–481.
- Wood, P., Bennett, T., Wood, J., & Bennett, C. (1990). Grading and evaluation practices and policies of school teachers. Bowling Green, OH: Bowling Green State University. Retrieved from ERIC database. (ED319782).
- *Banker, H. J. (1927a). The significance of teachers' marks (Part 1). Journal of Educational Research, 16, 159–171.
- *Banker, H. J. (1927b). The significance of teachers' marks (Part 2). Journal of Educational Research, 16, 271–284.
- Bowers, A. J. (2011). What's in a grade? The multidimensional nature of what teacher-assigned grades assess in high school. Educational Research and Evaluation, 17, 141–159.
- Brennan, R. T., Kim, J., Wenz-Gross, M., & Siperstein, G. N. (2001). The relative equitability of high-stakes testing versus teacher-assigned grades: An analysis of the Massachusetts Comprehensive Assessment System (MCAS). Harvard Educational Review, 71, 173–216.
- Carey, T., & Carifio, J. (2012). The minimum grading controversy: Results of a quantitative study of seven years of grading data from an urban high school. Educational Researcher, 41, 201–208.
- *Carter, R. S. (1952). How invalid are marks assigned by teachers? Journal of Educational Psychology, 43, 218–228.
- *Carter, R. S. (1953). Non-intellectual variables Involved in teachers marks. Journal of Educational Research, 47, 81–95.
- *D'Agostino, J., & Welsh, M. (2007, April). Standards-based progress reports and standards-based assessment score convergence. Paper presented at the annual meeting of the American Educational Research Association, Chicago, IL.
- Fish, E. (1969). The relationships of teachers' assigned marks to tested achievement among educationally and culturally disadvantaged children in the elementary grades (Final Report, Project No. 8-F-113). Retrieved from ERIC database. (ED035709).
- Guskey, T. R. (2011). Stability and change in high school grades. NSAAP Bulletin, 95, 85–98.
- Halliwell, J. W. (1960). The relationship of certain factors to marketing practices in individualized reporting programs. Journal of Educational Research, 54, 76–78.
- Hausdorff, H., & Farr, S. D. (1965). The effect of grading practices on the marks of gifted sixth grade children. Journal of Educational Research, 59, 169–172.
- Knafle, J. D. (1972). The relationship of behavior ratings to grades earned by female high school students. Journal of Educational Research, 66, 106–110.
- McCandless, B. R., Roberts, A., & Starnes, T. (1972). Teachers' marks, achievement test scores, and aptitude relations with respect to social class, race, and sex. Journal of Educational Psychology, 63, 153–159.
- Miner, B. C. (1967). Three factors of school achievement. Journal of Educational Research, 60, 370–376.
- Moore, C. C. (1939). The elementary school mark. Pedagogical Seminary and Journal of Genetic Psychology, 54, 285–294.
- Office of Research. (1994). What do grades mean? Differences across schools. Office of Educational Research and Improvement, U.S. Department of Education: Washington, DC.
- Resnick, J. (1951). A study of some relationships between high school grades and certain aspects of adjustment. Journal of Educational Research, 44, 321–340.
- Ross, C. C., & Hooks, N. T. (1930). How shall we predict high school achievement? Journal of Educational Research, 22, 184–196.
- Russell, I. L., & Thalman, W. A. (1955). Personality, does it influence teachers' marks? Journal of Educational Research, 48, 561–564.
- Sobel, F. S. (1936). Teachers' marks and objective tests as indicators of adjustment. Teachers College Record, 38, 239–240.
- Stewart, J. L. (1920). Uniformity of teachers' marks versus variability. School Review, 28, 529–533.
- Swineford, F. (1947). Examination of the purported unreliability of teachers' marks. Elementary School Journal, 47, 516–521.
- Terwilliger, J. S. (1968). Individual differences in the marking practices of secondary school teachers. Journal of Educational Measurement, 5, 9–15.
- Unzicker, S. P. (1925). Teachers' marks and intelligence. Journal of Educational Research, 11, 123–131.
- *Welsh, M. E., & D'Agostino, J. V. (2009). Fostering consistency between standards-based grades and large-scale assessment results. In T. R. Guskey (Ed.), Practical solutions for serious problems in standards-based grading (pp. 75–104). Thousand Oaks, CA: Corwin.
- *Welsh, M. E., D'Agostino, J. V., & Kaniskan, B. (2013). Grading as a reform effort: Do standards-based grades converge with test scores? Educational Measurement: Issues and Practice, 32, 26–36.
- Willingham, W. W., Pollack, J. M., & Lewis, C. (2002). Grades and test scores: Accounting for observed differences. Journal of Educational Measurement, 39, 1–37.