Search in:

Advanced search

Journal of Research on Educational Effectiveness Volume 15, 2022 - Issue 3

Submit an article Journal homepage

287

Views

CrossRef citations to date

Altmetric

Intervention, Evaluation, and Policy Studies

The Effects of Higher-Stakes Teacher Evaluation on Office Disciplinary Referrals

David D. Liebowitza Department of Educational Methodology, Policy and Leadership, University of Oregon, Eugene, Oregon, USACorrespondence[email protected]

https://orcid.org/0000-0001-7375-6034

Lorna Portera Department of Educational Methodology, Policy and Leadership, University of Oregon, Eugene, Oregon, USA

Dylan Braggb Educational and Community Supports, College of Education, University of Oregon, Eugene, Oregon, USA

Pages 475-509 | Received 15 Jul 2020, Accepted 15 Oct 2021, Published online: 26 Jan 2022

Cite this article
https://doi.org/10.1080/19345747.2021.2015496
CrossMark

Full Article
Figures & data
References
Supplemental
Citations
Metrics
Reprints & Permissions

References

Abadie, A., Athey, S., Imbens, G., & Wooldridge, J. (2017). When should you adjust standard errors for clustering? NBER Working Paper Series No. 24003, Cambridge, MA.
Google Scholar
Anderson, K. P. (2020). Academic, attendance, and behavioral outcomes of a suspension reduction policy: Lessons for school leaders and policy makers. Educational Administration Quarterly, 56(3), 435–471. https://doi.org/https://doi.org/10.1177/0013161X19861138
Web of Science ®Google Scholar
Athey, S., & Imbens, G. (2018). Design-based analysis in difference-in-differences settings with staggered adoption. NBER Working Paper Series No. 24963, Cambridge, MA. https://doi.org/https://doi.org/10.3386/w24963
Google Scholar
Bacher-Hicks, A., Billings, S. B., & Deming, D. J. (2019). The school to prison pipeline: Long-run impacts of school suspensions on adult crime. NBER Working Paper Series No. 26257, Cambridge, MA.
Google Scholar
Bertrand, M., Duflo, E., & Mullainathan, S. (2004). How much should we trust differences-in-differences estimates? The Quarterly Journal of Economics, 119(1), 249–275. https://doi.org/https://doi.org/10.1162/003355304772839588
Web of Science ®Google Scholar
Bezinque, A., Garcia, K., Darling, K., & Stuart-Cassel, V. (2018). Compendium of school discipline laws and regulations for the 50 states, Washington, D.C. and the U.S. territories. Washington, DC. http://safesupportivelearning.ed.gov/school-discipline-compendium
Google Scholar
Borusyak, K., & Jaravel, X. (2017). Revisiting event study designs (SSRN Working Paper). SSRN Working Papers. https://doi.org/https://doi.org/10.2139/ssrn.2826228
Google Scholar
Bragg, D. (2019). School-wide information system: Dataset D0098. University of Oregon.
Google Scholar
Brehm, M., Imberman, S. A., & Lovenheim, M. F. (2017). Achievement effects of individual performance incentives in a teacher merit pay tournament. Labour Economics, 44, 133–150. https://doi.org/https://doi.org/10.1016/j.labeco.2016.12.008
Web of Science ®Google Scholar
Burgess, S., Rawal, S., & Taylor, E. S. (2021). Teacher peer observation and student test scores: Evidence from a field experiment in English secondary schools. Journal of Labor Economics, 39(4), 1155–1186. https://doi.org/https://doi.org/10.1086/712997
Web of Science ®Google Scholar
Callaway, B., & Sant’Anna, P. H. C. (2021). Difference-in-differences with multiple time periods. Journal of Econometrics, 225(2), 200–230. https://doi.org/https://doi.org/10.1016/j.jeconom.2020.12.001
Web of Science ®Google Scholar
Cameron, A., Gelbach, J. B., & Miller, D. L. (2011). Robust inference with multiway clustering. Journal of Business & Economic Statistics, 29(2), 238–249. https://doi.org/https://doi.org/10.1198/jbes.2010.07136
Web of Science ®Google Scholar
Carrell, S. E., & Hoekstra, M. L. (2010). Externalities in the classroom: How children exposed to domestic violence affect everyone’s kids. American Economic Journal: Applied Economics, 2(1), 211–228. https://doi.org/https://doi.org/10.1257/app.2.1.211
Web of Science ®Google Scholar
Chakrabarti, R. (2014). Incentives and responses under no child left behind: Credible threats and the role of competition. Journal of Public Economics, 110, 124–146. https://doi.org/https://doi.org/10.1016/J.JPUBECO.2013.08.005
Web of Science ®Google Scholar
Chiang, H. (2009). How accountability pressure on failing schools affects student achievement. Journal of Public Economics, 93(9–10), 1045–1057. https://doi.org/https://doi.org/10.1016/J.JPUBECO.2009.06.002
Web of Science ®Google Scholar
Cohen, J., Hutt, E., Berlin, R., & Wiseman, E. (2020). The change we cannot see: Instructional quality and classroom observation in the era of common core. Educational Policy, 1–27. https://doi.org/https://doi.org/10.1177/0895904820951114.
Google Scholar
Connally, K., & Tooley, M. (2016). Beyond ratings: Re-envisioning state teacher evaluation systems as tools for professional growth.
Google Scholar
Cullen, J. B., Koedel, C., & Parsons, E. (2019). The compositional effect of rigorous teacher evaluation on workforce quality. Education Finance and Policy, 1–85. https://doi.org/https://doi.org/10.1162/edfp_a_00292.
Google Scholar
Curran, F. C. (2016). Estimating the effect of state zero tolerance laws on exclusionary discipline, racial discipline gaps, and student behavior. Educational Evaluation and Policy Analysis, 38(4), 647–668. https://doi.org/https://doi.org/10.3102/0162373716652728
Web of Science ®Google Scholar
Danielson, C. (1996). Enhancing professional practice: A framework for teaching. ASCD.
Google Scholar
de Chaisemartin, C., & D’Haultfoeuille, X. (2018). Fuzzy difference-in-differences. The Review of Economic Studies, 85(2), 999–1028. https://doi.org/https://doi.org/10.1093/restud/rdx049
Google Scholar
de Chaisemartin, C., & D’Haultfoeuille, X. (2020). Two-way fixed effects estimators with heterogeneous treatment effects. American Economic Review, 110(9), 2964–2996. https://doi.org/https://doi.org/10.1257/aer.20181169
Web of Science ®Google Scholar
Dee, T. S., & Wyckoff, J. (2015). Incentives, selection, and teacher performance: Evidence from IMPACT. Journal of Policy Analysis and Management, 34(2), 267–297. https://doi.org/https://doi.org/10.1002/pam.21818
Web of Science ®Google Scholar
Deming, D. J., Cohodes, S., Jennings, J., & Jencks, C. (2016). School accountability, postsecondary attainment, and earnings. Review of Economics and Statistics, 98(5), 848–862. https://doi.org/https://doi.org/10.1162/REST_a_00598
Web of Science ®Google Scholar
Deming, D. J., & Figlio, D. (2016). Accountability in US education: Applying lessons from K–12 experience to higher education. Journal of Economic Perspectives, 30(3), 33–56. https://doi.org/https://doi.org/10.1257/jep.30.3.33
Web of Science ®Google Scholar
Dixit, A. (2002). Incentives and organizations in the public sector: An interpretive review. The Journal of Human Resources, 37(4), 696–727. https://doi.org/https://doi.org/10.2307/3069614
Web of Science ®Google Scholar
Donaldson, M. L. (2021). Multidisciplinary perspectives on teacher evaluation: Understanding the research and theory. Routledge.
Google Scholar
Donaldson, M. L., & Woulfin, S. (2018). From tinkering to going “rogue”: How principals use agency when enacting new teacher evaluation systems. Educational Evaluation and Policy Analysis, 40(4), 531–556. https://doi.org/https://doi.org/10.3102/0162373718784205
Web of Science ®Google Scholar
Duncan, A. (2012, July 23). The Tennessee Story. The Huffington Post. https://www.huffpost.com/entry/the-tennessee-story_b_1695467
Google Scholar
Eren, O. (2019). Teacher incentives and student achievement: Evidence from an advancement program. Journal of Policy Analysis and Management, 38(4), 867–890. https://doi.org/https://doi.org/10.1002/pam.22146
Web of Science ®Google Scholar
Ferman, B., & Pinto, C. (2019). Inference in differences-in-differences with few treated groups and heteroskedasticity. The Review of Economics and Statistics, 101(3), 452–467. https://doi.org/https://doi.org/10.1162/rest_a_00759
Web of Science ®Google Scholar
Figlio, D. N. (2006). Testing, crime and punishment. Journal of Public Economics, 90(4–5), 837–851. https://doi.org/https://doi.org/10.1016/J.JPUBECO.2005.01.003
Web of Science ®Google Scholar
Ford, T. G., Van Sickle, M. E., Clark, L. V., Fazio-Brunson, M., & Schween, D. C. (2017). Teacher self-efficacy, professional commitment, and high-stakes teacher evaluation policy in Louisiana. Educational Policy, 31(2), 202–248. https://doi.org/https://doi.org/10.1177/0895904815586855
Web of Science ®Google Scholar
Freyaldenhoven, S., Hansen, C., & Shapiro, J. M. (2019). Pre-event trends in the panel event-study design. American Economic Review, 109(9), 3307–3338. https://doi.org/https://doi.org/10.1257/aer.20180609
Web of Science ®Google Scholar
Garet, M. S., Wayne, A. J., Brown, S., Rickles, J., Song, M., & Manzeseke, D. (2017). The impact of providing performance feedback to teachers and principals (NCESS 2018-4001).
Google Scholar
Gibbons, C. E., Suárez Serrato, J. C., & Urbancic, M. B. (2018). Broken or fixed effects? Journal of Econometric Methods, 8(1). https://doi.org/https://doi.org/10.1515/jem-2017-0002.
Google Scholar
Gilmour, A. F., Majeika, C. E., Sheaffer, A. W., & Wehby, J. H. (2019). The coverage of classroom management in teacher evaluation rubrics. Teacher Education and Special Education, 42(2), 161–174. https://doi.org/https://doi.org/10.1177/0888406418781918
Web of Science ®Google Scholar
Goodman-Bacon, A. (2021). Difference-in-differences with variation in treatment timing. Journal of Econometrics, 225(2), 254–277. https://doi.org/https://doi.org/10.1016/j.jeconom.2021.03.014
Web of Science ®Google Scholar
Greflund, S., McIntosh, K., Mercer, S. H., & May, S. L. (2014). Examining disproportionality in school discipline for aboriginal students in schools implementing PBIS. Canadian Journal of School Psychology, 29(3), 213–235. https://doi.org/https://doi.org/10.1177/0829573514542214
Google Scholar
Hamilton, L. S., Berends, M., & Stecher, B. M. (2005). Teachers’ responses to standards-based accountability (Rand Working Papers No. WR-259-EDU), Santa Monica, CA. https://www.rand.org/pubs/working_papers/WR259.html
Google Scholar
Holbein, J. B., & Ladd, H. F. (2017). Accountability pressure: Regression discontinuity estimates of how No Child Left Behind influenced student behavior. Economics of Education Review, 58, 55–67. https://doi.org/https://doi.org/10.1016/J.ECONEDUREV.2017.03.005
Web of Science ®Google Scholar
Horner, R. H., Sugai, G., Smolkowski, K., Eber, L., Nakasato, J., Todd, A. W., & Esperanza, J. (2009). A randomized, wait-list controlled effectiveness trial assessing school-wide positive behavior support in elementary schools. Journal of Positive Behavior Interventions, 11(3), 133–144. https://doi.org/https://doi.org/10.1177/1098300709332067
Web of Science ®Google Scholar
Hoselton, R. (2018). SWIS 2017-18 summary report. Eugene, OR.
Google Scholar
IES. (2014). State requirements for teacher evaluation policies promoted by race to the top, Washington, DC. https://ies.ed.gov/ncee/pubs/20144016/pdf/20144016.pdf
Google Scholar
Imai, K., & Kim, I. S. (2020). On the use of two-way fixed effects regression models for causal inference with panel data. Political Analysis, 1–11. https://doi.org/https://doi.org/10.1017/pan.2020.33
Google Scholar
Jacob, R. T., Doolittle, F., Kemple, J., & Somers, M.-A. (2019). A framework for learning from null results. Educational Researcher, 48(9), 580–589. https://doi.org/https://doi.org/10.3102/0013189X19891955
Web of Science ®Google Scholar
Jacobs, S., & Doherty, K. (2015). State of the states 2015: Evaluating teaching, leading and learning.
Google Scholar
Kennedy-Lewis, B. L., & Murphy, A. S. (2016). Listening to “frequent flyers”: What persistently disciplined students have to say about being labeled as “bad. Teachers College Record: The Voice of Scholarship in Education, 118(1), 1–40. https://doi.org/https://doi.org/10.1177/016146811611800106
Web of Science ®Google Scholar
Kraft, M. A., Brunner, E. J., Dougherty, S. M., & Schwegman, D. J. (2020). Teacher accountability reforms and the supply and quality of new teachers. Journal of Public Economics, 188, 104212. https://doi.org/https://doi.org/10.1016/j.jpubeco.2020.104212
Web of Science ®Google Scholar
Kraft, M. A., & Gilmour, A. F. (2016). Can principals promote teacher development as evaluators? A case study of principals’ views and experiences. Educational Administration Quarterly, 52(5), 711–753. https://doi.org/https://doi.org/10.1177/0013161X16653445
PubMed Web of Science ®Google Scholar
Kraft, M. A., & Gilmour, A. F. (2017). Revisiting the widget effect: Teacher evaluation reforms and the distribution of teacher effectiveness. Educational Researcher, 46(5), 234–249. https://doi.org/https://doi.org/10.3102/0013189X17718797
Web of Science ®Google Scholar
Lacoe, J., & Steinberg, M. P. (2018). Do suspensions affect student outcomes? Educational Evaluation and Policy Analysis, 0(0). https://doi.org/https://doi.org/10.3102/0162373718794897
Google Scholar
Ladd, H. F., & Lauen, D. L. (2010). Status versus growth: The distributional effects of school accountability policies. Journal of Policy Analysis and Management, 29(3), 426–450. https://doi.org/https://doi.org/10.1002/pam.20504
Web of Science ®Google Scholar
Lazear, E. (2001). Educational production. The Quarterly Journal of Economics, 116(3), 777–803. https://doi.org/https://doi.org/10.1162/00335530152466232
Web of Science ®Google Scholar
Liebowitz, D. D. (2020). Teacher evaluation for growth and accountability: Under what conditions does it improve student outcomes? (Unpublished Working Paper). Eugene, OR. https://scholar.harvard.edu/files/dliebowitz/files/teacher_eval_review_oct_2020.pdf
Google Scholar
Loeb, S., Miller, L. C., & Wyckoff, J. (2015). Performance screens for school improvement. Educational Researcher, 44(4), 199–212. https://doi.org/https://doi.org/10.3102/0013189X15584773
Web of Science ®Google Scholar
Macartney, H. (2016). The dynamic effects of educational accountability. Journal of Labor Economics, 34(1), 1–28. https://doi.org/https://doi.org/10.1086/682333
Web of Science ®Google Scholar
Macartney, H., McMillan, R., & Petronijevic, U. (2019). Teacher value-added and economic agency (NBER Working Paper Series No. 24747). Cambridge, MA. https://doi.org/https://doi.org/10.3386/w24747
Google Scholar
Mashburn, A. J., Pianta, R. C., Hamre, B. K., Downer, J. T., Barbarin, O. A., Bryant, D., Burchinal, M., Early, D. M., & Howes, C. (2008). Measures of classroom quality in prekindergarten and children’s development of academic, language, and social skills. Child Development, 79(3), 732–749. https://doi.org/https://doi.org/10.1111/j.1467-8624.2008.01154.x.
PubMed Web of Science ®Google Scholar
McIntosh, K., Mercer, S., Hume, A., Frank, J. L., Turri, M., & Mathews, S. (2013). Factors related to sustained implementation of schoolwide positive behavior support. Exceptional Children, 79(3), 293–311.
Web of Science ®Google Scholar
Mercer, S. H., McIntosh, K., & Hoselton, R. (2017). Comparability of fidelity measures for assessing tier 1 school-wide positive behavioral interventions and supports. Journal of Positive Behavior Interventions, 19(4), 195–204. https://doi.org/https://doi.org/10.1177/1098300717693384
Web of Science ®Google Scholar
Miller, D., Shenhav, N., & Grosz, M. (2019). Selection into identification in fixed effects models, with application to head start (NBER Working Paper Series No. 26174). Cambridge, MA. https://doi.org/https://doi.org/10.3386/w26174
Google Scholar
NCTQ. (2011). State of the states: Trends and early lessons on teacher evaluation and effectiveness policies.
Google Scholar
NCTQ. (2016). State-by-state evaluation timeline briefs.
Google Scholar
NCTQ. (2017). State teacher policy database. https://www.nctq.org/yearbook
Google Scholar
Neal, D., & Schanzenbach, D. (2010). Left behind by design: Proficiency counts and test-based accountability. Review of Economics and Statistics, 92(2), 263–283. https://doi.org/https://doi.org/10.1162/rest.2010.12318
Web of Science ®Google Scholar
Ozek, U. (2012). One day too late? Mobile students in the era of accountability (CALDER Working Paper Series No. 82). caldercenter.org/sites/default/files/WP 82 Final.pdf
Google Scholar
Phipps, A., & Wiseman, E. (2019). Enacting the rubric: Teacher improvements in windows of high-stakes observation. Education Finance and Policy, 1–51. https://doi.org/https://doi.org/10.1162/edfp_a_00295.
Google Scholar
Phipps, A. R. (2021). Unintended consequences of teacher performance pay: A theory on incentives and evidence from Washington, D.C. (unpublished Working Paper). https://sites.google.com/view/aaronphippsecon/research
Google Scholar
Pope, N. G. (2019). The effect of teacher ratings on teacher performance. Journal of Public Economics, 172, 84–110. https://doi.org/https://doi.org/10.1016/J.JPUBECO.2019.01.001
Web of Science ®Google Scholar
Rafa, A. (2019). The status of school discipline in state policy. Denver, CO. www.ecs.org/wp-content/uploads/The-Status-of-School-Discipline-in-State-Policy.pdf
Google Scholar
Reback, R. (2008). Teaching to the rating: School accountability and the distribution of student achievement. Journal of Public Economics, 92(5–6), 1394–1415. https://doi.org/https://doi.org/10.1016/J.JPUBECO.2007.05.003
Web of Science ®Google Scholar
Reback, R., Rockoff, J., & Schwartz, H. L. (2014). Under pressure: Job security, resource allocation, and productivity in schools under No Child Left Behind. American Economic Journal: Economic Policy, 6(3), 207–241. https://doi.org/https://doi.org/10.1257/pol.6.3.207
Web of Science ®Google Scholar
Rothstein, J. (2015). Teacher quality policy when supply matters. American Economic Review, 105(1), 100–130. https://doi.org/https://doi.org/10.1257/aer.20121242
Web of Science ®Google Scholar
Skiba, R. J. (2015). Interventions to address racial/ethnic disparities in school discipline: Can systems reform be race-neutral? In Race and social problems (pp. 107–124). New York: Springer.
Google Scholar
Sorensen, L. C., Bushway, S. D., & Gifford, E. J. (2021). Getting tough? The effects of discretionary principal discipline on student outcomes. Education Finance and Policy, 1–74. https://doi.org/https://doi.org/10.1162/edfp_a_00341
Google Scholar
Stecher, B., Holtzman, D., Garet, M., Hamilton, L., Engberg, J., Steiner, E., … Chambers, J. (2018). Improving teaching effectiveness: Final report: The intensive partnerships for effective teaching through 2015–2016. RAND Corporation.
Google Scholar
Steinberg, M. P., & Donaldson, M. L. (2016). The new educational accountability: Understanding the landscape of teacher evaluation in the post-NCLB era. Education Finance and Policy, 11(3), 340–359. https://doi.org/https://doi.org/10.1162/EDFP_a_00186
Web of Science ®Google Scholar
Steinberg, M. P., & Lacoe, J. (2018). Reforming school discipline: School-level policy implementation and the consequences for suspended students and their peers. American Journal of Education, 125(1), 29–77. https://doi.org/https://doi.org/10.1086/699811
Web of Science ®Google Scholar
Steinberg, M. P., & Sartain, L. (2015). Does teacher evaluation improve school performance? Experimental evidence from Chicago’s excellence in teaching project. Education Finance and Policy, 10(4), 535–572. https://doi.org/https://doi.org/10.1162/EDFP_a_00173
Web of Science ®Google Scholar
Strunk, K. O., Barrett, N., & Lincove, J. A. (2017). When tenure ends: The short-run effects of the elimination of Louisiana’s teacher employment protections on teacher exit and retirement. https://educationresearchalliancenola.org/files/publications/041217-Strunk-Barrett-Lincove-When-Tenure-Ends.pdf
Google Scholar
Taylor, E. S., & Tyler, J. H. (2012). The effect of evaluation on teacher performance. American Economic Review, 102(7), 3628–3651. https://doi.org/https://doi.org/10.1257/aer.102.7.3628
Web of Science ®Google Scholar
U.S. Department of Education, & U.S. Department of Justice. (2014). Dear colleague letter on the nondiscriminatory administration of school discipline.
Google Scholar
Vogell, H. (2011, July 26). Investigation into APS cheating finds unethical behavior across every level. Atlanta Journal-Constitution, p. 1. https://www.ajc.com/news/local/investigation-into-aps-cheating-finds-unethical-behavior-across-every-level
Google Scholar
Welsh, R. O., & Little, S. (2018). The school discipline dilemma: A comprehensive review of disparities and alternative approaches. Review of Educational Research, 88(5), 752–794. https://doi.org/https://doi.org/10.3102/0034654318791582
Web of Science ®Google Scholar
Wieczorek, D., Clark, B., & Theoharis, G. (2019). Principals’ instructional feedback practices during race to the top. Leadership and Policy in Schools, 18(3), 357–381. https://doi.org/https://doi.org/10.1080/15700763.2017.1398336
Web of Science ®Google Scholar
Winters, M. A., & Cowen, J. M. (2013). Who would stay, who would be dismissed? An empirical consideration of value-added teacher retention policies. Educational Researcher, 42(6), 330–337. https://doi.org/https://doi.org/10.3102/0013189X13496145
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

The Effects of Higher-Stakes Teacher Evaluation on Office Disciplinary Referrals

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

The Effects of Higher-Stakes Teacher Evaluation on Office Disciplinary Referrals

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date