References
- Abadie, A., Athey, S., Imbens, G., & Wooldridge, J. (2017). When should you adjust standard errors for clustering? NBER Working Paper Series No. 24003, Cambridge, MA.
- Anderson, K. P. (2020). Academic, attendance, and behavioral outcomes of a suspension reduction policy: Lessons for school leaders and policy makers. Educational Administration Quarterly, 56(3), 435–471. https://doi.org/https://doi.org/10.1177/0013161X19861138
- Athey, S., & Imbens, G. (2018). Design-based analysis in difference-in-differences settings with staggered adoption. NBER Working Paper Series No. 24963, Cambridge, MA. https://doi.org/https://doi.org/10.3386/w24963
- Bacher-Hicks, A., Billings, S. B., & Deming, D. J. (2019). The school to prison pipeline: Long-run impacts of school suspensions on adult crime. NBER Working Paper Series No. 26257, Cambridge, MA.
- Bertrand, M., Duflo, E., & Mullainathan, S. (2004). How much should we trust differences-in-differences estimates? The Quarterly Journal of Economics, 119(1), 249–275. https://doi.org/https://doi.org/10.1162/003355304772839588
- Bezinque, A., Garcia, K., Darling, K., & Stuart-Cassel, V. (2018). Compendium of school discipline laws and regulations for the 50 states, Washington, D.C. and the U.S. territories. Washington, DC. http://safesupportivelearning.ed.gov/school-discipline-compendium
- Borusyak, K., & Jaravel, X. (2017). Revisiting event study designs (SSRN Working Paper). SSRN Working Papers. https://doi.org/https://doi.org/10.2139/ssrn.2826228
- Bragg, D. (2019). School-wide information system: Dataset D0098. University of Oregon.
- Brehm, M., Imberman, S. A., & Lovenheim, M. F. (2017). Achievement effects of individual performance incentives in a teacher merit pay tournament. Labour Economics, 44, 133–150. https://doi.org/https://doi.org/10.1016/j.labeco.2016.12.008
- Burgess, S., Rawal, S., & Taylor, E. S. (2021). Teacher peer observation and student test scores: Evidence from a field experiment in English secondary schools. Journal of Labor Economics, 39(4), 1155–1186. https://doi.org/https://doi.org/10.1086/712997
- Callaway, B., & Sant’Anna, P. H. C. (2021). Difference-in-differences with multiple time periods. Journal of Econometrics, 225(2), 200–230. https://doi.org/https://doi.org/10.1016/j.jeconom.2020.12.001
- Cameron, A., Gelbach, J. B., & Miller, D. L. (2011). Robust inference with multiway clustering. Journal of Business & Economic Statistics, 29(2), 238–249. https://doi.org/https://doi.org/10.1198/jbes.2010.07136
- Carrell, S. E., & Hoekstra, M. L. (2010). Externalities in the classroom: How children exposed to domestic violence affect everyone’s kids. American Economic Journal: Applied Economics, 2(1), 211–228. https://doi.org/https://doi.org/10.1257/app.2.1.211
- Chakrabarti, R. (2014). Incentives and responses under no child left behind: Credible threats and the role of competition. Journal of Public Economics, 110, 124–146. https://doi.org/https://doi.org/10.1016/J.JPUBECO.2013.08.005
- Chiang, H. (2009). How accountability pressure on failing schools affects student achievement. Journal of Public Economics, 93(9–10), 1045–1057. https://doi.org/https://doi.org/10.1016/J.JPUBECO.2009.06.002
- Cohen, J., Hutt, E., Berlin, R., & Wiseman, E. (2020). The change we cannot see: Instructional quality and classroom observation in the era of common core. Educational Policy, 1–27. https://doi.org/https://doi.org/10.1177/0895904820951114.
- Connally, K., & Tooley, M. (2016). Beyond ratings: Re-envisioning state teacher evaluation systems as tools for professional growth.
- Cullen, J. B., Koedel, C., & Parsons, E. (2019). The compositional effect of rigorous teacher evaluation on workforce quality. Education Finance and Policy, 1–85. https://doi.org/https://doi.org/10.1162/edfp_a_00292.
- Curran, F. C. (2016). Estimating the effect of state zero tolerance laws on exclusionary discipline, racial discipline gaps, and student behavior. Educational Evaluation and Policy Analysis, 38(4), 647–668. https://doi.org/https://doi.org/10.3102/0162373716652728
- Danielson, C. (1996). Enhancing professional practice: A framework for teaching. ASCD.
- de Chaisemartin, C., & D’Haultfoeuille, X. (2018). Fuzzy difference-in-differences. The Review of Economic Studies, 85(2), 999–1028. https://doi.org/https://doi.org/10.1093/restud/rdx049
- de Chaisemartin, C., & D’Haultfoeuille, X. (2020). Two-way fixed effects estimators with heterogeneous treatment effects. American Economic Review, 110(9), 2964–2996. https://doi.org/https://doi.org/10.1257/aer.20181169
- Dee, T. S., & Wyckoff, J. (2015). Incentives, selection, and teacher performance: Evidence from IMPACT. Journal of Policy Analysis and Management, 34(2), 267–297. https://doi.org/https://doi.org/10.1002/pam.21818
- Deming, D. J., Cohodes, S., Jennings, J., & Jencks, C. (2016). School accountability, postsecondary attainment, and earnings. Review of Economics and Statistics, 98(5), 848–862. https://doi.org/https://doi.org/10.1162/REST_a_00598
- Deming, D. J., & Figlio, D. (2016). Accountability in US education: Applying lessons from K–12 experience to higher education. Journal of Economic Perspectives, 30(3), 33–56. https://doi.org/https://doi.org/10.1257/jep.30.3.33
- Dixit, A. (2002). Incentives and organizations in the public sector: An interpretive review. The Journal of Human Resources, 37(4), 696–727. https://doi.org/https://doi.org/10.2307/3069614
- Donaldson, M. L. (2021). Multidisciplinary perspectives on teacher evaluation: Understanding the research and theory. Routledge.
- Donaldson, M. L., & Woulfin, S. (2018). From tinkering to going “rogue”: How principals use agency when enacting new teacher evaluation systems. Educational Evaluation and Policy Analysis, 40(4), 531–556. https://doi.org/https://doi.org/10.3102/0162373718784205
- Duncan, A. (2012, July 23). The Tennessee Story. The Huffington Post. https://www.huffpost.com/entry/the-tennessee-story_b_1695467
- Eren, O. (2019). Teacher incentives and student achievement: Evidence from an advancement program. Journal of Policy Analysis and Management, 38(4), 867–890. https://doi.org/https://doi.org/10.1002/pam.22146
- Ferman, B., & Pinto, C. (2019). Inference in differences-in-differences with few treated groups and heteroskedasticity. The Review of Economics and Statistics, 101(3), 452–467. https://doi.org/https://doi.org/10.1162/rest_a_00759
- Figlio, D. N. (2006). Testing, crime and punishment. Journal of Public Economics, 90(4–5), 837–851. https://doi.org/https://doi.org/10.1016/J.JPUBECO.2005.01.003
- Ford, T. G., Van Sickle, M. E., Clark, L. V., Fazio-Brunson, M., & Schween, D. C. (2017). Teacher self-efficacy, professional commitment, and high-stakes teacher evaluation policy in Louisiana. Educational Policy, 31(2), 202–248. https://doi.org/https://doi.org/10.1177/0895904815586855
- Freyaldenhoven, S., Hansen, C., & Shapiro, J. M. (2019). Pre-event trends in the panel event-study design. American Economic Review, 109(9), 3307–3338. https://doi.org/https://doi.org/10.1257/aer.20180609
- Garet, M. S., Wayne, A. J., Brown, S., Rickles, J., Song, M., & Manzeseke, D. (2017). The impact of providing performance feedback to teachers and principals (NCESS 2018-4001).
- Gibbons, C. E., Suárez Serrato, J. C., & Urbancic, M. B. (2018). Broken or fixed effects? Journal of Econometric Methods, 8(1). https://doi.org/https://doi.org/10.1515/jem-2017-0002.
- Gilmour, A. F., Majeika, C. E., Sheaffer, A. W., & Wehby, J. H. (2019). The coverage of classroom management in teacher evaluation rubrics. Teacher Education and Special Education, 42(2), 161–174. https://doi.org/https://doi.org/10.1177/0888406418781918
- Goodman-Bacon, A. (2021). Difference-in-differences with variation in treatment timing. Journal of Econometrics, 225(2), 254–277. https://doi.org/https://doi.org/10.1016/j.jeconom.2021.03.014
- Greflund, S., McIntosh, K., Mercer, S. H., & May, S. L. (2014). Examining disproportionality in school discipline for aboriginal students in schools implementing PBIS. Canadian Journal of School Psychology, 29(3), 213–235. https://doi.org/https://doi.org/10.1177/0829573514542214
- Hamilton, L. S., Berends, M., & Stecher, B. M. (2005). Teachers’ responses to standards-based accountability (Rand Working Papers No. WR-259-EDU), Santa Monica, CA. https://www.rand.org/pubs/working_papers/WR259.html
- Holbein, J. B., & Ladd, H. F. (2017). Accountability pressure: Regression discontinuity estimates of how No Child Left Behind influenced student behavior. Economics of Education Review, 58, 55–67. https://doi.org/https://doi.org/10.1016/J.ECONEDUREV.2017.03.005
- Horner, R. H., Sugai, G., Smolkowski, K., Eber, L., Nakasato, J., Todd, A. W., & Esperanza, J. (2009). A randomized, wait-list controlled effectiveness trial assessing school-wide positive behavior support in elementary schools. Journal of Positive Behavior Interventions, 11(3), 133–144. https://doi.org/https://doi.org/10.1177/1098300709332067
- Hoselton, R. (2018). SWIS 2017-18 summary report. Eugene, OR.
- IES. (2014). State requirements for teacher evaluation policies promoted by race to the top, Washington, DC. https://ies.ed.gov/ncee/pubs/20144016/pdf/20144016.pdf
- Imai, K., & Kim, I. S. (2020). On the use of two-way fixed effects regression models for causal inference with panel data. Political Analysis, 1–11. https://doi.org/https://doi.org/10.1017/pan.2020.33
- Jacob, R. T., Doolittle, F., Kemple, J., & Somers, M.-A. (2019). A framework for learning from null results. Educational Researcher, 48(9), 580–589. https://doi.org/https://doi.org/10.3102/0013189X19891955
- Jacobs, S., & Doherty, K. (2015). State of the states 2015: Evaluating teaching, leading and learning.
- Kennedy-Lewis, B. L., & Murphy, A. S. (2016). Listening to “frequent flyers”: What persistently disciplined students have to say about being labeled as “bad. Teachers College Record: The Voice of Scholarship in Education, 118(1), 1–40. https://doi.org/https://doi.org/10.1177/016146811611800106
- Kraft, M. A., Brunner, E. J., Dougherty, S. M., & Schwegman, D. J. (2020). Teacher accountability reforms and the supply and quality of new teachers. Journal of Public Economics, 188, 104212. https://doi.org/https://doi.org/10.1016/j.jpubeco.2020.104212
- Kraft, M. A., & Gilmour, A. F. (2016). Can principals promote teacher development as evaluators? A case study of principals’ views and experiences. Educational Administration Quarterly, 52(5), 711–753. https://doi.org/https://doi.org/10.1177/0013161X16653445
- Kraft, M. A., & Gilmour, A. F. (2017). Revisiting the widget effect: Teacher evaluation reforms and the distribution of teacher effectiveness. Educational Researcher, 46(5), 234–249. https://doi.org/https://doi.org/10.3102/0013189X17718797
- Lacoe, J., & Steinberg, M. P. (2018). Do suspensions affect student outcomes? Educational Evaluation and Policy Analysis, 0(0). https://doi.org/https://doi.org/10.3102/0162373718794897
- Ladd, H. F., & Lauen, D. L. (2010). Status versus growth: The distributional effects of school accountability policies. Journal of Policy Analysis and Management, 29(3), 426–450. https://doi.org/https://doi.org/10.1002/pam.20504
- Lazear, E. (2001). Educational production. The Quarterly Journal of Economics, 116(3), 777–803. https://doi.org/https://doi.org/10.1162/00335530152466232
- Liebowitz, D. D. (2020). Teacher evaluation for growth and accountability: Under what conditions does it improve student outcomes? (Unpublished Working Paper). Eugene, OR. https://scholar.harvard.edu/files/dliebowitz/files/teacher_eval_review_oct_2020.pdf
- Loeb, S., Miller, L. C., & Wyckoff, J. (2015). Performance screens for school improvement. Educational Researcher, 44(4), 199–212. https://doi.org/https://doi.org/10.3102/0013189X15584773
- Macartney, H. (2016). The dynamic effects of educational accountability. Journal of Labor Economics, 34(1), 1–28. https://doi.org/https://doi.org/10.1086/682333
- Macartney, H., McMillan, R., & Petronijevic, U. (2019). Teacher value-added and economic agency (NBER Working Paper Series No. 24747). Cambridge, MA. https://doi.org/https://doi.org/10.3386/w24747
- Mashburn, A. J., Pianta, R. C., Hamre, B. K., Downer, J. T., Barbarin, O. A., Bryant, D., Burchinal, M., Early, D. M., & Howes, C. (2008). Measures of classroom quality in prekindergarten and children’s development of academic, language, and social skills. Child Development, 79(3), 732–749. https://doi.org/https://doi.org/10.1111/j.1467-8624.2008.01154.x.
- McIntosh, K., Mercer, S., Hume, A., Frank, J. L., Turri, M., & Mathews, S. (2013). Factors related to sustained implementation of schoolwide positive behavior support. Exceptional Children, 79(3), 293–311.
- Mercer, S. H., McIntosh, K., & Hoselton, R. (2017). Comparability of fidelity measures for assessing tier 1 school-wide positive behavioral interventions and supports. Journal of Positive Behavior Interventions, 19(4), 195–204. https://doi.org/https://doi.org/10.1177/1098300717693384
- Miller, D., Shenhav, N., & Grosz, M. (2019). Selection into identification in fixed effects models, with application to head start (NBER Working Paper Series No. 26174). Cambridge, MA. https://doi.org/https://doi.org/10.3386/w26174
- NCTQ. (2011). State of the states: Trends and early lessons on teacher evaluation and effectiveness policies.
- NCTQ. (2016). State-by-state evaluation timeline briefs.
- NCTQ. (2017). State teacher policy database. https://www.nctq.org/yearbook
- Neal, D., & Schanzenbach, D. (2010). Left behind by design: Proficiency counts and test-based accountability. Review of Economics and Statistics, 92(2), 263–283. https://doi.org/https://doi.org/10.1162/rest.2010.12318
- Ozek, U. (2012). One day too late? Mobile students in the era of accountability (CALDER Working Paper Series No. 82). caldercenter.org/sites/default/files/WP 82 Final.pdf
- Phipps, A., & Wiseman, E. (2019). Enacting the rubric: Teacher improvements in windows of high-stakes observation. Education Finance and Policy, 1–51. https://doi.org/https://doi.org/10.1162/edfp_a_00295.
- Phipps, A. R. (2021). Unintended consequences of teacher performance pay: A theory on incentives and evidence from Washington, D.C. (unpublished Working Paper). https://sites.google.com/view/aaronphippsecon/research
- Pope, N. G. (2019). The effect of teacher ratings on teacher performance. Journal of Public Economics, 172, 84–110. https://doi.org/https://doi.org/10.1016/J.JPUBECO.2019.01.001
- Rafa, A. (2019). The status of school discipline in state policy. Denver, CO. www.ecs.org/wp-content/uploads/The-Status-of-School-Discipline-in-State-Policy.pdf
- Reback, R. (2008). Teaching to the rating: School accountability and the distribution of student achievement. Journal of Public Economics, 92(5–6), 1394–1415. https://doi.org/https://doi.org/10.1016/J.JPUBECO.2007.05.003
- Reback, R., Rockoff, J., & Schwartz, H. L. (2014). Under pressure: Job security, resource allocation, and productivity in schools under No Child Left Behind. American Economic Journal: Economic Policy, 6(3), 207–241. https://doi.org/https://doi.org/10.1257/pol.6.3.207
- Rothstein, J. (2015). Teacher quality policy when supply matters. American Economic Review, 105(1), 100–130. https://doi.org/https://doi.org/10.1257/aer.20121242
- Skiba, R. J. (2015). Interventions to address racial/ethnic disparities in school discipline: Can systems reform be race-neutral? In Race and social problems (pp. 107–124). New York: Springer.
- Sorensen, L. C., Bushway, S. D., & Gifford, E. J. (2021). Getting tough? The effects of discretionary principal discipline on student outcomes. Education Finance and Policy, 1–74. https://doi.org/https://doi.org/10.1162/edfp_a_00341
- Stecher, B., Holtzman, D., Garet, M., Hamilton, L., Engberg, J., Steiner, E., … Chambers, J. (2018). Improving teaching effectiveness: Final report: The intensive partnerships for effective teaching through 2015–2016. RAND Corporation.
- Steinberg, M. P., & Donaldson, M. L. (2016). The new educational accountability: Understanding the landscape of teacher evaluation in the post-NCLB era. Education Finance and Policy, 11(3), 340–359. https://doi.org/https://doi.org/10.1162/EDFP_a_00186
- Steinberg, M. P., & Lacoe, J. (2018). Reforming school discipline: School-level policy implementation and the consequences for suspended students and their peers. American Journal of Education, 125(1), 29–77. https://doi.org/https://doi.org/10.1086/699811
- Steinberg, M. P., & Sartain, L. (2015). Does teacher evaluation improve school performance? Experimental evidence from Chicago’s excellence in teaching project. Education Finance and Policy, 10(4), 535–572. https://doi.org/https://doi.org/10.1162/EDFP_a_00173
- Strunk, K. O., Barrett, N., & Lincove, J. A. (2017). When tenure ends: The short-run effects of the elimination of Louisiana’s teacher employment protections on teacher exit and retirement. https://educationresearchalliancenola.org/files/publications/041217-Strunk-Barrett-Lincove-When-Tenure-Ends.pdf
- Taylor, E. S., & Tyler, J. H. (2012). The effect of evaluation on teacher performance. American Economic Review, 102(7), 3628–3651. https://doi.org/https://doi.org/10.1257/aer.102.7.3628
- U.S. Department of Education, & U.S. Department of Justice. (2014). Dear colleague letter on the nondiscriminatory administration of school discipline.
- Vogell, H. (2011, July 26). Investigation into APS cheating finds unethical behavior across every level. Atlanta Journal-Constitution, p. 1. https://www.ajc.com/news/local/investigation-into-aps-cheating-finds-unethical-behavior-across-every-level
- Welsh, R. O., & Little, S. (2018). The school discipline dilemma: A comprehensive review of disparities and alternative approaches. Review of Educational Research, 88(5), 752–794. https://doi.org/https://doi.org/10.3102/0034654318791582
- Wieczorek, D., Clark, B., & Theoharis, G. (2019). Principals’ instructional feedback practices during race to the top. Leadership and Policy in Schools, 18(3), 357–381. https://doi.org/https://doi.org/10.1080/15700763.2017.1398336
- Winters, M. A., & Cowen, J. M. (2013). Who would stay, who would be dismissed? An empirical consideration of value-added teacher retention policies. Educational Researcher, 42(6), 330–337. https://doi.org/https://doi.org/10.3102/0013189X13496145