References
- Atkinson, M. J., & Wade, T. D. (2015). Mindfulness-based prevention for eating disorders: A school-based cluster randomized controlled study. International Journal of Eating Disorders, 48(7), 1024–1037. https://doi.org/https://doi.org/10.1002/eat.22416
- Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), v067i01. https://doi.org/https://doi.org/10.18637/jss.v067.i01
- Berk, R. A. (2005). Randomized experiments as the bronze standard. Journal of Experimental Criminology, 1(4), 417–433.
- Bloom, H. S. (2005). Randomizing groups to evaluate place-based programs. In H. S. Bloom (Ed.), Learning more from social experiments: Evolving analytic approaches (pp. 115–172). Russel Sage Foundation.
- Bloom, H. S., Bos, J. M., & Lee, S. W. (1999). Using cluster random assignment to measure program impacts. Statistical implications for the evaluation of education programs. Evaluation Review, 23(4), 445–469. https://doi.org/https://doi.org/10.1177/0193841X9902300405
- Bloom, H. S., Richburg-Hayes, L., & Black, A. R. (2007). Using covariates to improve precision: Empirical guidance for studies that randomize schools to measure the impacts of educational interventions. Educational Evaluation and Policy Analysis, 29(1), 30–59. https://doi.org/https://doi.org/10.3102/0162373707299550
- Bradshaw, C. P., Koth, C. W., Bevans, K. B., Ialongo, N., & Leaf, P. J. (2008). The impact of school-wide positive behavioral interventions and supports (PBIS) on the organizational health of elementary schools. School Psychology Quarterly, 23(4), 462–473. https://doi.org/https://doi.org/10.1037/a0012883
- Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd ed.). Academic Press.
- Cohen, J. (1992). A power primer. Psychological Bulletin, 112(1), 155–159. https://doi.org/https://doi.org/10.1037/0033-2909.112.1.155
- Cox, K., & Kelcey, B. (2019). Optimal design of cluster- and multisite-randomized studies using fallible outcome measures. Evaluation Review, 43(3–4), 189–225. https://doi.org/https://doi.org/10.1177/0193841X19870878
- Dong, N., Kelcey, B., & Spybrook, J. (2018). Power analyses for moderator effects in three-level cluster randomized trials. The Journal of Experimental Education, 86(3), 489–514. https://doi.org/https://doi.org/10.1080/00220973.2017.1315714
- Fitzmaurice, G. M., Laird, N. M., & Ware, J. H. (2011). Applied longitudinal data analysis (2nd ed.). John Wiley & Sons, Inc.
- Glass, J. E., Bobb, J. F., Lee, A. K., Richards, J. E., Lapham, G. T., Ludman, E., Achtmeyer, C., Caldeiro, R. M., Parrish, R., Williams, E. C., Lozano, P., & Bradley, K. A. (2018). Study protocol: A cluster-randomized trial implementing sustained patient-centered alcohol- related care (SPARC trial). Implementation Science, 13(1), 108. https://doi.org/https://doi.org/10.1186/s13012-018-0795-9
- Groves, R. M., Fowler, F. J., Couper, M. P., Lepkowski, J. M., Singer, E., & Tourangeau, R. (2009). Survey methodology (2nd ed.). John Wiley & Sons, Inc.
- Hedges, L., & Hedberg, E. C. (2007). Intraclass correlation values for planning group- randomized trials in education. Educational Evaluation and Policy Analysis, 29(1), 60–87. https://doi.org/https://doi.org/10.3102/0162373707299706
- Hill, C. J., Bloom, H. S., Black, A. R., & Lipsey, M. W. (2007). Empirical benchmarks for interpreting effect sizes in research. MDRC. https://www.mdrc.org/sites/default/files/full_84.pdf
- Hox, J. J., & Maas, C. J. M. (2001). The accuracy of multilevel structural equation modeling with pseudobalanced groups and small samples. Structural Equation Modeling: A Multidisciplinary Journal, 8(2), 157–174. https://doi.org/https://doi.org/10.1207/S15328007SEM0802_1
- Hoyle, R., & Gottfredson, N. C. (2015). Sample size considerations in prevention research applications of multilevel modeling and structural equation modeling. Prevention Science: The Official Journal of the Society for Prevention Research, 16(7), 987–996. https://doi.org/https://doi.org/10.1007/s11121-014-0489-8
- Institute of Education Sciences. (2003). Identifying and implementing educational practices supported by rigorous evidence: A user friendly guide. Coalition for Evidence-Based Policy. https://ies.ed.gov/ncee/pdf/evidence_based.pdf
- Kelcey, B., Spybrook, J., & Dong, N. (2019). Sample size planning for cluster-randomized interventions probing multilevel mediation. Prevention Science: The Official Journal of the Society for Prevention Research, 20(3), 407–418. https://doi.org/https://doi.org/10.1007/s11121-018-0921-6
- Kirk, R. (1982). Experimental design: Procedures for the behavioral sciences (2nd ed.). Brooks/Cole.
- Konstantopoulos, S. (2010). Power analysis in two-level unbalanced designs. The Journal of Experimental Education, 78(3), 291–317. https://doi.org/https://doi.org/10.1080/00220970903292876
- Konstantopoulos, S. (2012). The impact of covariates on statistical power in cluster randomized designs: Which level matters more? Multivariate Behavioral Research, 47(3), 392–420. https://doi.org/https://doi.org/10.1080/00273171.2012.673898
- Lam, A. C., Ruzek, E. A., Schenke, K., Conley, A. M., & Karabenick, S. A. (2015). Student perceptions of classroom achievement goal structure: Is it appropriate to aggregate? Journal of Educational Psychology, 107(4), 1102–1115.
- Liu, X. (2003). Statistical power and optimum sample allocation ratio for treatment and control having unequal costs per unit of randomization. Journal of Educational and Behavioral Statistics, 28(3), 231–248. https://doi.org/https://doi.org/10.3102/10769986028003231
- Maas, C. J., & Hox, J. J. (2005). Sufficient sample sizes for multilevel modeling. Methodology, 1(3), 86–92. https://doi.org/https://doi.org/10.1027/1614-2241.1.3.86
- Manatunga, A. K., Hudgens, M. G., & Chen, S. (2001). Sample size estimation in cluster randomized studies with varying cluster size. Biometrical Journal, 43(1), 75–86. https://doi.org/https://doi.org/10.1002/1521-4036(200102)43:1 < 75::AID-BIMJ75 > 3.0.CO;2-N
- Murray, D. M. (1998). Design and analysis of group-randomized trials. Oxford University Press, Inc.
- Murray, D. M., & Short, B. (1995). Intra-class correlation among measures related to alcohol use by young adults: Estimates, correlates, and applications in intervention studies. Journal of Studies on Alcohol, 56(6), 681–692. https://doi.org/https://doi.org/10.15288/jsa.1995.56.681
- R Core Team. (2020). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org/
- Raudenbush, S. W. (1993). Hierarchical linear models and experimental design. In L. K. Edwards (Ed.), Applied analysis of variance in behavioral science (pp. 459–495). Marcel Dekker.
- Raudenbush, S. W. (1997). Statistical analysis and optimal design for cluster randomized trials. Psychological Methods, 2(2), 173–185. https://doi.org/https://doi.org/10.1037/1082-989X.2.2.173
- Raudenbush, S. W., & Liu, X. (2000). Statistical power and optimal design for multisite randomized trials. Psychological Methods, 5(2), 199–213. https://doi.org/https://doi.org/10.1037/1082-989x.5.2.199
- Resnicow, K., Davis, M., Smith, M., Baranowski, T., Lin, L. S., Baranowski, J., Doyle, C., & Wang, D. T. (1998). Results of the TeachWell worksite wellness program. American Journal of Public Health, 88(2), 250–257. https://doi.org/10.2105/AJPH.88.2.250
- Scherbaum, C. A., & Ferreter, J. M. (2009). Estimating statistical power and required sample sizes for organizational research using multilevel modeling. Organizational Research Methods, 12(2), 347–367.
- Schochet, P. Z. (2008). Statistical power for randomized assignment evaluation of education programs. Journal of Educational and Behavioral Statistics, 33(1), 62–87. https://doi.org/https://doi.org/10.3102/1076998607302714
- Shadish, W. R., Rodolfo, G., Wong, V. C., Steiner, P. M., & Cook, T. D. (2011). A randomized experiment comparing random and cutoff-based assignment. Psychological Methods, 16(2), 179–191. https://doi.org/https://doi.org/10.1037/a0023345
- Snijders, T. A. B., & Bosker, R. J. (1993). Standard errors and sample sizes for two-level research. Journal of Educational Statistics, 18(3), 237–259. https://doi.org/https://doi.org/10.3102/10769986018003237
- Snijders, T., & Bosker, R. (1999). Multilevel analysis: An introduction to basic and advanced multilevel modeling. SAGE.
- Spybrook, J. (2014). Detecting intervention effects across context: An examination of the precision of cluster randomized trials. The Journal of Experimental Education, 82(3), 334–357. https://doi.org/https://doi.org/10.1080/00220973.2013.813364
- Spybrook, J., Shi, R., & Kelcey, B. (2016). Progress in the past decade: An examination of the precision of cluster randomized trials funded by the U.S. Institute of Educational Sciences. International Journal of Research & Method in Education, 39(3), 255–267.
- Spybrook, J., Bloom, H., Cogdon, R., Hill, C., Martinez, A., Raudenbush, S. (2011). Optimal design plus empirical evidence: Documentation for the “Optimal Design” software. http://hlmsoft.net/od/od-manual-20111016-v300.pdf
- Tolan, P., Elreda, L. M., Bradshaw, C. P., Downer, J. T., & Ialongo, N. (2020). Randomized trial testing the integration of the Good Behavior Game and MyTeachingPartner™: The moderating role of distress among new teachers on student outcomes. Journal of School Psychology, 78, 75–95. https://doi.org/https://doi.org/10.1016/j.jsp.2019.12.002
- Usami, S. (2014). Generalized sample size determination formulas for experimental research with hierarchical data. Behavior Research Methods, 46(2), 346–356. https://doi.org/https://doi.org/10.3758/s13428-013-0387-1
- Van Breukelen, G. J. P., Candel, M. J. J., & Berger, M. P. F. (2007). Relative efficiency of unequal versus equal cluster sizes in cluster randomized and multicenter trials. Statistics in Medicine, 26(13), 2589–2603. https://doi.org/https://doi.org/10.1002/sim.2740