References
- Agresti, A. (2007). An introduction to categorical data analysis (2nd ed). Hoboken, NJ: Wiley.
- Allison, P. (2004). Convergence problems in logistic regression. In M. Altman, J. Gill, & M. McDonald (Eds.), Numerical issues in statistical computing for the social scientist (pp. 238–252). New York, NY: Wiley.
- Altman, D. G., & Bland, J. M. (1994). Statistics notes: Quartiles, quintiles, centiles, and other quantiles. BMJ, 309(6960), 996–996. doi:https://doi.org/10.1136/bmj.309.6960.996
- Angrist, J. D. (2006). Instrumental variables methods in experimental criminological research: What, why and how. Journal of Experimental Criminology, 2(1), 23–44. doi:https://doi.org/10.1007/s11292-005-5126-x
- Angrist, J. D., & Pischke, J.-S. (2014). Mastering ’metrics: The path from cause to effect. Princeton, NJ: Princeton University Press.
- Barros, A. J., & Hirakata, V. N. (2003). Alternatives for logistic regression in cross-sectional studies: An empirical comparison of models that directly estimate the prevalence ratio. BMC Medical Research Methodology, 3(1), 21. doi:https://doi.org/10.1186/1471-2288-3-21
- Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 48. doi:https://doi.org/10.18637/jss.v067.i01
- Blizzard, L., & Hosmer, W. (2006). Parameter estimation and goodness-of-fit in log binomial regression. Biometrical Journal, 48(1), 5–22. doi:https://doi.org/10.1002/bimj.200410165
- Carsey, T. M., & Harden, J. J. (2013). Monte Carlo simulation and resampling methods for social science. Thousand Oaks, CA: Sage Publications.
- Carter, R. E., Lipsitz, S. R., & Tilley, B. C. (2005). Quasi-likelihood estimation for relative risk regression models. Biostatistics, 6(1), 39–44. doi:https://doi.org/10.1093/biostatistics/kxh016
- Chen, H., Cohen, P., & Chen, S. (2010). How big is a big odds ratio? Interpreting the magnitudes of odds ratios in epidemiological studies. Communications in Statistics - Simulation and Computation, 39(4), 860–864. doi:https://doi.org/10.1080/03610911003650383
- Chen, W., Qian, L., Shi, J., & Franklin, M. (2018). Comparing performance between log-binomial and robust Poisson regression models for estimating risk ratios under model misspecification. BMC Medical Research Methodology, 18(1), 1–12. doi:https://doi.org/10.1186/s12874-018-0519-5
- Cheung, Y. B. (2007). A modified least-squares regression approach to the estimation of risk difference. American Journal of Epidemiology, 166(11), 1337–1344. doi:https://doi.org/10.1093/aje/kwm223
- Clarke, P. (2008). When can group level clustering be ignored? Multilevel models versus single-level models with sparse data. Journal of Epidemiology & Community Health, 62(8), 752–758. doi:https://doi.org/10.1136/jech.2007.060798
- Cleary, P. D., & Angel, R. (1984). The analysis of relationships involving dichotomous dependent variables. Journal of Health and Social Behavior, 25(3), 334–348. doi:https://doi.org/10.2307/2136429
- Cohen, J., Cohen, P., West, S. G., & Aiken, L. S. (2003). Applied multiple regression/correlation analysis for the behavioral sciences. Mahwah, NJ: Lawrence Erlbaum Associates.
- Coxe, S., West, S. G., & Aiken, L. S. (2009). The analysis of count data: A gentle introduction to Poisson regression and its alternatives. Journal of Personality Assessment, 91(2), 121–136. doi:https://doi.org/10.1080/00223890802634175
- Cummings, P. (2009). The relative merits of risk ratios and odds ratios. Archives of Pediatrics & Adolescent Medicine, 163, 438–445. doi:https://doi.org/10.1001/archpediatrics.2009.31
- Davies, H. T. O., Crombie, I. K., & Tavakoli, M. (1998). When can odds ratios mislead? BMJ (Clinical Research ed.), 316(7136), 989–991. doi:https://doi.org/10.1136/bmj.316.7136.989
- Deddens, J. A., & Petersen, M. R. (2008). Approaches for estimating prevalence ratios. Occupational and Environmental Medicine, 65(7), 501–506. doi:https://doi.org/10.1136/oem.2007.034777
- Deke, J. (2014). Using the linear probability model to estimate impacts on binary outcomes in randomized controlled trials. Washington, DC: U.S. Department of Health and Human Services. Retrieved from https://www.hhs.gov/ash/oah/oah-initiatives/assets/lpm-tabrief.pdf
- DeMaris, A. (1995). A tutorial in logistic regression. Journal of Marriage and the Family, 57(4), 956–968. doi:https://doi.org/10.2307/353415
- Dey, E. L., & Astin, A. W. (1993). Statistical alternatives for studying college student retention: A comparative analysis of logit, probit, and linear regression. Research in Higher Education, 34(5), 569–581. doi:https://doi.org/10.1007/BF00991920
- Dong, N., & Maynard, R. (2013). PowerUp!: A tool for calculating minimum detectable effect sizes and minimum required sample sizes for experimental and quasi-experimental design studies. Journal of Research on Educational Effectiveness, 6(1), 24–67. doi:https://doi.org/10.1080/19345747.2012.673143
- Donoghoe, M., & Marschner, I. C. (2018). logbin: An R package for relative risk regression using the log-binomial model. Journal of Statistical Software, 86(9), 1–22. doi:https://doi.org/10.18637/jss.v086.i09
- Dynarski, S., Hyman, J., & Schanzenbach, D. W. (2013). Experimental evidence on the effect of childhood investments on postsecondary attainment and degree completion. Journal of Policy Analysis and Management, 32(4), 692–717. doi:https://doi.org/10.1002/pam.21715
- Finch, H., & Schneider, M. K. (2007). Classification accuracy of neural networks vs. discriminant analysis, logistic regression, and classification and regression trees. Methodology, 3(2), 47–57. doi:https://doi.org/10.1027/1614-2241.3.2.47
- Foster, E. M. (1997). Instrumental variables for logistic regression: An illustration. Social Science Research, 26(4), 487–504. doi:https://doi.org/10.1006/ssre.1997.0606
- Fox, J. (1991). Regression diagnostics: An introduction (Vol. 79). Newbury Park, CA: Sage.
- Gardner, W., Mulvey, E. P., & Shaw, E. C. (1995). Regression analyses of counts and rates: Poisson, overdispersed Poisson, and negative binomial models. Psychological Bulletin, 118(3), 392. doi:https://doi.org/10.1037//0033-2909.118.3.392
- Gerbing, D. W., & Anderson, J. C. (1987). Improper solutions in the analysis of covariance structures: Their interpretability and a comparison of alternate respecifications. Psychometrika, 52(1), 99–111. doi:https://doi.org/10.1007/BF02293958
- Giles, D. (2012, June 1). Another gripe about the linear probability model. Retrieved from http://davegiles.blogspot.com/2012/06/another-gripe-about-linear-probability.html
- Greenland, S., Mansournia, M. A., & Altman, D. G. (2016). Sparse data bias: A problem hiding in plain sight. BMJ, 352, i1981. doi:https://doi.org/10.1136/bmj.i1981
- Hayes, A. F., & Cai, L. (2007). Using heteroskedasticity-consistent standard error estimators in OLS regression: An introduction and software implementation. Behavior Research Methods, 39(4), 709–722. doi:https://doi.org/10.3758/BF03192961
- Heckman, J. J., & Snyder, J. M. (1997). Linear probability models of the demand for attributes with an empirical application to estimating the preferences of legislators. The RAND Journal of Economics, 28, S142–S189. doi:https://doi.org/10.2307/3087459
- Heinze, G. (2006). A comparative investigation of methods for logistic regression with separated or nearly separated data. Statistics in Medicine, 25(24), 4216–4226. doi:https://doi.org/10.1002/sim.2687
- Hellevik, O. (2009). Linear versus logistic regression when the dependent variable is a dichotomy. Quality & Quantity, 43, 59–74. doi:https://doi.org/10.1007/s11135-007-9077-3
- Horrace, W. C., & Oaxaca, R. L. (2006). Results on the bias and inconsistency of ordinary least squares for the linear probability model. Economics Letters, 90(3), 321–327. doi:https://doi.org/10.1016/j.econlet.2005.08.024
- Hosmer, D. W., & Lemeshow, S. (2004). Applied logistic regression (2nd ed.). New York, NY: Wiley.
- Huang, F. L., & Cornell, D. G. (2015). The impact of definition and question order on the prevalence of bullying victimization using student self-reports. Psychological Assessment, 27(4), 1484–1493. doi:https://doi.org/10.1037/pas0000149.
- Huang, F. L., & Cornell, D. G. (2012). Pick your Poisson: A tutorial on analyzing counts of student victimization data. Journal of School Violence, 11(3), 187–206. doi:https://doi.org/10.1080/15388220.2012.682010
- Huang, F. L., & Moon, T. R. (2013). What are the odds of that? A primer on understanding logistic regression. Gifted Child Quarterly, 57(3), 197–204. doi:https://doi.org/10.1177/0016986213490022
- King, G., & Zeng, L. (2001). Logistic regression in rare events data. Political Analysis, 9(2), 137–163. doi:https://doi.org/10.1093/oxfordjournals.pan.a004868
- Knol, M. J., Cessie, S. L., Algra, A., Vandenbroucke, J. P., & Groenwold, R. H. H. (2012). Overestimation of risk ratios by odds ratios in trials and cohort studies: Alternatives to logistic regression. Canadian Medical Association Journal, 184(8), 895–899. doi:https://doi.org/10.1503/cmaj.101715
- Land, K. C., McCall, P. L., & Parker, K. F. (1994). Logistic versus hazards regression analyses in evaluation research: An exposition and application to the North Carolina Court Counselors’ Intensive Protective Supervision project. Evaluation Review, 18(4), 411–437. doi:https://doi.org/10.1177/0193841X9401800403
- Lasorsa, D. L. (2003). Question-order effects in surveys: The case of political interest, news attention, and knowledge. Journalism & Mass Communication Quarterly, 80, 499–512. doi:https://doi.org/10.1177/107769900308000302
- Lei, P.-W., & Koehly, L. M. (2003). Linear discriminant analysis versus logistic regression: A comparison of classification errors in the two-group case. The Journal of Experimental Education, 72(1), 25–49. doi:https://doi.org/10.1080/00220970309600878
- Liberman, A. M. (2005). How much more likely? The implications of odds ratios for probabilities. American Journal of Evaluation, 26(2), 253–266. doi:https://doi.org/10.1177/1098214005275825
- Long, J. S., & Ervin, L. H. (2000). Using heteroscedasticity consistent standard errors in the linear regression model. The American Statistician, 54, 217–224. doi:https://doi.org/10.1080/00031305.2000.10474549
- Lottes, I. L., DeMaris, A., & Adler, M. A. (1996). Using and interpreting logistic regression: A guide for teachers and students. Teaching Sociology, 24(3), 284–298. doi:https://doi.org/10.2307/1318743
- Lumley, T., Diehr, P., Emerson, S., & Chen, L. (2002). The importance of the normality assumption in large public health data sets. Annual Review of Public Health, 23(1), 151–169. doi:https://doi.org/10.1146/annurev.publhealth.23.100901.140546
- Luo, J., Zhang, J., & Sun, H. (2014). Estimation of relative risk using a log-binomial model with constraints. Computational Statistics, 29(5), 981–1003. doi:https://doi.org/10.1007/s00180-013-0476-8
- Marschner, I. C. (2011). glm2: Fitting generalized linear models with convergence problems. The R Journal, 3(2), 12–15. doi:https://doi.org/10.32614/RJ-2011-012
- Marschner, I. C., & Gillett, A. C. (2012). Relative risk regression: Reliable and flexible methods for log-binomial models. Biostatistics, 13(1), 179–192. doi:https://doi.org/10.1093/biostatistics/kxr030
- McNutt, L.-A., Wu, C., Xue, X., & Hafner, J. P. (2003). Estimating the relative risk in cohort studies and clinical trials of common outcomes. American Journal of Epidemiology, 157(10), 940–943. doi:https://doi.org/10.1093/aje/kwg074
- Moineddin, R., Matheson, F. I., & Glazier, R. H. (2007). A simulation study of sample size for multilevel logistic regression models. BMC Medical Research Methodology, 7(1), 34. doi:https://doi.org/10.1186/1471-2288-7-34
- Mood, C. (2010). Logistic regression: Why we cannot do what we think we can do, and what we can do about it. European Sociological Review, 26(1), 67–82. doi:https://doi.org/10.1093/esr/jcp006
- Murnane, R. J., & Willett, J. B. (2011). Methods matter: Improving causal inference in educational and social science research. New York, NY: Oxford University Press.
- Muthén, L. K., & Muthén, B. O. (2002). How to use a Monte Carlo study to decide on sample size and determine power. Structural Equation Modeling: A Multidisciplinary Journal, 9(4), 599–620. doi:https://doi.org/10.1207/S15328007SEM0904_8
- Oldendick, R. (2008). Encyclopedia of survey research methods. Thousand Oaks, CA: Sage Publications, Inc. Retrieved from http://srmo.sagepub.com/view/encyclopedia-of-survey-research-methods/n428.xml
- Pearce, N. (2004). Effect measures in prevalence studies. Environmental Health Perspectives, 112(10), 1047–1050. doi:https://doi.org/10.1289/ehp.6927
- Pinheiro, J., Bates, D., DebRoy, S., Sarkar, D, & R Core Team. (2014). nlme: Linear and nonlinear mixed effects models. Retrieved from http://CRAN.R-project.org/package=nlme
- Pischke, J. S. (2012). Probit better than LPM? Retrieved from http://www.mostlyharmlesseconometrics.com/2012/07/probit-better-than-lpm/
- R Core Team. (2017). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. Retrieved from http://www.R-project.org/
- Robertson, D. A., King-Kallimanis, B. L., & Kenny, R. A. (2016). Negative perceptions of aging predict longitudinal decline in cognitive function. Psychology and Aging, 31(1), 71. doi:https://doi.org/10.1037/pag0000061
- Robinson, L. D., & Jewell, N. P. (1991). Some surprising results about covariate adjustment in logistic regression models. International Statistical Review/Revue Internationale de Statistique, 59, 227–240. doi:https://doi.org/10.2307/1403444
- Schwartz, L. M., Woloshin, S., & Welch, H. G. (1999). Misunderstandings about the effects of race and sex on physicians’ referrals for cardiac catheterization. New England Journal of Medicine, 341(4), 279–283. doi:https://doi.org/10.1056/NEJM199907223410411
- Skov, T., Deddens, J., Petersen, M. R., & Endahl, L. (1998). Prevalence proportion ratios: Estimation and hypothesis testing. International Journal of Epidemiology, 27, 91–95. doi:https://doi.org/10.1093/ije/27.1.91
- Solberg, M. E., & Olweus, D. (2003). Prevalence estimation of school bullying with the Olweus Bully/Victim Questionnaire. Aggressive Behavior, 29(3), 239–268. doi:https://doi.org/10.1002/ab.10047
- Spellman, B. A. (2015). A short (personal) future history of revolution 2.0. Perspectives on Psychological Science, 10(6), 886–899. doi:https://doi.org/10.1177/1745691615609918
- Troncoso Skidmore, S., & Thompson, B. (2013). Bias and precision of some classical ANOVA effect sizes when assumptions are violated. Behavior Research Methods, 45(2), 536–546. doi:https://doi.org/10.3758/s13428-012-0257-2
- Vansteelandt, S., Bowden, J., Babanezhad, M., & Goetghebeur, E. (2011). Instrumental variables for logistic regression: An illustration. Statistical Science, 26(3), 403–422. doi:https://doi.org/10.1214/11-STS360
- von Hippel, P. (2015). Linear vs logistic probability models: Which is better, and when? Retrieved from http://statisticalhorizons.com/linear-vs-logistic
- von Hippel, P. (2017). When can you fit a linear probability model? More often than you think. Retrieved from http://statisticalhorizons.com/when-can-you-fit
- Wacholder, S. (1986). Binomial regression in GLIM: Estimating risk ratios and risk differences. American Journal of Epidemiology, 123(1), 174–184. doi:https://doi.org/10.1093/oxfordjournals.aje.a114212
- Wang, X. (2014). Firth logistic regression for rare variant association tests. Frontiers in Genetics, 5, 187. doi:https://doi.org/10.3389/fgene.2014.00187
- Williamson, T., Eliasziw, M., & Fick, G. H. (2013). Log-binomial models: Exploring failed convergence. Emerging Themes in Epidemiology, 10(1), 14. doi:https://doi.org/10.1186/1742-7622-10-14
- Zeileis, A. (2006). Object-oriented computation of sandwich estimators. Journal of Statistical Software, 16(9), 1–16. doi:https://doi.org/10.18637/jss.v016.i09
- Zhang, J., & Yu, K. F. (1998). What's the relative risk? A method of correcting the odds ratio in cohort studies of common outcomes. JAMA, 280(19), 1690–1691. doi:https://doi.org/10.1001/jama.280.19.1690
- Zou, G. (2004). A modified Poisson regression approach to prospective studies with binary data. American Journal of Epidemiology, 159(7), 702–706. doi:https://doi.org/10.1093/aje/kwh090
- Zou, G. Y., & Donner, A. (2013). Extension of the modified Poisson regression model to prospective studies with correlated binary data. Statistical Methods in Medical Research, 22(6), 661–670. doi:https://doi.org/10.1177/0962280211427759