3,117
Views
1
CrossRef citations to date
0
Altmetric
Articles

Expanding the Scope of Statistical Computing: Training Statisticians to Be Software Engineers

ORCID Icon &
Pages S7-S15 | Published online: 22 Mar 2021

References

  • Barry, T. (2018), “Collections in R: Review and Proposal,” The R Journal, 10, 455–471, DOI: 10.32614/RJ-2018-037.
  • Baumer, B. S., Çetinkaya-Rundel, M., Bray, A., Loi, L., and Horton, N. J. (2014), “R Markdown: Integrating a Reproducible Analysis Tool Into Introductory Statistics,” Technology Innovations in Statistics Education, 8, available at https://escholarship.org/uc/item/90b2f5xh.
  • Behnel, S., Bradshaw, R., Citro, C., Dalcin, L., Seljebotn, D. S., and Smith, K. (2011), “Cython: The Best of Both Worlds,” Computing in Science Engineering, 13, 31–39, DOI: 10.1109/MCSE.2010.118.
  • Beller, M., Bacchelli, A., Zaidman, A., and Juergens, E. (2014), “Modern Code Reviews in Open-Source Projects: Which Problems Do They Fix?,” in Proceedings of the 11th Working Conference on Mining Software Repositories, pp. 202–211. DOI: 10.1145/2597073.2597082.
  • Bentley, J. L. (1975), “Multidimensional Binary Search Trees Used for Associative Searching,” Communications of the ACM, 18, 509–517, DOI: 10.1145/361002.361007.
  • Bion, R., Chang, R., and Goodman, J. (2018), “How R Helps Airbnb Make the Most of Its Data,” The American Statistician, 72, 46–52, DOI: 10.1080/00031305.2017.1392362.
  • Bissi, W., Neto, A. G. S. S., and Emer, M. C. F. P. (2016), “The Effects of Test Driven Development on Internal Quality, External Quality and Productivity: A Systematic Review,” Information and Software Technology, 74, 45–54, DOI: 10.1016/j.infsof.2016.02.004.
  • Breiman, L. (2001), “Random Forests,” Machine Learning, 45, 5–32, DOI: 10.1023/A:1010933404324.
  • Bryan, J. (2018), “Excuse Me, Do You Have a Moment to Talk About Version Control?,” The American Statistician, 72, 20–27, DOI: 10.1080/00031305.2017.1399928.
  • Çetinkaya-Rundel, M., and Rundel, C. (2018), “Infrastructure and Tools for Teaching Computing Throughout the Statistical Curriculum,” The American Statistician, 72, 58–65, DOI: 10.1080/00031305.2017.1397549.
  • CMU Career & Professional Development Center (2018), “First Destination Outcomes: Dietrich College Statistics & Data Science, Bachelor’s,” available at https://www.cmu.edu/career/documents/2018_one_pagers/dc/Bachelors/%20Stats.pdf.
  • Eddelbuettel, D., and Francois, R. (2011), “Rcpp: Seamless R and C++ Integration,” Journal of Statistical Software, 40, 1–18, DOI: 10.18637/jss.v040.i08.
  • Eklund, A., Nichols, T. E., and Knutsson, H. (2016), “Cluster Failure: Why fMRI Inferences for Spatial Extent Have Inflated False-Positive Rates,” Proceedings of the National Academy of Sciences of the United States of America, 113, 7900–7905, DOI: 10.1073/pnas.1602413113.
  • Fiksel, J., Jager, L. R., Hardin, J. S., and Taub, M. A. (2019), “Using GitHub Classroom to Teach Statistics,” Journal of Statistics Education, 27, 110–119, DOI: 10.1080/10691898.2019.1617089.
  • Freeman, S., Eddy, S. L., McDonough, M., Smith, M. K., Okoroafor, N., Jordt, H., and Wenderoth, M. P. (2014), “Active Learning Increases Student Performance in Science, Engineering, and Mathematics,” Proceedings of the National Academy of Sciences of the United States of America, 111, 8410–8415, DOI: 10.1073/pnas.1319030111.
  • Gray, A. G., and Moore, A. W. (2003), “Nonparametric Density Estimation: Toward Computational Tractability,” in Proceedings of the 2003 SIAM International Conference on Data Mining, pp. 203–211. DOI: 10.1137/1.9781611972733.19.
  • Greenhouse, J. B., and Seltman, H. J. (2018), “On Teaching Statistical Practice: From Novice to Expert,” The American Statistician, 72, 147–154, DOI: 10.1080/00031305.2016.1270230.
  • Johnson, N. A. (2013), “A Dynamic Programming Algorithm for the Fused Lasso and L0-Segmentation,” Journal of Computational and Graphical Statistics, 22, 246–260, DOI: 10.1080/10618600.2012.681238.
  • Jordan, M. I. (2013), “On Statistics, Computation and Scalability,” Bernoulli, 19, 1378–1390, DOI: 10.3150/12-BEJSP17.
  • Kulik, C.-L. C., Kulik, J. A., and Bangert-Drowns, R. L. (1990), “Effectiveness of Mastery Learning Programs: A Meta-Analysis,” Review of Educational Research, 60, 265–299, DOI: 10.3102/00346543060002265.
  • Leisch, F. (2002), “Sweave: Dynamic Generation of Statistical Reports Using Literate Data Analysis,” in Compstat 2002—Proceedings in Computational Statistics, eds. W. Härdle and B. Rönz, Heidelberg: Physica Verlag, pp. 575–580.
  • Liu, F. T., Ting, K. M., and Zhou, Z.-H. (2012), “Isolation-Based Anomaly Detection,” ACM Transactions on Knowledge Discovery From Data, 6, 1–39, DOI: 10.1145/2133360.2133363.
  • Mäntylä, M. V., and Lassenius, C. (2009), “What Types of Defects Are Really Discovered in Code Reviews?,” IEEE Transactions on Software Engineering, 35, 430–448, DOI: 10.1109/TSE.2008.71.
  • National Academies of Sciences, Engineering, and Medicine (2018), Data Science for Undergraduates: Opportunities and Options, Washington, DC: The National Academies Press.
  • Nolan, D., and Temple Lang, D. (2009), “Integrating Computing Into the Statistics Curricula,” available at https://www.stat.berkeley.edu/statcur/.
  • ——— (2010), “Computing in the Statistics Curricula,” The American Statistician, 64, 97–107, DOI: 10.1198/tast.2010.09132.
  • Rigby, P. C., and Bird, C. (2013), “Convergent Contemporary Software Peer Review Practices,” in Proceedings of the 9th Joint Meeting on Foundations of Software Engineering, pp. 202–212.
  • Rossini, A. J. (2001), “Literate Statistical Practice,” in Proceedings of the 2nd International Workshop on Distributed Statistical Computing, eds. K. Hornik and F. Leisch.
  • Sadowski, C., Söderberg, E., Church, L., Sipko, M., and Bacchelli, A. (2018), “Modern Code Review: A Case Study at Google,” in Proceedings of the 40th International Conference on Software Engineering: Software Engineering in Practice, pp. 181–190.
  • Wang, A. L.-C. (2003), “An Industrial-Strength Audio Search Algorithm,” in Proceedings of the 4th International Conference on Music Information Retrieval.
  • Wayne, K. (2016), “Autocomplete-Me,” in SIGCSE Nifty Assignments, available at http://nifty.stanford.edu/2016/wayne-autocomplete-me/.
  • Wickham, H. (2011), “testthat: Get Started With Testing,” The R Journal, 3, 5–10. DOI: 10.32614/RJ-2011-002.
  • ——— (2014), “Tidy Data,” Journal of Statistical Software, 59, 1–23, DOI: 10.18637/jss.v059.i10.
  • Wickham, H., Averick, M., Bryan, J., Chang, W., McGowan, L. D. A., François, R., Grolemund, G., Hayes, A., Henry, L., Hester, J., Kuhn, M., Pedersen, T. L., Miller, E., Bache, S. M., Müller, K., Ooms, J., Robinson, D., Seidel, D. P., Spinu, V., Takahashi, K., Vaughan, D., Wilke, C., Woo, K., and Yutani, H. (2019), “Welcome to the Tidyverse,” Journal of Open Source Software, 4, 1686, DOI: 10.21105/joss.01686.
  • Williams, L., Maximilien, E. M., and Vouk, M. (2003), “Test-Driven Development as a Defect-Reduction Practice,” in 14th International Symposium on Software Reliability Engineering, pp. 34–45.
  • Xie, Y. (2015), Dynamic Documents With R and knitr (2nd ed.), Boca Raton, FL: Chapman and Hall/CRC.
  • Xie, Y., Allaire, J. J., and Grolemund, G. (2018), R Markdown: The Definitive Guide, Boca Raton, FL: Chapman and Hall/CRC.