References
- Anderson, E. (1935), “The Irises of the Gaspe Peninsula,” Bulletin of the American Iris Society 59, 2–5.
- Bureau of Labor Statistics, U.S. Department of Labor. (2021a), “National Longitudinal Survey of Youth 1979 Cohort, 1979-2016 (Rounds 1-28),” Produced and distributed by the Center for Human Resource Research (CHRR), The Ohio State University. Columbus, OH. Available at https://www.nlsinfo.org/bibliography-citing-nls-data.
- Bureau of Labor Statistics, U.S. Department of Labor. (2021b), “National Longitudinal Survey of Youth 1979 Cohort, Topical Guide to the Data,” available at https://www.nlsinfo.org/content/cohorts/nlsy79/topical-guide/employment/work-experience.
- Chang, W., Cheng, J., Allaire, J. J., Xie, Y., and McPherson, J. (2020), shiny: Web Application Framework for R. Available at https://CRAN.R-project.org/package=shiny.
- Chatfield, C. 1985. “The Initial Examination of Data,” Journal of the Royal Statistical Society, Series A, 148, 214–253.
- Cooksey, E. C. (2017), “Using the National Longitudinal Surveys of Youth (NLSY) to Conduct Life Course Analyses,” in Handbook of Life Course Health Development, eds. R. M. Lerner, N. Halfon, and C. B. Forrest, pp. 561–577, Cham: Springer. DOI: 10.1007/978-3-319-47143-3_23..
- Dasu, T., and Johnson, T. (2003), Exploratory Data Mining and Data Cleaning. Wiley Series in Probability and Statistics, Hoboken: Wiley.
- Firke, S. (2020), janitor: Simple Tools for Examining and Cleaning Dirty Data. Available at https://CRAN.R-project.org/package=janitor.
- Fullilove, M. T. (1998), “Comment: Abandoning “Race” as a Variable in Public Health Research–an Idea Whose Time Has Come,” American Journal of Public Health, 88, 1297–1298.
- Grimshaw, S. D. (2015), “A Framework for Infusing Authentic Data Experiences Within Statistics Courses,” The American Statistician 69, 307–314. DOI: 10.1080/00031305.2015.1081106..
- Heidari, S., Babor, T. F., De Castro, P., Tort, S., and Curno, M. (2016), “Sex and Gender Equity in Research: Rationale for the SAGER Guidelines and Recommended Use,” Research Integrity and Peer Review 1, 1–9. 10.1186/s41073-016-0007-6.
- Henry, L., and Hadley, W. (2020), purrr: Functional Programming Tools. Available at https://CRAN.R-project.org/package=purrr.
- Horst, A., Marie, A., Hill, P., and Gorman, K. B. (2020), Palmerpenguins: Palmer Archipelago (Antarctica) Penguin Data. Available at DOI: 10.5281/zenodo.3960218..
- Huebner, M., Werner, V., and Cessie, S. L. (2016), “A Systematic Approach to Initial Data Analysis Is Good Research Practice,” The Journal of Thoracic and Cardiovascular Surgery 151, 25–27. DOI: 10.1016/j.jtcvs.2015.09.085.
- Ilk, O. (2004), “Exploratory Multivariate Longitudinal Data Analysis and Models for Multivariate Longitudinal Binary Data,” PhD thesis, Iowa State University. DOI: 10.31274/rtd-180813-11012.
- Kennedy, L., Khanna, K., Simpson, D., and Gelman, A. (2020), “Using Sex and Gender in Survey Adjustment,” available at https://arxiv.org/abs/2009.14401.
- Kim, A. Y., Ismay, C., and Chunn, J. (2018), “The Fivethirtyeight R Package: “Tame Data” Principles for Introductory Statistics and Data Science Courses,” Technology Innovations in Statistics Education, 11, 1–22. DOI: 10.5070/T511103589..
- Koller, M. (2016), “robustlmm: An R Package for Robust Estimation of Linear Mixed-Effects Models,” Journal of Statistical Software 75, 1–24. DOI: 10.18637/jss.v075.i06.
- Moncrief, M. (2015), “By the Numbers – the Average Australian Doesn’t Exist… Not a Single One of Us Is ‘Normal’,” available at https://bit.ly/smh-not-normal.
- Office of Management and Budget. (1997), “Revisions to the Standards for the Classification of Federal Data on Race and Ethnicity,” available at https://www.govinfo.gov/content/pkg/FR-1997-10-30/pdf/97-28653.pdf.
- Open Knowledge Foundation. (2021), “Open Definition. Defining Open in Open Data, Open Content, and Open Knowledge,” available at http://opendefinition.org/od/2.1/en/.
- Pedersen, T. L. (2020), patchwork: The Composer of Plots. https://CRAN.R-project.org/package=patchwork.
- Pergamit, M. R., Pierret, C. R., Rothstein, D. S., and Veum, J. R. (2001), “Data Watch: The National Longitudinal Surveys,” The Journal of Economic Perspectives, 15, 239–53. DOI: 10.1257/jep.15.2.239.
- R Core Team. (2020), R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing. Available at https://www.R-project.org/.
- Singer, J. D., and Willett, J. B. (2003), Applied Longitudinal Data Analysis: Modeling Change and Event Occurrence, Oxford: Oxford University Press.
- Stodel, M. (2020), “Stop Using Iris,” available at https://www.meganstodel.com/posts/no-to-iris/.
- Tierney, N., Cook, D., and Prvan, T. (2020), brolgar: BRowse Over Longitudinal Data Graphically and Analytically in R, available at https://github.com/njtierney/brolgar.
- Tukey, J. W. (1977), Exploratory Data Analysis. Addison-Wesley Series in Behavioral Science. Reading, MA: Addison-Wesley Pub. Co.
- Unwin, A., and Kleinman, K. (2021), “The Iris Data Set: In Search of the Source of Virginica,” Significance 18, 26–29. DOI: 10.1111/1740-9713.01589..
- van der Loo, M. P. J., and de Jonge, E. (2021), “Data Validation Infrastructure for R,” Journal of Statistical Software, 97, 1–31. DOI: 10.18637/jss.v097.i10..
- van der Loo, M., and de Jonge, E. (2018), Statistical Data Cleaning with Applications in R, Hoboken: Wiley.
- Venables, W. N., and Ripley, B. D. (2002), Modern Applied Statistics with S (4th ed.), New York: Springer. Available at http://www.stats.ox.ac.uk/pub/MASS4.
- Wang, E., Cook, D., and Hyndman, R. J. (2020), “A New Tidy Data Structure to Support Exploration and Modeling of Temporal Data,” Journal of Computational and Graphical Statistics 29, 466–478. DOI: 10.1080/10618600.2019.1695624..
- Wickham, H. (2011), “The Split-Apply-Combine Strategy for Data Analysis,” Journal of Statistical Software, 40, 1–29. DOI: 10.18637/jss.v040.i01..
- Wickham, H. (2014), “Tidy Data,” Journal of Statistical Software 59, 1–23.
- Wickham, H. (2016), ggplot2: Elegant Graphics for Data Analysis, New York: Springer-Verlag. Available at https://ggplot2.tidyverse.org.
- Wickham, H. (2019), stringr: Simple, Consistent Wrappers for Common String Operations. Available at https://CRAN.R-project.org/package=stringr.
- Wickham, H. (2020), tidyr: Tidy Messy Data. Available at https://CRAN.R-project.org/package=tidyr.
- Wickham, H., Averick, M., Bryan, J., Chang, W., D’Agostino McGowan, L., François, R., Grolemund, G., Hayes, A., Henry, L., Hester, J., Kuhn, M., Pedersen, T. L., Miller, E., Bache, S. M., Müller, K., Ooms, J., Robinson, D., Seidel, D. P., Spinu, V., Takahashi, K., Vaughan, D., Wilke, C., Woo, K., and Yutani, H. (2019), “Welcome to the Tidyverse,” Journal of Open Source Software, 4, 1686. DOI: 10.21105/joss.01686..
- Wickham, H., François, R., Henry, L., and Müller, K. (2020), dplyr: A Grammar of Data Manipulation. Available at https://CRAN.R-project.org/package=dplyr.
- Wickham, H., and Hester, J. (2020), readr: Read Rectangular Text Data. Available at https://CRAN.R-project.org/package=readr.
- Wolpin, K. I. (2005), “National Longitudinal Survey of Youth 1979 Cohort, 1979-2016 (Rounds 1-28),” Published by Bureau of Labor Statistics, U.S. Department of Labor. Available at https://www.bls.gov/opub/mlr/2005/02/art3full.pdf.
- Xie, Y. (2014), “Knitr: A Comprehensive Tool for Reproducible Research in R,” in Implementing Reproducible Computational Research, eds. V. Stodden, F. Leisch, and R. D. Peng, pp. 3–31, Boca Raton, FL: Chapman and Hall/CRC. Available at http://www.crcpress.com/product/isbn/9781466561595.
- Xie, Y., Dervieux, C., and Riederer, E. (2020), R Markdown Cookbook. Boca Raton, Florida: Chapman; Hall/CRC. Available at https://bookdown.org/yihui/rmarkdown-cookbook.
- Zhu, H. (2019), kableExtra: Construct Complex Table with ’kable’ and Pipe Syntax. Available at https://CRAN.R-project.org/package=kableExtra.