References
- Antonie, L., K. Inwood, D. J. Lizotte, and J. Andrew Ross. 2014. Tracking people over time in 19th century Canada for longitudinal analysis. Machine Learning 95(1):129–46. no. :
- Bloothooft, G., P. Christen, K. Mandemakers, and M. Schraagen. 2015. Population reconstruction. Cham: Springer.
- Breiman, L. 2001. Random forests. Machine Learning 45(1):5–32.
- Christen, P. 2012. Data matching: Concepts and techniques for record linkage, entity resolution, and duplicate detection. Berlin: Springer Science & Business Media.
- Cilliers, J. A. 2016. “A demographic history of settler South Africa.” Thesis, Stellenbosch University. http://ir.nrf.ac.za/handle/10907/497.
- Clark, G., N. Cummins, Y. Hao, and D. D. Vidal. 2015. Surnames: A new source for the history of social mobility. Explorations in Economic History 55(1):3–24.
- Dong, H., C. Campbell, S. Kurosu, W. Yang, and J. Z. Lee. 2015. New sources for comparative social science: Historical population panel data from East Asia. Demography 52 (3):1061–88.
- Feigenbaum, J. J. 2016. “Automated census record linking: A machine learning approach.” http://scholar.harvard.edu/jfeigenbaum/publications/ automated-census-record-linking.
- Ferrie, J. P. 1996. A new sample of males linked from the public use microdata sample of the 1850 U.S. Federal census of population to the 1860 U.S. Federal census manuscript schedules. Historical Methods: A Journal of Quantitative and Interdisciplinary History 29(4):141–56. no. : https://doi.org/10.1080/01615440.1996.10112735.
- Fourie, J. 2016. The data revolution in African economic history. Journal of Interdisciplinary History 47 (2):193–212.
- Fu, Z., H. Boot, P. Christen, and J. Zhou. 2014. Automatic record linkage of individuals and households in historical census data. International Journal of Humanities and Arts Computing 8(2):204–25. no. :
- Goeken, R., L. Huynh, T. A. Lynch, and R. Vick. 2011. New methods of census record linking. Historical Methods: A Journal of Quantitative and Interdisciplinary History 44(1):7–14. no. : https://doi.org/10.1080/01615440.2010.517152.
- Guell, M., J. V. R. Mora, and C. I. Telmer. 2015. The informational content of surnames, the evolution of intergenerational mobility and assortative mating. The Review of Economic Studies 82(2):693–735.
- Hastie, T., R. Tibshirani, and J. H. Friedman. 2009. The elements of statistical learning: data mining, inference, and prediction. Second edition, corrected 7th printing. Springer series in statistics. New York: Springer.
- Hautaniemi, S. I., D. L. Anderton, and A. Swedlund. 2000. Methods and validity of a panel study using record linkage: Matching death records to a geographic census sample in two Massachusetts towns, 1850– 1912. Historical Methods: A Journal of Quantitative and Interdisciplinary History 33(1):16–29. no. : Accessed June 28, 2018. https://doi.org/10.1080/01615440009598943.
- Heckman, J. J. 1979. Sample selection bias as a specification error. Econometrica 47(1):153–61.
- James, G., D. Witten, T. Hastie, and R. Tibshirani. 2013. An introduction to statistical learning. Vol. 6. New York: Springer.
- Liaw, A., and M. Wiener. 2002. Classification and regression by random-forest. R News 2(3):18–22. http://CRAN.R-project.org/doc/Rnews/.
- Little, R. J., and D. B. Rubin. 1987. Statistical analysis with missing data. New York: Wiley.
- Loo, M. P. J. V D. 2014. The stringdist package for approximate string matching. The R Journal 6(1):111–22. http://CRAN.R- project. org/package = stringdist.
- Massey, C. G. 2017. Playing with matches: an assessment of accuracy in linked historical data. Historical Methods: A Journal of Quantitative and Interdisciplinary History 1–15. doi.org/10.1080/01615440.2017.1288598
- Meyer, D., E. Dimitriadou, K. Hornik, A. Weingessel, and F. Leisch. 2017. e1071: Misc Functions of the Department of Statistics, Probability Theory Group (Formerly: E1071), TU Wien. https://CRAN.R-project.org/package=e1071.
- Potgieter, M., and J. Visagie. 1974. Inventaris van Opgaafrolle. Cape Town: Cape Town Archives Repository.
- R Core Team 2015. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. https://www.R-project.org/.
- Rosenwaike, I., M. E. Hill, S. H. Preston, and I. T. Elo. 1998. Linking death certificates to early census records: the african american matched records sample. Historical Methods: A Journal of Quantitative and Interdisciplinary History 31(2):65–74. Accessed June 28, 2018. http://www.tandfonline.com/doi/abs/10.1080/01615449809601189.
- Ruggles, S. 2002. Linking historical censuses: A new approach. History and Computing 14(1–2):213–24. Accessed June 28, 2018. https://www.euppublishing.com/doi/abs/10.3366/hac.2002.14.1-2.213.
- Ruggles, S. 2012. The future of historical family demography. Annual Review of Sociology 38(1):423–41. http://dx.doi.org/10.1146/annurevsoc- 071811-145533.
- Ruggles, S. 2014. Big microdata for population research. Demography 51(1):287–97.
- Ruggles, S., C. A. Fitch, and E. Roberts. 2018. Historical census record linkage. Annual Review of Sociology 44 (1)null. Accessed June 28, 2018. https://doi.org/10.1146/annurev-soc-073117-041447.
- Solon, G., S. J. Haider, and J. M. Wooldridge. 2015. What are We weighting for? Journal of Human Resources 50(2):301–16. no. :
- Venables, W. N., and B. D. Ripley. 2002. Modern applied statistics with S. Fourth. New York: Springer. http://www.stats.ox.ac.uk/pub/MASS4.
- Vick, R., and L. Huynh. 2011. The effects of standardizing names for record linkage: Evidence from the United States and Norway. Historical Methods: A Journal of Quantitative and Interdisciplinary History 44(1):15–24. no. : http://dx.doi.org/10.1080/01615440.2010.514849.
- Wisselgren, M. J., S. Edvinsson, M. Berggren, and M. Larsson. 2014. Testing methods of record linkage on Swedish censuses. Historical Methods: A Journal of Quantitative and Interdisciplinary History 47(3):138–51. no. : https://doi.org/10.1080/01615440.2014.913967.