382
Views
0
CrossRef citations to date
0
Altmetric
Articles

Fast Bayesian Record Linkage With Record-Specific Disagreement Parameters

ORCID Icon

References

  • Abramitzky, R., Mill, R., and Pérez, S. (2018), Linking Individuals Across Historical Sources: A Fully Automated Approach, Working Paper No. 24324. National Bureau of Economic Research.
  • Belin, T. R., and Rubin, D. B. (1995), “A Method for Calibrating False-Match Rates in Record Linkage,” Journal of the American Statistical Association, 90, 694–707. DOI: 10.1080/01621459.1995.10476563.
  • Christen, P. (2012), “A Survey of Indexing Techniques for Scalable Record Linkage and Deduplication,” IEEE Transactions on Knowledge and Data Engineering, 24, 1537–1555. DOI: 10.1109/TKDE.2011.127.
  • Cohen, W. W., Ravikumar, P., and Fienberg, S. E. (2003), “A Comparison of String Distance Metrics for Name-Matching Tasks,” Proceedings of the 2003 International Conference on Information Integration on the Web, AAAI Press, pp. 73–78.
  • Costa, D. L. (2013), “Leaders: Privilege, Sacrifice, Opportunity, and Personnel Economics in the American Civil War,” The Journal of Law, Economics, and Organization, 30, 437–462. DOI: 10.1093/jleo/ewt005.
  • Eddelbuettel, D., and François, R. (2011), “Rcpp: Seamless R and C++ Integration,” Journal of Statistical Software, 40, 1–18. DOI: 10.18637/jss.v040.i08.
  • Enamorado, T., Fifield, B., and Imai, K. (2019), “Using a Probabilistic Model to Assist Merging of Large-Scale Administrative Records,” American Political Science Review, 113, 353–371. DOI: 10.1017/S0003055418000783.
  • Fellegi, I. P., and Sunter, A. B. (1969), “A Theory for Record Linkage,” Journal of the American Statistical Association, 64, 1183–1210. DOI: 10.1080/01621459.1969.10501049.
  • Fogel, R. W., Costa, D. L., Haines, M., Lee, C., Nguyen, L., Pope, C., Rosenberg, I., Scrimshaw, N., Trussell, J., Wilson, S., Wimmer, L. T., Kim, J.,Bassett, J., Burton, J., and Yetter, N. (2000), “Aging of Veterans of the Union Army: Version M-5.”
  • Fortini, M., Liseo, B., Nuccitelli, A., and Scanu, M. (2001, 06), “On Bayesian Record Linkage,” Research in Official Statistics, 1, 185–198.
  • Jaro, M. A. (1989), “Advances in Record-Linkage Methodology as Applied to Matching the 1985 Census of Tampa, Florida,” Journal of the American Statistical Association, 84, 414–420. DOI: 10.1080/01621459.1989.10478785.
  • Larsen, M. D. (2002), “Comments on Hierarchical Bayesian Record Linkage,” Proceedings of the Section on Survey Research Methods, pp. 1995–2000.
  • Larsen, M. D. (2005), “Hierarchical Bayesian Record Linkage Theory,” Proceedings of the Section on Survey Research Methods, pp. 3277–3284.
  • Larsen, M. D. (2010), “Record Linkage Modeling in Federal Statistical Databases,” in FCSM Research Conference, Washington DC: Federal Committee on Statistical Methodology.
  • Larsen, M. D., and Rubin, D. B. (2001), “Iterative Automated Record Linkage Using Mixture Models,” Journal of the American Statistical Association, 96, 32–41. DOI: 10.1198/016214501750332956.
  • Marchant, N. G., Kaplan, A., Elazar, D. N., Rubinstein, B. I. P., and Steorts, R. C. (2021), “D-Blink: Distributed End-to-End Bayesian Entity Resolution,” Journal of Computational and Graphical Statistics, 30, 406–421. DOI: 10.1080/10618600.2020.1825451.
  • McVeigh, B. S., and Murray, J. S. (2017), “Practical Bayesian Inference for Record Linkage,” arXiv: 1710.10558.
  • McVeigh, B. S., Spahn, B. T., and Murray, J. S. (2020), “Scaling Bayesian Probabilistic Record Linkage with Post-Hoc Blocking: An Application to the California Great Registers,” arXiv: 1905.05337.
  • Murray, J. (2016), “Probabilistic Record Linkage and Deduplication after Indexing, Blocking, and Filtering,” Journal of Privacy and Confidentiality, 7, 3–24.
  • Newcombe, H. B., Kennedy, J. M., Axford, S. J., and James, A. P. (1959), “Automatic Linkage of Vital Records,” Science, 130, 954–959. DOI: 10.1126/science.130.3381.954.
  • R Core Team (2019), R: A Language and Environment for Statistical Computing (Computer software manual). Vienna, Austria: R Core Team. Available at https://www.R-project.org/.
  • Sadinle, M. (2017), “Bayesian Estimation of Bipartite Matchings for Record Linkage,” Journal of the American Statistical Association, 112, 600–612. DOI: 10.1080/01621459.2016.1148612.
  • Sadinle, M., and Feinberg, S. E. (2013), “A Generalized Fellegi Sunter Framework for Multiple Record Linkage With Application to Homicide Record Systems,” Journal of the American Statistical Association, 108, 385–397. DOI: 10.1080/01621459.2012.757231.
  • Steorts, R. C. (2015), “Entity Resolution With Empirically Motivated Priors,” Bayesian Analysis, 10, 849–875. DOI: 10.1214/15-BA965SI.
  • Tancredi, A., and Liseo, B. (2011), “A Hierarchical Bayesian Approach to Record Linkage and Population Size Problems,” Annals of Applied Statics, 5, 1553–1585.
  • Thibaudeau, Y. (1993), “The Discrimination Power of Dependency Structures in Record Linkage,” in Survey Methodology (Vol. 19), ed. M.P. Singh, Ottawa: Statistics Canada.
  • Winkler, W. E. (1988), “Using the EM Algorithm for Weight Computation in the Fellegi-Sunter Model of Record Linkage,” Proceedings of the Section on Survey Research Methods, pp. 667–671.
  • Winkler, W. E. (1989), “Near Automatic Weight Computation in the Fellegi-Sunter Model of Record Linkage,” Proceedings of the Bureau of the Census Annual Research Conference, pp. 145–155.
  • Winkler, W. E. (1990), “String Comparator Metrics and Enhanced Decision Rules in the Fellegi-Sunter Model of Record Linkage,” in Proceedings of the Section on Survey Research Methods, Alexandria, VA: ASA, pp. 354–359.
  • Wortman, J. P. H. (2019), “Record Linkage Methods with Applications to Causal Inference and Election Voting Data,” ProQuest Dissertations and Theses, p. 125.
  • Xu, H., Li, X., Shen, C., Hui, S. L. and Grannis, S. (2019), “Incorporating Conditional Dependence in Latent Class Models for Probabilistic Record Linkage: Does It Matter?” Annals of Applied Statistics, 13, 1753–1790.
  • Yancey, W. E. (2000), “Frequency-Dependent Probability Measures for Record Linkage,” in Proceedings of the Section on Survey Research Methods, American Statistical Association, pp. 752–757.
  • Zanella, G. (2020), “Informed Proposals for Local MCMC in Discrete Spaces,” Journal of the American Statistical Association, 115, 852–865. DOI: 10.1080/01621459.2019.1585255.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.