187
Views
1
CrossRef citations to date
0
Altmetric
General

Estimating the Performance of Entity Resolution Algorithms: Lessons Learned Through PatentsView.org

ORCID Icon, , , , ORCID Icon &
Pages 370-380 | Received 03 Oct 2022, Accepted 04 Mar 2023, Published online: 13 Apr 2023
 

Abstract

This article introduces a novel evaluation methodology for entity resolution algorithms. It is motivated by PatentsView.org, a public-use patent data exploration platform that disambiguates patent inventors using an entity resolution algorithm. We provide a data collection methodology and tailored performance estimators that account for sampling biases. Our approach is simple, practical, and principled—key characteristics that allow us to paint the first representative picture of PatentsView’s disambiguation performance. The results are used to inform PatentsView’s users of the reliability of the data and to allow the comparison of competing disambiguation algorithms.

Authors’ Contributions

Olivier Binette led the evaluation project and wrote the majority of the manuscript. Sokhna A York and Emma Hickerson carried out the data collection by manually reviewing inventor clusters. Youngsoo Baek provided bias adjustment and uncertainty quantification for ratio estimators. Sarvo Madhavan was a technical advisor and contributed to code. Christina Jones was an advisor and project manager. All authors provided input on the manuscript.

Data Availability Statement

All data and code used for this article are available as part of the PatentsView-Evaluation Python package (version 1.0.1) at https://github.com/PatentsView/PatentsView-Evaluation/releases/tag/1.0.1.

Disclosure Statement

The authors report there are no competing interests to declare.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 106.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.