542
Views
1
CrossRef citations to date
0
Altmetric
Theory and Methods

Doubly Robust Capture-Recapture Methods for Estimating Population Size

ORCID Icon, ORCID Icon & ORCID Icon
Pages 1309-1321 | Received 01 Jul 2021, Accepted 22 Feb 2023, Published online: 12 Apr 2023
 

Abstract

Estimation of population size using incomplete lists has a long history across many biological and social sciences. For example, human rights groups often construct partial lists of victims of armed conflicts, to estimate the total number of victims. Earlier statistical methods for this setup often use parametric assumptions, or rely on suboptimal plug-in-type nonparametric estimators; but both approaches can lead to substantial bias, the former via model misspecification and the latter via smoothing. Under an identifying assumption that two lists are conditionally independent given measured covariates, we make several contributions. First, we derive the nonparametric efficiency bound for estimating the capture probability, which indicates the best possible performance of any estimator, and sheds light on the statistical limits of capture-recapture methods. Then we present a new estimator, that has a double robustness property new to capture-recapture, and is near-optimal in a nonasymptotic sense, under relatively mild nonparametric conditions. Next, we give a confidence interval construction method for total population size from generic capture probability estimators, and prove nonasymptotic near-validity. Finally, we apply them to estimate the number of killings and disappearances in Peru during its internal armed conflict between 1980 and 2000. Supplementary materials for this article are available online.

Supplementary Materials

In the supplement, we provide the proofs for all results presented in this paper along with additional simulation results and details about the real data.

Acknowledgments

The authors thank Mark van der Laan, and Robin Mejia for very helpful discussions.

Additional information

Funding

Manjari Das and Edward Kennedy gratefully acknowledge support from the Berkman Faculty Development Fund, and Edward Kennedy from NSF Grant DMS1810979.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.