827
Views
2
CrossRef citations to date
0
Altmetric
Theory and Methods

Inference for Multivariate Regression Model Based on Synthetic Data Generated Using Plug-in Sampling

, , , &
Pages 720-733 | Received 18 Dec 2015, Accepted 05 Mar 2021, Published online: 27 Apr 2021
 

Abstract

In this article, the authors derive the likelihood-based exact inference for singly and multiply imputed synthetic data in the context of a multivariate regression model. The synthetic data are generated via the Plug-in Sampling method, where the unknown parameters in the model are set equal to the observed values of their point estimators based on the original data, and synthetic data are drawn from this estimated version of the model. Simulation studies are carried out in order to confirm the theoretical results. The authors provide exact test procedures, which in case multiple synthetic datasets are permissible, are compared with the asymptotic results of Reiter. An application using 2000 U.S. Current Population Survey public use data is discussed. Furthermore, properties of the proposed methodology are evaluated in scenarios where some of the conditions that were used to derive the methodology do not hold, namely for nonnormal and discrete distributed random variables, cases in which the inferential procedures developed still show very good performances.

Acknowledgments

Ricardo Moura sincerely thanks the faculty of the Department of Mathematics and Statistics at UMBC (University of Maryland, Baltimore County) for their support and encouragement. Part of Ricardo Moura’s work was also completed during a ‘Summer at Census’ visit to the US Census Bureau for which he is thankful to Dr. Tommy Wright. The authors also want to earnestly thank the constructive contributions of the Editors-in-Chief, Associate Editors and reviewers who dealt with the manuscript since its first submission.

Additional information

Funding

This work is funded by national funds through FCT - Fundação para a Ciência e a Tecnologia, I.P., under the scope of projects UID/MAT/00297/2013 and UIDB/00297/2020 (Center for Mathematics and Applications), Ricardo Moura’s research was also supported by a Fulbright Research Grant.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.