1,565
Views
11
CrossRef citations to date
0
Altmetric
Theory and Methods

A Distributed and Integrated Method of Moments for High-Dimensional Correlated Data Analysis

ORCID Icon &
Pages 805-818 | Received 04 Jan 2018, Accepted 23 Feb 2020, Published online: 02 Apr 2020
 

Abstract

This article is motivated by a regression analysis of electroencephalography (EEG) neuroimaging data with high-dimensional correlated responses with multilevel nested correlations. We develop a divide-and-conquer procedure implemented in a fully distributed and parallelized computational scheme for statistical estimation and inference of regression parameters. Despite significant efforts in the literature, the computational bottleneck associated with high-dimensional likelihoods prevents the scalability of existing methods. The proposed method addresses this challenge by dividing responses into subvectors to be analyzed separately and in parallel on a distributed platform using pairwise composite likelihood. Theoretical challenges related to combining results from dependent data are overcome in a statistically efficient way using a meta-estimator derived from Hansen’s generalized method of moments. We provide a rigorous theoretical framework for efficient estimation, inference, and goodness-of-fit tests. We develop an R package for ease of implementation. We illustrate our method’s performance with simulations and the analysis of the EEG data, and find that iron deficiency is significantly associated with two auditory recognition memory related potentials in the left parietal-occipital region of the brain. Supplementary materials for this article are available online.

Supplementary Materials

Additional technical details, proofs of theorems, simulations and data analysis results are in the supplementary materials, along with an R package.

Acknowledgments

The authors are grateful for the constructive comments given by the associate editor and the anonymous reviewers that led to a significant improvement of the article.

Additional information

Funding

This research was funded by grants NSF DMS1811734, NIH R01ES024732, and NIH P01ES022844.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.