Search in:

Journal of the American Statistical Association Volume 114, 2019 - Issue 528

Submit an article Journal homepage

942

Views

CrossRef citations to date

Altmetric

Theory and Methods

Tuning-Free Heterogeneous Inference in Massive Networks

Zhao Rena Department of Statistics, University of Pittsburgh, Pittsburgh, PA; Correspondence[email protected]
View further author information

Yongjian Kangb Data Sciences and Operations Department, Marshall School of Business, University of Southern California, Los Angeles, CAView further author information

Yingying Fanb Data Sciences and Operations Department, Marshall School of Business, University of Southern California, Los Angeles, CAView further author information

Jinchi Lvb Data Sciences and Operations Department, Marshall School of Business, University of Southern California, Los Angeles, CAView further author information

Pages 1908-1925 | Received 02 Jul 2017, Accepted 06 Oct 2018, Published online: 11 Apr 2019

Cite this article
https://doi.org/10.1080/01621459.2018.1537920
CrossMark

Sample our Mathematics & Statistics journals, sign in here to start your FREE access for 14 days

Full Article
Figures & data
References
Supplemental
Citations
Metrics
Reprints & Permissions
Read this article /doi/full/10.1080/01621459.2018.1537920?needAccess=true

Abstract

Heterogeneity is often natural in many contemporary applications involving massive data. While posing new challenges to effective learning, it can play a crucial role in powering meaningful scientific discoveries through the integration of information among subpopulations of interest. In this article, we exploit multiple networks with Gaussian graphs to encode the connectivity patterns of a large number of features on the subpopulations. To uncover the underlying sparsity structures across subpopulations, we suggest a framework of large-scale tuning-free heterogeneous inference, where the number of networks is allowed to diverge. In particular, two new tests, the chi-based and the linear functional-based tests, are introduced and their asymptotic null distributions are established. Under mild regularity conditions, we establish that both tests are optimal in achieving the testable region boundary and the sample size requirement for the latter test is minimal. Both theoretical guarantees and the tuning-free property stem from efficient multiple-network estimation by our newly suggested heterogeneous group square-root Lasso for high-dimensional multi-response regression with heterogeneous noises. To solve this convex program, we further introduce a scalable algorithm that enjoys provable convergence to the global optimum. Both computational and theoretical advantages are elucidated through simulation and real data examples. Supplementary materials for this article are available online.

Keywords:

Efficiency
Heterogeneous group square-root Lasso
Heterogeneous learning
High dimensionality
Large-scale inference
Multiple networks
Scalability
Sparsity

Supplementary Material

The online supplementary materials contain a scalable HGSL algorithm with provable convergence, the proofs of Theorems 2.1-3.1 and Propositions 2.1-2.3, as well as the proofs of key lemmas and additional technical details. Additional computational cost comparison with existing methods is also provided.

Acknowledgments

Part of this work was completed while the last two authors visited the Departments of Statistics at University of California, Berkeley and Stanford University. These authors sincerely thank both departments for their hospitality.

Additional information

Funding

This work was supported by NSF Grant DMS-1812030, NIH funding: NIH Grant 1R01GM131407-01, NSF CAREER Awards DMS-0955316, and DMS-1150318, a grant from the Simons Foundation, and Adobe Data Science Research Award.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Related Research Data

The Group Square-Root Lasso: Theoretical Properties and Fast Algorithms

Source: Institute of Electrical and Electronics Engineers (IEEE)

Adaptive estimation of a quadratic functional by model selection

Source: Institute of Mathematical Statistics

Scaled sparse linear regression

Source: arXiv

Scalable Algorithms for Data and Network Analysis

Source: Now Publishers

Asymptotic normality and optimalities in estimation of large Gaussian graphical models

Source: arXiv

Regression Shrinkage and Selection via the Lasso

Source: Wiley

Innovated scalable efficient estimation in ultra-large Gaussian graphical models

Source: The Institute of Mathematical Statistics

Sparse inverse covariance estimation with the graphical lasso

Source: Oxford University Press (OUP)

Covariance and precision matrix estimation for high-dimensional time series

Source: Institute of Mathematical Statistics

RANK: Large-Scale Inference with Graphical Nonlinear Knockoffs.

Source: Taylor & Francis

The Benefit of Group Sparsity

Source: Institute of Mathematical Statistics

High-dimensional graphs and variable selection with the Lasso

Source: The Institute of Mathematical Statistics

High trans-ethnic replicability of GWAS results implies common causal variants.

Source: Public Library of Science (PLoS)

High-dimensional covariance estimation by minimizing ℓ1-penalized log-determinant divergence

Source: Institute of Mathematical Statistics

Graphical Models, Exponential Families, and Variational Inference

Source: Now Publishers

Square-Root Lasso: Pivotal Recovery of Sparse Signals via Conic Programming

Source: Elsevier BV

Joint estimation of multiple graphical models

Source: Oxford University Press (OUP)

The joint graphical lasso for inverse covariance estimation across multiple classes

Source: arXiv

Time varying undirected graphs

Source: Carnegie Mellon University

Estimation of a Multi-dimensional Log-concave Density

Source: Wiley

A Constrained ℓ1 Minimization Approach to Sparse Precision Matrix Estimation

Source: Informa UK Limited

Linking provided by

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Tuning-Free Heterogeneous Inference in Massive Networks

Related Research Data

Information for

Open access

Opportunities

Help and information

Tuning-Free Heterogeneous Inference in Massive Networks

Abstract

Supplementary Material

Acknowledgments

Additional information

Funding

Reprints and Corporate Permissions

Academic Permissions

Related Research Data

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature