Search in:

Journal of the American Statistical Association Volume 114, 2019 - Issue 525

Submit an article Journal homepage

1,560

Views

CrossRef citations to date

Altmetric

Theory and Methods

Group SLOPE – Adaptive Selection of Groups of Predictors

Damian BrzyskiDepartment of Epidemiology and Biostatistics, Indiana University, Bloomington, IN;Institute of Mathematics, Jagiellonian University, Cracow, PolandView further author information

Alexej GossmannBioinnovation PhD Program, Tulane University, New Orleans, LAView further author information

Weijie SuDepartment of Statistics, University of Pennsylvania, Philadelphia, PAView further author information

Małgorzata BogdanInstitute of Mathematics, University of Wroclaw, Wroclaw, PolandView further author information

Pages 419-433 | Received 01 Nov 2016, Published online: 06 Aug 2018

Cite this article
https://doi.org/10.1080/01621459.2017.1411269
CrossMark

Sample our Mathematics & Statistics journals, sign in here to start your FREE access for 14 days

Full Article
Figures & data
References
Supplemental
Citations
Metrics
Reprints & Permissions
Read this article /doi/full/10.1080/01621459.2017.1411269?needAccess=true

ABSTRACT

Sorted L-One Penalized Estimation (SLOPE; Bogdan et al. Citation2013, Citation2015) is a relatively new convex optimization procedure, which allows for adaptive selection of regressors under sparse high-dimensional designs. Here, we extend the idea of SLOPE to deal with the situation when one aims at selecting whole groups of explanatory variables instead of single regressors. Such groups can be formed by clustering strongly correlated predictors or groups of dummy variables corresponding to different levels of the same qualitative predictor. We formulate the respective convex optimization problem, group SLOPE (gSLOPE), and propose an efficient algorithm for its solution. We also define a notion of the group false discovery rate (gFDR) and provide a choice of the sequence of tuning parameters for gSLOPE so that gFDR is provably controlled at a prespecified level if the groups of variables are orthogonal to each other. Moreover, we prove that the resulting procedure adapts to unknown sparsity and is asymptotically minimax with respect to the estimation of the proportions of variance of the response variable explained by regressors from different groups. We also provide a method for the choice of the regularizing sequence when variables in different groups are not orthogonal but statistically independent and illustrate its good properties with computer simulations. Finally, we illustrate the advantages of gSLOPE in the context of Genome Wide Association Studies. R package grpSLOPE with an implementation of our method is available on The Comprehensive R Archive Network.

KEYWORDS:

Asymptotic minimax
False discovery rate
Group selection
Model selection
Multiple regression
SLOPE

Acknowledgments

The authors thank Ewout van den Berg, Emmanuel J. Candès, Jan Mielniczuk and Chiara Sabatti for helpful remarks and suggestions. D. B. would like to thank Professor Jerzy Ombach for significant help with the process of obtaining access to the data.

Additional information

Funding

D. B. and M. B. are supported by European Union’s 7th Framework Programme for research, technological development and demonstration under Grant Agreement no 602552 and by the Polish Ministry of Science and Higher Education according to agreement 2932/7.PR/2013/2. Additionally, D.B. acknowledges the support from NIMH grant R01MH108467.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Related Research Data

Controlling the false discovery rate via knockoffs

Source: Institute of Mathematical Statistics

Asymptotic Bayes-optimality under sparsity of some multiple testing procedures

Source: Institute of Mathematical Statistics

Some optimality properties of FDR controlling rules under sparsity

Source: Institute of Mathematical Statistics

Sparse Optimization with Least-Squares Constraints

Source: Society for Industrial & Applied Mathematics (SIAM)

SLOPE is adaptive to unknown sparsity and asymptotically minimax

Source: Institute of Mathematical Statistics

Variance component model to account for sample structure in genome-wide association studies

Source: Springer Science and Business Media LLC

Controlling the Rate of GWAS False Discoveries

Source: Genetics Society of America

Adapting to unknown sparsity by controlling the false discovery rate

Source: Institute of Mathematical Statistics

Early Life Factors and Blood Pressure at Age 31 Years in the 1966 Northern Finland Birth Cohort

Source: Ovid Technologies (Wolters Kluwer Health)

A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems

Source: Society for Industrial & Applied Mathematics (SIAM)

Block-Sparse Recovery via Convex Optimization

Source: Institute of Electrical and Electronics Engineers (IEEE)

A new look at the statistical model identification

Source: Institute of Electrical and Electronics Engineers (IEEE)

Asymptotic Analysis of Complex LASSO via Complex Approximate Message Passing (CAMP)

Source: Institute of Electrical and Electronics Engineers (IEEE)

Group SLOPE - adaptive selection of groups of predictors^*

Source: Figshare

Genome-wide association analysis of metabolic traits in a birth cohort from a founder population

Source: Springer Nature

SLOPE—Adaptive variable selection via convex optimization

Source: Institute of Mathematical Statistics

On false discovery rate thresholding for classification under sparsity

Source: Institute of Mathematical Statistics

Group SLOPE – Adaptive Selection of Groups of Predictors

Source: Figshare

Estimating the Dimension of a Model

Source: Institute of Mathematical Statistics

A new look at the statistical model identification

Source: Institute of Electrical and Electronics Engineers (IEEE)

Model selection and estimation in regression with grouped variables

Source: Wiley

Probing the Pareto Frontier for Basis Pursuit Solutions

Source: Society for Industrial & Applied Mathematics (SIAM)

PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses

Source: The American Society of Human Genetics. Published by Elsevier Inc.

A Sparse-Group Lasso

Source: Informa UK Limited

Genetic model testing and statistical power in population‐based association studies of quantitative traits

Source: Wiley

Lessons learned from IDeAl — 33 recommendations from the IDeAl-net about design and analysis of small population clinical trials

Source: BMC

Linking provided by

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Group SLOPE – Adaptive Selection of Groups of Predictors

Related Research Data

Information for

Open access

Opportunities

Help and information

Group SLOPE – Adaptive Selection of Groups of Predictors

ABSTRACT

Acknowledgments

Additional information

Funding

Reprints and Corporate Permissions

Academic Permissions

Related Research Data

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature