Views

CrossRef citations to date

Altmetric

Original Articles

ADMM for multiaffine constrained optimization

Wenbo GaoDepartment of Industrial Engineering and Operations Research, Columbia University, New York, NY, USACorrespondence[email protected]
[email protected]
View further author information

Donald GoldfarbDepartment of Industrial Engineering and Operations Research, Columbia University, New York, NY, USAView further author information

Frank E. CurtisDepartment of Industrial and Systems Engineering, Lehigh University, Bethlehem, PA, USA

https://orcid.org/0000-0001-7214-9187 View further author information

ABSTRACT

We expand the scope of the alternating direction method of multipliers (ADMM). Specifically, we show that ADMM, when employed to solve problems with multiaffine constraints that satisfy certain verifiable assumptions, converges to the set of constrained stationary points if the penalty parameter in the augmented Lagrangian is sufficiently large. When the Kurdyka–Łojasiewicz (K–Ł) property holds, this is strengthened to convergence to a single constrained stationary point. Our analysis applies under assumptions that we have endeavoured to make as weak as possible. It applies to problems that involve nonconvex and/or nonsmooth objective terms, in addition to the multiaffine constraints that can involve multiple (three or more) blocks of variables. To illustrate the applicability of our results, we describe examples including nonnegative matrix factorization, sparse learning, risk parity portfolio selection, nonconvex formulations of convex problems and neural network training. In each case, our ADMM approach encounters only subproblems that have closed-form solutions.

KEYWORDS:

2010 MATHEMATICS SUBJECT CLASSIFICATIONS:

Acknowledgements

We thank Qing Qu, Yuqian Zhang and John Wright for helpful discussions about applications of multiaffine ADMM. We thank Wotao Yin for his feedback, and for bringing the paper [Citation23] to our attention. We thank Yenson Lau for discussions about numerical experiments.

Disclosure statement

No potential conflict of interest was reported by the authors.

ORCID

Frank E. Curtis http://orcid.org/0000-0001-7214-9187

Notes

1 [Citation23] shows that every limit point of ADMM for the problem (NMF) is a constrained stationary point, but does not show that such limit points necessarily exist.

2 See Section 5.4 for the definition of $λ_{m i n}$ and $λ_{+ +}$ .

3 Note that we have deliberately excluded $ℓ = 0$ . 4.2.2 is not required to hold for $X_{0}$ .

4 That is, either (1a) and (1b) hold, or (2a) and (2b) hold.

5 As an illustrative example, a problem may be formulated with constraints $X_{0} X_{1} + Z_{1} = 0, X_{0} + P_{1} (X_{1}) + Z_{2} = 0, X_{0} X_{2} + Z_{3} = 0, P_{2} (X_{2}) + Z_{4} = 0$ , where $P_{1}, P_{2}$ are injective linear maps. The notation $A (X, Z_{0}) + Q (Z_{>})$ denotes the concatenation of these equations, which can also be seen naturally as a system of four constraints. In this case, the indices $r (ℓ) \in {1, 2, 3, 4}$ , and 4.2.2(2a) is satisfied by the second constraint $X_{0} + P_{1} (X_{1}) + Z_{2} = 0$ for the variables $X_{0}, X_{1}$ (i.e. $r (0) = r (1) = 2$ and $R_{0} = I, R_{1} = P_{1}$ ), and by the fourth constraint $P_{2} (X_{2}) + Z_{4} = 0$ for $X_{2}$ .

6 Note that $X_{0}$ is included here, unlike in Assumption 4.2.

7 For linear constraints $A_{1} x_{1} + \dots + A_{n} x_{n} = b$ , the equivalent statement is that $I m (A_{n}) \supseteq ⋃_{i = 1}^{n - 1} I m (A_{i})$ .

8 In the sense that this exact assumption is always necessary and cannot be replaced.

9 When $j > m_{1}$ , $θ_{j} (X)$ is a constant map of Y.

10 To see this, define the prox-Lagrangian $L^{P} (U, Y, W, O) = L (U, Y, W) + ∥ Y - O ∥_{S}^{2}$ . By definition, $Y^{+}$ decreases the prox-Lagrangian, so $L^{P} (U, Y^{+}, W, Y^{k}) \leq L^{P} (U, Y^{k}, W, Y^{k}) = L (U, Y, W)$ and the desired result follows.

11 To motivate the sub-blocks $(Y_{0}, Y_{1})$ in 6.13, one should look to the decomposition of $ψ (Z)$ in Assumption 4.1, where we can take $Y_{0} = {Z_{0}}$ and $Y_{1} = Z_{>}$ . Intuitively, $Y_{1}$ is a sub-block such that ψ is a smooth function of $Y_{1}$ , and which is ‘absorbing’ in the sense that for any $U^{+}$ and $Y_{0}^{+}$ , there exists $Y_{1}$ making the solution feasible.

12 (2) is assumed to hold for the iterates $U^{+}$ and $Y^{+}$ generated by ADMM as the minimal required condition, but one should not, in general, think of this property as being specifically related to the iterates of the algorithm. In the cases we consider, it will be a property of the function f and the constraint C that for any point $(\tilde{U}, \tilde{Y})$ , there exists ${\hat{Y}}_{1} \in {a r g m i n}_{Y_{1}} {f_{\tilde{U}} ({\tilde{Y}}_{0}, Y_{1}) : C_{\tilde{U}} ({\tilde{Y}}_{0}, Y_{1}) = b_{\tilde{U}}}$ such that $∥ {\hat{Y}}_{1} - {\tilde{Y}}_{1} ∥^{2} \leq ζ ∥ C_{\tilde{U}} (Y^{+}) - b_{\tilde{U}} ∥^{2}$ .

13 To clarify the definition of ${\hat{Y}}_{1}$ , the sub-block for $Y_{0}$ is fixed to the value of $Y_{0}^{+}$ on the given iteration, and then ${\hat{Y}}_{1}$ is obtained by minimizing $f_{U^{+}} (Y_{0}^{+}, Y_{1})$ for the $Y_{1}$ sub-block over the feasible region $C_{U^{+}} (Y_{0}^{+}, Y_{1}) = b_{U^{+}}$ .

Additional information

Funding

The research of Wenbo Gao and Donald Goldfarb was supported in part by the National Science Foundation under NSF Grants CCF-1527809 and CCF- 1838061. The research of Frank E. Curtis was supported in part by the National Science Foundation under NSF Grant CCF-1618717 and by the U.S. Department of Energy under DOE Grant DE-SC0010615.

Notes on contributors

Wenbo Gao

Wenbo Gao is a PhD student in the Department of Industrial Engineering and Operations Research at Columbia University.

Donald Goldfarb

Donald Goldfarb is the Alexander and Hermine Avanessians Professor of Industrial Engineering and Operations Research at Columbia University. He has been a faculty member at Columbia Engineering since 1982. He served as interim dean of the School in 2012–2013, executive vice dean in 2011–2012, acting dean in 1994–1995, and chair of the IEOR department from 1984 to 2002.

His research interests include algorithms for linear, quadratic, semidefinite, convex and general nonlinear programming, network flows, large sparse systems, and applications in robust optimization, imaging, machine learning, and finance. He has published more than 100 technical papers and served on the editorial boards of several journals, including editor in chief of Mathematical Programming, editor of the SIAM (Society for Industrial and Applied Mathematics) Journal on Optimization and the SIAM Journal on Numerical Analysis, and associate editor of Operations Research and Mathematics of Computation. He has been a member of the councils of the Mathematical Programming Society and the American Mathematical Society, numerous technical society program and award committees, and advisory committees to various universities and government research agencies.

Before coming to Columbia, he held positions as professor and acting chair in the Department of Computer Science at the City College of New York, visiting professor in the Department of Computer Science and at the School of Operations Research and Industrial Engineering at Cornell University, and assistant research scientist at the Courant Institute of Mathematical Sciences of New York University. He earned a BChE from Cornell in 1963 and MA and PhD from Princeton in 1965 and 1966, respectively.

Frank E. Curtis

Frank E. Curtis is an Associate Professor in the Department of Industrial and Systems Engineering at Lehigh University, where he has been employed since 2009. He received his Bachelors degree from the College of William and Mary in 2003 with a double major in Mathematics and Computer Science, received his Masters in 2004 and PhD in 2007 from the Department of Industrial Engineering and Management Science at Northwestern University, and spent two years as a Postdoctoral Researcher in the Courant Institute of Mathematical Sciences at New York University from 2007 until 2009. His research focuses on the design, analysis, and implementation of numerical methods for solving large-scale nonlinear optimization problems. He received an Early Career Award from the Advanced Scientific Computing Research program of the US Department of Energy, and has received funding from various programs of the US National Science Foundation, including through a TRIPODS Institute grant awarded to him and his collaborators at Lehigh, Northwestern, and Boston University. He currently serves as an Associate Editor for Mathematical Programming, SIAM Journal on Optimization, Mathematics of Operations Research, and Mathematical Programming Computation. He served as the Vice Chair for Nonlinear Programming for the INFORMS Optimization Society from 2010 until 2012, and is currently very active in professional societies and groups related to mathematical optimization, including INFORMS, the Mathematics Optimization Society, and the SIAM Activity Group on Optimization.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

ADMM for multiaffine constrained optimization

Notes on contributors

Wenbo Gao

Donald Goldfarb

Frank E. Curtis

Information for

Open access

Opportunities

Help and information

ADMM for multiaffine constrained optimization

ABSTRACT

Acknowledgements

Disclosure statement

ORCID

Notes

Additional information

Funding

Notes on contributors

Wenbo Gao

Donald Goldfarb

Frank E. Curtis

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature