Search in:

Optimization Methods and Software Volume 39, 2024 - Issue 2

Views

CrossRef citations to date

Altmetric

Research Article

Convergence analysis of stochastic higher-order majorization–minimization algorithms

Daniela Lupua Automatic Control and Systems Engineering Department, University Politehnica Bucharest, Bucharest, RomaniaView further author information

Ion Necoarab Gheorghe Mihoc-Caius Iacob Institute of Mathematical Statistics and Applied Mathematics of the Romanian Academy, Bucharest, RomaniaCorrespondence[email protected]
View further author information

Pages 384-413 | Received 30 May 2022, Accepted 04 Sep 2023, Published online: 25 Sep 2023

Cite this article
https://doi.org/10.1080/10556788.2023.2256447
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions

References

A. Agafonov, D. Kamzolov, P. Dvurechensky, and A. Gasnikov, Inexact tensor methods and their application to stochastic convex optimization, arXiv preprint: 2012.15636, 2020.
Google Scholar
L. Bottou, F. Curtis, and J. Nocedal, Optimization methods for large-scale machine learning, SIAM Rev. 60 (2018), pp. 223–311.
Web of Science ®Google Scholar
E.G. Birgin, J.L. Gardenghi, J.M. Martinez, S.A. Santos, and P.H.L. Toint, Worst-case evaluation complexity for unconstrained nonlinear optimization using high-order regularized models, Math. Program. 163(1–2) (2017), pp. 359–368.
Web of Science ®Google Scholar
R. Bollapragada, R.H. Byrd, and J. Nocedal, Exact and inexact subsampled Newton methods for optimization, IMA J. Numer. Anal. 39(2) (2018), pp. 545–578.
Web of Science ®Google Scholar
J. Bolte, A. Daniilidis, and A. Lewis, The Lojasiewicz inequality for nonsmooth subanalytic functions with applications to subgradient dynamical systems, SIAM J. Optim. 17(4) (2007), pp. 1205–1223.
Web of Science ®Google Scholar
S. Boyd, N. Parikh, E. Chu, B. Peleato, and J. Eckstein, Distributed optimization and statistical learning via the alternating direction method of multipliers, Found. Trends Mach. Learn. 3(1) (2011), pp. 1–122.
Google Scholar
C. Cartis, N. IM. Gould, and P.L. Toint, A concise second-order complexity analysis for unconstrained optimization using high-order regularized models, Optim. Methods Softw. 35(2) (2020), pp. 243–256.
Web of Science ®Google Scholar
C.C. Chang and C.J. Lin, LIBSVM: a library for support vector machines, ACM. Trans. Intell. Syst. Technol. 2(3) (2011), pp. 1–27.
Web of Science ®Google Scholar
A. Defazio, F. Bach, and S. Lacoste-Julien, SAGA: a fast incremental gradient method with support for non-strongly convex composite objectives, Adv. Neural. Inf. Process. Syst. 27 (2014), pp. 1646–1654.
Google Scholar
N. Doikov and Y.U. Nesterov, Local convergence of tensor methods, Math. Program. 193(1) (2022), pp. 315–336.
PubMed Web of Science ®Google Scholar
N. Doikov and Y.U. Nesterov, Inexact tensor methods with dynamic accuracies, in International Conference on Machine Learning, 2020, pp. 2577–2586.
Google Scholar
A. Gasnikov, P. Dvurechensky, E. Gorbunov, E. Vorontsova, D. Selikhanovych, C.A. Uribe, B. Jiang, H. Wang, S. Zhang, S. Bubeck, and Q. Jiang, Near optimal methods for minimizing convex functions with Lipschitz pth derivatives, In Conference on Learning Theory, 2019, pp. 1392–1393.
Google Scholar
I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning, MIT Press, 2016.
Google Scholar
A. Hyvarinen, J. Karhunen, and E. Oja, Independent Component Analysis, Wiley, 2001.
Google Scholar
R. Johnson and T. Zhang, Accelerating stochastic gradient descent using predictive variance reduction, in Advances in Neural Information Processing Systems, pp. 315–323, 2013.
Google Scholar
D. Kovalev, K. Mishchenko, and P. Richtarik, Stochastic Newton and cubic Newton methods with simple local linear-quadratic rates, in Advances in Neural Information Processing Systems, 2019.
Google Scholar
A. Lucchi and J. Kohler, A Stochastic tensor method for non-convex optimization, arXiv preprint: 1911.10367, 2019.
Google Scholar
A. Mokhtari, M. Eisen, and A. Ribeiro, IQN: an incremental quasi-Newton method with local superlinear convergence rate, SIAM J. Optim. 28(2) (2018), pp. 1670–1698.
Web of Science ®Google Scholar
E. Moulines and F. Bach, Non-asymptotic analysis of stochastic approximation algorithms for machine learning, Adv. Neural Inform. Process. Syst. 24 (2011), pp. 451–459.
Google Scholar
J. Mairal, Incremental majorization–minimization optimization with application to large-scale machine learning, SIAM J. Optim. 25(2) (2015), pp. 829–855.
Web of Science ®Google Scholar
Y.U. Nesterov, Implementable tensor methods in unconstrained convex optimization, Math. Program.186(1–2) (2021), pp. 157–183.
PubMed Web of Science ®Google Scholar
Y.U. Nesterov, Inexact basic tensor methods for some classes of convex optimization problems, Optim. Methods Softw. 37(3) (2022), pp. 878–906. doi:10.1080/10556788.2020.1854252
Web of Science ®Google Scholar
Y.U. Nesterov, Gradient methods for minimizing composite functions, Math. Program. 140(1) (2013), pp. 125–161.
Web of Science ®Google Scholar
I. Necoara, General convergence analysis of stochastic first order methods for composite optimization, J. Optim. Theory. Appl. 189(1) (2021), pp. 66–95.
Web of Science ®Google Scholar
I. Necoara, V. Nedelcu, and I. Dumitrache, Parallel and distributed optimization methods for estimation and control in networks, J. Process. Control. 21(5) (2011), pp. 756–766.
Web of Science ®Google Scholar
I. Necoara and D. Lupu, General higher-order majorization–minimization algorithms for (non)convex optimization, arXiv preprint: 2010.13893, 2020.
Google Scholar
A. Nemirovski, A. Juditsky, G. Lan, and A. Shapiro, Robust stochastic approximation approach to stochastic programming, SIAM J. Optim. 19(4) (2009), pp. 1574–1609.
Web of Science ®Google Scholar
Y.U. Nesterov and B.T. Polyak, Cubic regularization of Newton method and its global performance, Math. Program. 108(1) (2006), pp. 177–205.
Web of Science ®Google Scholar
L.M. Nguyen, J. Liu, K. Scheinberg, and M. Takac, SARAH: A novel method for machine learning problems using stochastic recursive gradient, in International Conference on Machine Learning, 2017.
Google Scholar
A. Rodomanov and Y.U. Nesterov, Smoothness parameter of power of Euclidean norm, J. Optim. Theory. Appl. 185(2) (2020), pp. 303–326.
PubMed Web of Science ®Google Scholar
L. Rosasco, S. Villa, and B.C. Vu, Convergence of stochastic proximal gradient algorithm, Appl. Math. Optim. 82(3) (2020), pp. 891–917.
Web of Science ®Google Scholar
N. Tripuraneni, M. Stern, C. Jin, J. Regier, and M.I. Jordan, Stochastic cubic regularization for fast nonconvex optimization, Adv. Neural. Inf. Process. Syst. 31 (2018), pp. 2899–2908.
Google Scholar
D. Zhou, P. Xu, and Q. Gu, Stochastic variance-reduced cubic regularized Newton method, in International Conference on Machine Learning, 2018, pp. 5985–5994.
Google Scholar
http://www.ehu.eus/ccwintco/index.php/Hyperspectral-Remote-Sensing-Scenes.
Google Scholar
A. Wächter and L.T. Biegler, On the implementation of a primal–dual interior point filter line search algorithm for large-Scale nonlinear programming, Math. Program. 106(1) (2006), pp. 25–57.
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Convergence analysis of stochastic higher-order majorization–minimization algorithms

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Convergence analysis of stochastic higher-order majorization–minimization algorithms

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date