193
Views
6
CrossRef citations to date
0
Altmetric
Original Articles

A Bregman extension of quasi-Newton updates I: an information geometrical framework

&
Pages 96-123 | Received 22 Nov 2010, Accepted 03 Aug 2011, Published online: 04 Oct 2011
 

Abstract

We study quasi-Newton methods from the viewpoint of information geometry. Fletcher has studied a variational problem which derives approximate Hessian update formulae of quasi-Newton methods. We point out that the variational problem is identical to the optimization of the Kullback–Leibler (KL) divergence, which is a discrepancy measure between two probability distributions. The KL-divergence introduces a differential geometrical structure on the set of positive-definite matrices, and the geometric view helps our intuitive understanding of the Hessian update in quasi-Newton methods. Then, we introduce the Bregman divergence as an extension of the KL-divergence. As well as the KL-divergence, the Bregman divergence introduces the information geometrical structure on the set of positive-definite matrices. We derive extended quasi-Newton update formulae based on the variational problem of the Bregman divergence. From the geometrical viewpoint, we study the invariance property of Hessian update formulae. We also propose an extension of the sparse quasi-Newton update. Especially, we point out that the sparse quasi-Newton method is closely related to statistical algorithm such as em-algorithm and boosting. We show that the information geometry is a useful tool not only to better understand the numerical algorithm but also to design new update formulae in quasi-Newton methods.

Acknowledgements

The authors are grateful to Dr Nobuo Yamashita of Kyoto university for the helpful comments. T. Kanamori was partially supported by Grant-in-Aid for Young Scientists (20700251).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.