1,027
Views
49
CrossRef citations to date
0
Altmetric
Articles: Graphical Statistics

Contribution Biplots

Pages 107-122 | Received 01 Jul 2010, Published online: 27 Mar 2013
 

Abstract

To interpret the biplot, it is necessary to know which points—usually variables—are the ones that are important contributors to the solution, especially when there are many variables involved. This information can be calculated separately as part of the biplot's numerical results, but this means that a table has to be consulted along with the graphical display. We propose a new scaling of the display, called the contribution biplot, which incorporates this diagnostic information directly into the display itself, showing visually the important contributors and thus facilitating the biplot interpretation and often simplifying the graphical representation considerably. The contribution biplot can be applied to a wide variety of analyses, such as correspondence analysis, principal component analysis, log-ratio analysis, and various forms of discriminant analysis, and, in fact, to any method based on dimension reduction through the singular value decomposition. In the contribution biplot, one set of points, usually the rows of a data matrix, optimally represents the spatial positions of the cases or sample units, according to an appropriate distance measure. The other set of points, usually the columns of the data matrix, is represented by vectors that are related to their contributions to the low-dimensional solution. A fringe benefit is that often only one common scale for the row and column points is needed on the principal axes, thus avoiding the problem of enlarging or contracting the scale of one set of points to make the biplot legible. Furthermore, the contribution biplot also solves the problem in correspondence analysis and log-ratio analysis of low-frequency categories that are located on the periphery of the map, giving the false impression that they are important, when they are in fact contributing minimally to the solution. This article has supplementary materials online.

ACKNOWLEDGMENTS

The research support of the BBVA Foundation in Madrid, Spain, is gratefully acknowledged, as well as partial support from the Spanish Ministry of Science and Innovation grants MTM2008-00642 and MTM2009-09063. The associate editor's constructive comments on the earlier versions of this article led to major improvements, as did the comprehensive reports of two reviewers. The Calanus glacialis fatty acid data were kindly provided by Janne Søreide of UNIS (The University Centre in Svalbard).

Additional information

Notes on contributors

Michael Greenacre

Michael Greenacre, Professor of Statistics, Department of Economics and Business, Universitat Pompeu Fabra, Barcelona, Spain, and Barcelona Graduate School of Economics, Barcelona, Spain (E-mail: [email protected]).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.