Bagged K-Means Clustering of Metabolome Data

J. A. Hageman Biosystems Data Analysis, Swammerdam Institute for Life Sciences (SILS), Universiteit van Amsterdam, Amsterdam, The Netherlands

R. A. van den Berg TNO Quality of Life, AJ Zeist, The Netherlands

J. A. Westerhuis Biosystems Data Analysis, Swammerdam Institute for Life Sciences (SILS), Universiteit van Amsterdam, Amsterdam, The Netherlands

H. C. J. Hoefsloot Biosystems Data Analysis, Swammerdam Institute for Life Sciences (SILS), Universiteit van Amsterdam, Amsterdam, The Netherlands

A. K. Smilde Biosystems Data Analysis, Swammerdam Institute for Life Sciences (SILS), Universiteit van Amsterdam, Amsterdam, The Netherlands

Abstract

Clustering of metabolomics data can be hampered by noise originating from biological variation, physical sampling error and analytical error. Using data analysis methods which are not specially suited for dealing with noisy data will yield sub optimal solutions. Bootstrap aggregating (bagging) is a resampling technique that can deal with noise and improves accuracy. This paper demonstrates the possibilities for bagged clustering applied to metabolomics data. The metabolomics data used in this paper is computer-generated with the human red blood cell model. Perturbing this model can be done in several ways. In this paper, inhibition experiments are mimicked inhibiting enzyme activity to 10% of its original value. Comparing bagged K-means clustering to ordinary K-means, the number of metabolites switching clusters under the influence of heteroscedastic noise is lower if bagging is used. This favors bagged K-means above ordinary K-means clustering when dealing with noisy metabolomics data. A special validation scheme, independent of the addition of noise, has been devised to demonstrate the positive effects of bagging on clustering.

Keywords:

ACKNOWLEDGMENTS

The authors like to thank Dr. U. Thissen (TNO, Quality of Life, The Netherlands) for contributing to the Matlab implementation of the red blood cell model and Dr. J. Snoep (Vrije Universiteit, Department of Molecular Cell Physiology, The Netherlands) for sharing the scheme used in and .

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Bagged K-Means Clustering of Metabolome Data

Information for

Open access

Opportunities

Help and information

Bagged K-Means Clustering of Metabolome Data

Abstract

ACKNOWLEDGMENTS

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature