Abstract
Cluster significance analysis is a tool that allows the identification of ‘embedded clusters’ in QSAR datasets. It is successfully applied to an eye irritation data set to show that these data are indeed asymmetric. The method identifies five parameters that form an embedded cluster of eye irritants amongst non irritants, although full separation is not achieved. This method has considerable potential to identify potential non-linearity in toxicology data sets and for parameter reduction. It is shown also that this can be obtained relatively quickly with an analysis performed on 100,000 subsets containing the same information as an analysis on 1,000,000 subsets.