560
Views
108
CrossRef citations to date
0
Altmetric
General Paper

On the suitability of resampling techniques for the class imbalance problem in credit scoring

, &
Pages 1060-1070 | Received 01 Nov 2011, Accepted 01 Aug 2012, Published online: 21 Dec 2017
 

Abstract

In real-life credit scoring applications, the case in which the class of defaulters is under-represented in comparison with the class of non-defaulters is a very common situation, but it has still received little attention. The present paper investigates the suitability and performance of several resampling techniques when applied in conjunction with statistical and artificial intelligence prediction models over five real-world credit data sets, which have artificially been modified to derive different imbalance ratios (proportion of defaulters and non-defaulters examples). Experimental results demonstrate that the use of resampling methods consistently improves the performance given by the original imbalanced data. Besides, it is also important to note that in general, over-sampling techniques perform better than any under-sampling approach.

Acknowledgements

This work has partially been supported by the Spanish Ministry of Education and Science under grant TIN2009–14205 and the Generalitat Valenciana under grant PROMETEO/2010/028.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.