Search in:

Applied Artificial Intelligence

An International Journal

Volume 35, 2021 - Issue 15

Submit an article Journal homepage

Free access

2,814

Views

CrossRef citations to date

Altmetric

Listen

Research Article

A Hybrid Machine Learning Model for Credit Approval

Cheng-Hsiung Wenga Department of Artificial Intelligence and Health Management, Central Taiwan University of Science and Technology, Taichung, Taiwan, Republic of China;b Department of Information Management, National Chin-Yi University of Technology, Taichung, Taiwan, Republic of China

Cheng-Kui Huangc Department of Business Administration, National Chung Cheng University, Taiwan, Republic of ChinaCorrespondence[email protected]

https://orcid.org/0000-0001-8994-3598

Pages 1439-1465 | Received 28 Dec 2020, Accepted 13 Sep 2021, Published online: 12 Oct 2021

Cite this article
https://doi.org/10.1080/08839514.2021.1982475
CrossMark

In this article

ABSTRACT
Introduction
Related Work
The Proposed Approach
Experimental Results
Conclusion and Future Works
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF View EPUB EPUB

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

Incorrect decision-making in financial institutions is very likely to cause financial crises. In recent years, many studies have demonstrated that artificial intelligence techniques can be used as alternative methods for credit scoring. Previous studies showed that prediction models built using hybrid approaches perform better than single approaches. In addition, feature selection or instance selection techniques should be incorporated into building prediction models to improve the prediction performance. In this study, we integrate feature selection, instance selection, and decision tree techniques to propose a new approach to predicting credit approval. Experimental results obtained using the survey data show that our proposed approach is superior to the other five traditional machine learning approaches in the measures. In addition, our approach has a lower cost effect than the traditional five methods. That is, the proposed approach generates fewer costs, such as money loss, than the traditional five approaches.

Introduction

Credit-risk evaluation decisions are important for the financial institutions involved due to the high level of risk associated with wrong decisions. The ability to accurately predict credit failure is a very important issue in financial decision-making. Incorrect decision-making in financial institutions is very likely to cause financial crises (Tsai Citation2014). The purpose of credit scoring is to classify the applicants into two types: applicants with good credit and applicants with bad credit. Even a 1% improvement in the accuracy of the credit scoring of applicants with bad credit can greatly decrease the losses of financial institutions (Hand and Henley Citation1997).

In recent years, many studies have demonstrated that artificial intelligence (AI) techniques can be used as alternative methods for credit scoring, such as artificial neural networks (ANN) (Guotai, Abedin, and Moula Citation2017; Tsai Citation2014; Wang et al. Citation2011), decision trees (DT) (Tsai Citation2014; Wang et al. Citation2011), and support vector machines (SVM) (Chen and Li Citation2014; Zhong et al. Citation2014). Previous studies showed that prediction models built using hybrid approaches, such as classifiers with clustering, perform better than single approaches (classifiers only) (Hsieh Citation2005; Luo, Cheng, and Hsieh Citation2009; Ping and Yongheng Citation2011). Moreover, some previous studies revealed that feature selection (Catal and Diri Citation2009; Lee Citation2009; Saeys, Inza, and Larrañaga Citation2007; Tsai Citation2009; Tsai and Hsiao Citation2010) or instance selection (Sun and Li Citation2011; Tsai and Cheng Citation2012) should be incorporated into building prediction models to improve prediction performance.

In this study, we propose a new approach, integrating feature selection, instance selection, and classifier to build a prediction model for credit approval. The proposed framework is shown in . Its process is described as follows. First, a measure (gain ratio) is used for feature selection. Second, a clustering method, expectation maximization (EM), is applied to cluster training dataset into k clusters in advance. Finally, a classification method, called the C4.5 algorithm, is used to build k decision tree classifiers for k clusters of instances.

Figure 1. The proposed framework.

In the experiment, we use measures named precision, true positive rate (TPR), accuracy, and F₁ to evaluate the performance difference between the proposed approach and five traditional machine learning approaches, DT (decision tree), MLP (multiple-layer perceptron), NB (naive Bayes classifiers), RF (random forest), and SVM (support vector machine). In addition, we also use the cost-effect to evaluate the cost-effectiveness analysis (CEA) of the proposed approach and the five traditional machine learning approaches (DT, MLP, NB, RF, and SVM).

The rest of this paper is organized as follows. Section 2 reviews related work. The proposed clustering-based decision tree is illustrated in Section 3. The evaluation criteria are illustrated in Section 4. Case studies based on real data are used to demonstrate the experimental results in Section 5. Section 6 discusses the conclusions and offers suggestions for future work.

Related Work

Decision Tree Related Works

Classification is an important problem in the field of data mining. In classification, we are given a set of example records, called the training data set, with each record consisting of several attributes. One of the categorical attributes, called the class label, indicates the class to which each record belongs. The objective of classification is to utilize the training data set to build a model of the class label such that it can be used to classify new data whose class labels are unknown.

Many types of models have been built for classification, such as neural networks, statistical models, genetic models, and decision tree models (Han, Kamber, and Pei Citation2006). Classification trees, also called decision trees, are especially attractive in a data mining environment for several reasons (Breiman et al. Citation1984). First, due to their intuitive representation, the resulting classification model is easy for human beings to assimilate (Mehta, Agrawal, and Rissanen Citation1996). Second, decision trees do not require any parameter settings from the user and thus are especially suited for exploratory knowledge discovery. Third, decision trees can be constructed relatively quickly compared to other methods (Shafer, Agrawal, and Mehta Citation1996). Finally, the accuracy of decision trees is comparable or superior to that of other classification models (Lim, Loh, and Shih Citation1998).

Related to decision tree classifiers, Quinlan (Citation1986) proposed a decision tree algorithm known as Iterative Dichotomiser 3 (ID3). Later, Quinlan (Citation1987) proposed C4.5 (a successor of ID3), which became a benchmark work to which newer supervised learning algorithms are often compared. Breiman et al. (Citation1984) proposed the classification and regression tree (CART) algorithm, which describes the generation of binary decision trees. Other algorithms for decision tree induction include SLIQ (Mehta, Agrawal, and Rissanen Citation1996), SPRINT (Shafer, Agrawal, and Mehta Citation1996), BOAT (Gehrke et al. Citation1999) and so on. The efficiency of existing decision tree algorithms, such as ID3, C4.5, and CART, has been well established for relatively small data sets. The SPRINT and SLIQ algorithms can both handle categorical and continuous valued attributes and are also suitable for very large training sets. The BOAT algorithm can be used for incremental updates. That is, BOAT can take new insertions and deletions for the training data and update the decision tree to reflect these changes (Han, Kamber, and Pei Citation2006).

An attribute selection measure is a heuristic for selecting the splitting criterion that separates a given data partition of class-labeled training tuples into individual classes. The presence of redundant attributes does not adversely affect the accuracy of decision trees. An attribute is redundant if it is strongly correlated with another attribute in the data. One of the two redundant attributes is not used for splitting once the other attribute has been chosen. However, if the data set contains many irrelevant attributes, i.e., attributes that are not useful for the classification task, then some of these may be accidentally chosen during the tree-growing process, resulting in a decision tree that is larger than necessary (Tan, Steinbach, and Kumar Citation2006).

Clustering methods are used to improve the accuracy of decision trees by eliminating the irrelevant attributes. Pushpalatha and Rajalakshmi (Citation2018) analyzed the importance of attribute selection techniques in a credit approval dataset. According to their study, logistic regression with the CFSSubsetEval attribute selection method yields better performance when compared to other techniques. Pristyanto, Adi, and Sunyoto (Citation2019) proposed a proper feature selection model for increasing the accuracy of specific classifier models by comparing several existing feature selection models and some classifiers.

The related studies about the development or improvement for the classification techniques are of abundance. Due to space limitation, the researches of Ngai, Xiu, and Chau (Citation2009) and Ngai et al. (Citation2011) provide the literature review for the classification topic.

Clustering Related Works

The k-means (KM) approach has been widely used in pattern recognition problems. Several variations and improvements to the original algorithm have been made. MacQueen’s (Citation1967) k-means algorithm is widely used because of its simplicity. This algorithm has been shown to converge to a local minimum (Selim and Ismail Citation1984). Elsewhere, it has been shown that there is no guarantee of optimal clustering, since the convergence depends on the initial seeds selected (Looney Citation2002). The k-means algorithm, however, is not considered to be the best choice for clustering due to its poor time performance and other requirements. It typically requires that clusters should be spherical, that the data should be free of noise and that its operation should be properly initialized (Estivill-Castro and Yang Citation2004).

Expectation maximization (EM) (Dempster, Laird, and Rubin Citation1977) is an improved version of the k-means algorithm, with better performance. It is a statistical technique for maximum likelihood estimation using mixed models. The EM algorithm is the most frequently used technique for estimating class conditional probability density functions (PDF) (Abd-Almageed, El-Osery, and Smith Citation2003).

EM clusters data in a manner different than in the k-means method. Unlike distance-based or hard membership algorithms (such as k-Means), EM is known to be an appropriate optimization algorithm for constructing proper statistical models of data (Bradley and Fayyad Citation1998). However, convergence to a local rather than the global optima is a problem arising due to its iterative nature. This means that the method is sensitive to the initial conditions, and thus not robust. To overcome the initialization problem, several methods for determining ‘good’ initial parameters for EM have been suggested, mainly based on sub-sampling, voting, and two-stage clustering (Meila and Heckerman Citation1998).

EM aims at finding clusters such that the maximum likelihood of each cluster’s parameters is obtained. EM starts with an initial estimate for the missing variables and iterates to find the maximum likelihood (ML) for these variables. Maximum likelihood methods estimate the parameters by values that maximize the sample’s probability for an event. EM is typically used with mixture models. The goal of the EM algorithm is to maximize the overall probability or likelihood of the data, given the (final) clusters. Unlike the classic implementation of k-means clustering, the general EM algorithm can be applied to both continuous and categorical variables (Bradley, Fayyad, and Reina Citation1998).

The related research about the development or improvement for the classification techniques is abundant. Due to space limitation, the studies of Levin (Citation2015), Reddy and Ussenaiah (Citation2012), and Xu and Tian (Citation2015) provide the literature review for the clustering topic.

Credit Scoring

The purpose of credit scoring is to classify the applicants into two types: applicants with good credit and applicants with bad credit. Applicants with good credit are very likely to repay their financial obligation. Those with bad credit have a high possibility of defaulting. The accuracy of credit scoring is critical to financial institutions’ profitability. Even a 1% improvement in the accuracy of credit scoring of applicants with bad credit can greatly decrease the losses of financial institutions (Hand and Henley Citation1997).

Credit scoring was originally evaluated subjectively according to personal experiences. However, with the tremendous increase of applicants, it is impossible to conduct the work manually. Statistical techniques and artificial intelligence (AI) techniques, which are the two major categories of automatic credit scoring techniques, have been investigated in prior studies (Huang et al. Citation2004). In addition, Huang et al. (Citation2004) found that AI techniques are superior to statistical techniques in dealing with credit scoring problems, especially for nonlinear pattern classification. Muniyandi, Rajeswari, and Rajaram (Citation2012) proposed an anomaly detection method using “k-Means + C4.5,” a method to cascade k-means clustering and C4.5 decision tree methods for classifying anomalous and normal activities in a computer network.

In recent years, many studies have demonstrated that AI techniques, such as artificial neural networks (ANN) (Chang and Yeh Citation2012; Hájek Citation2011; Tsai Citation2014; Wang et al. Citation2011), stochastic gradient boosting (Orlova Citation2021), decision trees (DT) (Inyaem and Chuaytem Citation2020; Tsai Citation2014; Wang et al. Citation2011), ensemble model (Nalić, Martinović, and Žagar Citation2020; Zhang et al. Citation2021), and support vector machines (SVM) (Chen and Li Citation2014; Kim and Ahn Citation2012; Tsai Citation2014; Wang et al. Citation2011; Wang and Ma Citation2012; Yeh, Lin, and Hsu Citation2012; Zhong et al. Citation2014) can be used as alternative methods for credit scoring. For completely understanding the previous studies, the survey of Dastile, Celik, and Potsane (Citation2020) proffers the literature review with respect to this issue.

compares hybrid learning models related to credit rating techniques and evaluation methods. In the related works, previous hybrid models are generally compared with single machine learning techniques to make the final conclusion.

A Hybrid Machine Learning Model for Credit Approval

ABSTRACT

Introduction

Related Work

Decision Tree Related Works

Clustering Related Works

Credit Scoring

Table 1. Comparison of works

The Proposed Approach

Decision Tree

Information Gain

Gain Ratio

The Expectation Maximization (EM) Clustering Method

The Proposed Framework

Experimental Results

Table 2. Comparison of the six approaches

Table 3. Parameter settings of the proposed CBDT model

Evaluation Criteria

Table 4. Confusion matrix for positive and negative tuples

Table 5. Cost matrix

Table 6. Cost matrix for credit approval

Statlog (German Credit Data) Dataset

Table 7. The average experiment results compared with the other 5 methods

Table 8. The average experiment results of the new three methods

Table 9. The average experiment results compared with the other three methods

Statlog (Australian Credit Approval) Dataset

Table 10. The average experiment results compared with the other five methods

Table 11. The average experiment results of the new three methods

Table 12. The average experiment results compared with the other three methods

Credit-approval Dataset

Table 13. The average experiment results compared with the other five methods

Table 14. The average experiment results of the new three methods

Table 15. The average experiment results compared with the other three methods

Comparison with Other past Studies

Conclusion and Future Works

Disclosure Statement

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date