Full article: Multi-class misclassification cost matrix for credit ratings in peer-to-peer lending

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

Online peer-to-peer (P2P) lending is a new form of loans. Different from traditional banks, lenders provide loans to borrowers directly through P2P platforms. Since many P2P loans are unsecured personal loans, credit rating of loans is vital to control default risk and improve profit for lenders and platforms. Standard binary classifiers are inappropriate in P2P lending because there are multiple credit classes and misclassification costs vary largely across classes in P2P lending. Though there are a few works that studied cost-sensitive classifiers in P2P lending, none of them have analyzed this issue from the perspective of multi-class classifications and measured misclassification costs of different credit grades using real losses and opportunity costs. The objective of this paper is to model credit rating in P2P lending as a cost-sensitive multi-class classification problem. We proposed a misclassification cost matrix for P2P credit grading with a set of equations and models to calculate the costs. An experiment using publicly available data from Lending Club was conducted to validate the usefulness of the proposed misclassification cost matrix. The results showed that the cost-sensitive classifiers can significantly reduce the total cost, which is essential for the survival and profitability of P2P platforms.

Keywords:

1. Introduction

In the past decade, online peer-to-peer (P2P) lending, as a popular form of personal loan, has emerged in credit market. It transfers traditional way of face-to-face personal loans through online services (Bachmann et al., Citation2011). P2P lending is an electronic marketplace where individual lenders provide loans to individual borrowers. It is pervasive, convenient, efficient, and low-cost without the involvement of traditional financial institutions (Guo, Zhou, Luo, Liu, & Xiong, Citation2016).

Since the first lending platform Zopa was established in UK in February 2005, an increasing number of P2P lending platforms, such as Prosper, Smava, and Lending Club, have been developed all around the world (Ge, Feng, Gu, & Zhang, Citation2017) and accumulated data and management experiences. Comparing with traditional banking systems, P2P lending has some characteristics. First, P2P platforms facilitate transactions by connecting borrowers and lenders directly. Borrowers fill in electronic loan application forms, including amounts, terms, purposes, and personal information (such as age, job, address, and credit card). Platforms provide available financial situations and credit histories of borrowers to lenders, who will decide whether to grant a loan and an interest rate. Platforms use various approaches to help lenders set interest rates. Some platforms carry out an auction at which a borrower set her/his maximum interest rate and lenders give their bids (Galloway, Citation2009). Another approach is to assign interest rates automatically using borrowers’ credit grades, which are calculated based on borrowers’ characteristics (Collier & Hampshire, Citation2010). Generally, better credit grades are associated with lower interest rates. Second, P2P lending platforms charge service fees for transactions (Klafft, Citation2008), instead of charging borrowers higher interest rates than the cost of the money as traditional financial institutions. P2P lending process benefits both borrowers and lenders. While borrowers can borrow money at lower costs than traditional financial institutions, lenders can make more money than putting their money in banks. This benefit comes with the risk of borrowers’ defaulting on the loans because many P2P loans are unsecured personal loans and most lenders have little knowledge about credit risk management (Xia, Liu, & Liu, Citation2017).

To control default rates and risks, P2P lending platforms built classification models to evaluate credit risks of loans and borrowers and suggest appropriate interest rates for loan applications. The quality of credit classification models is vital to the credit risk management and sustainability of P2P lending platforms. Using experiences from financial institutions, P2P lending platforms adopt and develop classification algorithms to categorize borrowers into different credit grades based on their characteristics and credit history, and recognize potential borrowers who are likely to default (Lessmann, Baesens, Seow, & Thomas, Citation2015; Florez-Lopez & Ramon-Jeronimo, Citation2014; Marqués, García, & Sánchez, Citation2013).

Though it is a common practice in traditional credit rating to use standard cost-insensitive binary classification algorithms (Li, Kou, Peng, & Shi, Citation2017; Morente-Molinera, Mezei, Carlsson, & Herrera-Viedma, Citation2017), such as logistic regression, neural networks, and decision trees (Butaru et al., Citation2016; Luo, Wu, & Wu, Citation2017), they are not appropriate in P2P lending for the following reasons. First, there are more than two classes of credit grades in P2P lending and each credit grade implies a certain level of risk. Thus multi-class classification should be considered in P2P credit grading. Second, P2P loan data are imbalanced. The number of samples in different credit grades varies dramatically. For instance, the number of ideal borrowers in the best grade or high-risk borrowers in the worst grade is much smaller than the other grade groups. Third, misclassification costs are not uniform across classes in P2P lending. In general, the cost of classifying a loan with bad credit as a good one is usually greater than classifying a good one as bad (Chen, Ribeiro, & Chen, Citation2016). In a multi-class credit-grading scenario, classifying a sample of grade C into grade A is more costly than classifying B into A. Therefore, standard cost-insensitive multi-class classification, in which all errors have the same cost, is not suitable for credit rating in P2P lending.

Cost-sensitive multi-class classifiers fit well for credit rating in P2P lending. Cost-sensitive classifiers were developed for imbalanced data classification (Elkan, Citation2001; Hu et al., Citation2015; Sun, Shang, & Li, Citation2014). Various cost-sensitive classifiers have been proposed for credit rating (Bahnsen, Aouada, & Ottersten, Citation2015; Chao & Peng, Citation2018; Marqués et al., Citation2013; Sahin, Bulkan, & Duman, Citation2013). The goal of cost-sensitive classifier is to minimize total costs measured by a misclassification cost matrix (Guan, Yuan, Ma, Khattak, & Chow, Citation2017), which is not only necessary but also important for cost-sensitive classification problems.

Though there are a few works in P2P lending (Xia et al., Citation2017; Xu, Chen, & Chau, Citation2016) that studied cost-sensitive classifiers, none of them have analyzed this issue from the perspective of multi-class classifications and measured misclassification costs of different credit grades using real losses and opportunity costs associated with P2P lending. How to measure the misclassification costs of different credit grades is a useful but understudied problem. Serrano-Cinca and Gutiérrez-Nieto (Citation2016) showed that loan profitability outperformed loan default probability in P2P lending, which proved the importance of considering both interest rates and the probability of default in P2P credit scoring.

Misclassification costs are losses of lenders’ earnings due to misclassifying credit grades of loans. It equals to the difference between the return of a loan when it is correctly classified and the return of a loan when it is misclassified as other credit grade. The difference can be one of the following situations: Equation(1)(1) $E R_{i} = (1 - P D_{i}) \cdot (1 + I_{i}) + P D_{i} \cdot (1 - Lgd) = 1 + I_{i} - P D_{i} \cdot (I_{i} + Lgd)$ (1) If a loan is classified to a better credit grade with a lower interest rate, the risk to default of the loan is underestimated and the interest rate of the loan is set lower than it should be, which means that the interest maybe insufficient to cover the risk that the lender bears. The lender will lose potential returns that they could have gotten, including an unpaid risk that the borrower should pay for the higher-risk loan. Equation(2)(2) $P D^{'} {(j | i)}_{i > j} = P D_{i} + β \cdot (I_{i} - I_{j})$ (2) If a loan is classified to a worse credit grade with a higher interest rate, borrowers might be scared away or it may increase their chance to default, which causes opportunity costs and financial losses to lenders.

The objective of this paper is to propose a multi-class cost matrix that measures misclassification costs of P2P credit grading by considering real losses and opportunity costs associated with P2P lending. We developed a set of equations and models to calculate misclassification costs. The parameters in the proposed equations and models are designed to calculate the cost matrix and support P2P lending platforms’ operations. A case study using data from Lending Club is conducted to demonstrate the performances of the proposed cost matrix using several well-known cost-sensitive classifiers. The results show that the proposed cost matrix can not only reveal the sources of losses caused by misclassifications, but also reduce the total costs for real-world P2P platforms, which is better than cost-insensitive classification algorithms.

The rest of this paper is organized as follows. Section 2 reviews related works. Section 3 proposes an abstract structure of credit grades, and misclassification costs which measure real financial losses in P2P lending. Section 4 analyzes the range of parameters in the misclassification cost matrix and explains their managerial implications. Section 5 conducts an experiment using data from Lending Club. Section 6 concludes the paper with limitations and future research directions.

2. Related works

The goal of most classifiers is to maximize accuracy and minimize misclassifications. Various classification methods have been proposed for credit rating and risk management (Santana, Lanzarini, & Bariviera, Citation2018; Huang & Kou, Citation2014; Kou, Peng, & Wang, Citation2014; Lanzarini, Villa Monte, Bariviera, & Jimbo Santana, Citation2017; Peng, Wang, Kou, & Shi, Citation2011; Wu & Kou, Citation2016;). Standard classifiers treat the costs of misclassifications the same, which is not true in real credit risk management (Fiore, De Santis, Perla, Zanetti, & Palmieri, Citation2017; Tapkan, Özbakır, Kulluk, & Baykasoğlu, Citation2016). Many researches support the use of cost-sensitive classifiers in credit rating. Sahin et al. (Citation2013) proposed a cost-sensitive decision tree approach with varying misclassification costs. It is successfully used in credit card fraud detection to decrease financial losses. Alejo, García, Marqués, Sánchez, and Antonio-Velázquez (Citation2013) improved the Multilayer Perceptron neural network using three misclassification cost functions and can be used to improve the prediction effectively in credit rating. Bahnsen, Aouada, and Ottersten (Citation2014, Bahnsen et al., Citation2015) suggested example-dependent cost-sensitive methods and proposed logistic regression and decision trees for credit scoring.

Misclassification cost can be described by a cost matrix C = (c_ij)_n×n, where c_ij indicates the cost due to misclassifying an instance of class i as class j, and n is the number of classes (Domingos, Citation1999). In credit rating, the measurement of misclassification costs in C is not only a basic component of cost-sensitive classification, but also vital for high quality credit rating. Real financial indicators, like profit-based or financial loss-related measures, are well aligned with the objectives in credit rating (Maldonado, Bravo, Lopez, & Perez, Citation2017; Serrano-Cinca & Gutiérrez-Nieto, Citation2016; Verbraken, Bravo, Weber, & Baesens, Citation2014). Beling, Covaliu, and Oliver (Citation2005) set the cost of a false negative to a loan’s interest rate charged to the customer int_r, the cost of a false positive to the loss given default Lgd, and both the costs of true positive and true negative are set to zero. Following this notation, this paper regards default loans as negative instances and good loans as positive instances. shows the cost matrix (Beling et al., Citation2005).

Multi-class misclassification cost matrix for credit ratings in peer-to-peer lending

Abstract

1. Introduction

2. Related works

Table 1. Cost matrix proposed by Beling et al. (Citation2005).

Table 2. Cost matrix proposed by Hand et al. (Citation2008).

Table 3. Cost matrix proposed by Bahnsen et al. (Citation2013).

Table 4. Example-dependent cost matrix proposed by Bahnsen et al. (Citation2014).

Table 5. Cost matrix in Xia et al. (Citation2017).

3. Misclassification cost measures

3.1. Modeling risks and profits in P2P lending

Table 6. Structure of credit grades in P2P lending.

3.2. Misclassification cost matrix

Table 7. Misclassification cost matrix C for credit rating in P2P lending.

3.2.1. Lower triangular submatrix C1: Prediction (j) better than actual (i)

3.2.2. Upper triangular submatrix C2: Actual (i) better than prediction (j)

4. Parameters analysis

4.1. Loss given default: Lgd

4.2. PD’s revised coefficient: β

4.3. Borrowers’ churn rate: α

5. Experiment: A case study on lending club

5.1. Data collection

Table 8. Number of instances in lending club data.

Table 9. Structure of credit grades on lending club data.

5.2. Setting parameters and managerial implications

Table 10. Actual annualized returns provided by lending clubTable Footnote*.

Table 11. Revised PD on lending club.

5.3. Cost matrix

Table 12. Misclassification cost matrix on lending club.

5.4. Sensitivity analysis

Table 13. Cosine similarities of cost matrixes when the parameters are discrete values.

5.5. Cost-sensitive credit rating

Table 14. Confusion matrix NTable Footnote*.

Table 15. Classification results using proposed cost matrix.

6. Conclusion and discussion

Disclosure statement

Additional information

Funding

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

3.2.1. Lower triangular submatrix C¹: Prediction (j) better than actual (i)

3.2.2. Upper triangular submatrix C²: Actual (i) better than prediction (j)