Search in:

Applied Artificial Intelligence

An International Journal

Volume 35, 2021 - Issue 15

Submit an article Journal homepage

Free access

2,814

Views

CrossRef citations to date

Altmetric

Research Article

A Hybrid Machine Learning Model for Credit Approval

Cheng-Hsiung Wenga Department of Artificial Intelligence and Health Management, Central Taiwan University of Science and Technology, Taichung, Taiwan, Republic of China;b Department of Information Management, National Chin-Yi University of Technology, Taichung, Taiwan, Republic of China

Cheng-Kui Huangc Department of Business Administration, National Chung Cheng University, Taiwan, Republic of ChinaCorrespondence[email protected]

https://orcid.org/0000-0001-8994-3598

Pages 1439-1465 | Received 28 Dec 2020, Accepted 13 Sep 2021, Published online: 12 Oct 2021

Cite this article
https://doi.org/10.1080/08839514.2021.1982475
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF View EPUB EPUB

References

Abd-Almageed, W., A. El-Osery, and C. E. Smith (2003, October). Non-parametric expectation maximization: A learning automata approach. In Systems, Man and Cybernetics, 2003. IEEE International Conference on (Vol. 3, pp. 2996–1465). IEEE, Washington DC.
Google Scholar
Bradley, P. S., U. Fayyad, and C. Reina (1998). Scaling EM (expectation-maximization) clustering to large databases (pp. 9-15). Redmond: Technical Report MSR-TR-98-35, Microsoft Research.
Google Scholar
Bradley, P. S., and U. M. Fayyad. July 1998. Refining Initial points for K-means clustering. ICML 98:91–99.
Google Scholar
Breiman, L., J. Friedman, C. J. Stone, and R. A. Olshen. 1984. Classification and regression trees. Belmont, CA, Wadsworth: CRC press.
Google Scholar
Catal, C., and B. Diri. 2009. Investigating the effect of dataset size, metrics sets, and feature selection techniques on software fault prediction problem. Information Sciences 179 (8):1040–58. doi:https://doi.org/10.1016/j.ins.2008.12.001.
Web of Science ®Google Scholar
Chang, S. Y., and T. Y. Yeh. 2012. An artificial immune classifier for credit scoring analysis. Applied Soft Computing 12 (2):611–18. doi:https://doi.org/10.1016/j.asoc.2011.11.002.
Web of Science ®Google Scholar
Chen, C. C., and S. T. Li. 2014. Credit rating with a monotonicity-constrained support vector machine model. Expert Systems with Applications 41 (16):7235–47. doi:https://doi.org/10.1016/j.eswa.2014.05.035.
Web of Science ®Google Scholar
Chen, F. L., and F. C. Li. 2010. Combination of feature selection approaches with SVM in credit scoring. Expert Systems with Applications 37 (7):4902–09. doi:https://doi.org/10.1016/j.eswa.2009.12.025.
Web of Science ®Google Scholar
Dastile, X., T. Celik, and M. Potsane. 2020. Statistical and machine learning models in credit scoring: A systematic literature survey. Applied Soft Computing 91:106263. doi:https://doi.org/10.1016/j.asoc.2020.106263.
Web of Science ®Google Scholar
Dempster, A. P., N. M. Laird, and D. B. Rubin. 1977. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society. Series B (Methodological) 39 (1):1–38. doi:https://doi.org/10.1111/j.2517-6161.1977.tb01600.x.
Web of Science ®Google Scholar
Dua, D. and Graff, C. (2019). UCI Machine Learning Repository. Irvine, CA: University of California, School of Information and Computer Science. http://archive.ics.uci.edu/ml
Google Scholar
Estivill-Castro, V., and J. Yang. 2004. Fast and robust general purpose clustering algorithms. Data Mining and Knowledge Discovery 8 (2):127–50. doi:https://doi.org/10.1023/B:DAMI.0000015869.08323.b3.
Web of Science ®Google Scholar
Gehrke, J., V. Ganti, R. Ramakrishnan, and W. Y. Loh. 1999 June. BOAT—optimistic decision tree construction. ACM SIGMOD Record. Vol. 28, No. 2, pp. 169-180, Philadelphia, PA: ACM.
Google Scholar
Guotai, C., M. Z. Abedin, and F. E. Moula. 2017. Modeling credit approval data with neural networks: An experimental investigation and optimization. Journal of Business Economics and Management 18 (2):224–40. doi:https://doi.org/10.3846/16111699.2017.1280844.
Web of Science ®Google Scholar
Hájek, P. 2011. Municipal credit rating modelling by neural networks. Decision Support Systems 51 (1):108–18. doi:https://doi.org/10.1016/j.dss.2010.11.033.
Web of Science ®Google Scholar
Han, J., M. Kamber, and J. Pei. 2006. Data mining, southeast asia edition: Concepts and techniques. Morgan kaufmann, San Francisco.
Google Scholar
Hand, D. J., and W. E. Henley. 1997. Statistical classification methods in consumer credit scoring: A review. Journal of the Royal Statistical Society: Series A (Statistics in Society) 160 (3):523–41. doi:https://doi.org/10.1111/j.1467-985X.1997.00078.x.
Google Scholar
Hsieh, N. C. 2005. Hybrid mining approach in the design of credit scoring models. Expert Systems with Applications 28 (4):655–65. doi:https://doi.org/10.1016/j.eswa.2004.12.022.
Web of Science ®Google Scholar
Hsieh, N. C., and L. P. Hung. 2010. A data driven ensemble classifier for credit scoring analysis. Expert Systems with Applications 37 (1):534–45. doi:https://doi.org/10.1016/j.eswa.2009.05.059.
Web of Science ®Google Scholar
Huang, C. L., M. C. Chen, and C. J. Wang. 2007. Credit scoring with a data mining approach based on support vector machines. Expert Systems with Applications 33 (4):847–56. doi:https://doi.org/10.1016/j.eswa.2006.07.007.
Web of Science ®Google Scholar
Huang, Z., H. Chen, C. J. Hsu, W. H. Chen, and S. Wu. 2004. Credit rating analysis with support vector machines and neural networks: A market comparative study. Decision Support Systems 37 (4):543–58. doi:https://doi.org/10.1016/S0167-9236(03)00086-1.
Web of Science ®Google Scholar
Huysmans, J., B. Baesens, J. Vanthienen, and T. Van Gestel. 2006. Failure prediction with self-organizing maps. Expert Systems with Applications 30 (3):479–87. doi:https://doi.org/10.1016/j.eswa.2005.10.005.
Web of Science ®Google Scholar
Ilter, D., E. Deniz, and O. Kocadagli. 2021. Hybridized artificial neural network classifiers with a novel feature selection procedure based genetic algorithms and information complexity in credit scoring. Applied Stochastic Models in Business and Industry 37 (2):203–28. doi:https://doi.org/10.1002/asmb.2614.
Google Scholar
Inyaem, U., and S. Chuaytem. 2020. Machine learning apply for financial credit approval to filter selected customer in domain specific bank. Science and Technology RMUTT Journal 10:1.
Google Scholar
Jadhav, S., H. He, and K. Jenkins. 2018. Information gain directed genetic algorithm wrapper feature selection for credit rating. Applied Soft Computing 69:541–53. doi:https://doi.org/10.1016/j.asoc.2018.04.033.
Web of Science ®Google Scholar
Kim, K. J., and H. Ahn. 2012. A corporate credit rating model using multi-class support vector machines with an ordinal pairwise partitioning approach. Computers & Operations Research 39 (8):1800–11. doi:https://doi.org/10.1016/j.cor.2011.06.023.
Web of Science ®Google Scholar
Koutanaei, F. N., H. Sajedi, and M. Khanbabaei. 2015. A hybrid data mining model of feature selection algorithms and ensemble learning classifiers for credit scoring. Journal of Retailing and Consumer Services 27:11–23. doi:https://doi.org/10.1016/j.jretconser.2015.07.003.
Web of Science ®Google Scholar
Lee, M. C. 2009. Using support vector machine with a hybrid feature selection method to the stock trend prediction. Expert Systems with Applications 36 (8):10896–904. doi:https://doi.org/10.1016/j.eswa.2009.02.038.
Web of Science ®Google Scholar
Lee, T. S., C. C. Chiu, C. J. Lu, and I. F. Chen. 2002. Credit scoring using the hybrid neural discriminant technique. Expert Systems with Applications 23 (3):245–54. doi:https://doi.org/10.1016/S0957-4174(02)00044-1.
Web of Science ®Google Scholar
Levin, M. S. 2015. Combinatorial clustering: Literature review, methods, examples. Journal of Communications Technology and Electronics 60 (12):1403–28. doi:https://doi.org/10.1134/S1064226915120177.
Web of Science ®Google Scholar
Lim, T. S., W. Y. Loh, and Y. S. Shih (1998). An empirical comparison of decision trees and other classification methods.
Google Scholar
Looney, C. G. 2002. Interactive clustering and merging with a new fuzzy expected value. Pattern Recognition 35 (11):2413–23. doi:https://doi.org/10.1016/S0031-3203(01)00213-8.
Web of Science ®Google Scholar
Luo, S. T., B. W. Cheng, and C. H. Hsieh. 2009. Prediction model building with clustering-launched classification and support vector machines in credit scoring. Expert Systems with Applications 36 (4):7562–66. doi:https://doi.org/10.1016/j.eswa.2008.09.028.
Web of Science ®Google Scholar
MacQueen, J. (1967, June). Some methods for classification and analysis of multivariate observations. In Proceedings of the fifth Berkeley symposium on mathematical statistics and probability ( Vol. 1, No. 14, pp. 281-297), University of California Press, Berkeley, USA.
Google Scholar
Malhotra, R., and D. K. Malhotra. 2002. Differentiating between good credits and bad credits using neuro-fuzzy systems. European Journal of Operational Research 136 (1):190–211. doi:https://doi.org/10.1016/S0377-2217(01)00052-2.
Web of Science ®Google Scholar
Mehta, M., R. Agrawal, and J. Rissanen. 1996. SLIQ: A fast scalable classifier for data mining. In Advances in Database Technology—EDBT’96, 18–32. Avignon, France: Springer Berlin Heidelberg.
Google Scholar
Meila, M., and D. Heckerman. 1998. An experimental comparison of several clustering and initialization methods. arXiv preprint. arXiv:1301.7401.
Google Scholar
Muniyandi, A. P., R. Rajeswari, and R. Rajaram. 2012. Network anomaly detection by cascading k-Means clustering and C4. 5 decision tree algorithm. Procedia Engineering 30:174–82. doi:https://doi.org/10.1016/j.proeng.2012.01.849.
Google Scholar
Nalić, J., G. Martinović, and D. Žagar. 2020. New hybrid data mining model for credit scoring based on feature selection algorithm and ensemble classifiers. Advanced Engineering Informatics 45:101130. doi:https://doi.org/10.1016/j.aei.2020.101130.
Web of Science ®Google Scholar
Nasser, S., R. Alkhaldi, and G. Vert (2006, July). A modified fuzzy k-means clustering using expectation maximization. In Fuzzy Systems, 2006 IEEE International Conference on (pp. 231–35). IEEE, Vancouver,BC, Canada.
Google Scholar
Ngai, E. W. T., L. Xiu, and D. C. K. Chau. 2009. Application of data mining techniques in customer relationship management: A literature review and classification. Expert Systems with Applications 36 (2):2592–602. doi:https://doi.org/10.1016/j.eswa.2008.02.021.
Web of Science ®Google Scholar
Ngai, E. W. T., Y. Hu, Y. H. Wong, Y. Chen, and X. Sun. 2011. The application of data mining techniques in financial fraud detection: A classification framework and an academic review of literature. Decision Support Systems 50 (3):559–69. doi:https://doi.org/10.1016/j.dss.2010.08.006.
Web of Science ®Google Scholar
Orlova, E. V. 2021. Methodology and models fo: r individuals’ creditworthiness management using digital footprint data and machine learning methods. Mathematics 9 (15):1820. doi:https://doi.org/10.3390/math9151820.
Web of Science ®Google Scholar
Ping, Y., and L. Yongheng. 2011. Neighborhood rough set and SVM based hybrid credit scoring classifier. Expert Systems with Applications 38 (9):11300–04. doi:https://doi.org/10.1016/j.eswa.2011.02.179.
Web of Science ®Google Scholar
Pławiak, P., M. Abdar, J. Pławiak, V. Makarenkov, and U. R. Acharya. 2020. DGHNL: A new deep genetic hierarchical network of learners for prediction of credit scoring. Information Sciences 516:401–18. doi:https://doi.org/10.1016/j.ins.2019.12.045.
Web of Science ®Google Scholar
Pristyanto, Y., S. Adi, and A. Sunyoto (2019, July). The effect of feature selection on classification algorithms in credit approval. In 2019 International Conference on Information and Communications Technology (ICOIACT) (pp. 451–56). IEEE, Yogyakarta, Indonesia.
Google Scholar
Pushpalatha, D., and S. Rajalakshmi. 2018. Comparative analysis of machine learning and attribute selection techniques for credit approval data. International Journal of Pure and Applied Mathematics 118 (20):305–11.
Google Scholar
Quinlan, J. R. 1986. Induction of decision trees. Machine Learning 1 (1):81–106. doi:https://doi.org/10.1007/BF00116251.
Google Scholar
Quinlan, J. R. 1987. Simplifying decision trees. International Journal of Man-machine Studies 27 (3):221–34. doi:https://doi.org/10.1016/S0020-7373(87)80053-6.
Google Scholar
Reddy, B. G. O., and M. Ussenaiah. 2012. Literature survey on clustering techniques. IOSR Journal of Computer Engineering 3 (1):1–12. doi:https://doi.org/10.9790/0661-0310112.
Google Scholar
Saeys, Y., I. Inza, and P. Larrañaga. 2007. A review of feature selection techniques in bioinformatics. bioinformatics 23 (19):2507–17. doi:https://doi.org/10.1093/bioinformatics/btm344.
PubMed Web of Science ®Google Scholar
Selim, S. Z., and M. A. Ismail. 1984. K-means-type algorithms: A generalized convergence theorem and characterization of local optimality. Pattern Analysis and Machine Intelligence, IEEE Transactions On (1):81–87. doi:https://doi.org/10.1109/TPAMI.1984.4767478.
Google Scholar
Shafer, J., R. Agrawal, and M. Mehta (1996, September). SPRINT: A scalable parallel classi er for data mining. In Proc. 1996 Int. Conf. Very Large Data Bases (pp. 544–55), Bombay, India.
Google Scholar
Sun, J., and H. Li. 2011. Dynamic financial distress prediction using instance selection for the disposal of concept drift. Expert Systems with Applications 38 (3):2566–76. doi:https://doi.org/10.1016/j.eswa.2010.08.046.
Web of Science ®Google Scholar
Tan, P. N., M. Steinbach, and V. Kumar. 2006. Introduction to data mining(Vol. 1). Boston: Pearson Addison Wesley.
Google Scholar
Tripathi, D., D. R. Edla, and R. Cheruku. 2018. Hybrid credit scoring model using neighborhood rough set and multi-layer ensemble classification. Journal of Intelligent & Fuzzy Systems 34 (3):1543–49. doi:https://doi.org/10.3233/JIFS-169449.
Web of Science ®Google Scholar
Tsai, C. F. 2009. Feature selection in bankruptcy prediction. Knowledge-Based Systems 22 (2):120–27. doi:https://doi.org/10.1016/j.knosys.2008.08.002.
Web of Science ®Google Scholar
Tsai, C.-F. 2014. Combining cluster analysis with classifier ensembles to predict financial distress. Information Fusion 16:46–58. doi:https://doi.org/10.1016/j.inffus.2011.12.001.
Web of Science ®Google Scholar
Tsai, C.-F., and K.-C. Cheng. 2012. Simple instance selection for bankruptcy prediction. Knowledge-Based Systems 27:333–42. doi:https://doi.org/10.1016/j.knosys.2011.09.017.
Web of Science ®Google Scholar
Tsai, C.-F., and Y.-C. Hsiao. 2010. Combining multiple feature selection methods for stock prediction: Union, intersection, and multi-intersection approaches. Decision Support Systems 50 (1):258–69. doi:https://doi.org/10.1016/j.dss.2010.08.028.
Web of Science ®Google Scholar
Wang, G., J. Hao, J. Ma, and H. Jiang. 2011. A comparative assessment of ensemble learning for credit scoring. Expert Systems with Applications 38 (1):223–30. doi:https://doi.org/10.1016/j.eswa.2010.06.048.
Web of Science ®Google Scholar
Wang, G., and J. Ma. 2012. A hybrid ensemble approach for enterprise credit risk assessment based on support vector machine. Expert Systems with Applications 39 (5):5325–31. doi:https://doi.org/10.1016/j.eswa.2011.11.003.
Web of Science ®Google Scholar
Xu, D., and Y. Tian. 2015. A comprehensive survey of clustering algorithms. Annals of Data Science 2 (2):165–93. doi:https://doi.org/10.1007/s40745-015-0040-1.
Google Scholar
Yeh, -C.-C., F. Lin, and C.-Y. Hsu. 2012. A hybrid KMV model, random forests and rough set theory approach for credit rating. Knowledge-Based Systems 33:166–72. doi:https://doi.org/10.1016/j.knosys.2012.04.004.
Web of Science ®Google Scholar
Zhang, W., D. Yang, S. Zhang, J. H. Ablanedo-Rosas, X. Wu, and Y. Lou. 2021. A novel multi-stage ensemble model with enhanced outlier adaptation for credit scoring. Expert Systems with Applications 165:113872. doi:https://doi.org/10.1016/j.eswa.2020.113872.
Web of Science ®Google Scholar
Zhao, Z., S. Xu, B. H. Kang, M. M. J. Kabir, Y. Liu, and R. Wasinger. 2015. Investigation and improvement of multi-layer perceptron neural networks for credit scoring. Expert Systems with Applications 42 (7):3508–16. doi:https://doi.org/10.1016/j.eswa.2014.12.006.
Web of Science ®Google Scholar
Zhong, H., C. Miao, Z. Shen, and Y. Feng. 2014. Comparing the learning effectiveness of BP, ELM, I-ELM, and SVM for corporate credit ratings. Neurocomputing 128:285–95. doi:https://doi.org/10.1016/j.neucom.2013.02.054.
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

A Hybrid Machine Learning Model for Credit Approval

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

A Hybrid Machine Learning Model for Credit Approval

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date