References
- Abd-Almageed, W., A. El-Osery, and C. E. Smith (2003, October). Non-parametric expectation maximization: A learning automata approach. In Systems, Man and Cybernetics, 2003. IEEE International Conference on (Vol. 3, pp. 2996–1465). IEEE, Washington DC.
- Bradley, P. S., U. Fayyad, and C. Reina (1998). Scaling EM (expectation-maximization) clustering to large databases (pp. 9-15). Redmond: Technical Report MSR-TR-98-35, Microsoft Research.
- Bradley, P. S., and U. M. Fayyad. July 1998. Refining Initial points for K-means clustering. ICML 98:91–99.
- Breiman, L., J. Friedman, C. J. Stone, and R. A. Olshen. 1984. Classification and regression trees. Belmont, CA, Wadsworth: CRC press.
- Catal, C., and B. Diri. 2009. Investigating the effect of dataset size, metrics sets, and feature selection techniques on software fault prediction problem. Information Sciences 179 (8):1040–58. doi:https://doi.org/10.1016/j.ins.2008.12.001.
- Chang, S. Y., and T. Y. Yeh. 2012. An artificial immune classifier for credit scoring analysis. Applied Soft Computing 12 (2):611–18. doi:https://doi.org/10.1016/j.asoc.2011.11.002.
- Chen, C. C., and S. T. Li. 2014. Credit rating with a monotonicity-constrained support vector machine model. Expert Systems with Applications 41 (16):7235–47. doi:https://doi.org/10.1016/j.eswa.2014.05.035.
- Chen, F. L., and F. C. Li. 2010. Combination of feature selection approaches with SVM in credit scoring. Expert Systems with Applications 37 (7):4902–09. doi:https://doi.org/10.1016/j.eswa.2009.12.025.
- Dastile, X., T. Celik, and M. Potsane. 2020. Statistical and machine learning models in credit scoring: A systematic literature survey. Applied Soft Computing 91:106263. doi:https://doi.org/10.1016/j.asoc.2020.106263.
- Dempster, A. P., N. M. Laird, and D. B. Rubin. 1977. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society. Series B (Methodological) 39 (1):1–38. doi:https://doi.org/10.1111/j.2517-6161.1977.tb01600.x.
- Dua, D. and Graff, C. (2019). UCI Machine Learning Repository. Irvine, CA: University of California, School of Information and Computer Science. http://archive.ics.uci.edu/ml
- Estivill-Castro, V., and J. Yang. 2004. Fast and robust general purpose clustering algorithms. Data Mining and Knowledge Discovery 8 (2):127–50. doi:https://doi.org/10.1023/B:DAMI.0000015869.08323.b3.
- Gehrke, J., V. Ganti, R. Ramakrishnan, and W. Y. Loh. 1999 June. BOAT—optimistic decision tree construction. ACM SIGMOD Record. Vol. 28, No. 2, pp. 169-180, Philadelphia, PA: ACM.
- Guotai, C., M. Z. Abedin, and F. E. Moula. 2017. Modeling credit approval data with neural networks: An experimental investigation and optimization. Journal of Business Economics and Management 18 (2):224–40. doi:https://doi.org/10.3846/16111699.2017.1280844.
- Hájek, P. 2011. Municipal credit rating modelling by neural networks. Decision Support Systems 51 (1):108–18. doi:https://doi.org/10.1016/j.dss.2010.11.033.
- Han, J., M. Kamber, and J. Pei. 2006. Data mining, southeast asia edition: Concepts and techniques. Morgan kaufmann, San Francisco.
- Hand, D. J., and W. E. Henley. 1997. Statistical classification methods in consumer credit scoring: A review. Journal of the Royal Statistical Society: Series A (Statistics in Society) 160 (3):523–41. doi:https://doi.org/10.1111/j.1467-985X.1997.00078.x.
- Hsieh, N. C. 2005. Hybrid mining approach in the design of credit scoring models. Expert Systems with Applications 28 (4):655–65. doi:https://doi.org/10.1016/j.eswa.2004.12.022.
- Hsieh, N. C., and L. P. Hung. 2010. A data driven ensemble classifier for credit scoring analysis. Expert Systems with Applications 37 (1):534–45. doi:https://doi.org/10.1016/j.eswa.2009.05.059.
- Huang, C. L., M. C. Chen, and C. J. Wang. 2007. Credit scoring with a data mining approach based on support vector machines. Expert Systems with Applications 33 (4):847–56. doi:https://doi.org/10.1016/j.eswa.2006.07.007.
- Huang, Z., H. Chen, C. J. Hsu, W. H. Chen, and S. Wu. 2004. Credit rating analysis with support vector machines and neural networks: A market comparative study. Decision Support Systems 37 (4):543–58. doi:https://doi.org/10.1016/S0167-9236(03)00086-1.
- Huysmans, J., B. Baesens, J. Vanthienen, and T. Van Gestel. 2006. Failure prediction with self-organizing maps. Expert Systems with Applications 30 (3):479–87. doi:https://doi.org/10.1016/j.eswa.2005.10.005.
- Ilter, D., E. Deniz, and O. Kocadagli. 2021. Hybridized artificial neural network classifiers with a novel feature selection procedure based genetic algorithms and information complexity in credit scoring. Applied Stochastic Models in Business and Industry 37 (2):203–28. doi:https://doi.org/10.1002/asmb.2614.
- Inyaem, U., and S. Chuaytem. 2020. Machine learning apply for financial credit approval to filter selected customer in domain specific bank. Science and Technology RMUTT Journal 10:1.
- Jadhav, S., H. He, and K. Jenkins. 2018. Information gain directed genetic algorithm wrapper feature selection for credit rating. Applied Soft Computing 69:541–53. doi:https://doi.org/10.1016/j.asoc.2018.04.033.
- Kim, K. J., and H. Ahn. 2012. A corporate credit rating model using multi-class support vector machines with an ordinal pairwise partitioning approach. Computers & Operations Research 39 (8):1800–11. doi:https://doi.org/10.1016/j.cor.2011.06.023.
- Koutanaei, F. N., H. Sajedi, and M. Khanbabaei. 2015. A hybrid data mining model of feature selection algorithms and ensemble learning classifiers for credit scoring. Journal of Retailing and Consumer Services 27:11–23. doi:https://doi.org/10.1016/j.jretconser.2015.07.003.
- Lee, M. C. 2009. Using support vector machine with a hybrid feature selection method to the stock trend prediction. Expert Systems with Applications 36 (8):10896–904. doi:https://doi.org/10.1016/j.eswa.2009.02.038.
- Lee, T. S., C. C. Chiu, C. J. Lu, and I. F. Chen. 2002. Credit scoring using the hybrid neural discriminant technique. Expert Systems with Applications 23 (3):245–54. doi:https://doi.org/10.1016/S0957-4174(02)00044-1.
- Levin, M. S. 2015. Combinatorial clustering: Literature review, methods, examples. Journal of Communications Technology and Electronics 60 (12):1403–28. doi:https://doi.org/10.1134/S1064226915120177.
- Lim, T. S., W. Y. Loh, and Y. S. Shih (1998). An empirical comparison of decision trees and other classification methods.
- Looney, C. G. 2002. Interactive clustering and merging with a new fuzzy expected value. Pattern Recognition 35 (11):2413–23. doi:https://doi.org/10.1016/S0031-3203(01)00213-8.
- Luo, S. T., B. W. Cheng, and C. H. Hsieh. 2009. Prediction model building with clustering-launched classification and support vector machines in credit scoring. Expert Systems with Applications 36 (4):7562–66. doi:https://doi.org/10.1016/j.eswa.2008.09.028.
- MacQueen, J. (1967, June). Some methods for classification and analysis of multivariate observations. In Proceedings of the fifth Berkeley symposium on mathematical statistics and probability ( Vol. 1, No. 14, pp. 281-297), University of California Press, Berkeley, USA.
- Malhotra, R., and D. K. Malhotra. 2002. Differentiating between good credits and bad credits using neuro-fuzzy systems. European Journal of Operational Research 136 (1):190–211. doi:https://doi.org/10.1016/S0377-2217(01)00052-2.
- Mehta, M., R. Agrawal, and J. Rissanen. 1996. SLIQ: A fast scalable classifier for data mining. In Advances in Database Technology—EDBT’96, 18–32. Avignon, France: Springer Berlin Heidelberg.
- Meila, M., and D. Heckerman. 1998. An experimental comparison of several clustering and initialization methods. arXiv preprint. arXiv:1301.7401.
- Muniyandi, A. P., R. Rajeswari, and R. Rajaram. 2012. Network anomaly detection by cascading k-Means clustering and C4. 5 decision tree algorithm. Procedia Engineering 30:174–82. doi:https://doi.org/10.1016/j.proeng.2012.01.849.
- Nalić, J., G. Martinović, and D. Žagar. 2020. New hybrid data mining model for credit scoring based on feature selection algorithm and ensemble classifiers. Advanced Engineering Informatics 45:101130. doi:https://doi.org/10.1016/j.aei.2020.101130.
- Nasser, S., R. Alkhaldi, and G. Vert (2006, July). A modified fuzzy k-means clustering using expectation maximization. In Fuzzy Systems, 2006 IEEE International Conference on (pp. 231–35). IEEE, Vancouver,BC, Canada.
- Ngai, E. W. T., L. Xiu, and D. C. K. Chau. 2009. Application of data mining techniques in customer relationship management: A literature review and classification. Expert Systems with Applications 36 (2):2592–602. doi:https://doi.org/10.1016/j.eswa.2008.02.021.
- Ngai, E. W. T., Y. Hu, Y. H. Wong, Y. Chen, and X. Sun. 2011. The application of data mining techniques in financial fraud detection: A classification framework and an academic review of literature. Decision Support Systems 50 (3):559–69. doi:https://doi.org/10.1016/j.dss.2010.08.006.
- Orlova, E. V. 2021. Methodology and models fo: r individuals’ creditworthiness management using digital footprint data and machine learning methods. Mathematics 9 (15):1820. doi:https://doi.org/10.3390/math9151820.
- Ping, Y., and L. Yongheng. 2011. Neighborhood rough set and SVM based hybrid credit scoring classifier. Expert Systems with Applications 38 (9):11300–04. doi:https://doi.org/10.1016/j.eswa.2011.02.179.
- Pławiak, P., M. Abdar, J. Pławiak, V. Makarenkov, and U. R. Acharya. 2020. DGHNL: A new deep genetic hierarchical network of learners for prediction of credit scoring. Information Sciences 516:401–18. doi:https://doi.org/10.1016/j.ins.2019.12.045.
- Pristyanto, Y., S. Adi, and A. Sunyoto (2019, July). The effect of feature selection on classification algorithms in credit approval. In 2019 International Conference on Information and Communications Technology (ICOIACT) (pp. 451–56). IEEE, Yogyakarta, Indonesia.
- Pushpalatha, D., and S. Rajalakshmi. 2018. Comparative analysis of machine learning and attribute selection techniques for credit approval data. International Journal of Pure and Applied Mathematics 118 (20):305–11.
- Quinlan, J. R. 1986. Induction of decision trees. Machine Learning 1 (1):81–106. doi:https://doi.org/10.1007/BF00116251.
- Quinlan, J. R. 1987. Simplifying decision trees. International Journal of Man-machine Studies 27 (3):221–34. doi:https://doi.org/10.1016/S0020-7373(87)80053-6.
- Reddy, B. G. O., and M. Ussenaiah. 2012. Literature survey on clustering techniques. IOSR Journal of Computer Engineering 3 (1):1–12. doi:https://doi.org/10.9790/0661-0310112.
- Saeys, Y., I. Inza, and P. Larrañaga. 2007. A review of feature selection techniques in bioinformatics. bioinformatics 23 (19):2507–17. doi:https://doi.org/10.1093/bioinformatics/btm344.
- Selim, S. Z., and M. A. Ismail. 1984. K-means-type algorithms: A generalized convergence theorem and characterization of local optimality. Pattern Analysis and Machine Intelligence, IEEE Transactions On (1):81–87. doi:https://doi.org/10.1109/TPAMI.1984.4767478.
- Shafer, J., R. Agrawal, and M. Mehta (1996, September). SPRINT: A scalable parallel classi er for data mining. In Proc. 1996 Int. Conf. Very Large Data Bases (pp. 544–55), Bombay, India.
- Sun, J., and H. Li. 2011. Dynamic financial distress prediction using instance selection for the disposal of concept drift. Expert Systems with Applications 38 (3):2566–76. doi:https://doi.org/10.1016/j.eswa.2010.08.046.
- Tan, P. N., M. Steinbach, and V. Kumar. 2006. Introduction to data mining(Vol. 1). Boston: Pearson Addison Wesley.
- Tripathi, D., D. R. Edla, and R. Cheruku. 2018. Hybrid credit scoring model using neighborhood rough set and multi-layer ensemble classification. Journal of Intelligent & Fuzzy Systems 34 (3):1543–49. doi:https://doi.org/10.3233/JIFS-169449.
- Tsai, C. F. 2009. Feature selection in bankruptcy prediction. Knowledge-Based Systems 22 (2):120–27. doi:https://doi.org/10.1016/j.knosys.2008.08.002.
- Tsai, C.-F. 2014. Combining cluster analysis with classifier ensembles to predict financial distress. Information Fusion 16:46–58. doi:https://doi.org/10.1016/j.inffus.2011.12.001.
- Tsai, C.-F., and K.-C. Cheng. 2012. Simple instance selection for bankruptcy prediction. Knowledge-Based Systems 27:333–42. doi:https://doi.org/10.1016/j.knosys.2011.09.017.
- Tsai, C.-F., and Y.-C. Hsiao. 2010. Combining multiple feature selection methods for stock prediction: Union, intersection, and multi-intersection approaches. Decision Support Systems 50 (1):258–69. doi:https://doi.org/10.1016/j.dss.2010.08.028.
- Wang, G., J. Hao, J. Ma, and H. Jiang. 2011. A comparative assessment of ensemble learning for credit scoring. Expert Systems with Applications 38 (1):223–30. doi:https://doi.org/10.1016/j.eswa.2010.06.048.
- Wang, G., and J. Ma. 2012. A hybrid ensemble approach for enterprise credit risk assessment based on support vector machine. Expert Systems with Applications 39 (5):5325–31. doi:https://doi.org/10.1016/j.eswa.2011.11.003.
- Xu, D., and Y. Tian. 2015. A comprehensive survey of clustering algorithms. Annals of Data Science 2 (2):165–93. doi:https://doi.org/10.1007/s40745-015-0040-1.
- Yeh, -C.-C., F. Lin, and C.-Y. Hsu. 2012. A hybrid KMV model, random forests and rough set theory approach for credit rating. Knowledge-Based Systems 33:166–72. doi:https://doi.org/10.1016/j.knosys.2012.04.004.
- Zhang, W., D. Yang, S. Zhang, J. H. Ablanedo-Rosas, X. Wu, and Y. Lou. 2021. A novel multi-stage ensemble model with enhanced outlier adaptation for credit scoring. Expert Systems with Applications 165:113872. doi:https://doi.org/10.1016/j.eswa.2020.113872.
- Zhao, Z., S. Xu, B. H. Kang, M. M. J. Kabir, Y. Liu, and R. Wasinger. 2015. Investigation and improvement of multi-layer perceptron neural networks for credit scoring. Expert Systems with Applications 42 (7):3508–16. doi:https://doi.org/10.1016/j.eswa.2014.12.006.
- Zhong, H., C. Miao, Z. Shen, and Y. Feng. 2014. Comparing the learning effectiveness of BP, ELM, I-ELM, and SVM for corporate credit ratings. Neurocomputing 128:285–95. doi:https://doi.org/10.1016/j.neucom.2013.02.054.