1,693
Views
37
CrossRef citations to date
0
Altmetric
Articles

Mining Semantic Soft Factors for Credit Risk Evaluation in Peer-to-Peer Lending

, , &

References

  • Abellán, J.; and Castellano, J.G. A comparative study on base classifiers in ensemble methods for credit scoring. Expert Systems with Applications, 73, (May 2017), 1–10.
  • Abu-Errub, A. Arabic text classification algorithm using TFIDF and chi square measurements. International Journal of Computer Applications, 93, 6 (May 2014), 40–45.
  • Baesens, B.; Van Gestel, T.; Viaene, S.; Stepanova, M.; Suykens, J.; and Vanthienen, J. Benchmarking state-of-the-art classification algorithms for credit scoring. Journal of the Operational Research Society, 54, 6 ( June 2003), 627–635.
  • Bertomeu, J.; and Marinovic, I. A Theory of hard and soft information. The Accounting Review, 91, 1 ( January 2016), 1–20.
  • Blei, D.M.; Ng, A.Y.; and Jordan, M.I. Latent Dirichlet allocation. Journal of Machine Learning Research, 3, (January 2003), 993–1022.
  • Bormuth, J.R. Development of Readability Analysis. Final Report, Project No. 7-0052, The University of Chicago, Chicago, Illinois, 1969.
  • Bouma, G. Normalized (pointwise) mutual information in collocation extraction. In Proceedings of German Society for Computational Linguistics, Potsdam, German. 2009, pp. 31-40.
  • Carneiro, N.; Figueira, G.; and Costa, M. A data mining based system for credit-card fraud detection in e-tail. Decision Support Systems, 95, (March 2017), 91–101.
  • Ciresan, D.C., Giusti, A., Gambardella, L.M., and Schmidhuber, J. Deep neural networks segment neuronal membranes in electron microscopy images. In Proceedings of Advances in Neural Information Processing Systems, Lake Tahoe, Nevada, United States. 2012, pp. 2843-2851.
  • Coleman, E.B. Developing a technology of written instruction: Some determiners of the complexity of prose. In Ernst Z. Rothkopf and Paul E. Johnson (Eds.), Verbal Learning Research and the Technology of Written Instruction. 1971, pp. 155-204. New York, United States: Teachers College Press.
  • Devi, C.R.D.; and Chezian, R.M. A relative evaluation of the performance of ensemble learning in credit scoring. In Proceedings of the 2016 IEEE International Conference on Advances in Computer Applications, Coimbatore, India. 2016, pp. 161–165.
  • Dorfleitner, G.; Priberny, C.; Schuster, S.; Stoiber, J.; Weber, M.; de Castro, I.; and Kammler, J. Description-text related soft information in peer-to-peer lending – Evidence from two leading European platforms. Journal of Banking & Finance, 64, (March 2016), 169–187.
  • Dumais, S.T. Latent semantic analysis. Annual Review of Information Science and Technology, 38, 1 (September 2005), 188–230.
  • Emekter, R.; Tu, Y.; Jirasakuldech, B.; and Lu, M. Evaluating credit risk and loan performance in online Peer-to-Peer (P2P) lending. Applied Economics, 47, 1 ( January 2015), 54–70.
  • Fayed, H.A.; and Atiya, A.F. Speed up grid-search for parameter selection of support vector machines. Applied Soft Computing, 80, (July 2019), 202–210.
  • Finlay, S. Multiple classifier architectures and their application to credit risk assessment. European Journal of Operational Research, 210, 2 (April 2011), 368–378.
  • Gao, Q.; Lin, M.; and Sias, R.W. Words matter: The role of texts in online credit markets (September 24, 2018). Available at SSRN: http://dx.doi.org/10.2139/ssrn.2446114.
  • Ge, R.; Feng, J.; Gu, B.; and Zhang, P. Predicting and deterring default with social media information in peer-to-peer lending. Journal of Management Information Systems, 34, 2 (April 2017), 401–424.
  • Guo, Y., Zhou, W., Luo, C., Liu, C., and Xiong, H. Instance-based credit risk assessment for investment decisions in P2P lending. European Journal of Operational Research, 249, 2 (March 2016), 417–426.
  • Hand, D.J. Measuring classifier performance: A coherent alternative to the area under the ROC curve. Machine Learning, 77, 1 (October 2009), 103–123.
  • Harris, T. Quantitative credit risk assessment using support vector machines: Broad versus Narrow default definitions. Expert Systems with Applications, 40, 11 (September 2013), 4404–4413.
  • Hore, C.; Asahara, M.; and Matsumoto, Y. Automatic extraction of fixed multiword expressions. In Proceedings of the International Conference on Natural Language Processing. 2005, pp. 565-575.
  • Iyer, R.; Khwaja, A.I.; Luttmer, E.F.P.; and Shue, K. Screening peers softly: Inferring the quality of small borrowers. Management Science, 62, 6 ( June 2016), 1554–1577.
  • Jiang, C., Wang, Z., Wang, R., and Ding, Y. Loan default prediction by combining soft information extracted from descriptive text in online peer-to-peer lending. Annals of Operations Research, 266, 1–2 ( July 2018), 511–529.
  • Jiang, Y.; (Chad) Ho, Y.-C.; Yan, X.; and Tan, Y. Investor platform choice: Herding, platform attributes, and regulations. Journal of Management Information Systems, 35, 1 ( January 2018), 86–116.
  • Kiruthika; and Dilsha, M. A neural network approach for microfinance credit scoring. Journal of Statistics and Management Systems, 18, 1–2 ( March 2015), 121–138.
  • Larrimore, L.; Jiang, L.; Larrimore, J.; Markowitz, D.; and Gorski, S. Peer to peer lending: The relationship between language features, trustworthiness, and persuasion success. Journal of Applied Communication Research, 39, 1 (February 2011), 19–37.
  • Lessmann, S.; Baesens, B.; Seow, H.-V.; and Thomas, L.C. Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research. European Journal of Operational Research, 247, 1 (November 2015), 124–136.
  • Liberti, J.M.; and Petersen, M.A. Information: Hard and soft. The Review of Corporate Finance Studies, 8, 1 (March 2019), 1–41.
  • Lin, M.; Prabhala, N.R.; and Viswanathan, S. Judging borrowers by the company they keep: Friendship networks and information asymmetry in online peer-to-peer lending. Management Science, 59, 1 ( January 2013), 17–35.
  • Manning, C.; Surdeanu, M.; Bauer, J.; Finkel, J.; Bethard, S.; and McClosky, D. The Stanford CoreNLP Natural Language Processing Toolkit. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Baltimore, Maryland, United States. 2014, pp. 55–60.
  • Martens, D.; Provost, F.; Clark, J.; and Junqué de Fortuny, E. Mining massive fine-grained behavior data to improve predictive analytics. MIS Quarterly, 40, 4 ( April 2016), 869–888.
  • Mikolov, T.; Sutskever, I.; Chen, K.; Corrado, G.; and Dean, J. Distributed representations ofwords and phrases and their compositionality. In Proceedings of Advances in Neural Information Processing Systems, Lake Tahoe, Nevada, United States. 2013, pp. 3111-3119.
  • Mikolov, T.; Yih, W.T.; and Zweig, G. Linguistic regularities in continuous spaceword representations. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Atlanta, Georgia, United States. 2013, pp. 746–751.
  • Oreski, S.; Oreski, D.; and Oreski, G. Hybrid system with genetic algorithm and artificial neural networks and its application to retail credit risk assessment. Expert Systems with Applications, 39, 16 (November 2012), 12605–12617.
  • Pennington, J.; Socher, R.; and Manning, C. Glove: Global Vectors for word representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, Doha, Qatar. 2014, pp. 1532–1543.
  • Pötzsch, S.; and Böhme, R. The role of soft information in trust building: Evidence from online social lending. In Proceedings of the International Conference on Trust and Trustworthy Computing, Berlin, Germany. 2010, pp. 381-395.
  • Senter, R.J.; and Smith, E.A. Automated Readability Index. Technical Report AMRL-TR-66-220, University of Cincinnati, Cincinnati, Ohio, 1967.
  • Shmueli, G. To explain or to predict? Statistical Science, 25, 3 (August 2010), 289–310.
  • Sievert, C.; and Shirley, K. LDAvis: A method for visualizing and interpreting topics. In Proceedings of the Workshop on Interactive Language Learning, Visualization, and Interfaces, Baltimore, Maryland, United States. 2014, pp. 63-70.
  • Stepanova, M.; and Thomas, L. Survival analysis methods for personal loan data. Operations Research, 50, 2 (April 2002), 277–289.
  • Thomas, Lyn C. Consumer Credit Models: Pricing, Profit and portfolios: Pricing, Profit and Portfolios. New York, United States: Oxford University Press, 2009.
  • Tibshirani, R. Regression shrinkage and selection via the lasso: A retrospective. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 73, 3 (June 2011), 273–282.
  • Tong, E.N.C.; Mues, C.; and Thomas, L.C. Mixture cure models in credit scoring: If and when borrowers default. European Journal of Operational Research, 218, 1 (April 2012), 132–139.
  • Wang, G.; Ma, J.; Huang, L.; and Xu, K. Two credit scoring models based on dual strategy ensemble trees. Knowledge-Based Systems, 26, (February 2012), 61–68.
  • Wang, S.; Qi, Y.; Fu, B.; and Liu, H. Credit risk evaluation based on text analysis. International Journal of Cognitive Informatics and Natural Intelligence, 10, 1 ( January 2016), 1–11.
  • Wei, Z.; and Lin, M. Market mechanisms in online peer-to-peer lending. Management Science, 63, 12 ( December 2017), 4236–4257.
  • Yao, X.; Crook, J.; and Andreeva, G. Support vector regression for loss given default modelling. European Journal of Operational Research, 240, 2 (January 2015), 528–538.
  • Zhang, D.; Leung, S.C.H.; and Ye, Z. A decision tree scoring model based on genetic algorithm and K-means algorithm. In Proceedings of the Third International Conference on Convergence and Hybrid Information Technology, Busan, South Korea, 2008, pp. 1043–1047.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.