2,774
Views
0
CrossRef citations to date
0
Altmetric
RESEARCH ARTICLE

Designing a data mining process for the financial services domain

, , &
Pages 140-166 | Received 13 Jun 2021, Accepted 01 Jun 2022, Published online: 07 Jul 2022

References

  • Adam, I. O. (2014). The ontological, epistemological and methodological debates in information systems research: A partial review. Epistemological and Methodological Debates in Information Systems Research: A Partial Review (March 2014). https://dx.doi.org/10.2139/ssrn.2411620.
  • Adner, R. (2017). Ecosystem as structure: An actionable construct for strategy. Journal of Management, 43(1), 39–58. https://doi.org/10.1177/0149206316678451
  • Alturki, A., Gable, G. G., Bandara, W., & Gregor, S. (2012). Validating the design science research roadmap: Through the lens of ”the idealised model for theory development”. In S. L. Pan & T. H. Cao (Eds.), 16th Pacific Asia Conference on information systems, PACIS 2012, ho chi minh city, Vietnam, 11-15 July 2012 (pp. 2). http://aisel.aisnet.org/pacis2012/2
  • Anand, S. S., Patrick, A., Hughes, J. G., & Bell, D. A. (1998). A data mining methodology for cross-sales. Knowledge-based Systems, 10(7), 449–461. https://doi.org/10.1016/S0950-7051(98)00035-5
  • Anand, S. S., & Büchner, A. G. (1998). Decision support using data mining. Financial Times Management. Financial Times Pitman.
  • Arnold, M., Boston, J., Desmond, M., Duesterwald, E., Elder, B., Murthi, A., et al. (2020). Towards automating the AI operations lifecycle. CoRR, abs/2003.12808. https://arxiv.org/abs/2003.12808
  • Berendt, B., & Preibusch, S. Better decision support through exploratory discrimination-aware data mining: Foundations and empirical evidence. (2014). Artificial Intelligence and Law, 22(2), 175–209. https://doi.org/10.1007/s10506-013-9152-0
  • Bhattacherjee, A. (2001). Understanding information systems continuance: An expectation- confirmation model. MIS Quarterly, 25 (3), 351–370. http://misq.org/understanding-information-systems-continuance-an-expectation-confirmation-model.html
  • Bourque, P., & Fairley, R. E. (2014). Guide to the software engineering body of knowledge (swebok (r)): Version 3.0. IEEE Computer Society Press.
  • Brachman, R. J., & Anand, T. (1996). The process of knowledge discovery in databases. In Advances in knowledge discovery and data mining (pp. 37–57). American Association for Artificial Intelligence.
  • Bradley, P. (2020). Risk management standards and the active management of malicious intent in artificial superintelligence. AI & SOCIETY, 35(2), 319–328. https://doi.org/10.1007/s00146-019-00890-2
  • Buchner, A. G., Mulvenna, M. D., Anand, S. S., & Hughes, J. G. (1999). An internet-enabled knowledge discovery process. Proceedings of the 9th international database conference, 1836. San Diego, California, USA: User-Driven Navigation Pattern Discovery from Internet Data, Web Usage Analysis and User Profiling, International WEBKDD'99 Workshop, 74-–91. https://doi.org/10.1007/3-540-44934-5\_5. .
  • Cabena, P., Hadjinian, P., Stadler, R., Verhees, J., & Zanasi, A. (1997). International Business Machines Corporation (San Jose, C., & International Technical Support Organization). Discovering data mining: from concept to implementation. Prentice Hall PTR.
  • Cao, L. (2018). . AI in Finance: A Review. ACM Comput Article 39. https://doi.org/10.1145/nnnnnnn.nnnnnnn.
  • Cao, L. (2021). AI in finance: Challenges, techniques and opportunities. CoRR, abs/2107.09051. https://arxiv.org/abs/2107.09051
  • Chandra, L., Seidel, S., & Gregor, S. (2015). Prescriptive knowledge in IS research: Conceptualizing design principles in terms of materiality, action, and boundary conditions. In S. X. Bui & R. H. S Jr. (Eds.), 48th Hawaii international conference on system sciences, HICSS 2015, kauai, Hawaii, usa, January 5-8, 2015 (pp. 4039–4048). IEEE Computer Society. https://doi.org/10.1109/HICSS.2015.485
  • Chapman, P., Clinton, J., Kerber, R., Khabaza, T., Reinartz, T., and Shearer, C., et al. (2000). Crisp-dm 1.0 step-by-step data mining guide, 9(13) . SPSS Inc, pp. 1–73.
  • Cortet, M., Rijks, T., & Nijland, S. (2016). Psd2: The digital transformation accelerator for banks. Journal of Payments Strategy & Systems, 10(1), 13–27.
  • Cusick, J. J. (2020). Business value of ITSM. Requirement or mirage? CoRR. abs/2001.00219. http://arxiv.org/abs/2001.00219.
  • Czarnecki, C., & Dietze, C. (2017). Domain-specific reference modeling in the telecommu- nications industry. In A. Maedche, J. Vom Brocke, & A. R. Hevner (Eds.), Designing the digital transformation - 12th international conference, DESRIST 2017, karlsruhe, Germany, may 30 - June 1, 2017, proceedings, 10243, 313–329. Springer. https://doi.org/10.1007/978-3-319-59144-5/19
  • Davenport, T., & Harris, J. (2017). Competing on analytics: The new science of winning. Harvard Business Press.
  • Debuse, J. (2007). Extending data mining methodologies to encompass organizational factors. Systems Research and Behavioral Science: The Official Journal of the International Federation for Systems Research, 24(2), 183–190. https://doi.org/10.1002/sres.823
  • Diaz, D., Theodoulidis, B., & Abioye, E. (2012). Cross-border challenges in financial markets monitoring and surveillance: A case study of customer-driven service value networks. 2012 annual srii global conference (pp. 146–157).
  • Dignum, V. (2017). Responsible autonomy. In C. Sierra (Ed.), Proceedings of the twenty- sixth international joint conference on artificial intelligence, IJCAI 2017, melbourne, Australia, August 19-25, 2017 (pp. 4698–4704). ijcai.org. https://doi.org/10.24963/ijcai.2017/655
  • Dignum, V. Ethics in artificial intelligence: Introduction to the special issue. (2018). Ethics and Information Technology, 20(1), 1–3. https://doi.org/10.1007/s10676-018-9450-z
  • Dumas, M., Rosa, M. L., Mendling, J., & Reijers, H. A. (2018). Fundamentals of business process management, second edition. Springer. https://doi.org/10.1007/978-3-662-56509-4
  • Fayyad, U. M., Piatetsky-Shapiro, G., & Smyth, P. (1996a). From data mining to knowledge discovery in databases. AI Magazine, 17(3), 37–54. https://doi.org/10.1609/aimag.v17i3.1230
  • Fayyad, U. M., Piatetsky-Shapiro, G., & Smyth, P. The KDD process for extracting useful knowledge from volumes of data. (1996b). Communications of the ACM, 39(11), 27–34. https://doi.org/10.1145/240455.240464
  • Fayyad, U. M., Piatetsky-Shapiro, G., & Smyth, P. (1996c). Knowledge discovery and data mining: Towards a unifying framework. Proceedings of the second international conference on knowledge discovery and data mining (kdd-96) (pp. 82–88). portland, oregon, USA. Association for the Advancement of Artificial Intelligence (AAAI). http://www.aaai.org/Library/KDD/1996/kdd96-014.php
  • Friedler, S. A., Scheidegger, C., Venkatasubramanian, S., Choudhary, S., Hamilton, E. P., & Roth, D. (2019). A comparative study of fairness-enhancing interventions in machine learning. Proceedings of the conference on fairness, accountability, and transparency, fat* 2019, Atlanta, ga, usa, January 29-31, 2019 (pp. 329–338). ACM. https://doi.org/10.1145/3287560.3287589
  • Galup, S. D., Dattero, R., & Quan, J. What do agile, lean, and ITIL mean to DevOps? (2020). Communications of the ACM, 63(10), 48–53. https://doi.org/10.1145/3372114
  • Gertosio, C., & Dussauchoy, A. Knowledge discovery from industrial databases. (2004). Journal of Intelligent Manufacturing, 15(1), 29–37. https://doi.org/10.1023/B/%3AJIMS.0000010073.54241.e7
  • Gregor, S., & Hevner, A. R. (2013). Positioning and presenting design science research for maximum impact. MIS Quarterly, 37 (2), 337–355. http://misq.org/positioning-and-presenting-design-science-research-for-maximum-impact.html
  • Haes, S. D., Grembergen, W. V., & Debreceny, R. S. (2013). COBIT 5 and enterprise governance of information technology: Building blocks and research opportunities. Journal Information Systems, 27(1), 307–324. https://doi.org/10.2308/isys-50422
  • Hajian, S., Bonchi, F., & Castillo, C. (2016). Algorithmic bias: From discrimination discovery to fairness-aware data mining. In B. Krishnapuram, M. Shah, A. J. Smola, C. C. Aggarwal, D. Shen, & R. Rastogi (Eds.), Proceedings of the 22nd A CM SIGKDD international conference on knowledge discovery and datamining, San Francisco, ca, usa, August 13-17, 2016 (pp. 2125–2126). ACM. https://doi.org/10.1145/2939672.2945386
  • Hamada, K., Ishikawa, F., Masuda, S., Myojin, T., Nishi, Y., Ogawa, H., et al. (2020a). Guidelines for quality assurance of machine learning-based artificial intelligence. In R. Garc´ıa-Castro (Ed.), The 32nd international conference on software engineering and knowledge engineering, SEKE 2020, KSIR virtual conference center, usa, July 9-19, 2020 (pp. 335–341). KSI Research Inc. https://doi.org/10.18293/SEKE2020-094
  • Hamada, K., Ishikawa, F., Masuda, S., Myojin, T., Nishi, Y., and Ogawa, H., et al. (2020b). Guidelines for quality assurance of machine learning-based artificial intelligence. International Journal of Software Engineering and Knowledge Engineering, Special Issue: Best Paper from SEKE 2020, 11 (12), 1589–1606. doi:10.18293/SEKE2020-094.
  • Hang, Y., & Fong, S. (2009). A framework of business intelligence-driven data mining for e- business. 2009 fifth international joint conference on inc, ims and idc (pp. 1964–1970).
  • Hevner, A. R., March, S. T., Park, J., & Ram, S. (2004). Design science in information systems research. MIS Quarterly, 28(1), 75–105. https://doi.org/10.2307/25148625
  • Hevner, A., & Chatterjee, S. (2010). Design science research in information systems. Design research in information systems (pp. 9–22). Boston, MA: Springer.
  • Huber, S., Wiemer, H., Schneider, D., & Ihlenfeldt, S. (2019). Dmme: Data mining methodology for engineering applications – A holistic extension to the crisp-dm model. Procedia CIRP, 79, 403–408. Elsevier. https://doi.org/10.1016/j.procir.2019.02.106
  • Hummer, W., Muthusamy, V., Rausch, T., Dube, P., Maghraoui, K. E., Murthi, A., et al. (2019). Modelops: Cloud-based lifecycle management for reliable and trusted AI. IEEE international conference on cloud engineering, IC2E 2019, Prague, Czech Republic, June 24-7, 2019 (pp. 113–120). IEEE. https://doi.org/10.1109/IC2E.2019.00025
  • Kashyap, S., & Iveroth, E. (2021). Transparency and accountability influences of regulation on risk control: The case of a Swedish bank. Journal of Management and Governance 25(2), 475-–508. https://doi.org/10.1007/s10997-020-09550-w
  • Kurgan, L. A., & Musilek, P. (2006). A survey of knowledge discovery and data mining process models. The Knowledge Engineering Review, 21(1), 1–24. https://doi.org/10.1017/S0269888906000737
  • Kurshan, E., Shen, H., & Chen, J. (2020). Towards self-regulating AI: Challenges and opportunities of AI model governance in financial services. Proceedings of the First ACM International Conference on AI in Finance (p. 8).
  • Kusner, M. J., Loftus, J. R., Russell, C., & Silva, R. (2017). Counterfactual fairness. In I. Guyon et al. (Eds.), Advances in neural information processing systems 30: Annual conference on neural information processing systems 2017, December 4-9, 2017 (pp. 4066–4076). CA, USA: Long Beach. https://proceedings.neurips.cc/paper/2017/hash/a486cd07e4ac3d270571622f4f316ec5-Abstract.html
  • Marban, O., Mariscal, G., Menasalvas, E., & Segovia, J. (2007). An engineering approach to data mining projects. In International conference on intelligent data engineering and automated. learning 4881, 578–588. doi:10.1007/978-3-540-77226-2\59.
  • Marban, O., Mariscal, G., & Segovia, J. (2009). A data mining and knowledge discovery process model. In P. Julio & K. Adem (Eds.), Data mining and knowledge discovery in real life applications (pp. 438–453). Paris, I-Tech.
  • Marban, O., Segovia, J., Menasalvas, E., & Fernandez-Baizan, C. (2009). Toward data mining engineering: A software engineering approach. Information Systems, 34(1), 87–107. https://doi.org/10.1016/j.is.2008.04.003
  • Mariscal, G., Marban, O., & Fernandez, C. (2010). A survey of data mining and knowledge discovery process models and methodologies. The Knowledge Engineering Review, 25(2), 137–166. https://doi.org/10.1017/S0269888910000032
  • Mendes, R., & Vilela, J. P. (2017). Privacy-preserving data mining: Methods, metrics, and applications. IEEE Access, 5, 10562–10582. https://doi.org/10.1109/ACCESS.2017.2706947.
  • Meth, H., Müller, B., & Maedche, A. (2015). Designing a requirement mining system. Journal of the Association for Information Systems, 16 (9), 2. http://aisel.aisnet.org/jais/vol16/iss9/2
  • Montiel, J., Bifet, A., & Abdessalem, T. (2017). Predicting over-indebtedness on batch and streaming data. In J. Nie et al. (Eds.), 2017 IEEE international conference on big data, bigdata 2017, Boston, ma, usa, December 11-14, 2017 (pp. 1504–1513). IEEE Computer Society. https://doi.org/10.1109/BigData.2017.8258084
  • Morabito, V. (2016). The future of digital business innovation. Springer Books.
  • Morley, J., Floridi, L., Kinsey, L., & Elhalal, A. (2020). From what to how: An initial review of publicly available AI ethics tools, methods and research to translate principles into practices. Science and Engineering Ethics, 26 (4), 2141–2168. https://doi.org/10.1007/s11948-019-00165-5
  • Niaksu, O. (2015). Crisp data mining methodology extension for medical domain. Baltic Journal of Modern Computing, 3(2), 92.
  • Ostrowski, L., & Helfert, M. (2011). Commonality in various design science methodologies. In M. Ganzha, L. A. Maciaszek, & M. Paprzycki (Eds.), Federated conference on computer science and information systems - fedcsis 2011, szczecin, Poland, 18-21 September 2011, proceedings (pp. 317–320). http://ieeexplore.ieee.org/document/6078172/
  • Palmi´e, M., Wincent, J., Parida, V., & Caglar, U. (2020). The evolution of the financial technology ecosystem: An introduction and agenda for future research on disruptive innovations in ecosystems. Technological Forecasting and Social Change, 151(151), 119779. https://doi.org/10.1016/j.techfore.2019.119779
  • Peffers,K., Tuunanen, T., Rothenberger, M. A., & Chatterjee, S. (2008). A design science research methodology for information systems research. Journal of Management Information Systems, 24 (3), 45–77. http://www.jmis-web.org/articles/765
  • Peffers, K., Tuunanen, T., & Niehaves, B. Design science research genres: Introduction to the special issue on exemplars and criteria for applicable design science research. (2018). European Journal of Information Systems, 27(2), 129–139. https://doi.org/10.1080/0960085X.2018.1458066
  • Pivk, A., Vasilecas, O., Kalibatiene, D., & Rupnik, R. (2013). On approach for the implementation of data mining to business process optimisation in commercial companies. Technological and Economic Development of Economy, 19(2), 237–256. https://doi.org/10.3846/20294913.2013.796501
  • Plotnikova, V., Dumas, M., & Milani, F. (2019b). Data mining methodologies in the banking domain: A systematic literature review. In M. Pankowska & K. Sandkuhl (Eds.), Perspectives in business informatics research - 18th international conference, BIR 2019, katowice,poland, (Vol. 365, pp. 104–118). Springer. Retrieved from https://doi.org/10.1007/978-3-030-31143-8/8
  • Plotnikova, V., Dumas, M., & Milani, F. (2020). Adaptations of data mining methodologies: A systematic literature review. PeerJ Comput. Sci, 6, e267. Retrieved from. https://doi.org/10.7717/peerj-cs.267
  • Plotnikova, V., Dumas, M., & Milani, F. (2021). Adapting the CRISP-DM data mining process: A case study in the financial services domain. In S. S. Cherfi, A. Perini, & S. Nurcan (Eds.), Research challenges in information science - 15th international conference, RCIS 2021,limassol, cyprus, (Vol. 415, pp. 55–71). Springer. https://doi.org/10.1007/978-3-030-75018-3/4
  • Priebe, T., & Markus, S. (2015). Business information modeling: A methodology for data- intensive projects, data science and big data governance. 2015 IEEE international conference on big data, big data 2015, Santa Clara, ca, usa, October 29 - November 1, 2015 (pp. 2056–2065). IEEE Computer Society. https://doi.org/10.1109/BigData.2015.7363987
  • Rahman, F. A., Desa, M. I., & Wibowo, A. (2011). A review of kdd-data mining framework and its application in logistics and transportation. In The 7th international conference on networked computing and advanced information management (pp. 175–180).
  • Richards, J. T., Piorkowski, D., Hind, M., Houde, S., & Mojsilovic, A. (2020). A methodology for creating AI factsheets. CoRR, abs/2006.13796. https://arxiv.org/abs/2006.13796
  • Runeson, P., Höst, M., Rainer, A., & Regnell, B. (2012). Case study research in software engineering – Guidelines and examples. Wiley. http://eu.wiley.com/WileyCDA/WileyTitle/productCd-1118104358.html
  • Saldana, J. (2015). The coding manual for qualitative researchers. Sage publications.
  • SAS Institute, I. (2017). Sas® enterprise miner™ 14.3: Reference help. SAS Institute Inc.
  • Siau, K., & Wang, W. (2020). Artificial intelligence (AI) ethics: Ethics of AI and ethical AI. Journal of Database Management, 31(2), 74–87. https://doi.org/10.4018/JDM.2020040105
  • Solarte, J. (2002). A proposed data mining methodology and its application to industrial engineering [Master’s thesis]. University of Tennessee.
  • Sommerville, I. Integrated requirements engineering: A tutorial. (2005). IEEE Software, 22(1), 16–23. https://doi.org/10.1109/MS.2005.13
  • Studer, S., Bui, T. B., Drescher, C., Hanuschkin, A., Winkler, L., and Peters, S., et al. (2020). Towards CRISP-ML(Q): A machine learning process model with quality assurance methodology, 3(2), 392-413. CoRR, abs/2003.05155. https://arxiv.org/abs/2003.05155
  • Tavares, R., Vieira, R., & Pedro, L. (2017). A preliminary proposal of a conceptual educational data mining framework for science education: Scientific competences development and self- regulated learning. 2017 international symposium on computers in education (siie) (pp. 1–6).
  • Vakkuri, V., Kemell, K., & Abrahamsson, P. (2019). AI ethics in industry: A research framework. In M. M. Rantanen & J. Koskinen (Eds.), Proceedings of the third seminar on technologyethics, tethics 2019, turku, Finland, October 23-24, 2019, 2505, 49–60. CEUR-WS.org. http://ceur-ws.org/Vol-2505/paper06.pdf
  • Venable, J. (2006). The role of theory and theorising in design science research. Proceedings of the 1st international conference on design science in information systems and technology (desrist 2006) (pp. 1–18).
  • Venable, J. R., Pries-Heje, J., & Baskerville, R. L. (2016). FEDS: A framework for evaluation in design science research open. European Journal of Information Systems, 25(1), 77–89. https://doi.org/10.1057/ejis.2014.36
  • Venable, J. R., Pries-Heje, J., & Baskerville, R. L. (2017). Choosing a design science research methodology. Australasian conference on information systems 2017, hobart, Australia. ACIS2017 Conference Proceeding.
  • Wiegers, K., & Beatty, J. (2013). Software requirements. Pearson Education.
  • Xiang, L. (2009). Context-aware data mining methodology for supply chain finance cooperative systems. 2009 Fifth International Conference on Autonomic and Autonomous Systems (pp. 301–306. https://doi.org/10.1109/ICAS.2009.48.