Publication Cover
Cybernetics and Systems
An International Journal
Volume 55, 2024 - Issue 5
59
Views
1
CrossRef citations to date
0
Altmetric
Research Articles

A Hybrid Intelligent Clustering Model for Tackling Incomplete Mixed Data Using Heuristic Algorithm with Artificial Intelligence

, &

References

  • Afridi, M. K., N. Azam, J. T. Yao, and E. Alanazi. 2018. A three-way clustering approach for handling missing data using GTRS. International Journal of Approximate Reasoning 98:11–24. doi:10.1016/j.ijar.2018.04.001.
  • Arshad, A., S. Riaz, and L. Jiao. 2019. Semi-supervised deep fuzzy C-mean clustering for imbalanced multi-class classification. IEEE Access 7:28100–12. doi:10.1109/ACCESS.2019.2901860.
  • Bu, F., Z. Chen, Q. Zhang, and L. T. Yang. 2016. Incomplete high-dimensional data imputation algorithm using feature selection and clustering analysis on cloud. The Journal of Supercomputing 72 (8):2977–90. doi:10.1007/s11227-015-1433-9.
  • Cao, D., K. Zeng, J. Wang, P. K. Sharma, X. Ma, Y. Liu, and S. Zhou. 2022. BERT-based deep spatial-temporal network for taxi demand prediction. IEEE Transactions on Intelligent Transportation Systems 23 (7):9442–54. doi:10.1109/TITS.2021.3122114.
  • Chao, G., S. Wang, S. Yang, C. Li, and D. Chu. 2022. Incomplete multi-view clustering with multiple imputation and ensemble clustering. Applied Intelligence 52 (13):14811–21. doi:10.1007/s10489-021-02978-z.
  • Chen, C., K. Li, S. G. Teo, X. Zou, K. Li, and Z. Zeng. 2020. Citywide traffic flow prediction based on multiple gated spatio-temporal convolutional neural networks. ACM Transactions on Knowledge Discovery from Data 14 (4):1–23. doi:10.1145/3385414.
  • Choudhury, S. J., and N. R. Pal. 2021. Deep and structure-preserving autoencoders for clustering data with missing information. in IEEE Transactions on Emerging Topics in Computational Intelligence 5 (4):639–50. doi:10.1109/TETCI.2019.2949264.
  • Cui, Z., X. Jing, P. Zhao, W. Zhang, and J. Chen. 2021. A new subspace clustering strategy for AI-based data analysis in IoT system. IEEE Internet of Things Journal 8 (16):12540–9. doi:10.1109/JIOT.2021.3056578.
  • Dataset1: https://www.kaggle.com/datasets/uciml/iris?select=database.sqlite:Access Date: 2022-06-27.
  • Dataset2: https://www.kaggle.com/datasets/uciml/red-wine-quality-cortez-et-al-2009:Access Date: 2022-06-27.
  • Dataset3: https://www.kaggle.com/datasets/uciml/glass:Access Date: 2022-06-27.
  • Dataset4: https://www.kaggle.com/datasets/ritesaluja/bank-note-authentication-uci-data:Access Date: 2022-06-27.
  • Dataset5: https://www.kaggle.com/datasets/faizunnabi/contraceptive-method-choice:Access Date: 2022-06-27.
  • Duan, M., K. Li, X. Liao, and K. Li. 2018. A parallel multiclassification algorithm for big data using an extreme learning machine. IEEE Transactions on Neural Networks and Learning Systems 29 (6):2337–51. doi:10.1109/TNNLS.2017.2654357.
  • Dutta, P., P. Mishra, and S. Saha. 2020. Incomplete multi-view gene clustering with data regeneration using Shape Boltzmann Machine. Computers in Biology and Medicine 125:103965. doi:10.1016/j.compbiomed.2020.103965.
  • Far, R. R., M. F. Zanjani, M. Saif, and S. Chakrabarti. 2020. Correlation clustering imputation for diagnosing attacks and faults with missing power grid data. IEEE Transactions on Smart Grid 11 (2):1453–64.
  • Iam-On, N., T. Boongeon, S. Garrett, and C. Price. 2012. A link-based cluster ensemble approach for categorical data clustering. IEEE Transactions on Knowledge and Data Engineering 24 (3):413–25. doi:10.1109/TKDE.2010.268.
  • Kaveh, A., and A. D. Eslamlou. 2020. Water strider algorithm: A new metaheuristic and applications. Structures 25:520–41. doi:10.1016/j.istruc.2020.03.033.
  • Li, D., H. Gu, and L. Zhang. 2013. A hybrid genetic algorithm–fuzzy c-means approach for incomplete data clustering based on nearest-neighbor intervals. Soft Computing 17 (10):1787–96. doi:10.1007/s00500-013-0997-7.
  • Li, J., S. Song, Y. Zhang, and Z. Zhou. 2016. Robust K-median and K-means clustering algorithms for incomplete data. Mathematical Problems in Engineering 2016:1–8. doi:10.1155/2016/4321928.
  • Li, M., X. Li, and J. Li. 2020. High-dimensional clustering for incomplete mixed dataset using artificial intelligence. IEEE Access 8:69629–38. doi:10.1109/ACCESS.2020.2986813.
  • Li, W., Y. Ding, Y. Yang, R. S. Sherratt, J. H. Park, and J. Wang. 2020. Parameterized algorithms of fundamental NP-hard problems: A survey. Human-centric Computing and Information Sciences 10:29.
  • Liao, L., K. Li, K. Li, C. Yang, and Q. Tian. 2018. A multiple kernel density clustering algorithm for incomplete datasets in bioinformatics. BMC Systems Biology 12 (Suppl 6):111. doi:10.1186/s12918-018-0630-6.
  • Mirjalili, S., and A. Lewis. 2016. The whale optimization algorithm. Advances in Engineering Software 95:51–67. doi:10.1016/j.advengsoft.2016.01.008.
  • Mirjalili, S., S. M. Mirjalili, and A. Lewis. 2014. Grey wolf optimizer. Advances in Engineering Software 69:46–61. doi:10.1016/j.advengsoft.2013.12.007.
  • Pedersen, M. E. H., and A. J. Chipperfield. 2010. Simplifying particle swarm optimization. Applied Soft Computing 10 (2):618–28. doi:10.1016/j.asoc.2009.08.029.
  • Pu, B., K. Li, S. Li, and N. Zhu. 2021. Automatic fetal ultrasound standard plane recognition based on deep learning and IIoT. IEEE Transactions on Industrial Informatics 17 (11):7771–80. doi:10.1109/TII.2021.3069470.
  • Salehnia, N., N. Salehnia, H. Ansari, S. Kolsoumi, and M. Bannayan. 2019. Climate data clustering effects on arid and semi-arid rainfed wheat yield: A comparison of artificial intelligence and K-means approaches. International Journal of Biometeorology 63 (7):861–72.
  • Sangeetha, T., and A. G. Mary. 2021. A fuzzy proximity relation approach for outlier detection in the mixed dataset by using rough entropy-based weighted density method. Soft Computing Letters 3:100027. doi:10.1016/j.socl.2021.100027.
  • Selvalakshmi, B., M. Subramaniam, and K. Sathiyasekar. 2021. Semantic conceptual relational similarity based web document clustering for efficient information retrieval using semantic ontology. KSII Transactions on Internet and Information Systems 15 (9):3102–19. doi:10.3837/tiis.2021.09.001.
  • Shi, H., P. Wang, X. Yang, and H. Yu. 2022. An improved mean imputation clustering algorithm for incomplete data. Neural Processing Letters 54 (5):3537–50. doi:10.1007/s11063-020-10298-5.
  • Subramaniam, M., A. Kathirvel, E. Sabithay, and H. A. Basha. 2021. Modified firefly algorithm and fuzzy C-mean clustering based semantic information retrieval. Journal of Web Engineering 20 (1):33–52.
  • Swaminathan, D. K., E. Kirubakaran, E. B. Rajsingh, and A. Shamila Ebenezer. 2019. MIFCM: MICE imputed fuzzy C means clustering approach for effective skyline query processing on partially complete data. Journal of Ambient Intelligence and Humanized Computing. doi:10.1007/s12652-019-01538-1.
  • Tabjula, J., S. Kalyani, P. Rajagopal, and B. Srinivasan. 2022. Statistics-based baseline-free approach for rapid inspection of delamination in composite structures using ultrasonic guided waves. Structural Health Monitoring 21 (6). doi:10.1177/14759217211073335.
  • Tabjula, J. L., S. Kanakambaran, S. Kalyani, P. Rajagopal, and B. Srinivasan. 2021. Outlier analysis for defect detection using sparse sampling in guided wave structural health monitoring. Structural Control and Health Monitering 28 (3):e2690. doi:10.1002/stc.2690.
  • Tran, C. T., M. Zhang, P. Andreae, B. Xue, and L. T. Bui. 2018. Improving performance of classification on incomplete data using feature selection and clustering. Applied Soft Computing 73:848–61. doi:10.1016/j.asoc.2018.09.026.
  • Umathe, V. H., and G. Chaudhary. 2015. A review on incomplete data and clustering. International Journal of Computer Science and Information Technologies 6 (2):1225–7.
  • Wang, J., C. Jin, Q. Tang, N. Xiong, and G. Srivastava. 2021. Intelligent ubiquitous network accessibility for wireless-powered MEC in UAV-assisted B5G. IEEE Transactions on Network Science and Engineering 8 (4):2801–13. doi:10.1109/TNSE.2020.3029048.
  • Wang, J., Y. Yang, T. Wang, R. Sherratt, and J. Zhang. 2020. Big data service architecture: A survey. Journal of Internet Technology 21 (2):393–405.
  • Wang, P., and X. Chen. 2020. Three-way ensemble clustering for incomplete data. IEEE Access 8:91855–64. doi:10.1109/ACCESS.2020.2994380.
  • Wang, S., M. Li, N. Hu, E. Zhu, J. Hu, X. Liu, and J. Yin. 2019. K-means clustering with incomplete data. IEEE Access 7:69162–71. doi:10.1109/ACCESS.2019.2910287.
  • Wei, Y., Y. Tang, and P. D. McNicholas. 2020. Flexible high-dimensional unsupervised learning with missing data. IEEE Transactions on Pattern Analysis and Machine Intelligence 42 (3):610–21.
  • Xue, Z., and H. Wang. 2021. Effective density-based clustering algorithms for incomplete data. Big Data Mining and Analytics 4 (3):183–94. doi:10.26599/BDMA.2021.9020001.
  • Zhang, C., R. Gao, H. Qin, and X. Feng. 2021. Three-way clustering method for incomplete information system based on set-pair analysis. Granular Computing 6 (2):389–98. doi:10.1007/s41066-019-00197-z.
  • Zhang, D.-Q., and S.-C. Chen. 2003. Clustering incomplete data using kernel-based fuzzy C-means algorithm. Neural Processing Letters 18 (3):155–62. doi:10.1023/B:NEPL.0000011135.19145.1b.
  • Zhang, J., S. Zhong, T. Wang, H.-C. Chao, and J. Wang. 2020. Blockchain-based systems and applications: A survey. Journal of Internet Technology 21 (1):1–14.
  • Zhang, L., Z. Bing, and L. Zhang. 2015. A hybrid clustering algorithm based on missing attribute interval estimation for incomplete data. Pattern Analysis and Applications 18 (2):377–84. doi:10.1007/s10044-014-0376-8.
  • Zhang, L., W. Lu, X. Liu, W. Pedrycz, and C. Zhong. 2016. Fuzzy C-means clustering of incomplete data based on probabilistic information granules of missing values. Knowledge-Based Systems 99:51–70. doi:10.1016/j.knosys.2016.01.048.
  • Zhang, Q., L. T. Yang, Z. Chen, and F. Xia. 2017. A high-order possibilistic C-means algorithm for clustering incomplete multimedia data. IEEE Systems Journal 11 (4):2160–9. doi:10.1109/JSYST.2015.2423499.
  • Zhao, L., J. Zhang, T. Yang, and Z. Chen. 2022. Incomplete multi-view clustering based on weighted sparse and low rank representation. Applied Intelligence 52 (13):14822–38. doi:10.1007/s10489-022-03246-4.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.