160
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Adaptive Coati Optimization Enabled Deep CNN-based Image Captioning

, , &
Article: 2381166 | Received 04 Nov 2023, Accepted 09 Jul 2024, Published online: 17 Jul 2024

References

  • Al Duhayyim, M., S. Alazwari, H. A. Mengash, R. Marzouk, J. S. Alzahrani, H. Mahgoub, F. Althukair, and A. S. Salama. 2022. Metaheuristics optimization with deep learning enabled automated image captioning system. Applied Sciences 12 (15):7724. doi:10.3390/app12157724.
  • Al-Malla, M. A., A. Jafar, and N. Ghneim. 2022. Image captioning model using attention and object features to mimic human image understanding. Journal of Big Data 9 (1):1–19. doi:10.1186/s40537-022-00571-w.
  • Anderson, P., X. He, C. Buehler, D. Teney, M. Johnson, S. Gould, and L. Zhang. 2018. Bottom-up and top-down attention for image captioning and visual question answering. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018, Salt Lake City, UT, USA, 6077–6086. doi:10.1109/CVPR.2018.00636
  • Balasubramaniam, S., and V. Kavitha. 2013. A survey on data retrieval techniques in cloud computing. Journal of Convergence Information Technology 8 (16):1–15.
  • Balasubramaniam, S., M. H. Syed, N. S. More, and V. Polepally. 2023. Deep learning-based power prediction aware charge scheduling approach in cloud based electric vehicular network. Engineering Applications of Artificial Intelligence 121:105869.
  • Chang, Y. H., Y. J. Chen, R. H. Huang, and Y. T. Yu. 2021. Enhanced image captioning with color recognition using deep learning methods. Applied Sciences 12 (1):209. doi:10.3390/app12010209.
  • Chen, T., Z. Li, J. Wu, H. Ma, and B. Su. 2022. Improving image captioning with pyramid attention and SC-GAN. Image and Vision Computing 117:104340. doi:10.1016/j.imavis.2021.104340.
  • Choudhury, A., S. Balasubramaniam, A. P. Kumar, and S. N. P. Kumar. 2023. PSSO: Political squirrel search optimizer-driven deep learning for severity level detection and classification of lung cancer. International Journal of Information Technology & Decision Making 1–34. doi:10.1142/S0219622023500189.
  • Dehghani, M., Z. Montazeri, E. Trojovská, and P. Trojovský. 2023. Coati optimization algorithm: A new bio-inspired metaheuristic algorithm for solving optimization problems. Knowledge-Based Systems 259:110011. doi:10.1016/j.knosys.2022.110011.
  • Fan, K. C., and T. Y. Hung. 2014. A novel local pattern descriptor—local vector pattern in high-order derivative space for face recognition. IEEE Transactions on Image Processing 23 (7):2877–91. doi:10.1109/TIP.2014.2321495.
  • Fausto, F., E. Cuevas, and A. Gonzales. 2017. A new descriptor for image matching based on bionic principles. Pattern Analysis and Applications 20 (4):1245–59. doi:10.1007/s10044-017-0605-z.
  • Ghandi, T., H. Pourreza, and H. Mahyar. 2023. Deep learning approaches on image captioning: A review. ACM Computing Surveys 56 (3):1–39. doi:10.1145/3617592.
  • Humaira, M., P. Shimul, M. A. R. K. Jim, A. S. Ami, and F. M. Shah. 2021. A hybridized deep learning method for Bengali image captioning. International Journal of Advanced Computer Science & Applications 12 (2). doi:10.14569/IJACSA.2021.0120287.
  • Kim, H., and S. Bang. 2020. Data for: Context-based information generation from construction site images using unmanned aerial vehicle (UAV)-acquired data and image captioning. Mendeley Data V1. Accessed July 2023. doi:10.17632/4h68fmktwh.1.
  • Lessa, V., and M. Marengoni. 2016. Applying artificial neural network for the classification of breast cancer using infrared thermographic images. Proceedings of Computer Vision and Graphics. ICCVG 2016, 10 September 2016, Warsaw, Poland, vol. 9972, 429–438. Springer International Publishing. doi:10.1007/978-3-319-46418-3_38.
  • Luo, J., Y. Li, Y. Pan, T. Yao, J. Feng, H. Chao, and T. Mei. 2023. Semantic-conditional diffusion networks for image captioning. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 23359–23368. doi:10.1109/CVPR52729.2023.02237.
  • Minarno, A. E., Z. Ibrahim, A. Nur, M. Y. Hasanuddin, N. M. Diah, and Y. Munarko. 2022. Leaf based plant species classification using deep convolutional neural network. Proceedings of 10th International Conference on Information and Communication Technology (ICoICT), Bandung, Indonesia, 99–104. IEEE. doi:10.1109/ICoICT55009.2022.9914851.
  • Omri, M., S. Abdel-Khalek, E. M. Khalil, J. Bouslimi, and G. P. Joshi. 2022. Modeling of hyperparameter tuned deep learning model for automated image captioning. Mathematics 10 (3):288. doi:10.3390/math10030288.
  • Singh, A., J. Krishna Raguru, G. Prasad, S. Chauhan, P. K. Tiwari, A. Zaguia, M. A. Ullah, and S. K. Gupta. 2022. Medical image captioning using optimized deep learning model. Computational Intelligence and Neuroscience 2022:1–9. doi:10.1155/2022/9638438.
  • Thangavel, K., N. Palanisamy, S. Muthusamy, O. P. Mishra, S. C. M. Sundararajan, H. Panchal, A. K. Loganathan, and P. Ramamoorthi. 2023. A novel method for image captioning using multimodal feature fusion employing mask RNN and LSTM models. Soft Computing 27 (19):14205–14218.
  • Wang, C., and X. Gu. 2023. Learning joint relationship attention network for image captioning. Expert Systems with Applications 211:118474. doi:10.1016/j.eswa.2022.118474.
  • Xian, T., Z. Li, Z. Tang, and H. Ma. 2022. Adaptive path selection for dynamic image captioning. IEEE Transactions on Circuits and Systems for Video Technology 32 (9):5762–75. doi:10.1109/TCSVT.2022.3155795.
  • Xian, T., Z. Li, C. Zhang, and H. Ma. 2022. Dual global enhanced transformer for image captioning. Neural Networks 148:129–41. doi:10.1016/j.neunet.2022.01.011.
  • Xie, J., R. Girshick, and A. Farhadi. 2016. Unsupervised deep embedding for clustering analysis. Proceedings of the 33rd International conference on machine learning, PMLR, New York, NY, USA, vol. 48, 478–487.