1,217
Views
5
CrossRef citations to date
0
Altmetric
Research Article

Dynamic activation and enhanced image contour features for object detection

ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon &
Article: 2155614 | Received 11 Oct 2022, Accepted 01 Dec 2022, Published online: 19 Dec 2022

References

  • Bochkovskiy, A., Wang, C. Y., & Liao, H. Y. M. (2020). YOLOv4: Optimal speed and accuracy of object detection. https://doi.org/10.48550/arXiv.2004.10934
  • Dubey, S. R., Singh, S. K., & Chaudhuri, B. B. (2022). Activation functions in deep learning: A comprehensive survey and benchmark. Neurocomputing. https://doi.org/10.1016/j.neucom.2022.06.111
  • Gavrilescu, R., Zet, C., Foalău, C., Skoczylas, M., & Cotovanu, D. (2018, October). Faster R-CNN: An approach to real-time object detection. In 2018 International Conference and Exposition on Electrical and Power Engineering (pp. 165–168). https://doi.org/10.1109/ICEPE.2018.8559776
  • Han, K., Wang, Y., Tian, Q., Guo, J., Xu, C., & Xu, C. (2020). GhostNet: More features from cheap operations. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 1580–1589). https://doi.org/10.48550/arXiv.1911.11907
  • He, K., Gkioxari, G., Dollár, P., & Girshick, R. (2017). Mask R-CNN. In Proceedings of the IEEE International Conference on Computer Vision (pp. 2961–2969). https://doi.org/10.48550/arXiv.1703.06870
  • He, K., Zhang, X., Ren, S., & Sun, J. (2015). Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37(9), 1904–1916. https://doi.org/10.1109/TPAMI.2015.2389824
  • Howard, A., Sandler, M., Chu, G., Chen, L. C., Chen, B., Tan, M., Wang, W., Zhu, Y., Pang, R., & Vasudevan, V. (2019). Searching for mobilenetv3. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 1314–1324).
  • Howard, A. G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., & Adam, H. (2017). Mobilenets: Efficient convolutional neural networks for mobile vision applications. https://doi.org/10.48550/arXiv.1704.04861
  • Hu, J., Shen, L., & Sun, G. (2018). Squeeze-and-excitation networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 7132–7141). https://doi.org/10.48550/arXiv.1709.01507
  • Hu, Q., & Zhai, L. (2019). RGB-D image multi-target detection method based on 3D DSF R-CNN. International Journal of Pattern Recognition and Artificial Intelligence, 33(8). https://doi.org/10.1142/S0218001419540260
  • Iandola, F. N., Han, S., Moskewicz, M. W., Ashraf, K., Dally, W. J., & Keutzer, K. (2016). SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5 MB model size. https://doi.org/10.48550/arXiv.1602.07360
  • Jocher, G., Nishimura, K., Mineeva, T., & Vilariño, R. (2020). YOLOv5. Code repository.
  • Lan, R., Sun, L., Liu, Z., Lu, H., Pang, C., & Luo, X. (2020). MADNet: A fast and lightweight network for single-image super resolution. IEEE Transactions on Cybernetics, 51(3), 1443–1453. https://doi.org/10.1109/TCYB.2020.2970104
  • Li, B., & Li, J. (2022). Methods for landslide detection based on lightweight YOLOv4 convolutional neural network. Earth Science Informatics, 765–775. https://doi.org/10.1007/s12145-022-00764-0
  • Li, C., Li, L., Jiang, H., Weng, K., Geng, Y., Li, L., Ke, Z., Li, Q., Cheng, M., Nie, W., & Wei, X. (2022). YOLOv6: A single-stage object detection framework for industrial applications. https://doi.org/10.48550/arXiv.2209.02976
  • Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C. Y., & Berg, A. C. (2016, October). SSD: Single shot multibox detector. In European Conference on Computer Vision (pp. 21–37). Springer. https://doi.org/10.1007/978-3-319-46448-0_2
  • Ma, N., Zhang, X., Liu, M., & Sun, J. (2021). Activate or not: Learning customized activation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 8032–8042). https://doi.org/10.48550/arXiv.2009.04759
  • Quinn, J. A., Nakasi, R., Mugagga, P. K., Byanyima, P., Lubega, W., & Andama, A. (2016, December). Deep convolutional neural networks for microscopy-based point of care diagnostics. In Machine Learning for Healthcare Conference (pp. 271–281). https://doi.org/10.48550/arXiv.1608.02989
  • Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 779–788).
  • Redmon, J., & Farhadi, A. (2017). YOLO9000: Better, faster, stronger. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 7263–7271). https://doi.org/10.48550/arXiv.1612.08242
  • Redmon, J., & Farhadi, A. (2018). YOLOv3: An incremental improvement. https://doi.org/10.48550/arXiv.1804.02767
  • Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., & Chen, L. C. (2018). Mobilenetv2: Inverted residuals and linear bottlenecks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4510–4520). https://doi.org/10.48550/arXiv.1801.04381
  • Wang, C. Y., Bochkovskiy, A., & Liao, H. Y. M. (2021). Scaled-YOLOv4: Scaling cross stage partial network. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 13024–13033). https://doi.org/10.1109/CVPR46437.2021.01283
  • Wang, C. Y., Bochkovskiy, A., & Liao, H. Y. M. (2022). YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. https://doi.org/10.48550/arXiv.2207.02696
  • Wang, Y., Hua, C., Ding, W., & Wu, R. (2022). Real-time detection of flame and smoke using an improved YOLOv4 network. Signal, Image and Video Processing, 16(4), 1109–1116. https://doi.org/10.1007/s11760-021-02060-8
  • Yan, P., Sun, Q., Yin, N., Hua, L., Shang, S., & Zhang, C. (2022). Detection of coal and gangue based on improved YOLOv5. 1 which embedded scSE module. Measurement, 188, Article 110530. https://doi.org/10.1016/j.measurement.2021.110530
  • Yang, D., Du, Y., Yao, H., & Bao, L. (2022). Image semantic segmentation with hierarchical feature fusion based on deep neural network. Connection Science, 34(1), 1772–1784. https://doi.org/10.1080/09540091.2022.2082384
  • Yu, J., & Zhang, W. (2021). Face mask wearing detection algorithm based on improved YOLO-v4. Sensors, 21(9), 3263. https://doi.org/10.3390/s21093263
  • Zhang, X., Zhou, X., Lin, M., & Sun, J. (2018). Shufflenet: An extremely efficient convolutional neural network for mobile devices. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 6848–6856). https://doi.org/10.48550/arXiv.1707.01083
  • Zhu, Y., Xia, Q., & Jin, W. (2022). SRDD: A lightweight end-to-end object detection with transformer. Connection Science, 34(1), 2448–2465. https://doi.org/10.1080/09540091.2022.2125499