A point and density map hybrid network for crowd counting and localization based on unmanned aerial vehicles

Lei Zhaoa Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, People's Republic of ChinaView further author information

Zhengwei Baob Ningbo Jiwang Information Technology Co., Ltd, Ningbo, People's Republic of ChinaCorrespondence[email protected]
View further author information

Zhijun Xiea Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, People's Republic of China;d Zhejiang Engineering Research Center of Advcanced Mass spectrometry and Clinical Application, Ningbo, People's Republic of ChinaView further author information

Guangyan Huangc School of Information Technology, Deakin University, Melbourne, AustraliaView further author information

Zeeshan Ur Rehmana Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, People's Republic of ChinaView further author information

Pages 2481-2499 | Received 07 Jul 2022, Accepted 23 Sep 2022, Published online: 11 Oct 2022

Cite this article
https://doi.org/10.1080/09540091.2022.2130878
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

References

Bai, S., He, Z., Qiao, Y., Hu, H., Wu, W., & Yan, J. (2020). Adaptive dilated network with self-correction supervision for counting. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020 (pp. 4593–4602). Computer Vision Foundation/IEEE. https://doi.org/10.1109/CVPR42600.2020.00465
Google Scholar
Basalamah, S., Khan, S. D., & Ullah, H. (2019). Scale driven convolutional neural network model for people counting and localization in crowd scenes. IEEE Access, 7, 71576–71584. https://doi.org/10.1109/ACCESS.2019.2918650
Web of Science ®Google Scholar
Cao, X., Wang, Z., Zhao, Y., & Su, F. (2018). Scale aggregation network for accurate and efficient crowd counting. In V. Ferrari, M. Hebert, C. Sminchisescu, & Y. Weiss (Eds.), Computer Vision – ECCV 2018 – 15th European Conference, Proceedings, Part V (Vol. 11209, pp. 757–773). Springer. https://doi.org/10.1007/978-3-030-01228-1_45
Google Scholar
Deb, D., & Ventura, J. (2018). An aggregated multicolumn dilated convolution network for perspective-free counting. In 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops 2018, (pp. 195–204). Computer Vision Foundation/IEEE Computer Society. https://doi.org/10.1109/CVPRW.2018.00057
Google Scholar
Du, D., Wen, L., Zhu, P., Fan, H., Hu, Q., Ling, H., Shah, M., Pan, J., Al-Ali, A., Mohamed, A., Imene, B., Dong, B., Zhang, B., Nesma, B. H., Xu, C., Duan, C., Castiello, C., Mencar, C., Liang, D., … Zhao, Z. (2020). VisDrone-CC2020: The vision meets drone crowd counting challenge results. In A. Bartoli, & A. Fusiello (Eds.), Computer Vision – ECCV 2020 Workshops – Proceedings, Part IV (Vol. 12538, pp. 675–691). Springer. https://doi.org/10.1007/978-3-030-66823-5_41.
Google Scholar
Fan, Z., Zhu, Y., Song, Y., & Liu, Z. (2020). Generating high quality crowd density map based on perceptual loss. Applied Intelligence, 50(4), 1073–1085. https://doi.org/10.1007/s10489-019-01573-7
Web of Science ®Google Scholar
He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 (pp. 770–778). IEEE Computer Society. https://doi.org/10.1109/CVPR.2016.90
Google Scholar
Huang, G., Liu, Z., van der Maaten, L., & Weinberger, K. Q. (2017). Densely connected convolutional networks. In 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 (pp. 2261–2269). IEEE Computer Society. https://doi.org/10.1109/CVPR.2017.243
Google Scholar
Huang, S., Li, X., Cheng, Z., Zhang, Z., & Hauptmann, A. G. (2020). Stacked pooling for boosting scale invariance of crowd counting. In 2020 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2020 (pp. 2578–2582). IEEE. https://doi.org/10.1109/ICASSP40776.2020.9053070
Google Scholar
Jiang, H., & Jin, W. (2019). Effective use of convolutional neural networks and diverse deep supervision for better crowd counting. Applied Intelligence, 49(7), 2415–2433. https://doi.org/10.1007/s10489-018-1394-9
Web of Science ®Google Scholar
Jiang, M., Lin, J., & Wang, Z. J. (2021). A smartly simple way for joint crowd counting and localization. Neurocomputing, 459, 35–43. https://doi.org/10.1016/j.neucom.2021.06.055
Web of Science ®Google Scholar
Laradji, I. H., Rostamzadeh, N., Pinheiro, P. O., Vázquez, D., & Schmidt, M. (2018). Where are the blobs: Counting by localization with point supervision. In V. Ferrari, M. Hebert, C. Sminchisescu, & Y. Weiss (Eds.), Computer Vision – ECCV 2018 – 15th European Conference, Proceedings, Part II (Vol. 11206, pp. 560–576). Springer. https://doi.org/10.1007/978-3-030-01216-8_34
Google Scholar
Lempitsky, V. S., & Zisserman, A. (2010). Learning to count objects in images. In J.D. Lafferty, C.K.I. Williams, J. Shawe-Taylor, R.S. Zemel, & A. Culotta (Eds.), Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010 (pp. 1324–1332). Curran Associates.
Google Scholar
Li, Y., Chen, Y., Wang, N., & Zhang, Z. (2019). Scale-aware trident networks for object detection. In 2019 IEEE/CVF International Conference on Computer Vision, ICCV 2019 (pp. 6053–6062). IEEE. https://doi.org/10.1109/ICCV.2019.00615
Google Scholar
Li, Y., Zhang, X., & Chen, D. (2018). CSRNet: Dilated convolutional neural networks for understanding the highly congested scenes. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018 (pp. 1091–1100). Computer Vision Foundation/IEEE Computer Society. https://doi.org/10.1109/CVPR.2018.00120
Google Scholar
Liang, D., Xu, W., & Bai, X. (2022). An end-to-end transformer model for crowd localization. CoRR. https://arxiv.org/abs/2202.13065.
Google Scholar
Liang, D., Xu, W., Zhu, Y., & Zhou, Y. (2022). Focal inverse distance transform maps for crowd localization. IEEE Transactions on Multimedia, 1–13. https://doi.org/10.1109/TMM.2022.3203870
Google Scholar
Lin, T., Dollár, P., Girshick, R. B., He, K., Hariharan, B., & Belongie, S. J. (2017). Feature pyramid networks for object detection. In 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 (pp. 936–944). IEEE Computer Society. https://doi.org/10.1109/CVPR.2017.106
Google Scholar
Lin, T., Goyal, P., Girshick, R. B., He, K., & Dollár, P. (2017). Focal loss for dense object detection. In IEEE International Conference on Computer Vision, ICCV 2017 (pp. 2999–3007). IEEE Computer Society. https://doi.org/10.1109/ICCV.2017.324
Google Scholar
Liu, C., Weng, X., & Mu, Y. (2019). Recurrent attentive zooming for joint crowd counting and precise localization. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019 (pp. 1217–1226). Computer Vision Foundation/IEEE. https://doi.org/10.1109/CVPR.2019.00131
Google Scholar
Liu, L., Lu, H., Xiong, H., Xian, K., Cao, Z., & Shen, C. (2020). Counting objects by blockwise classification. IEEE Transactions on Circuits and Systems for Video Technology, 30(10), 3513–3527. https://doi.org/10.1109/TCSVT.2019.2942970
Web of Science ®Google Scholar
Liu, W., Salzmann, M., & Fua, P. (2019). Context-aware crowd counting. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019 (pp. 5099–5108). Computer Vision Foundation/IEEE. https://doi.org/10.1109/CVPR.2019.00524
Google Scholar
Liu, Y., Shi, M., Zhao, Q., & Wang, X. (2019). Point in, box out: Beyond counting persons in crowds. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2019 (pp. 6469–6478). Computer Vision Foundation/IEEE. https://doi.org/10.1109/CVPR.2019.00663
Google Scholar
Liu, Y., Yang, F., & Hu, P. (2020). Small-object detection in UAV-captured images via multi-branch parallel feature pyramid networks. IEEE Access, 8, 145740–145750. https://doi.org/10.1109/ACCESS.2020.3014910
Web of Science ®Google Scholar
Liu, Z., He, Z., Wang, L., Wang, W., Yuan, Y., Zhang, D., Zhang, J., Zhu, P., L. V. Gool, Han, J., Hoi, S., Hu, Q., Liu, M., Pan, J., Yin, B., Zhang, B., Liu, C., Ding, D., Liang, D., … Cao, Z. (2021). VisDrone-CC2021: The vision meets drone crowd counting challenge results. In 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) (pp. 2830–2838). IEEE. https://doi.org/10.1109/ICCVW54120.2021.00317.
Google Scholar
Liu, Z., Mao, H., Wu, C. Y., Feichtenhofer, C., Darrell, T., & Xie, S. (2022). A convnet for the 2020s. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 11976–11986). IEEE. https://doi.org/10.1109/CVPR52688.2022.01167
Google Scholar
Loshchilov, I., & Hutter, F. (2017). SGDR: Stochastic gradient descent with warm restarts. OpenReview.net. https://openreview.net/forum?id=Skq89Scxx.
Google Scholar
Loshchilov, I., & Hutter, F. (2019). Decoupled weight decay regularization. OpenReview.net. https://openreview.net/forum?id=Bkg6RiCqY7.
Google Scholar
Ma, Z., Wei, X., Hong, X., & Gong, Y. (2019). Bayesian loss for crowd count estimation with point supervision. In 2019 IEEE/CVF International Conference on Computer Vision, ICCV 2019 (pp. 6141–6150). IEEE. https://doi.org/10.1109/ICCV.2019.00624.
Google Scholar
Sam, D. B., Peri, S. V., Sundararaman, M. N., Kamath, A., & Babu, R. V. (2021). Locate, size, and count: Accurately resolving people in dense crowds via detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(8), 2739–2751. https://doi.org/10.1109/TPAMI.2020.2974830
PubMed Web of Science ®Google Scholar
Sam, D. B., Surya, S., & Babu, R. V. (2017). Switching convolutional neural network for crowd counting. In 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 (pp. 4031–4039). IEEE Computer Society. https://doi.org/10.1109/CVPR.2017.429
Google Scholar
Shen, Z., Xu, Y., Ni, B., Wang, M., Hu, J., & Yang, X. (2018). Crowd counting via adversarial cross-scale consistency pursuit. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018 (pp. 5245–5254). Computer Vision Foundation/IEEE Computer Society. https://doi.org/10.1109/CVPR.2018.00550
Google Scholar
Simonyan, K., & Zisserman, A. (2015). Very deep convolutional networks for large-scale image recognition. In Y. Bengio, & Y. LeCun (Eds.), 3rd International Conference on Learning Representations, ICLR 2015, Conference Track Proceedings. OpenReview.net. http://arxiv.org/abs/1409.1556
Google Scholar
Sindagi, V. A., & Patel, V. M. (2017). CNN-Based cascaded multi-task learning of high-level prior and density estimation for crowd counting. In 14th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2017 (pp. 1–6). IEEE Computer Society. https://doi.org/10.1109/AVSS.2017.8078491
Google Scholar
Song, Q., Wang, C., Jiang, Z., Wang, Y., Tai, Y., Wang, C., Li, J., Huang, F., & Wu, Y. (2021). Rethinking counting and localization in crowds: A purely point-based framework. In 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021 (pp. 3345–3354). IEEE. https://doi.org/10.1109/ICCV48922.2021.00335
Google Scholar
Tan, M., & Le, Q. V. (2021). EfficientNetV2: Smaller models and faster training. In M. Meila, & T. Zhang (Eds.), Proceedings of the 38th International Conference on Machine Learning, ICML 2021, Virtual Event (Vol. 139, pp. 10096–10106). PMLR. http://proceedings.mlr.press/v139/tan21a.html.
Google Scholar
Wang, B., Liu, H., Samaras, D., & Nguyen, M. H. (2020). Distribution matching for crowd counting. In H. Larochelle, M. Ranzato, R. Hadsell, M. Balcan, & H. Lin (Eds.), Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NEURIPS 2020. Virtual. https://proceedings.neurips.cc/paper/2020/hash/118bd558033a1016fcc82560c65cca5f-Abstract.html
Google Scholar
Wang, Q., Gao, J., Lin, W., & Li, X. (2021). NWPU-crowd: A large-scale benchmark for crowd counting and localization. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(6), 2141–2149. https://doi.org/10.1109/TPAMI.2020.3013269
PubMed Web of Science ®Google Scholar
Wang, W., Liu, Q., & Wang, W. (2022). Pyramid-dilated deep convolutional neural network for crowd counting. Applied Intelligence, 52(2), 1825–1837. https://doi.org/10.1007/s10489-021-02537-6
Web of Science ®Google Scholar
Wang, Z., Bovik, A. C., Sheikh, H. R., & Simoncelli, E. P. (2004). Image quality assessment: From error visibility to structural similarity. IEEE Transactions on Image Processing, 13(4), 600–612. https://doi.org/10.1109/TIP.2003.819861
PubMed Web of Science ®Google Scholar
Wen, L., Du, D., Zhu, P., Hu, Q., Wang, Q., Bo, L., & Lyu, S. (2021). Detection, tracking, and counting meets drones in crowds: A benchmark. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, Virtual (pp. 7812–7821). Computer Vision Foundation/IEEE.
Google Scholar
Xu, C., Liang, D., Xu, Y., Bai, S., Zhan, W., Bai, X., & Tomizuka, M. (2022). AutoScale: Learning to scale for crowd counting. International Journal of Computer Vision, 130(2), 405–434. https://doi.org/10.1007/s11263-021-01542-z
Web of Science ®Google Scholar
Zeng, L., Xu, X., Cai, B., Qiu, S., & Zhang, T. (2017). Multi-scale convolutional neural networks for crowd counting. In 2017 IEEE International Conference on Image Processing, ICIP 2017 (pp. 465–469). IEEE. https://doi.org/10.1109/ICIP.2017.8296324
Google Scholar
Zhang, Y., Zhou, D., Chen, S., Gao, S., & Ma, Y. (2016). Single-image crowd counting via multi-column convolutional neural network. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016 (pp. 589–597). IEEE Computer Society. https://doi.org/10.1109/CVPR.2016.70
Google Scholar
Zhu, P., Wen, L., Bian, X., Ling, H., & Hu, Q. (2018). Vision meets drones: A challenge. CoRR. http://arxiv.org/abs/1804.07437.
Google Scholar
Zou, Z., Su, X., Qu, X., & Zhou, P. (2018). DA-Net: Learning the fine-grained density distribution with deformation aggregation network. IEEE Access, 6, 60745–60756. https://doi.org/10.1109/ACCESS.2018.2875495
Web of Science ®Google Scholar

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

A point and density map hybrid network for crowd counting and localization based on unmanned aerial vehicles

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

A point and density map hybrid network for crowd counting and localization based on unmanned aerial vehicles

References

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date