Design and Analysis of a Lightweight Context Fusion CNN Scheme for Crowd Counting
- PMID: 31035697
- PMCID: PMC6539683
- DOI: 10.3390/s19092013
Design and Analysis of a Lightweight Context Fusion CNN Scheme for Crowd Counting
Abstract
Crowd counting, which is widely used in disaster management, traffic monitoring, and other fields of urban security, is a challenging task that is attracting increasing interest from researchers. For better accuracy, most methods have attempted to handle the scale variation explicitly. which results in huge scale changes of the object size. However, earlier methods based on convolutional neural networks (CNN) have focused primarily on improving accuracy while ignoring the complexity of the model. This paper proposes a novel method based on a lightweight CNN-based network for estimating crowd counting and generating density maps under resource constraints. The network is composed of three components: a basic feature extractor (BFE), a stacked à trous convolution module (SACM), and a context fusion module (CFM). The BFE encodes basic feature information with reduced spatial resolution for further refining. Various pieces of contextual information are generated through a short pipeline in SACM. To generate a context fusion density map, CFM distills feature maps from the above components. The whole network is trained in an end-to-end fashion and uses a compression factor to restrict its size. Experiments on three highly-challenging datasets demonstrate that the proposed method delivers attractive performance.
Keywords: computer vision; convolutional neural networks; crowd counting; deep learning.
Conflict of interest statement
The authors declare no conflict of interest.
Figures
References
-
- Hu Y., Chang H., Nian F., Wang Y., Li T. Dense Crowd Counting from Still Images with Convolutional Neural Networks. J. Vis. Commun. Image Represent. 2016;38:530–539. doi: 10.1016/j.jvcir.2016.03.021. - DOI
-
- Idrees H., Saleemi I., Seibert C., Shah M. Multi-source Multi-scale Counting in Extremely Dense Crowd Images; Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition; Portland, OR, USA. 23–28 June 2013; pp. 2547–2554. - DOI
-
- Sindagi V.A., Patel V.M. CNN-Based cascaded multi-task learning of high-level prior and density estimation for crowd counting; Proceedings of the IEEE International Conference on Advanced Video and Signal Based Surveillance; Lecce, Italy. 29 August–1 September 2017; pp. 1–6. - DOI
-
- Herath S., Harandi M., Porikli F. Going deeper into action recognition: A survey. Image Vis. Comput. 2017;60:4–21. doi: 10.1016/j.imavis.2017.01.010. - DOI
Grants and funding
LinkOut - more resources
Full Text Sources
