Delving Deep into Multiscale Pedestrian Detection via Single Scale Feature Maps
- PMID: 29614807
- PMCID: PMC5948919
- DOI: 10.3390/s18041063
Delving Deep into Multiscale Pedestrian Detection via Single Scale Feature Maps
Abstract
The standard pipeline in pedestrian detection is sliding a pedestrian model on an image feature pyramid to detect pedestrians of different scales. In this pipeline, feature pyramid construction is time consuming and becomes the bottleneck for fast detection. Recently, a method called multiresolution filtered channels (MRFC) was proposed which only used single scale feature maps to achieve fast detection. However, there are two shortcomings in MRFC which limit its accuracy. One is that the receptive field correspondence in different scales is weak. Another is that the features used are not scale invariance. In this paper, two solutions are proposed to tackle with the two shortcomings respectively. Specifically, scale-aware pooling is proposed to make a better receptive field correspondence, and soft decision tree is proposed to relive scale variance problem. When coupled with efficient sliding window classification strategy, our detector achieves fast detecting speed at the same time with state-of-the-art accuracy.
Keywords: boosted decision tree; pedestrian detection; receptive field correspondence; scale invariance; soft decision tree.
Conflict of interest statement
The authors declare no conflict of interest.
Figures
References
-
- Dollár P., Wojek C., Schiele B., Perona P. Pedestrian detection: A benchmark; Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009); Miami, FL, USA. 20–25 June 2009; pp. 304–311.
-
- Geiger A., Lenz P., Urtasun R. Are we ready for autonomous driving? The KITTI vision benchmark suite; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; Providence, RI, USA. 16–21 June 2012; pp. 3354–3361.
-
- Dalal N., Triggs B. Histograms of Oriented Gradients for Human Detection; Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005); San Diego, CA, USA. 20–26 June 2005; pp. 886–893.
-
- Ess A., Leibe B., Schindler K., Gool L.J.V. A mobile vision system for robust multi-person tracking; Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008); Anchorage, AK, USA. 24–26 June 2008.
LinkOut - more resources
Full Text Sources
Other Literature Sources
