Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Feb 10;17(2):336.
doi: 10.3390/s17020336.

Vehicle Detection in Aerial Images Based on Region Convolutional Neural Networks and Hard Negative Example Mining

Affiliations

Vehicle Detection in Aerial Images Based on Region Convolutional Neural Networks and Hard Negative Example Mining

Tianyu Tang et al. Sensors (Basel). .

Abstract

Detecting vehicles in aerial imagery plays an important role in a wide range of applications. The current vehicle detection methods are mostly based on sliding-window search and handcrafted or shallow-learning-based features, having limited description capability and heavy computational costs. Recently, due to the powerful feature representations, region convolutional neural networks (CNN) based detection methods have achieved state-of-the-art performance in computer vision, especially Faster R-CNN. However, directly using it for vehicle detection in aerial images has many limitations: (1) region proposal network (RPN) in Faster R-CNN has poor performance for accurately locating small-sized vehicles, due to the relatively coarse feature maps; and (2) the classifier after RPN cannot distinguish vehicles and complex backgrounds well. In this study, an improved detection method based on Faster R-CNN is proposed in order to accomplish the two challenges mentioned above. Firstly, to improve the recall, we employ a hyper region proposal network (HRPN) to extract vehicle-like targets with a combination of hierarchical feature maps. Then, we replace the classifier after RPN by a cascade of boosted classifiers to verify the candidate regions, aiming at reducing false detection by negative example mining. We evaluate our method on the Munich vehicle dataset and the collected vehicle dataset, with improvements in accuracy and robustness compared to existing methods.

Keywords: convolutional neural networks; hard negative example mining; hyper region proposal network; vehicle detection.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Two challenges for vehicle detection in aerial images. (a) small vehicles; (b) hard negative examples.
Figure 2
Figure 2
Proposed vehicle detection framework. Original large scale images are cropped into small scale blocks for training and testing.
Figure 3
Figure 3
Architecture of Hyper Region Proposal Network (HRPN). For 702×624 images, the feature maps size of Conv1-Conv5 are 351×312×96, 88×79×256, 45×40×384, 45×40×384, 45×40×256, respectively.
Figure 4
Figure 4
Training process of the cascade of boosted classifiers. We adopt the cascade of six stages, and employ {64, 128, 256, 512, 1024, 2048} weak classifiers in each stage.
Figure 5
Figure 5
Ground truth of the original Munich dataset, our modified ground truth and types of vehicles.
Figure 6
Figure 6
Recall versus Intersection-over-Union (IoU) threshold on the Munich Vehicle test set. (a 50 region proposals; (b) 100 region proposals; (c) 150 region proposals.
Figure 7
Figure 7
Recall versus number of proposals for different combining layers (IoU = 0.3).
Figure 8
Figure 8
Performance comparisons of the proposed method, HRPN, H-Faster (HRPN + Fast R-CNN) and aggregated channel features (ACF) detector in terms of mAP values in 10 test images.
Figure 9
Figure 9
precision-recall curve (PRC) of the proposed method and other state-of-the-art approaches for vehicle detection in 10 test images, respectively.
Figure 10
Figure 10
Performance after rescaling the image with different factors.
Figure 11
Figure 11
Performance comparisons of different methods in terms of AP values for the collected vehicle dataset.
Figure 12
Figure 12
Detection results from test images. Red boxes denote correct localization of car, yellow boxes denote correct localization of truck, green boxes and blue boxes denote missing detection and incorrect detection, respectively. (ad) are results for the Munich test aerial image blocks; (eh) are results on unmanned aerial vehicle (UAV) images; (i) is results for pansharpened color infrared (CIR) image; (j) is results for the original large-scale Munich test image; (k) is results for a large satellite image of Tokyo.

References

    1. Leitloff J., Rosenbaum D., Kurz F., Meynberg O., Reinartz P. An Operational System for Estimating Road Traffic Information from Aerial Images. Remote Sens. 2014;6:11315–11341. doi: 10.3390/rs61111315. - DOI
    1. Liu K., Mattyus G. Fast Multiclass Vehicle Detection on Aerial Images. IEEE Geosci. Remote Sens. Lett. 2015;12:1–5.
    1. Moranduzzo T., Melgani F. Automatic Car Counting Method for Unmanned Aerial Vehicle Images. IEEE Trans. Geosci. Remote Sens. 2014;52:1635–1647. doi: 10.1109/TGRS.2013.2253108. - DOI
    1. Moranduzzo T., Melgani F. Detecting Cars in UAV Images With a Catalog-Based Approach. IEEE Trans. Geosci. Remote Sens. 2014;52:6356–6367. doi: 10.1109/TGRS.2013.2296351. - DOI
    1. Chen Z., Wang C., Luo H., Wang H. Vehicle Detection in High-Resolution Aerial Images Based on Fast Sparse Representation Classification and Multiorder Feature. IEEE Trans. Intell. Transp. Syst. 2016;17:2296–2309. doi: 10.1109/TITS.2016.2517826. - DOI