Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jun 18;6(6):50.
doi: 10.3390/jimaging6060050.

Asynchronous Semantic Background Subtraction

Affiliations

Asynchronous Semantic Background Subtraction

Anthony Cioppa et al. J Imaging. .

Abstract

The method of Semantic Background Subtraction (SBS), which combines semantic segmentation and background subtraction, has recently emerged for the task of segmenting moving objects in video sequences. While SBS has been shown to improve background subtraction, a major difficulty is that it combines two streams generated at different frame rates. This results in SBS operating at the slowest frame rate of the two streams, usually being the one of the semantic segmentation algorithm. We present a method, referred to as "Asynchronous Semantic Background Subtraction" (ASBS), able to combine a semantic segmentation algorithm with any background subtraction algorithm asynchronously. It achieves performances close to that of SBS while operating at the fastest possible frame rate, being the one of the background subtraction algorithm. Our method consists in analyzing the temporal evolution of pixel features to possibly replicate the decisions previously enforced by semantics when no semantic information is computed. We showcase ASBS with several background subtraction algorithms and also add a feedback mechanism that feeds the background model of the background subtraction algorithm to upgrade its updating strategy and, consequently, enhance the decision. Experiments show that we systematically improve the performance, even when the semantic stream has a much slower frame rate than the frame rate of the background subtraction algorithm. In addition, we establish that, with the help of ASBS, a real-time background subtraction algorithm, such as ViBe, stays real time and competes with some of the best non-real-time unsupervised background subtraction algorithms such as SuBSENSE.

Keywords: background subtraction; motion detection; scene labeling; semantic segmentation; video processing.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Figures

Figure 1
Figure 1
Timing diagram of a naive real-time implementation of the semantic background subtraction (SBS) method when the frame rate of semantics is too slow to handle all the frames in real time. From top to bottom, the time lines represent: the input frames It, the computation of semantics St by the semantic segmentation algorithm (on GPU), the computation of intermediate segmentation masks Bt by the background subtraction (BGS) algorithm (on CPU), and the computation of output segmentation masks Dt by the SBS method (on CPU). Vertical lines indicate when an image is available and filled rectangular areas display when a GPU or CPU performs a task. Arrows show the inputs required by the different tasks. This diagram shows that even when the background subtraction algorithm is real time with respect to the input frame rate, it is the computation of semantics that dictates the output frame rate.
Figure 2
Figure 2
Schematic representation of our method named ASBS, extending SBS [30], capable to combine the two asynchronous streams of semantics and background subtraction masks to improve the performances of BGS algorithms. When semantics is available, Asynchronous Semantic Background Subtraction (ASBS) applies Rule 1, Rule 2, or selects the fallback, and it updates the color and rule maps. Otherwise, ASBS applies Rule A, Rule B, or it selects the fallback.
Figure 3
Figure 3
Timing diagram of ASBS in the case of a real-time BGS algorithm (ΔB<δI) satisfying the condition ΔB+ΔD<δI. Note that the output stream is delayed by a constant ΔS+ΔD time with respect to the input stream.
Figure 4
Figure 4
Overall F1 scores obtained with SBS and ASBS for four state-of-the-art BGS algorithms and different sub-sampling factors. The performances of ASBS decrease much more slowly than those of SBS with the decrease of the semantic frame rate and, therefore, are much closer to those of the ideal case (SBS with all semantic maps computed, that is SBS 1:1), meaning that ASBS provides better decisions for frames without semantics. On average, ASBS with 1 frame of semantics out of 25 frames (ASBS 25:1) performs as well as SBS, with copy of Bt, with 1 frame of semantics out of 2 frames (SBS 2:1).
Figure 5
Figure 5
Effects of SBS and ASBS on BGS algorithms in the mean ROC space of CDNet 2014 [12]. Each point represents the performance of a BGS algorithm and the end of the associated arrow indicates the performance after application of the methods for a temporal sub-sampling factor of 5:1. We observe that SBS improves the performances, but only marginally, whereas ASBS moves the performances much closer to the oracle (upper left corner).
Figure 6
Figure 6
Per-category analysis. We display the relative improvements of the F1 score of SBS, ASBS, and the second heuristic compared with the original algorithms, by considering only the frames without semantics (at a 5:1 semantic frame rate).
Figure 7
Figure 7
Evolution of the optimal thresholds τA and τB of the ASBS method when the semantic frame rate is reduced. Note that the Manhattan distance associated to these thresholds is computed on 8-bit color values. The results are shown here for the PAWCS algorithm, and follow the same trend for the IUTIS-5, SuBSENSE, and WeSamBe BGS algorithms.
Figure 8
Figure 8
Our feedback mechanism, which impacts the decisions of any BGS algorithm whose model update is conservative, consists to replace the BG/FG segmentation of the BGS algorithm by the final segmentation map improved by semantics (either by SBS or ASBS) to update the internal background model.
Figure 9
Figure 9
Comparison of the performances, computed with the mean F1 score on the CDNet 2014, of SBS and ASBS when there is a feedback that uses Dt to update the model of the BGS algorithm. The results are given with respect to a decreasing semantic frame rate. It can be seen that SBS and ASBS always improve the results of the original BGS algorithm and that a feedback is beneficial. Graphs in the right column show that the intrinsic quality of the BGS algorithms is improved, as their output Bt, prior to any combination with semantics, produces higher mean F1 scores.
Figure 10
Figure 10
Illustration of the results of ASBS using ViBe as BGS algorithm. From left to right, we provide the original color image, the ground truth, the BGS as provided by the original ViBe algorithm, using our ASBS method without any feedback, and using ASBS and a feedback. Each line corresponds to a representative frame of a video in each category of CDNet2014.
Figure 11
Figure 11
Timing diagram of ASBS with a feedback mechanism in the case of a real-time BGS algorithm (ΔB<δI) satisfying the condition ΔB+ΔD<δI and the computation of semantics being not real-time (ΔS>δI). Note that the feedback time ΔF is negligible.

References

    1. Bouwmans T. Traditional and recent approaches in background modeling for foreground detection: An overview. Comput. Sci. Rev. 2014;11–12:31–66. doi: 10.1016/j.cosrev.2014.04.001. - DOI
    1. Stauffer C., Grimson E. Adaptive background mixture models for real-time tracking; Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR); Corfu, Greece. 20–25 September 1995; pp. 246–252.
    1. Elgammal A., Harwood D., Davis L. European Conference on Computer Vision (ECCV) Volume 1843. Lecture Notes in Computer Science; Springer; Berlin, Germany: 2000. Non-parametric Model for Background Subtraction; pp. 751–767.
    1. Maddalena L., Petrosino A. A Self-Organizing Approach to Background Subtraction for Visual Surveillance Applications. IEEE Trans. Image Proc. 2008;17:1168–1177. doi: 10.1109/TIP.2008.924285. - DOI - PubMed
    1. Barnich O., Van Droogenbroeck M. ViBe: A universal background subtraction algorithm for video sequences. IEEE Trans. Image Proc. 2011;20:1709–1724. doi: 10.1109/TIP.2010.2101613. - DOI - PubMed

LinkOut - more resources