Asynchronous Semantic Background Subtraction

Anthony Cioppa¹, Marc Braham¹, Marc Van Droogenbroeck¹

Affiliations

PMID: 34460596
PMCID: PMC8321070
DOI: 10.3390/jimaging6060050

Asynchronous Semantic Background Subtraction

Anthony Cioppa et al. J Imaging. 2020.

. 2020 Jun 18;6(6):50.

doi: 10.3390/jimaging6060050.

Authors

Anthony Cioppa¹, Marc Braham¹, Marc Van Droogenbroeck¹

Affiliation

¹ Montefiore Institute, University of Liège, Quartier Polytech 1, Allée de la Découverte 10, 4000 Liège, Belgium.

PMID: 34460596
PMCID: PMC8321070
DOI: 10.3390/jimaging6060050

Abstract

The method of Semantic Background Subtraction (SBS), which combines semantic segmentation and background subtraction, has recently emerged for the task of segmenting moving objects in video sequences. While SBS has been shown to improve background subtraction, a major difficulty is that it combines two streams generated at different frame rates. This results in SBS operating at the slowest frame rate of the two streams, usually being the one of the semantic segmentation algorithm. We present a method, referred to as "Asynchronous Semantic Background Subtraction" (ASBS), able to combine a semantic segmentation algorithm with any background subtraction algorithm asynchronously. It achieves performances close to that of SBS while operating at the fastest possible frame rate, being the one of the background subtraction algorithm. Our method consists in analyzing the temporal evolution of pixel features to possibly replicate the decisions previously enforced by semantics when no semantic information is computed. We showcase ASBS with several background subtraction algorithms and also add a feedback mechanism that feeds the background model of the background subtraction algorithm to upgrade its updating strategy and, consequently, enhance the decision. Experiments show that we systematically improve the performance, even when the semantic stream has a much slower frame rate than the frame rate of the background subtraction algorithm. In addition, we establish that, with the help of ASBS, a real-time background subtraction algorithm, such as ViBe, stays real time and competes with some of the best non-real-time unsupervised background subtraction algorithms such as SuBSENSE.

Keywords: background subtraction; motion detection; scene labeling; semantic segmentation; video processing.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Figures

**Figure 1**
Timing diagram of a naive real-time implementation of the semantic background subtraction (SBS) method when the frame rate of semantics is too slow to handle all the frames in real time. From top to bottom, the time lines represent: the input frames $I_{t}$ , the computation of semantics $S_{t}$ by the semantic segmentation algorithm (on GPU), the computation of intermediate segmentation masks $B_{t}$ by the background subtraction (BGS) algorithm (on CPU), and the computation of output segmentation masks $D_{t}$ by the SBS method (on CPU). Vertical lines indicate when an image is available and filled rectangular areas display when a GPU or CPU performs a task. Arrows show the inputs required by the different tasks. This diagram shows that even when the background subtraction algorithm is real time with respect to the input frame rate, it is the computation of semantics that dictates the output frame rate.

**Figure 2**
Schematic representation of our method named ASBS, extending SBS [30], capable to combine the two asynchronous streams of semantics and background subtraction masks to improve the performances of BGS algorithms. When semantics is available, Asynchronous Semantic Background Subtraction (ASBS) applies Rule 1, Rule 2, or selects the fallback, and it updates the color and rule maps. Otherwise, ASBS applies Rule A, Rule B, or it selects the fallback.

**Figure 3**
Timing diagram of ASBS in the case of a real-time BGS algorithm ( $Δ_{B} < δ_{I}$ ) satisfying the condition $Δ_{B} + Δ_{D} < δ_{I}$ . Note that the output stream is delayed by a constant $Δ_{S} + Δ_{D}$ time with respect to the input stream.

**Figure 4**
Overall $F_{1}$ scores obtained with SBS and ASBS for four state-of-the-art BGS algorithms and different sub-sampling factors. The performances of ASBS decrease much more slowly than those of SBS with the decrease of the semantic frame rate and, therefore, are much closer to those of the ideal case (SBS with all semantic maps computed, that is SBS 1:1), meaning that ASBS provides better decisions for frames without semantics. On average, ASBS with 1 frame of semantics out of 25 frames (ASBS $25 : 1$ ) performs as well as SBS, with copy of $B_{t},$ with 1 frame of semantics out of 2 frames (SBS $2 : 1$ ).

**Figure 5**
Effects of SBS and ASBS on BGS algorithms in the mean ROC space of CDNet 2014 [12]. Each point represents the performance of a BGS algorithm and the end of the associated arrow indicates the performance after application of the methods for a temporal sub-sampling factor of $5 : 1$ . We observe that SBS improves the performances, but only marginally, whereas ASBS moves the performances much closer to the oracle (upper left corner).

**Figure 6**
Per-category analysis. We display the relative improvements of the $F_{1}$ score of SBS, ASBS, and the second heuristic compared with the original algorithms, by considering only the frames without semantics (at a $5 : 1$ semantic frame rate).

**Figure 7**
Evolution of the optimal thresholds $τ_{A}$ and $τ_{B}$ of the ASBS method when the semantic frame rate is reduced. Note that the Manhattan distance associated to these thresholds is computed on 8-bit color values. The results are shown here for the PAWCS algorithm, and follow the same trend for the IUTIS-5, SuBSENSE, and WeSamBe BGS algorithms.

**Figure 8**
Our feedback mechanism, which impacts the decisions of any BGS algorithm whose model update is conservative, consists to replace the $BG / FG$ segmentation of the BGS algorithm by the final segmentation map improved by semantics (either by SBS or ASBS) to update the internal background model.

**Figure 9**
Comparison of the performances, computed with the mean $F_{1}$ score on the CDNet 2014, of SBS and ASBS when there is a feedback that uses $D_{t}$ to update the model of the BGS algorithm. The results are given with respect to a decreasing semantic frame rate. It can be seen that SBS and ASBS always improve the results of the original BGS algorithm and that a feedback is beneficial. Graphs in the right column show that the intrinsic quality of the BGS algorithms is improved, as their output $B_{t}$ , prior to any combination with semantics, produces higher mean $F_{1}$ scores.

**Figure 10**
Illustration of the results of ASBS using ViBe as BGS algorithm. From left to right, we provide the original color image, the ground truth, the BGS as provided by the original ViBe algorithm, using our ASBS method without any feedback, and using ASBS and a feedback. Each line corresponds to a representative frame of a video in each category of CDNet2014.

**Figure 11**
Timing diagram of ASBS with a feedback mechanism in the case of a real-time BGS algorithm ( $Δ_{B} < δ_{I}$ ) satisfying the condition $Δ_{B} + Δ_{D} < δ_{I}$ and the computation of semantics being not real-time ( $Δ_{S} > δ_{I}$ ). Note that the feedback time $Δ_{F}$ is negligible.

See this image and copyright information in PMC

References

1. Bouwmans T. Traditional and recent approaches in background modeling for foreground detection: An overview. Comput. Sci. Rev. 2014;11–12:31–66. doi: 10.1016/j.cosrev.2014.04.001. - DOI
1. Stauffer C., Grimson E. Adaptive background mixture models for real-time tracking; Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR); Corfu, Greece. 20–25 September 1995; pp. 246–252.
1. Elgammal A., Harwood D., Davis L. European Conference on Computer Vision (ECCV) Volume 1843. Lecture Notes in Computer Science; Springer; Berlin, Germany: 2000. Non-parametric Model for Background Subtraction; pp. 751–767.
1. Maddalena L., Petrosino A. A Self-Organizing Approach to Background Subtraction for Visual Surveillance Applications. IEEE Trans. Image Proc. 2008;17:1168–1177. doi: 10.1109/TIP.2008.924285. - DOI - PubMed
1. Barnich O., Van Droogenbroeck M. ViBe: A universal background subtraction algorithm for video sequences. IEEE Trans. Image Proc. 2011;20:1709–1724. doi: 10.1109/TIP.2010.2101613. - DOI - PubMed

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Asynchronous Semantic Background Subtraction

Affiliation

Asynchronous Semantic Background Subtraction

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

References

LinkOut - more resources

Full Text Sources