Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Oct 17;24(20):6688.
doi: 10.3390/s24206688.

Explosion Detection Using Smartphones: Ensemble Learning with the Smartphone High-Explosive Audio Recordings Dataset and the ESC-50 Dataset

Affiliations

Explosion Detection Using Smartphones: Ensemble Learning with the Smartphone High-Explosive Audio Recordings Dataset and the ESC-50 Dataset

Samuel K Takazawa et al. Sensors (Basel). .

Abstract

Explosion monitoring is performed by infrasound and seismoacoustic sensor networks that are distributed globally, regionally, and locally. However, these networks are unevenly and sparsely distributed, especially at the local scale, as maintaining and deploying networks is costly. With increasing interest in smaller-yield explosions, the need for more dense networks has increased. To address this issue, we propose using smartphone sensors for explosion detection as they are cost-effective and easy to deploy. Although there are studies using smartphone sensors for explosion detection, the field is still in its infancy and new technologies need to be developed. We applied a machine learning model for explosion detection using smartphone microphones. The data used were from the Smartphone High-explosive Audio Recordings Dataset (SHAReD), a collection of 326 waveforms from 70 high-explosive (HE) events recorded on smartphones, and the ESC-50 dataset, a benchmarking dataset commonly used for environmental sound classification. Two machine learning models were trained and combined into an ensemble model for explosion detection. The resulting ensemble model classified audio signals as either "explosion", "ambient", or "other" with true positive rates (recall) greater than 96% for all three categories.

Keywords: data; detection; explosion; infrasound; machine learning; smartphone.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Figures

Figure 1
Figure 1
The architecture of YAMNet and an example architecture of a transfer learning model using YAMNet.
Figure 2
Figure 2
The smartphone deployment configuration of (a) the vented encasement and (b) the aluminum foil tube.
Figure 3
Figure 3
Histograms of (a) the smartphone’s distance from the explosion source, (b) the effective yield of the explosion source, (c) the number of smartphone recordings per explosion event, and (d) the smartphone model used for data collection in SHAReD.
Figure 4
Figure 4
Flowchart for the ensemble model along with the construction of D-YAMNet and LFM models.
Figure 5
Figure 5
The confusion matrix of D-YAMNet on the test dataset. The percentages are calculated by rows (true labels) and the count for each cell is listed under these in parenthesis.
Figure 6
Figure 6
The (a) normalized amplitude and (b) power spectral density of an “explosion” waveform that was misclassified as “ambient” by D-YAMNet. The explosion was in the 10 kg yield category and recorded by a smartphone ~11 km away from the source at a sample rate of 800 Hz.
Figure 7
Figure 7
The confusion matrix of the LFM on the test dataset. The percentages are calculated by rows (true labels) and the count for each cell is listed under these in parenthesis.
Figure 8
Figure 8
The (a) normalized amplitude and the (b) power spectral density of an “other” waveform that was misclassified as “explosion” by the LFM. The “other” sound was from an ESC-50 waveform labeled “dog”.
Figure 9
Figure 9
The confusion matrix of the ensemble model on the test dataset. The percentages are calculated by rows (true labels) and the count for each cell is listed under these in parenthesis.
Figure 10
Figure 10
The precision-recall curves for (a) D-YAMNet and (b) LFM.
Figure 11
Figure 11
The average confusion matrix for (a) D-YAMNet and (b) LFM.
Figure 12
Figure 12
Expanded normalized microphone data from SHAReD for event INL_20220714_07 for smartphone ID (a) 1806169311 and (c) 2122963039 and the corresponding predicted labels from D-YAMNet, LFM, and the ensemble model for smartphone ID (b) 1806169311 and (d) 2122963039. The predicted labels were obtained on segmented section of the full waveform with 0.96 s duration and 50% overlap.
Figure 13
Figure 13
The (a) confusion matrix and (b) precision-recall curves for the YAMNet model. The average precision (AP) for VINEDA was 0.86.
Figure 14
Figure 14
The (a) confusion matrix and (b) precision-recall curves for the YAMNet model.

References

    1. Garces M.A. Explosion Source Models. In: Le Pichon A., Blanc E., Hauchecorne A., editors. Infrasound Monitoring for Atmospheric Studies. Springer; Berlin/Heidelberg, Germany: 2019.
    1. Vergoz J., Hupe P., Listowski C., Le Pichon A., Garcés M.A., Marchetti E., Labazuy P., Ceranna L., Pilger C., Gaebler P., et al. IMS Observations of Infrasound and Acoustic-Gravity Waves Produced by the January 2022 Volcanic Eruption of Hunga, Tonga: A Global Analysis. Earth Planet. Sci. Lett. 2022;591:117639. doi: 10.1016/j.epsl.2022.117639. - DOI
    1. Ceranna L., Le Pichon A., Green D.N., Mialle P. The Buncefield Explosion: A Benchmark for Infrasound Analysis across Central Europe. Geophys. J. Int. 2009;177:491–508. doi: 10.1111/j.1365-246X.2008.03998.x. - DOI
    1. Gitterman Y. Sayarim Infrasound Calibration Explosion: Near-Source and Local Observations and Yield Estimation; Proceedings of the 2010 Monitoring Research Review: Ground-Based Nuclear Explosion Monitoring Technologies; Orlando, FL, USA. 21–23 September 2010; pp. 708–719. LA-UR-10-0.
    1. Fuchs F., Schneider F.M., Kolínský P., Serafin S., Bokelmann G. Rich Observations of Local and Regional Infrasound Phases Made by the AlpArray Seismic Network after Refinery Explosion. Sci. Rep. 2019;9:13027. doi: 10.1038/s41598-019-49494-2. - DOI - PMC - PubMed

LinkOut - more resources