. 2024 Oct 17;24(20):6688.

doi: 10.3390/s24206688.

Explosion Detection Using Smartphones: Ensemble Learning with the Smartphone High-Explosive Audio Recordings Dataset and the ESC-50 Dataset

Samuel K Takazawa¹, Sarah K Popenhagen¹, Luis A Ocampo Giraldo², Jay D Hix², Scott J Thompson², David L Chichester², Cleat P Zeiler³, Milton A Garcés¹

Affiliations

¹ Infrasound Laboratory, Hawai'i Institute of Geophysics and Planetology, School of Ocean and Earth Science and Technology, University of Hawai'i at Mānoa, Kailua-Kona, HI 96740, USA.
² Idaho National Laboratory, Idaho Falls, ID 83415, USA.
³ Nevada National Security Site, North Las Vegas, NV 89030, USA.

PMID: 39460167
PMCID: PMC11511011
DOI: 10.3390/s24206688

Explosion Detection Using Smartphones: Ensemble Learning with the Smartphone High-Explosive Audio Recordings Dataset and the ESC-50 Dataset

Samuel K Takazawa et al. Sensors (Basel). 2024.

. 2024 Oct 17;24(20):6688.

doi: 10.3390/s24206688.

Authors

Samuel K Takazawa¹, Sarah K Popenhagen¹, Luis A Ocampo Giraldo², Jay D Hix², Scott J Thompson², David L Chichester², Cleat P Zeiler³, Milton A Garcés¹

Affiliations

¹ Infrasound Laboratory, Hawai'i Institute of Geophysics and Planetology, School of Ocean and Earth Science and Technology, University of Hawai'i at Mānoa, Kailua-Kona, HI 96740, USA.
² Idaho National Laboratory, Idaho Falls, ID 83415, USA.
³ Nevada National Security Site, North Las Vegas, NV 89030, USA.

PMID: 39460167
PMCID: PMC11511011
DOI: 10.3390/s24206688

Abstract

Explosion monitoring is performed by infrasound and seismoacoustic sensor networks that are distributed globally, regionally, and locally. However, these networks are unevenly and sparsely distributed, especially at the local scale, as maintaining and deploying networks is costly. With increasing interest in smaller-yield explosions, the need for more dense networks has increased. To address this issue, we propose using smartphone sensors for explosion detection as they are cost-effective and easy to deploy. Although there are studies using smartphone sensors for explosion detection, the field is still in its infancy and new technologies need to be developed. We applied a machine learning model for explosion detection using smartphone microphones. The data used were from the Smartphone High-explosive Audio Recordings Dataset (SHAReD), a collection of 326 waveforms from 70 high-explosive (HE) events recorded on smartphones, and the ESC-50 dataset, a benchmarking dataset commonly used for environmental sound classification. Two machine learning models were trained and combined into an ensemble model for explosion detection. The resulting ensemble model classified audio signals as either "explosion", "ambient", or "other" with true positive rates (recall) greater than 96% for all three categories.

Keywords: data; detection; explosion; infrasound; machine learning; smartphone.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Figures

**Figure 1**
The architecture of YAMNet and an example architecture of a transfer learning model using YAMNet.

**Figure 2**
The smartphone deployment configuration of (a) the vented encasement and (b) the aluminum foil tube.

**Figure 3**
Histograms of (a) the smartphone’s distance from the explosion source, (b) the effective yield of the explosion source, (c) the number of smartphone recordings per explosion event, and (d) the smartphone model used for data collection in SHAReD.

**Figure 4**
Flowchart for the ensemble model along with the construction of D-YAMNet and LFM models.

**Figure 5**
The confusion matrix of D-YAMNet on the test dataset. The percentages are calculated by rows (true labels) and the count for each cell is listed under these in parenthesis.

**Figure 6**
The (a) normalized amplitude and (b) power spectral density of an “explosion” waveform that was misclassified as “ambient” by D-YAMNet. The explosion was in the 10 kg yield category and recorded by a smartphone ~11 km away from the source at a sample rate of 800 Hz.

**Figure 7**
The confusion matrix of the LFM on the test dataset. The percentages are calculated by rows (true labels) and the count for each cell is listed under these in parenthesis.

**Figure 8**
The (a) normalized amplitude and the (b) power spectral density of an “other” waveform that was misclassified as “explosion” by the LFM. The “other” sound was from an ESC-50 waveform labeled “dog”.

**Figure 9**
The confusion matrix of the ensemble model on the test dataset. The percentages are calculated by rows (true labels) and the count for each cell is listed under these in parenthesis.

**Figure 10**
The precision-recall curves for (a) D-YAMNet and (b) LFM.

**Figure 11**
The average confusion matrix for (a) D-YAMNet and (b) LFM.

**Figure 12**
Expanded normalized microphone data from SHAReD for event INL_20220714_07 for smartphone ID (a) 1806169311 and (c) 2122963039 and the corresponding predicted labels from D-YAMNet, LFM, and the ensemble model for smartphone ID (b) 1806169311 and (d) 2122963039. The predicted labels were obtained on segmented section of the full waveform with 0.96 s duration and 50% overlap.

**Figure 13**
The (a) confusion matrix and (b) precision-recall curves for the YAMNet model. The average precision (AP) for VINEDA was 0.86.

**Figure 14**
The (a) confusion matrix and (b) precision-recall curves for the YAMNet model.

See this image and copyright information in PMC

References

1. Garces M.A. Explosion Source Models. In: Le Pichon A., Blanc E., Hauchecorne A., editors. Infrasound Monitoring for Atmospheric Studies. Springer; Berlin/Heidelberg, Germany: 2019.
1. Vergoz J., Hupe P., Listowski C., Le Pichon A., Garcés M.A., Marchetti E., Labazuy P., Ceranna L., Pilger C., Gaebler P., et al. IMS Observations of Infrasound and Acoustic-Gravity Waves Produced by the January 2022 Volcanic Eruption of Hunga, Tonga: A Global Analysis. Earth Planet. Sci. Lett. 2022;591:117639. doi: 10.1016/j.epsl.2022.117639. - DOI
1. Ceranna L., Le Pichon A., Green D.N., Mialle P. The Buncefield Explosion: A Benchmark for Infrasound Analysis across Central Europe. Geophys. J. Int. 2009;177:491–508. doi: 10.1111/j.1365-246X.2008.03998.x. - DOI
1. Gitterman Y. Sayarim Infrasound Calibration Explosion: Near-Source and Local Observations and Yield Estimation; Proceedings of the 2010 Monitoring Research Review: Ground-Based Nuclear Explosion Monitoring Technologies; Orlando, FL, USA. 21–23 September 2010; pp. 708–719. LA-UR-10-0.
1. Fuchs F., Schneider F.M., Kolínský P., Serafin S., Bokelmann G. Rich Observations of Local and Regional Infrasound Phases Made by the AlpArray Seismic Network after Refinery Explosion. Sci. Rep. 2019;9:13027. doi: 10.1038/s41598-019-49494-2. - DOI - PMC - PubMed

Grants and funding

LinkOut - more resources

Full Text Sources
- MDPI
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Explosion Detection Using Smartphones: Ensemble Learning with the Smartphone High-Explosive Audio Recordings Dataset and the ESC-50 Dataset

Affiliations

Explosion Detection Using Smartphones: Ensemble Learning with the Smartphone High-Explosive Audio Recordings Dataset and the ESC-50 Dataset

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Grants and funding

LinkOut - more resources

Full Text Sources