Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jun 4;19(6):e0304744.
doi: 10.1371/journal.pone.0304744. eCollection 2024.

Machine learning with taxonomic family delimitation aids in the classification of ephemeral beaked whale events in passive acoustic monitoring

Affiliations

Machine learning with taxonomic family delimitation aids in the classification of ephemeral beaked whale events in passive acoustic monitoring

Alba Solsona-Berga et al. PLoS One. .

Abstract

Passive acoustic monitoring is an essential tool for studying beaked whale populations. This approach can monitor elusive and pelagic species, but the volume of data it generates has overwhelmed researchers' ability to quantify species occurrence for effective conservation and management efforts. Automation of data processing is crucial, and machine learning algorithms can rapidly identify species using their sounds. Beaked whale acoustic events, often infrequent and ephemeral, can be missed when co-occurring with signals of more abundant, and acoustically active species that dominate acoustic recordings. Prior efforts on large-scale classification of beaked whale signals with deep neural networks (DNNs) have approached the class as one of many classes, including other odontocete species and anthropogenic signals. That approach tends to miss ephemeral events in favor of more common and dominant classes. Here, we describe a DNN method for improved classification of beaked whale species using an extensive dataset from the western North Atlantic. We demonstrate that by training a DNN to focus on the taxonomic family of beaked whales, ephemeral events were correctly and efficiently identified to species, even with few echolocation clicks. By retrieving ephemeral events, this method can support improved estimation of beaked whale occurrence in regions of high odontocete acoustic activity.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1
Overview of the workflow for a) training and b) applying a generalized and targeted species classification pipeline. Boxes indicate automated algorithms, in black are the standard steps between workflows and in gray is an additional step to delimitate data to a target group.
Fig 2
Fig 2. Map of recording locations.
Map showing the latitude and longitude locations of all passive acoustic monitoring sites. Sites used in the representative training dataset are shown in red; sites included in the case study dataset are shown in yellow. Site abbreviations in the western North Atlantic: Heezen Canyon–HZ, Oceanographer Canyon–OC, Bear Seamount–BR, Nantucket Canyon–NC, Babylon Canyon–BC, Wilmington Canyon–WC, Norfolk Canyon–NFC, Hatteras–HAT, Onslow Bay–ONB, Gulf Stream–GS, Blake Plateau–BP, Bermuda–BM, Blake Spur–BS, Jacksonville–JAX. Site abbreviations in the Gulf of Mexico: Mississippi Canyon–MC, Green Canyon–GC, Dry Tortugas–DT, Howell Hook–HH. Map land cover, shaded relief, and ocean-bottom relief were obtained from Natural Earth (http://naturalearthdata.com/).
Fig 3
Fig 3. Signal classes formed using unsupervised clustering at a 5-minute bin level based on spectra, inter-click interval (ICI), and waveform envelope.
For each signal class, the top panels depict the mean and standard deviation of spectra among all clusters (top left) and concatenated mean cluster spectra (top right); the middle panels depict the mean and standard deviation of ICI distributions among all clusters (middle left) and concatenated cluster ICI distributions (middle right); and the bottom panels depict the mean and standard deviation of waveform envelops among all clusters (bottom left) and concatenated mean cluster waveform envelope (bottom right). Concatenated clusters have been sorted by peak frequency. Color map is normalized amplitudes on a scale from 0 (dark blue) to 1 (dark red). Refer to Table 1 for abbreviation IDs.
Fig 4
Fig 4. Confusion matrix for the DNN classifier on the balanced test set.
Each class consists of 500 cluster examples formed at a 5-minute bin level. Values in the matrix show the total number of examples classified, on the right bar the recall rate, and on the bottom bar the precision rate. Refer to Table 1 for abbreviation IDs.
Fig 5
Fig 5. Correlation between the total number of clicks per 5-minute bins manually labeled to a class (true class) and total number of clicks classified to predicted class by targeted species pipeline with a hard negative filter in the case study dataset.
Each point is a bin, and each subplot displays all bins manually assigned to a species, along with the class predicted by the DNN in color scale, with dark blue points representing true positives and other colored circles representing misclassified bins. The class abbreviation displayed in bold shows the true class of each subplot. Points along the diagonal show bins for which the number of manually labeled clicks were correctly predicted to a class. Above the diagonal, more clicks were predicted to a class than the true number of clicks per bin, whereas below the diagonal, less clicks were predicted than the total number of true clicks per bin.
Fig 6
Fig 6. Case study dataset bins without a manual label classified by the targeted species classification pipeline with a hard negative filter.
For each predicted beaked whale class, concatenated mean cluster spectra (top panel), concatenated cluster ICI distributions (middle panel), and concatenated mean cluster waveform envelope (bottom panel) are shown. Concatenated clusters have been sorted by site and classification prediction probability score. Color map is normalized amplitudes on a scale from 0 (dark blue) to 1 (dark red). Many of the clusters counted as misclassifications appear to have been correctly classified by the network, but were not labeled in the manual dataset. Refer to Table 1 for abbreviation IDs.
Fig 7
Fig 7. Case study dataset bins at site WC without a manual label and classified by a targeted species pipeline with a moderate negative filter and a generalized species pipeline.
For each predicted beaked whale class, concatenated mean cluster spectra (top panel), concatenated cluster ICI distributions (middle panel), and concatenated mean cluster waveform envelope (bottom panel) are shown. Concatenated clusters have been sorted by site and classification prediction probability score. Color map is normalized amplitudes on a scale from 0 (dark blue) to 1 (dark red). Many of the clusters counted as misclassifications appear to have been correctly classified by the network, but were not labeled in the manual dataset. Refer to Table 1 for abbreviation IDs, and Fig 6 for bins without a manual label and classified by a targeted species pipeline with a hard negative filter.

Similar articles

Cited by

References

    1. Dalebout ML, Mead JG, Baker CS, Baker AN, van Helden AL. A new species of beaked whale Mesoplodon perrini sp. N. (Cetacea: Ziphiidae) discovered through phylogenetic analyses of mitochondrial DNA sequences. Mar Mammal Sci. 2002;18: 577–608. 10.1111/j.1748-7692.2002.tb01061.x - DOI
    1. Carroll EL, McGowen MR, McCarthy ML, Marx FG, Aguilar N, Dalebout ML, et al.. Speciation in the deep: genomics and morphology reveal a new species of beaked whale Mesoplodon eueu. Proc R Soc B Biol Sci. 2021;288: 20211213. doi: 10.1098/rspb.2021.1213 - DOI - PMC - PubMed
    1. Brownell RL Jr., Kasuya T. Sato’s beaked whale: A new cetacean species discovered around Japan. Mar Mammal Sci. 2021;37: 768–771. 10.1111/mms.12810 - DOI
    1. Committee on Taxonomy. List of marine mammal species and subspecies. Society for Marine Mammology. Available: www.marinemammalscience.org
    1. IUCN. 2022. The IUCN Red List of Threatened Species. Version 2022–2. [cited 6 Jul 2023]. Available: https://www.iucnredlist.org