Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 May 11:9:854444.
doi: 10.3389/frobt.2022.854444. eCollection 2022.

ODAS: Open embeddeD Audition System

Affiliations

ODAS: Open embeddeD Audition System

François Grondin et al. Front Robot AI. .

Abstract

Artificial audition aims at providing hearing capabilities to machines, computers and robots. Existing frameworks in robot audition offer interesting sound source localization, tracking and separation performance, although involve a significant amount of computations that limit their use on robots with embedded computing capabilities. This paper presents ODAS, the Open embeddeD Audition System framework, which includes strategies to reduce the computational load and perform robot audition tasks on low-cost embedded computing systems. It presents key features of ODAS, along with cases illustrating its uses in different robots and artificial audition applications.

Keywords: embedded computing; microphone array; open source framework; robot audition; sound source localization.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
ODAS processing pipeline.
FIGURE 2
FIGURE 2
ODAS Studio Web Interface. Colored dots represent potential DOA with power levels, and solid lines illustrate the tracked sound source trajectories over time.
FIGURE 3
FIGURE 3
ODAS strategy exploiting microphone directivity to compute GCC-PHAT using relevant pairs of microphones in a closed array configuration.
FIGURE 4
FIGURE 4
Illustration of the two unit sphere search, first with coarse resolution (A), and then more precise search with finer resolution (B).
FIGURE 5
FIGURE 5
Tracking sound sources with Kalman filters: each DOA is associated to a previously tracked source, a false detection or a new source.
FIGURE 6
FIGURE 6
ODAS subarray SSS strategy to optimize processing.
FIGURE 7
FIGURE 7
Azimut-3 (open array configuration, 16 microphones).
FIGURE 8
FIGURE 8
Azimut-3 (closed array configuration, 16 microphones).
FIGURE 9
FIGURE 9
SecurBot (16-microphone configuration on top and sides).
FIGURE 10
FIGURE 10
Beam (8-microphone on a circular support).

References

    1. Brière S., Valin J.-M., Michaud F., Létourneau D. (2008). “Embedded Auditory System for Small Mobile Robots,” in Proceedings of the IEEE International Conference on Robotics and Automation, Pasadena, CA, May 19–23, 2008, 3463–3468. 10.1109/robot.2008.4543740 - DOI
    1. Chakrabarty S., Habets E. A. P. (2019). Multi-Speaker DOA Estimation Using Deep Convolutional Networks Trained with Noise Signals. IEEE J. Sel. Top. Signal Process. 13, 8–21. 10.1109/jstsp.2019.2901664 - DOI
    1. Cohen I., Berdugo B. (2002). Noise Estimation by Minima Controlled Recursive Averaging for Robust Speech Enhancement. IEEE Signal Process. Lett. 9, 12–15. 10.1109/97.988717 - DOI
    1. Côté C., Létourneau D., Michaud F., Valin J.-M., Brosseau Y., Raievsky C., et al. (2004). “Code Reusability Tools for Programming Mobile Robots,” in Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28–October 2, 2004, 1820–1825.
    1. Culjak I., Abram D., Pribanic T., Dzapo H., Cifrek M. (2012). “A Brief Introduction to OpenCV,” in Proceedings of the International Convention on Information, Communication and Electronic Technology, Opatija, Croatia, May 21–25, 2012, 1725–1730.

LinkOut - more resources