Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2023 Sep 1:2023.08.30.553559.
doi: 10.1101/2023.08.30.553559.

Large-scale annotated dataset for cochlear hair cell detection and classification

Affiliations

Large-scale annotated dataset for cochlear hair cell detection and classification

Christopher J Buswinka et al. bioRxiv. .

Update in

  • Large-scale annotated dataset for cochlear hair cell detection and classification.
    Buswinka CJ, Rosenberg DB, Simikyan RG, Osgood RT, Fernandez K, Nitta H, Hayashi Y, Liberman LW, Nguyen E, Yildiz E, Kim J, Jarysta A, Renauld J, Wesson E, Wang H, Thapa P, Bordiga P, McMurtry N, Llamas J, Kitcher SR, López-Porras AI, Cui R, Behnammanesh G, Bird JE, Ballesteros A, Vélez-Ortega AC, Edge ASB, Deans MR, Gnedeva K, Shrestha BR, Manor U, Zhao B, Ricci AJ, Tarchini B, Basch ML, Stepanyan R, Landegger LD, Rutherford MA, Liberman MC, Walters BJ, Kros CJ, Richardson GP, Cunningham LL, Indzhykulian AA. Buswinka CJ, et al. Sci Data. 2024 Apr 23;11(1):416. doi: 10.1038/s41597-024-03218-y. Sci Data. 2024. PMID: 38653806 Free PMC article.

Abstract

Our sense of hearing is mediated by cochlear hair cells, localized within the sensory epithelium called the organ of Corti. There are two types of hair cells in the cochlea, which are organized in one row of inner hair cells and three rows of outer hair cells. Each cochlea contains a few thousands of hair cells, and their survival is essential for our perception of sound because they are terminally differentiated and do not regenerate after insult. It is often desirable in hearing research to quantify the number of hair cells within cochlear samples, in both pathological conditions, and in response to treatment. However, the sheer number of cells along the cochlea makes manual quantification impractical. Machine learning can be used to overcome this challenge by automating the quantification process but requires a vast and diverse dataset for effective training. In this study, we present a large collection of annotated cochlear hair-cell datasets, labeled with commonly used hair-cell markers and imaged using various fluorescence microscopy techniques. The collection includes samples from mouse, human, pig and guinea pig cochlear tissue, from normal conditions and following in-vivo and in-vitro ototoxic drug application. The dataset includes over 90'000 hair cells, all of which have been manually identified and annotated as one of two cell types: inner hair cells and outer hair cells. This dataset is the result of a collaborative effort from multiple laboratories and has been carefully curated to represent a variety of imaging techniques. With suggested usage parameters and a well-described annotation procedure, this collection can facilitate the development of generalizable cochlear hair cell detection models or serve as a starting point for fine-tuning models for other analysis tasks. By providing this dataset, we aim to supply other groups within the hearing research community with the opportunity to develop their own tools with which to analyze cochlear imaging data more fully, accurately, and with greater ease.

Keywords: annotation; cochlea; detection; hair cells; inner hair cell; machine-learning-ready data; outer hair cell.

PubMed Disclaimer

Figures

Figure 1:
Figure 1:. Cochlear hair cells annotation workflow using a ‘human-in-the-loop’ annotation paradigm.
(A) An exemplar cochlea from our dataset with IHCs and OHCs annotated. Bounding boxes and classification labels were generated for all hair cells along the sensory epithelium. Representative regions in the (B) apex, (C) middle, and (D) base are shown as insets. Bounding boxes and labels were generated using labelImg (E), an open-source object annotation software, with green boxes annotating OHCs and yellow boxes annotating IHCs. The annotation procedure was optimized using the human-in-the-loop paradigm (F), where annotations were first generated by a preliminary neural network, then manually corrected and used to further train and improve the preliminary network, iteratively enabling more accurate candidate detections.

References

    1. Lim D. J. Functional structure of the organ of Corti: a review. Hearing research 22, 117–146 (1986). - PubMed
    1. Ashmore J. Tonotopy of cochlear hair cell biophysics (excl. mechanotransduction). Current opinion in physiology 18, 1–6, doi: 10.1016/j.cophys.2020.06.010 (2020). - DOI
    1. Schmiedt R. A. Acoustic injury and the physiology of hearing. J Acoust Soc Am 76, 1293–1317, doi: 10.1121/1.391446 (1984). - DOI - PubMed
    1. Ryan A. F., Kujawa S. G., Hammill T., Le Prell C. & Kil J. Temporary and Permanent Noise-induced Threshold Shifts: A Review of Basic and Clinical Observations. Otol Neurotol 37, e271–275, doi: 10.1097/MAO.0000000000001071 (2016). - DOI - PMC - PubMed
    1. Kurabi A., Keithley E. M., Housley G. D., Ryan A. F. & Wong A. C. Cellular mechanisms of noise-induced hearing loss. Hear Res 349, 129–137, doi: 10.1016/j.heares.2016.11.013 (2017). - DOI - PMC - PubMed

Publication types