Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar 21:60:111502.
doi: 10.1016/j.dib.2025.111502. eCollection 2025 Jun.

Kenyan sign language word-based pose dataset

Affiliations

Kenyan sign language word-based pose dataset

Ezekiel Maina et al. Data Brief. .

Abstract

In an era where technology fosters inclusion, sign language remains underrepresented in linguistic datasets, especially for low-resource languages such as Kenyan Sign Language (KSL). This paper presents a novel dataset created using MediaPipe's pose estimation technology, designed to address the scarcity of resources for KSL. The dataset includes 20,000 video recordings of KSL gestures, converted into anonymized stickman representations alongside detailed 3D pose coordinates stored in .npy files. The data collection process focused on preserving participant privacy while ensuring the integrity of gesture data. By utilizing pose estimation, the dataset captures manual and non-manual features of KSL while maintaining the anonymity of signers. Stickman representations abstract human features, mitigating ethical concerns associated with traditional video datasets and aligning with privacy-preserving practices. The dataset spans a diverse range of themes relevant to KSL, including daily interactions, cultural expressions, and educational contexts, providing comprehensive coverage of the KSL lexicon. This dataset is designed for reuse across multiple domains. Researchers can leverage it to train machine learning models for sign language recognition, while educators can utilize it to develop interactive language learning tools. Additionally, it supports the development of virtual sign language interpreters and 3D avatars for accessibility applications. By enabling seamless integration into machine learning frameworks, the dataset facilitates advancements in KSL-related technologies and contributes to bridging communication gaps within the Deaf community.

Keywords: Kenyan sign language; Low resource languages; Pose estimation; Stickman representation.

PubMed Disclaimer

Figures

Fig 1
Fig. 1
Reading NPY File with landmarks using python.
Fig 2
Fig. 2
Age distribution of respondents by gender.
Fig 3
Fig. 3
School distribution by gender.
Fig 4
Fig. 4
School distribution by experience.
Fig 5
Fig. 5
Video segmentation using ELAN tool.
Fig 6
Fig. 6
Segmentation process based on the tiers.
Fig 7
Fig. 7
Pose estimation.
Fig 8
Fig. 8
Stickman generation.
Fig 9
Fig. 9
Pose estimation and stickman generation process.

References

    1. World Federation of the Deaf (WFD, 2013). Our work. Retrieved March 1, 2025, http://wfdeaf.org/our-work/
    1. Kenya National Association for the Deaf. (2025). KNAD official website. Retrieved March 2, 2025, from https://knad.or.ke/.
    1. Pfau R., Steinbach M., Woll B. De Gruyter Mouton; 2012. Sign Language: An International Handbook. - DOI
    1. Caselli N.K., Sehyr Z.S., Cohen-Goldberg A.M., Emmorey K. ASL-LEX: a lexical database of American Sign Language. Behav. Res. Methods. 2017;49:784–801. doi: 10.3758/s13428-016-0742-0. - DOI - PMC - PubMed
    1. Brentari D. Cambridge University Press; 2019. Sign Languages: A Cambridge Language Survey. - DOI

LinkOut - more resources