Sharing Practices for Datasets Related to Accessibility and Aging
- PMID: 35187541
- PMCID: PMC8855358
- DOI: 10.1145/3441852.3471208
Sharing Practices for Datasets Related to Accessibility and Aging
Abstract
Datasets sourced from people with disabilities and older adults play an important role in innovation, benchmarking, and mitigating bias for both assistive and inclusive AI-infused applications. However, they are scarce. We conduct a systematic review of 137 accessibility datasets manually located across different disciplines over the last 35 years. Our analysis highlights how researchers navigate tensions between benefits and risks in data collection and sharing. We uncover patterns in data collection purpose, terminology, sample size, data types, and data sharing practices across communities of focus. We conclude by critically reflecting on challenges and opportunities related to locating and sharing accessibility datasets calling for technical, legal, and institutional privacy frameworks that are more attuned to concerns from these communities.
Keywords: Accessibility; Human-centered computing → Human computer interaction (HCI); Security and privacy → Human and societal aspects of security and privacy; dataset; disability; machine learning; repository; sharing practices.
Figures







References
-
- 2021. ACL: Association for Computational Linguistics. ACL Data and Code Repository. https://aclweb.org/aclwiki/ACL_Data_and_Code_Repository.
-
- 2021. ACM: Association for Computing Machinery. https://www.acm.org/.
-
- 2021. Amazon. Registry of Open Data on AWS. https://registry.opendata.aws/.
-
- 2021. CVF: Computer Vision Foundation. https://www.thecvf.com/.
-
- 2021. Google Search. https://www.google.com/search/howsearchworks/.