A Simple Standard for Sharing Ontological Mappings (SSSOM)
- PMID: 35616100
- PMCID: PMC9216545
- DOI: 10.1093/database/baac035
A Simple Standard for Sharing Ontological Mappings (SSSOM)
Abstract
Despite progress in the development of standards for describing and exchanging scientific information, the lack of easy-to-use standards for mapping between different representations of the same or similar objects in different databases poses a major impediment to data integration and interoperability. Mappings often lack the metadata needed to be correctly interpreted and applied. For example, are two terms equivalent or merely related? Are they narrow or broad matches? Or are they associated in some other way? Such relationships between the mapped terms are often not documented, which leads to incorrect assumptions and makes them hard to use in scenarios that require a high degree of precision (such as diagnostics or risk prediction). Furthermore, the lack of descriptions of how mappings were done makes it hard to combine and reconcile mappings, particularly curated and automated ones. We have developed the Simple Standard for Sharing Ontological Mappings (SSSOM) which addresses these problems by: (i) Introducing a machine-readable and extensible vocabulary to describe metadata that makes imprecision, inaccuracy and incompleteness in mappings explicit. (ii) Defining an easy-to-use simple table-based format that can be integrated into existing data science pipelines without the need to parse or query ontologies, and that integrates seamlessly with Linked Data principles. (iii) Implementing open and community-driven collaborative workflows that are designed to evolve the standard continuously to address changing requirements and mapping practices. (iv) Providing reference tools and software libraries for working with the standard. In this paper, we present the SSSOM standard, describe several use cases in detail and survey some of the existing work on standardizing the exchange of mappings, with the goal of making mappings Findable, Accessible, Interoperable and Reusable (FAIR). The SSSOM specification can be found at http://w3id.org/sssom/spec. Database URL: http://w3id.org/sssom/spec.
© The Author(s) 2022. Published by Oxford University Press.
Figures



Similar articles
-
Applying the FAIR principles to data in a hospital: challenges and opportunities in a pandemic.J Biomed Semantics. 2022 Apr 25;13(1):12. doi: 10.1186/s13326-022-00263-7. J Biomed Semantics. 2022. PMID: 35468846 Free PMC article.
-
Making Metadata Machine-Readable as the First Step to Providing Findable, Accessible, Interoperable, and Reusable Population Health Data: Framework Development and Implementation Study.Online J Public Health Inform. 2024 Aug 1;16:e56237. doi: 10.2196/56237. Online J Public Health Inform. 2024. PMID: 39088253 Free PMC article.
-
Toward a Domain-Overarching Metadata Schema for Making Health Research Studies FAIR (Findable, Accessible, Interoperable, and Reusable): Development of the NFDI4Health Metadata Schema.JMIR Med Inform. 2025 May 21;13:e63906. doi: 10.2196/63906. JMIR Med Inform. 2025. PMID: 40397930 Free PMC article.
-
The eXtensible ontology development (XOD) principles and tool implementation to support ontology interoperability.J Biomed Semantics. 2018 Jan 12;9(1):3. doi: 10.1186/s13326-017-0169-2. J Biomed Semantics. 2018. PMID: 29329592 Free PMC article. Review.
-
FAIR human neuroscientific data sharing to advance AI driven research and applications: Legal frameworks and missing metadata standards.Front Genet. 2023 Mar 13;14:1086802. doi: 10.3389/fgene.2023.1086802. eCollection 2023. Front Genet. 2023. PMID: 37007976 Free PMC article. Review.
Cited by
-
Data sharing and ontology use among agricultural genetics, genomics, and breeding databases and resources of the Agbiodata Consortium.Database (Oxford). 2023 Nov 15;2023:baad076. doi: 10.1093/database/baad076. Database (Oxford). 2023. PMID: 37971715 Free PMC article.
-
Planteome 2024 Update: Reference Ontologies and Knowledgebase for Plant Biology.Nucleic Acids Res. 2024 Jan 5;52(D1):D1548-D1555. doi: 10.1093/nar/gkad1028. Nucleic Acids Res. 2024. PMID: 38055832 Free PMC article.
-
Critical Data for Critical Care: A Primer on Leveraging Electronic Health Record Data for Research From Society of Critical Care Medicine's Panel on Data Sharing and Harmonization.Crit Care Explor. 2024 Nov 15;6(11):e1179. doi: 10.1097/CCE.0000000000001179. eCollection 2024 Nov. Crit Care Explor. 2024. PMID: 39559555 Free PMC article. Review.
-
Standardizing Survey Data Collection to Enhance Reproducibility: Development and Comparative Evaluation of the ReproSchema Ecosystem.J Med Internet Res. 2025 Jul 11;27:e63343. doi: 10.2196/63343. J Med Internet Res. 2025. PMID: 40644691 Free PMC article.
-
Mouse Genome Informatics: an integrated knowledgebase system for the laboratory mouse.Genetics. 2024 May 7;227(1):iyae031. doi: 10.1093/genetics/iyae031. Genetics. 2024. PMID: 38531069 Free PMC article.
References
-
- Broeder D., Budroni P., Degl’Innocenti E.. et al. (2021) SEMAF: A proposal for a flexible semantic mapping framework. https://zenodo.org/record/4651421#.Yn60VBPMKkg.
-
- Laadhar A., Abrahão E. and Jonquet C. (2020) Investigating one million XRefs in thirthy ontologies from the OBO world. In: 11th International Conference on Biomedical Ontologies (ICBO).
-
- Alignment API . https://moex.gitlabpages.inria.fr/alignapi/ (20 November 2021, date last accessed).
-
- Jackson R., Matentzoglu N., Overton J.A.. et al. (2021) OBO Foundry in 2021: operationalizing open data principles to evaluate ontologies. Database, 2021. 10.1093/database/baab069. - DOI - PMC - PubMed
-
- Laadhar A., Abrahão E. and Jonquet C. (2020) Investigating one million XRefs in thirthy ontologies from the OBO world. In: ICBO 2020-11th International Conference on Biomedical Ontologies, Vol. 2807, pp. G.1–12.