UniChem: a unified chemical structure cross-referencing and identifier tracking system
- PMID: 23317286
- PMCID: PMC3616875
- DOI: 10.1186/1758-2946-5-3
UniChem: a unified chemical structure cross-referencing and identifier tracking system
Abstract
UniChem is a freely available compound identifier mapping service on the internet, designed to optimize the efficiency with which structure-based hyperlinks may be built and maintained between chemistry-based resources. In the past, the creation and maintenance of such links at EMBL-EBI, where several chemistry-based resources exist, has required independent efforts by each of the separate teams. These efforts were complicated by the different data models, release schedules, and differing business rules for compound normalization and identifier nomenclature that exist across the organization. UniChem, a large-scale, non-redundant database of Standard InChIs with pointers between these structures and chemical identifiers from all the separate chemistry resources, was developed as a means of efficiently sharing the maintenance overhead of creating these links. Thus, for each source represented in UniChem, all links to and from all other sources are automatically calculated and immediately available for all to use. Updated mappings are immediately available upon loading of new data releases from the sources. Web services in UniChem provide users with a single simple automatable mechanism for maintaining all links from their resource to all other sources represented in UniChem. In addition, functionality to track changes in identifier usage allows users to monitor which identifiers are current, and which are obsolete. Lastly, UniChem has been deliberately designed to allow additional resources to be included with minimal effort. Indeed, the recent inclusion of data sources external to EMBL-EBI has provided a simple means of providing users with an even wider selection of resources with which to link to, all at no extra cost, while at the same time providing a simple mechanism for external resources to link to all EMBL-EBI chemistry resources.
Figures



Similar articles
-
UniChem: extension of InChI-based compound mapping to salt, connectivity and stereochemistry layers.J Cheminform. 2014 Sep 4;6(1):43. doi: 10.1186/s13321-014-0043-5. eCollection 2014 Dec. J Cheminform. 2014. PMID: 25221628 Free PMC article.
-
The Protein Identifier Cross-Referencing (PICR) service: reconciling protein identifiers across multiple source databases.BMC Bioinformatics. 2007 Oct 18;8:401. doi: 10.1186/1471-2105-8-401. BMC Bioinformatics. 2007. PMID: 17945017 Free PMC article.
-
The EMBL Nucleotide Sequence Database.Nucleic Acids Res. 2002 Jan 1;30(1):21-6. doi: 10.1093/nar/30.1.21. Nucleic Acids Res. 2002. PMID: 11752244 Free PMC article.
-
Caveat Usor: Assessing Differences between Major Chemistry Databases.ChemMedChem. 2018 Mar 20;13(6):470-481. doi: 10.1002/cmdc.201700724. Epub 2018 Feb 23. ChemMedChem. 2018. PMID: 29451740 Free PMC article. Review.
-
Designing drugs on the internet? Free web tools and services supporting medicinal chemistry.Curr Top Med Chem. 2007;7(15):1491-501. doi: 10.2174/156802607782194707. Curr Top Med Chem. 2007. PMID: 17897035 Review.
Cited by
-
Advancing drug discovery through assay development: a survey of tool compounds within the human solute carrier superfamily.Front Pharmacol. 2024 Jul 9;15:1401599. doi: 10.3389/fphar.2024.1401599. eCollection 2024. Front Pharmacol. 2024. PMID: 39050757 Free PMC article.
-
A Metabolites Merging Strategy (MMS): Harmonization to Enable Studies' Intercomparison.Metabolites. 2023 Nov 21;13(12):1167. doi: 10.3390/metabo13121167. Metabolites. 2023. PMID: 38132849 Free PMC article.
-
Navigating common pitfalls in metabolite identification and metabolomics bioinformatics.Metabolomics. 2024 Sep 21;20(5):103. doi: 10.1007/s11306-024-02167-2. Metabolomics. 2024. PMID: 39305388 Free PMC article. Review.
-
RTX-KG2: a system for building a semantically standardized knowledge graph for translational biomedicine.BMC Bioinformatics. 2022 Sep 29;23(1):400. doi: 10.1186/s12859-022-04932-3. BMC Bioinformatics. 2022. PMID: 36175836 Free PMC article.
-
An interactive retrieval system for clinical trial studies with context-dependent protocol elements.PLoS One. 2020 Sep 18;15(9):e0238290. doi: 10.1371/journal.pone.0238290. eCollection 2020. PLoS One. 2020. PMID: 32946464 Free PMC article.
References
-
- ChEBI. http://www.ebi.ac.uk/chebi.
-
- ChEMBL. https://www.ebi.ac.uk/chembl.
-
- PDBe. http://www.ebi.ac.uk/pdbe.
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous