Using XML to encode TMA DES metadata
- PMID: 21969921
- PMCID: PMC3169921
- DOI: 10.4103/2153-3539.84233
Using XML to encode TMA DES metadata
Abstract
Background: The Tissue Microarray Data Exchange Specification (TMA DES) is an XML specification for encoding TMA experiment data. While TMA DES data is encoded in XML, the files that describe its syntax, structure, and semantics are not. The DTD format is used to describe the syntax and structure of TMA DES, and the ISO 11179 format is used to define the semantics of TMA DES. However, XML Schema can be used in place of DTDs, and another XML encoded format, RDF, can be used in place of ISO 11179. Encoding all TMA DES data and metadata in XML would simplify the development and usage of programs which validate and parse TMA DES data. XML Schema has advantages over DTDs such as support for data types, and a more powerful means of specifying constraints on data values. An advantage of RDF encoded in XML over ISO 11179 is that XML defines rules for encoding data, whereas ISO 11179 does not.
Materials and methods: We created an XML Schema version of the TMA DES DTD. We wrote a program that converted ISO 11179 definitions to RDF encoded in XML, and used it to convert the TMA DES ISO 11179 definitions to RDF.
Results: We validated a sample TMA DES XML file that was supplied with the publication that originally specified TMA DES using our XML Schema. We successfully validated the RDF produced by our ISO 11179 converter with the W3C RDF validation service.
Conclusions: All TMA DES data could be encoded using XML, which simplifies its processing. XML Schema allows datatypes and valid value ranges to be specified for CDEs, which enables a wider range of error checking to be performed using XML Schemas than could be performed using DTDs.
Keywords: CDEs; DTD; TMA DES; XML; statistical analysis; tissue microarray.
Figures
References
-
- Kononen J, Bubendorf L, Kallioniemi A, Barlund M, Schraml P, Leighton S, et al. Tissue microarrays for high-throughput molecular profiling of tumor specimens. Nat Med. 1998;4:844–7. - PubMed
-
- Bray T, Paoli J, Sperberg-McQueen CM, Maler E, Yergeau F. Extensible Markup Language (XML) 1.0. W3C Recommendation. 2008. [Last accessed 2008 Nov 08]. Available from: http://www.w3.org/TR/2008/REC-xml-20081126/
-
- Berman J. Pathology data integration with eXtensible Markup Language. Hum Pathol. 2005;36:139–45. - PubMed
-
- Berman JJ, Edgerton ME, Friedman BA. The tissue microarray data exchange specification: A community-based, open source tool for sharing tissue microarray data. [Last accessed on 2010 Dec 03];BMC Med Inform Decis Mak. 2003 3:5. Available from: http://www.biomedcentral.com/1472-6947/3/5 . - PMC - PubMed
-
- Xperanto-TMA. [Last accessed on 2010 Dec 03]. Available from: http://www.xperanto.snubi.org/TMA/
