Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Mar 30:2:53.
doi: 10.1186/1756-0500-2-53.

Taxonomic information exchange and copyright: the Plazi approach

Affiliations

Taxonomic information exchange and copyright: the Plazi approach

Donat Agosti et al. BMC Res Notes. .

Abstract

Background: A large part of our knowledge on the world's species is recorded in the corpus of biodiversity literature with well over hundred million pages, and is represented in natural history collections estimated at 2 - 3 billion specimens. But this body of knowledge is almost entirely in paper-print form and is not directly accessible through the Internet. For the digitization of this literature, new territories have to be chartered in the fields of technical, legal and social issues that presently impede its advance. The taxonomic literature seems especially destined for such a transformation.

Discussion: Plazi was founded as an association with the primary goal of transforming both the printed and, more recently, "born-digital" taxonomic literature into semantically enabled, enhanced documents. This includes the creation of a test body of literature, an XML schema modeling its logic content (TaxonX), the development of a mark-up editor (GoldenGATE) allowing also the enhancement of documents with links to external resources via Life Science Identifiers (LSID), a repository for publications and issuance of bibliographic identifiers, a dedicated server to serve the marked up content (the Plazi Search and Retrieval Server, SRS) and semantic tools to mine information. Plazi's workflow is designed to respect copyright protection and achieves extraction by observing exceptions and limitations existent in international copyright law.

Conclusion: The information found in Plazi's databases - taxonomic treatments as well as the metadata of the publications - are in the public domain and can therefore be used for further scientific research without any restriction, whether or not contained in copyrighted publications.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Plazi workflow. Red: data that is served; blue: metadata.
Figure 2
Figure 2
Sample mark-up page. Left: sample of an original, published taxonomic treatment. Right: Same treatment marked-up in TaxonX XML schema and enhanced with external identifiers.
Figure 3
Figure 3
NLM/TaxonX XML document as source document. A wealth of derivative products can easily be derived from a NLM/TaxonX XML document, such as print, PDF or HTML products. Furthermore, single treatments can automatically be extracted and used as input for other applications such as Encyclopedia of Life, a typical data aggregator. The presence of Life Science Identifiers (LSID) in the semantically enhanced XML documents allows cross-linking independent web pages through LSID resolvers.

References

    1. CBD Convention on Biological Diversity http://biodiv.org
    1. Target 2010 http://www.countdown2010.org
    1. Antbase.org http://antbase.org
    1. Biodiversity Heritage Library http://www.biodiversitylibrary.org
    1. Polaszek A, 25 co-authors A universal register for animal names. Nature. 2005;437:477. doi: 10.1038/437477a. - DOI - PubMed

LinkOut - more resources