jmzReader: A Java parser library to process and visualize multiple text and XML-based mass spectrometry data formats

Johannes Griss¹, Florian Reisinger, Henning Hermjakob, Juan Antonio Vizcaíno

Affiliations

PMID: 22539430
PMCID: PMC3472022
DOI: 10.1002/pmic.201100578

Free PMC article

jmzReader: A Java parser library to process and visualize multiple text and XML-based mass spectrometry data formats

Johannes Griss et al. Proteomics. 2012 Mar.

Free PMC article

. 2012 Mar;12(6):795-8.

doi: 10.1002/pmic.201100578.

Authors

Johannes Griss¹, Florian Reisinger, Henning Hermjakob, Juan Antonio Vizcaíno

Affiliation

¹ EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.

PMID: 22539430
PMCID: PMC3472022
DOI: 10.1002/pmic.201100578

Abstract

We here present the jmzReader library: a collection of Java application programming interfaces (APIs) to parse the most commonly used peak list and XML-based mass spectrometry (MS) data formats: DTA, MS2, MGF, PKL, mzXML, mzData, and mzML (based on the already existing API jmzML). The library is optimized to be used in conjunction with mzIdentML, the recently released standard data format for reporting protein and peptide identifications, developed by the HUPO proteomics standards initiative (PSI). mzIdentML files do not contain spectra data but contain references to different kinds of external MS data files. As a key functionality, all parsers implement a common interface that supports the various methods used by mzIdentML to reference external spectra. Thus, when developing software for mzIdentML, programmers no longer have to support multiple MS data file formats but only this one interface. The library (which includes a viewer) is open source and, together with detailed documentation, can be downloaded from http://code.google.com/p/jmzreader/.

PubMed Disclaimer

Figures

**Figure 1**
Simplified schema of the jmzReader library's structure displaying its main components and their dependencies. All parsers implement the common jmzReader interface and, in addition, provide a file-format specific Java object model to access a file format's specific information. Thereby, each parser can be used independently of the rest of the jmzReader library. If this detailed information is not required, programmers can process all seven supported file formats using a single, easy-to-use Java interface.

**Figure 2**
Screenshot of the jmzReader GUI after opening a mzXML file. The selected spectrum is displayed using the mzGraphBrowser library (http://code.google.com/p/pride-toolsuite/wiki/PRIDEmzGraphBrowser). This library allows the user to manually annotate a spectrum and export the loaded file into MGF format. The jmzReader GUI is built using the jmzReader interface and, thus, supports all file formats supported by the jmzReader library.

See this image and copyright information in PMC

References

1. Carr S, Aebersold R, Baldwin M, Burlingame A, et al. The need for guidelines in publication of peptide and protein identification data: working Group on Publication Guidelines for Peptide and Protein Identification Data. Mol. Cell Proteomics. 2004;3:531–533. - PubMed
1. Eisenacher M. mzIdentML: an open community-built standard format for the results of proteomics spectrum identification algorithms. Methods Mol. Biol. 2011;696:161–177. - PubMed
1. Perkins DN, Pappin DJC, Creasy DM, Cottrell JS. Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis. 1999;20:3551–3567. - PubMed
1. MacCoss MJ, Wu CC, Yates JR. Probability-based validation of protein identifications using a modified SEQUEST algorithm. Anal. Chem. 2002;74:5593–5599. - PubMed
1. Geer LY, Markey SP, Kowalak JA, Wagner L, et al. Open mass spectrometry search algorithm. J. Proteome Res. 2004;3:958–964. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

Grants and funding

WT085949MA/Wellcome Trust/United Kingdom

LinkOut - more resources

Full Text Sources
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

jmzReader: A Java parser library to process and visualize multiple text and XML-based mass spectrometry data formats

Affiliation

jmzReader: A Java parser library to process and visualize multiple text and XML-based mass spectrometry data formats

Authors

Affiliation

Abstract

Figures

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Research Materials