Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Sep 17;21(Suppl 13):390.
doi: 10.1186/s12859-020-03684-2.

MAIRA- real-time taxonomic and functional analysis of long reads on a laptop

Affiliations

MAIRA- real-time taxonomic and functional analysis of long reads on a laptop

Benjamin Albrecht et al. BMC Bioinformatics. .

Abstract

Background: Advances in mobile sequencing devices and laptop performance make metagenomic sequencing and analysis in the field a technologically feasible prospect. However, metagenomic analysis pipelines are usually designed to run on servers and in the cloud.

Results: MAIRA is a new standalone program for interactive taxonomic and functional analysis of long read metagenomic sequencing data on a laptop, without requiring external resources. The program performs fast, online, genus-level analysis, and on-demand, detailed taxonomic and functional analysis. It uses two levels of frame-shift-aware alignment of DNA reads against protein reference sequences, and then performs detailed analysis using a protein synteny graph.

Conclusions: We envision this software being used by researchers in the field, when access to servers or cloud facilities is difficult, or by individuals that do not routinely access such facilities, such as medical researchers, crop scientists, or teachers.

Keywords: Antibiotic resistance; Functional analysis; Long read sequencing; Metagenomics; Microbiome; Mobile computing; Open source software; Taxonomic analysis; Virulence factors.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
MAIRA workflow. In an online manner, a batches of long reads obtained from a sequencing device are subjected to protein alignment against a precomputed database of genus-specific marker genes. Based on this, b the list of detected genera is updated in real-time. c Substantial evidence for the presence of a specific genus triggers d batch-wise protein alignment against a full database of proteins associated with the genus. Based on this, a protein graph is updated and used to perform species and-strain level analysis of the long reads, including the identification of antibiotic resistance and virulence factors
Fig. 2
Fig. 2
Performance on mock community. a The presence score for true positive (green) and false positive (red) genera as a function of computation time and number of reads processed. The blue line indicates the threshold used to deem a genus “present”. b The completeness of true positive (green) and false positive (red) species as a function of computation time and number of reads processed. The blue line indicates the threshold used to deem a species “present”. c The coverage of species, as a function of time and number of reads processed
Fig. 3
Fig. 3
MAIRA user interface. During processing of an input file or directory, MAIRA provides an overview of the genera, species and functional classes discovered so far. The user can explore and download results while they are being produced, so as to allow insights into the processed sequences

References

    1. Tyler AD, Mataseje L, Urfano CJ, Schmidt L, Antonation KS, Mulvey MR, Corbett CR. Evaluation of Oxford Nanopore’s MinION sequencing device for microbial whole genome sequencing applications. Sci Rep. 2018;8(1):1–12. doi: 10.1038/s41598-017-17765-5. - DOI - PMC - PubMed
    1. Huson DH, Beier S, Flade I, Górska A, El-Hadidi M, Mitra S, Ruscheweyh H-J, Tappu R. MEGAN community edition - interactive exploration and analysis of large-scale microbiome sequencing data. PLoS Comput Biol. 2016;12(6):1004957. doi: 10.1371/journal.pcbi.1004957. - DOI - PMC - PubMed
    1. Weber N, Liou D, Dommer J, MacMenamin P, Quinones M, Misner I, Oler AJ, Wan J, Kim L, Coakley McCarthy M, Ezeji S, Noble K, Hurt DE. Nephele: a cloud platform for simplified, standardized and reproducible microbiome data analysis. Bioinformatics. 2018;34(8):1411–3. doi: 10.1093/bioinformatics/btx617. - DOI - PMC - PubMed
    1. Jia B, Raphenya AR, Alcock B, Waglechner N, Guo P, Tsang KK, Lago BA, Dave BM, Pereira S, Sharma AN, Doshi S, Courtot M, Lo R, Williams LE, Frye JG, Elsayegh T, Sardar D, Westman EL, Pawlowski AC, Johnson TA, Brinkman FSL, Wright GD, McArthur AG. CARD 2017: expansion and model-centric curation of the comprehensive antibiotic resistance database. Nucleic Acids Res. 2017;45(D1):566–73. doi: 10.1093/nar/gkw1004. - DOI - PMC - PubMed
    1. Chen L, Yang J, Yu J, Yao Z, Sun L, Shen Y, Jin Q. VFDB: a reference database for bacterial virulence factors. Nucleic Acids Res. 2005;33(suppl_1):325–8. - PMC - PubMed

LinkOut - more resources