Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Mar;9(3):mgen000917.
doi: 10.1099/mgen.0.000917.

Palidis: fast discovery of novel insertion sequences

Affiliations

Palidis: fast discovery of novel insertion sequences

Victoria R Carr et al. Microb Genom. 2023 Mar.

Abstract

The diversity of microbial insertion sequences, crucial mobile genetic elements in generating diversity in microbial genomes, needs to be better represented in current microbial databases. Identification of these sequences in microbiome communities presents some significant problems that have led to their underrepresentation. Here, we present a bioinformatics pipeline called Palidis that recognizes insertion sequences in metagenomic sequence data rapidly by identifying inverted terminal repeat regions from mixed microbial community genomes. Applying Palidis to 264 human metagenomes identifies 879 unique insertion sequences, with 519 being novel and not previously characterized. Querying this catalogue against a large database of isolate genomes reveals evidence of horizontal gene transfer events across bacterial classes. We will continue to apply this tool more widely, building the Insertion Sequence Catalogue, a valuable resource for researchers wishing to query their microbial genomes for insertion sequences.

Keywords: antimicrobial resistance; horizontal gene transfer; insertion sequences; metagenome; mobile genetic element; transposon.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicting interests.

Figures

Fig. 1.
Fig. 1.
Steps summarizing Palidis. Step 1: reads from mixed microbial communities are pre-processed and run through pal-MEM to identify reads containing repeat sequences. Step 2: reads containing repeat sequences are mapped against the assemblies using Bowtie2 to find their positions and proximity filters applied to identify candidate ITRs. Step 3: candidate ITRs are clustered using CD-HIT-EST. ISs are identified by ITRs that are of the same cluster and are reverse complements of each other. Step 4: search of transposases using InterProScan. Step 5: final outputs of a FASTA file with insertion sequences and tab-delimited file with information are created.
Fig. 2.
Fig. 2.
(a) Length of insertion sequences and (b) number of transposases within the ISC .
Fig. 3.
Fig. 3.
Number of samples from the 661 k database that contain an IS found in across (a) genera; (b) species in the 661 k database a.
Fig. 4.
Fig. 4.
(a) Number of ISs shared across a number of distinct, known genera in the 661 k database; (b) network of shared ISs between genera. Each vertex represents either an IS (in ISfinder: blue or not in ISfinder: read) or a genus (labelled); (c) Number of ISs that are either in ISfinder (blue) or not (red) that are in distinct, known species in the 661 k database.

References

    1. Roberts AP, Mullany P. Tn916-like genetic elements: a diverse group of modular mobile elements conferring antibiotic resistance. FEMS Microbiol Rev. 2011;35:856–871. doi: 10.1111/j.1574-6976.2011.00283.x. - DOI - PubMed
    1. Frost LS, Leplae R, Summers AO, Toussaint A. Mobile genetic elements: the agents of open source evolution. Nat Rev Microbiol. 2005;3:722–732. doi: 10.1038/nrmicro1235. - DOI - PubMed
    1. Partridge SR, Kwong SM, Firth N, Jensen SO. Mobile genetic elements associated with antimicrobial resistance. Clin Microbiol Rev. 2018;31:e00088-17. doi: 10.1128/CMR.00088-17. - DOI - PMC - PubMed
    1. Mahillon J, Chandler M. Insertion sequences. Microbiol Mol Biol Rev. 1998;62:725–774. doi: 10.1128/MMBR.62.3.725-774.1998. - DOI - PMC - PubMed
    1. Aziz RK, Breitbart M, Edwards RA. Transposases are the most abundant, most ubiquitous genes in nature. Nucleic Acids Res. 2010;38:4207–4217. doi: 10.1093/nar/gkq140. - DOI - PMC - PubMed

Publication types

Substances

LinkOut - more resources