The CRISPR Spacer Space Is Dominated by Sequences from Species-Specific Mobilomes
- PMID: 28928211
- PMCID: PMC5605939
- DOI: 10.1128/mBio.01397-17
The CRISPR Spacer Space Is Dominated by Sequences from Species-Specific Mobilomes
Abstract
Clustered regularly interspaced short palindromic repeats and CRISPR-associated protein (CRISPR-Cas) systems store the memory of past encounters with foreign DNA in unique spacers that are inserted between direct repeats in CRISPR arrays. For only a small fraction of the spacers, homologous sequences, called protospacers, are detectable in viral, plasmid, and microbial genomes. The rest of the spacers remain the CRISPR "dark matter." We performed a comprehensive analysis of the spacers from all CRISPR-cas loci identified in bacterial and archaeal genomes, and we found that, depending on the CRISPR-Cas subtype and the prokaryotic phylum, protospacers were detectable for 1% to about 19% of the spacers (~7% global average). Among the detected protospacers, the majority, typically 80 to 90%, originated from viral genomes, including proviruses, and among the rest, the most common source was genes that are integrated into microbial chromosomes but are involved in plasmid conjugation or replication. Thus, almost all spacers with identifiable protospacers target mobile genetic elements (MGE). The GC content, as well as dinucleotide and tetranucleotide compositions, of microbial genomes, their spacer complements, and the cognate viral genomes showed a nearly perfect correlation and were almost identical. Given the near absence of self-targeting spacers, these findings are most compatible with the possibility that the spacers, including the dark matter, are derived almost completely from the species-specific microbial mobilomes.IMPORTANCE The principal function of CRISPR-Cas systems is thought to be protection of bacteria and archaea against viruses and other parasitic genetic elements. The CRISPR defense function is mediated by sequences from parasitic elements, known as spacers, that are inserted into CRISPR arrays and then transcribed and employed as guides to identify and inactivate the cognate parasitic genomes. However, only a small fraction of the CRISPR spacers match any sequences in the current databases, and of these, only a minority correspond to known parasitic elements. We show that nearly all spacers with matches originate from viral or plasmid genomes that are either free or have been integrated into the host genome. We further demonstrate that spacers with no matches have the same properties as those of identifiable origins, strongly suggesting that all spacers originate from mobile elements.
Keywords: CRISPR-Cas; bacteriophages; mobilome; oligonucleotide composition; spacer acquisition.
Figures








Similar articles
-
CRISPRCasdb a successor of CRISPRdb containing CRISPR arrays and cas genes from complete genome sequences, and tools to download and query lists of repeats and spacers.Nucleic Acids Res. 2020 Jan 8;48(D1):D535-D544. doi: 10.1093/nar/gkz915. Nucleic Acids Res. 2020. PMID: 31624845 Free PMC article.
-
On the Origin of Reverse Transcriptase-Using CRISPR-Cas Systems and Their Hyperdiverse, Enigmatic Spacer Repertoires.mBio. 2017 Jul 11;8(4):e00897-17. doi: 10.1128/mBio.00897-17. mBio. 2017. PMID: 28698278 Free PMC article.
-
Survey of clustered regularly interspaced short palindromic repeats and their associated Cas proteins (CRISPR/Cas) systems in multiple sequenced strains of Klebsiella pneumoniae.BMC Res Notes. 2015 Aug 4;8:332. doi: 10.1186/s13104-015-1285-7. BMC Res Notes. 2015. PMID: 26238567 Free PMC article.
-
Mobile Genetic Elements and Evolution of CRISPR-Cas Systems: All the Way There and Back.Genome Biol Evol. 2017 Oct 1;9(10):2812-2825. doi: 10.1093/gbe/evx192. Genome Biol Evol. 2017. PMID: 28985291 Free PMC article. Review.
-
Molecular mechanisms of CRISPR-Cas spacer acquisition.Nat Rev Microbiol. 2019 Jan;17(1):7-12. doi: 10.1038/s41579-018-0071-7. Nat Rev Microbiol. 2019. PMID: 30171202 Review.
Cited by
-
Positioning Diverse Type IV Structures and Functions Within Class 1 CRISPR-Cas Systems.Front Microbiol. 2021 May 21;12:671522. doi: 10.3389/fmicb.2021.671522. eCollection 2021. Front Microbiol. 2021. PMID: 34093491 Free PMC article.
-
Whole genome sequencing of Moraxella bovis strains from North America reveals two genotypes with different genetic determinants.BMC Microbiol. 2022 Oct 21;22(1):258. doi: 10.1186/s12866-022-02670-3. BMC Microbiol. 2022. PMID: 36271336 Free PMC article.
-
Ecophysiological Features Shape the Distribution of Prophages and CRISPR in Sulfate Reducing Prokaryotes.Microorganisms. 2021 Apr 27;9(5):931. doi: 10.3390/microorganisms9050931. Microorganisms. 2021. PMID: 33925267 Free PMC article.
-
Efficient Recovery of Complete Gut Viral Genomes by Combined Short- and Long-Read Sequencing.Adv Sci (Weinh). 2024 Apr;11(13):e2305818. doi: 10.1002/advs.202305818. Epub 2024 Jan 19. Adv Sci (Weinh). 2024. PMID: 38240578 Free PMC article.
-
Using an Endogenous CRISPR-Cas System for Genome Editing in the Human Pathogen Clostridium difficile.Appl Environ Microbiol. 2019 Oct 1;85(20):e01416-19. doi: 10.1128/AEM.01416-19. Print 2019 Oct 15. Appl Environ Microbiol. 2019. PMID: 31399410 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Miscellaneous