Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014;11(2):156-67.
doi: 10.4161/rna.27990. Epub 2014 Feb 7.

CRISPR adaptive immune systems of Archaea

Affiliations

CRISPR adaptive immune systems of Archaea

Gisle Vestergaard et al. RNA Biol. 2014.

Abstract

CRISPR adaptive immune systems were analyzed for all available completed genomes of archaea, which included representatives of each of the main archaeal phyla. Initially, all proteins encoded within, and proximal to, CRISPR-cas loci were clustered and analyzed using a profile-profile approach. Then cas genes were assigned to gene cassettes and to functional modules for adaptation and interference. CRISPR systems were then classified primarily on the basis of their concatenated Cas protein sequences and gene synteny of the interference modules. With few exceptions, they could be assigned to the universal Type I or Type III systems. For Type I, subtypes I-A, I-B, and I-D dominate but the data support the division of subtype I-B into two subtypes, designated I-B and I-G. About 70% of the Type III systems fall into the universal subtypes III-A and III-B but the remainder, some of which are phyla-specific, diverge significantly in Cas protein sequences, and/or gene synteny, and they are classified separately. Furthermore, a few CRISPR systems that could not be assigned to Type I or Type III are categorized as variant systems. Criteria are presented for assigning newly sequenced archaeal CRISPR systems to the different subtypes. Several accessory proteins were identified that show a specific gene linkage, especially to Type III interference modules, and these may be cofunctional with the CRISPR systems. Evidence is presented for extensive exchange having occurred between adaptation and interference modules of different archaeal CRISPR systems, indicating the wide compatibility of the functionally diverse interference complexes with the relatively conserved adaptation modules.

Keywords: CRISPR; Cas; Type I; Type III; archaea.

PubMed Disclaimer

Figures

None
Figure 1. 16S rRNA phylogenetic tree for all archaeal genomes included in the study. The tree shows the primary archaeal phyla with the total number of genomes analyzed in each phylum indicated (single genomes for a given phylum are not numbered). The putative kingdoms are color-coded on the bar on the far left. Crenarchaeota, green; Aigarchaeota, light green; Korarchaeota, yellow; Thaumarchaeota, light blue; Euryarchaeota, dark blue. Environmental habitats are color-coded on vertical lines to the right of the kingdom bar: hydrothermal, orange; marine, blue; wetland sediment, brown, and hypersaline lake, pink. For each phylum, the Type I and Type III CRISPR subtypes are color-coded on the right and the total numbers of each subtype are given. The branch length ruler corresponds to a 5% difference in 16S RNA sequence.
None
Figure 2. Dendrogram and gene syntenies of archaeal adaptation modules of different CRISPR subtypes and the total numbers of identified subtypes are given on the tree. The dendrogram represents a simplified version of the full dendrogram in Figure S1 showing the different classes of adaptation modules. Gene contents (indicated by bold white numbers), gene sizes, and gene syntenies are shown for representative adaptation modules. Further, the subtypes of interference modules associated with each class of adaptation module are shown. The threshold used to classify the variants is indicated by the light blue area toward the base of the tree, and it was selected so that the resulting classes would match optimally with the associated interference subtypes, which are mainly Type I. Nevertheless, some adaptation module classes are associated with interference modules from different subtypes.
None
Figure 3. Dendrogram and gene syntenies of archaeal interference modules of different Type I subtypes where the total numbers of identified subtypes are given on the tree. Gene contents (indicated by bold numbers), gene sizes, and gene syntenies are shown for representative interference modules of each subtype. The dendrogram is a simplified version of the top half of the full dendrogram in Figure S2. The subtype threshhold is indicated by the light blue vertical line near the base of the tree. Most of the identified modules fall within already established subtypes. The new subtype I-G was separated from the earlier subtype I-B, and variant subtypes VI-I, VI-2, and VI-3 constitute exceptional subtypes with few members.
None
Figure 4. Dendrogram of the classified Type III subtypes. Total numbers of identified subtypes are given on the tree. Gene contents, gene sizes, and gene syntenies are shown for representative interference modules of the different subtypes. The dendrogram represents a simplified and inverted bottom half of the full dendrogram in Figure S2. Gene contents are indicated by bold numbers within genes, and for subtypes III-A and III-B, csm/cmr gene names are also given above the genes. The subtypes have distinct gene syntenies and branch before the defined threshold indicated by the light blue vertical line. Subtypes III-C and III-D are newly defined, and VIII-1 that was found only in some members of the Sulfolobales, and is therefore classified as a variant subtype. All subtypes have cas10, protein S, cas5, and multiple paralogs of cas7. asp denotes the gene of a putative aspartate protease. The subtype I-D gene cassette (in yellow) that branches at the junction of the Type III and Type I subtypes in Figure S2 is included as an outgroup.
None
Figure 5. Examples of Type III-A, III-B, and III-D subtypes in archaea from diverse phyla that are associated with multiple and diverse accessory protein genes. Type III gene cassettes are color-coded red and cmr and csm gene labels are given. Type I-A gene cassettes are color-coded for adaptation (light blue) and interference (yellow). Some of the subtype III-B systems are directly linked to subtype I-A systems. All core cas genes are identified by bold numbers. Non-core cas-associated genes are color-coded violet and are labeled by names above the genes (see also Table 5). Genes for putative regulatory proteins (green) and Cas6 (orange) are also indicated together with CRISPR loci (dark blue) where the number indicates the number of repeat-spacer units. In some Type III gene cassettes, the number of accessory protein genes exceeds the number of core cas genes.
None
Figure 6. A putative example of modular exchange of Type I adaptation and interference modules. Gene cassettes for adaptation (coded blue) and interference (coded yellow) for Methanocaldococcus vulcanius and Methanocaldococcus fervens genomes are depicted showing the adaptation modules with highly similar sequences (93% concatenated amino acid sequence identity) that are associated with three different subtypes (I-A, I-D, and I-G) of Type I interference modules. Gene contents are indicated by bold numbers within genes. R (green) denotes genes of putative transcriptional regulators and 6 (orange) indicates genes of the Cas6 RNA processing enzyme.
None
Figure 7. Schematic comparison of (A) a subtype I-E interference complex from E. coli with (B) a subtype III-B interference complex from Pyrococcus furiosus. The two structures share homologous protein components consistent with the related compositions of their gene cassettes. In (A), the Cas6 protein is considered to be part of the interference complex and Cas3 is essential for the interference mechanism. In (B), Cas 6 has not been shown to be part of the interference complex. Moreover, the two Cas7 proteins in green (Cmr1) and orange Cmr6 (see Fig. 4) are Cas7 paralogs. The estimated binding sites of the crRNAs are color-coded brown.

References

    1. Barrangou R, van der Oost J, eds. CRISPR-Cas systems. Springer press, Heidelberg, 2012; pp. 1-299.
    1. Kunin V, Sorek R, Hugenholtz P. Evolutionary conservation of sequence and secondary structures in CRISPR repeats. Genome Biol. 2007;8:R61. doi: 10.1186/gb-2007-8-4-r61. - DOI - PMC - PubMed
    1. Haft DH, Selengut J, Mongodin EF, Nelson KE. A guild of 45 CRISPR-associated (Cas) protein families and multiple CRISPR/Cas subtypes exist in prokaryotic genomes. PLoS Comput Biol. 2005;1:e60. doi: 10.1371/journal.pcbi.0010060. - DOI - PMC - PubMed
    1. Makarova KS, Grishin NV, Shabalina SA, Wolf YI, Koonin EV. A putative RNA-interference-based immune system in prokaryotes: computational analysis of the predicted enzymatic machinery, functional analogies with eukaryotic RNAi, and hypothetical mechanisms of action. Biol Direct. 2006;1:7. doi: 10.1186/1745-6150-1-7. - DOI - PMC - PubMed
    1. Makarova KS, Haft DH, Barrangou R, Brouns SJ, Charpentier E, Horvath P, Moineau S, Mojica FJ, Wolf YI, Yakunin AF, et al. Evolution and classification of the CRISPR-Cas systems. Nat Rev Microbiol. 2011;9:467–77. doi: 10.1038/nrmicro2577. - DOI - PMC - PubMed

Publication types