Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Feb;5(2):e000243.
doi: 10.1099/mgen.0.000243. Epub 2019 Feb 4.

Pan-genome analyses of model fungal species

Affiliations

Pan-genome analyses of model fungal species

Charley G P McCarthy et al. Microb Genom. 2019 Feb.

Abstract

The concept of the species 'pan-genome', the union of 'core' conserved genes and all 'accessory' non-conserved genes across all strains of a species, was first proposed in prokaryotes to account for intraspecific variability. Species pan-genomes have been extensively studied in prokaryotes, but evidence of species pan-genomes has also been demonstrated in eukaryotes such as plants and fungi. Using a previously published methodology based on sequence homology and conserved microsynteny, in addition to bespoke pipelines, we have investigated the pan-genomes of four model fungal species: Saccharomyces cerevisiae, Candida albicans, Cryptococcus neoformans var. grubii and Aspergillus fumigatus. Between 80 and 90 % of gene models per strain in each of these species are core genes that are highly conserved across all strains of that species, many of which are involved in housekeeping and conserved survival processes. In many of these species, the remaining 'accessory' gene models are clustered within subterminal regions and may be involved in pathogenesis and antimicrobial resistance. Analysis of the ancestry of species core and accessory genomes suggests that fungal pan-genomes evolve by strain-level innovations such as gene duplication as opposed to wide-scale horizontal gene transfer. Our findings lend further supporting evidence to the existence of species pan-genomes in eukaryote taxa.

Keywords: Aspergillus; Cryptococcus; comparative genomics; fungal pangenomes; yeast.

PubMed Disclaimer

Conflict of interest statement

The authors declare that there are no conflicts of interest.

Figures

Fig. 1.
Fig. 1.
Seven-set Venn diagram representing a hypothetical species pan-genome. Each set represents genes/gene models conserved across strains of a given species. The core species genome (grey) is defined as the set of all genes/gene models conserved across all strains of a species, while the accessory genome consists of all genes/gene models not universally conserved within a species.
Fig. 2.
Fig. 2.
Approximate maximum-likelihood supermatrix phylogeny of the Saccharomyces cerevisiae pan-genome dataset based on 4311 core orthologue clusters. Saccharomyces cerevisiae populations are as assigned by Strope et al., clinical strains are indicated by red branches. Numbers below branches refer to Shimodaira–Hasegawa local supports, maximum supports are indicated by asterisks. Dollo parsimony analysis of gene model gain/loss events is annotated above branches in green and orange, respectively.
Fig. 3.
Fig. 3.
Approximate maximum-likelihood supermatrix phylogeny of the Candida albicans pan-genome dataset based on 4327 core orthologue clusters. Numbers below branches refer to Shimodaira–Hasegawa local supports, maximum supports are indicated by asterisks. Dollo parsimony analysis of gene model gain/loss events is annotated above branches in green and orange, respectively.
Fig. 4.
Fig. 4.
Approximate maximum-likelihood supermatrix phylogeny of the Cryptococcus neoformans var. grubii pan-genome dataset based on 4512 core orthologue clusters. Numbers below branches refer to Shimodaira–Hasegawa local supports, maximum supports are indicated by asterisks. Dollo parsimony analysis of gene model gain/loss events is annotated above branches in green and orange, respectively.
Fig. 5.
Fig. 5.
Approximate maximum-likelihood supermatrix phylogeny of the Aspergillus fumigatus pan-genome dataset based on 5724 core orthologue clusters. Numbers below branches refer to Shimodaira–Hasegawa local supports, maximum supports are indicated by asterisks. Dollo parsimony analysis of gene model gain/loss events is annotated above branches in green and orange, respectively.
Fig. 6.
Fig. 6.
Pan-genomes of four fungal species. (a) Saccharomyces cerevisiae, (b) Candida albicans, (c) Cryptococcus neoformans var. grubii, (d) Aspergillus fumigatus. The ring charts represent the total number of gene models in pan-genome complements expressed as a proportion of total pan-genome size. Sections in dark-red represent duplicated core gene models in the accessory genome.

References

    1. Parfrey LW, Lahr DJG, Katz LA. The dynamic nature of eukaryotic genomes. Mol Biol Evol. 2008;25:787–794. doi: 10.1093/molbev/msn032. - DOI - PMC - PubMed
    1. Rouli L, Merhej V, Fournier PE, Raoult D. The bacterial pangenome as a new tool for analysing pathogenic bacteria. New Microbes New Infect. 2015;7:72–85. doi: 10.1016/j.nmni.2015.06.005. - DOI - PMC - PubMed
    1. Diene SM, Merhej V, Henry M, El Filali A, Roux V, et al. The rhizome of the multidrug-resistant Enterobacter aerogenes genome reveals how new “killer bugs” are created because of a sympatric lifestyle. Mol Biol Evol. 2013;30:369–383. doi: 10.1093/molbev/mss236. - DOI - PubMed
    1. Lefébure T, Pavinski Bitar PD, Suzuki H, Stanhope MJ. Evolutionary dynamics of complete Campylobacter pan-genomes and the bacterial species concept. Genome Biol Evol. 2010;2:646–655. doi: 10.1093/gbe/evq048. - DOI - PMC - PubMed
    1. Snipen L, Almøy T, Ussery DW. Microbial comparative pan-genomics using binomial mixture models. BMC Genomics. 2009;10:385. doi: 10.1186/1471-2164-10-385. - DOI - PMC - PubMed

Publication types