Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 May 14;4(4):e00085-19.
doi: 10.1128/mSystems.00085-19. eCollection 2019 Jul-Aug.

Resistance Gene-Directed Genome Mining of 50 Aspergillus Species

Affiliations

Resistance Gene-Directed Genome Mining of 50 Aspergillus Species

Inge Kjærbølling et al. mSystems. .

Abstract

Fungal secondary metabolites are a rich source of valuable natural products, and genome sequencing has revealed a proliferation of predicted biosynthetic gene clusters in the genomes. However, it is currently an unfeasible task to characterize all biosynthetic gene clusters and to identify possible uses of the compounds. Therefore, a rational approach is needed to identify a short list of gene clusters responsible for producing valuable compounds. To this end, several bioactive clusters include a resistance gene, which is a paralog of the target gene inhibited by the compound. This mechanism can be used to identify these clusters. We have developed the FRIGG (fungal resistance gene-directed genome mining) pipeline for identifying this type of biosynthetic gene cluster based on homology patterns of the cluster genes. In this work, the FRIGG pipeline was run using 51 Aspergillus and Penicillium genomes, identifying 72 unique families of putative resistance genes. The pipeline also identified the previously characterized resistance gene inpE from the fellutamide B cluster, thereby validating the approach. We have successfully developed an approach to identify putative valuable bioactive clusters based on a specific resistance mechanism. This approach will be highly useful as an ever-increasing amount of genomic data becomes available; the art of identifying and selecting the right clusters producing novel valuable compounds will only become more crucial. IMPORTANCE Species belonging to the Aspergillus genus are known to produce a large number of secondary metabolites; some of these compounds are used as pharmaceuticals, such as penicillin, cyclosporine, and statin. With whole-genome sequencing, it became apparent that the genetic potential for secondary metabolite production is much larger than expected. As an increasing number of species are whole-genome sequenced, thousands of secondary metabolite genes are predicted, and the question of how to selectively identify novel bioactive compounds from this information arises. To address this question, we have created a pipeline to predict genes involved in the production of bioactive compounds based on a resistance gene hypothesis approach.

Keywords: Aspergillus; bioactive compounds; comparative genomics; fungal; genome mining; genomes; natural products; resistance; secondary metabolism.

PubMed Disclaimer

Figures

FIG 1
FIG 1
Resistance mechanism. (A) Mycophenolic acid chemical structure and biosynthetic cluster with the known resistance gene mpaF (highlighted in red), which is an IMP dehydrogenase (IMPDH) inhibitor. (B) Chemical structure of fellutamide B and overview of the biosynthetic cluster, including the resistance gene inpE (highlighted in red), which is a proteasome inhibitor. (C) Illustration of the resistance mechanism used by some toxin producers. The secondary metabolite is a toxin which inhibits an essential enzyme, the target of the compound. Within the cluster responsible for producing the toxin, a copy of the target gene is found; this version is still functioning despite the compound’s presence and hence makes the organism self-resistant.
FIG 2
FIG 2
Overview of the pipeline illustrating the initial data of predicted secondary metabolite gene clusters and homologous protein families. In step 1, the initial data are combined to generate counts of how many homologs are found in each organisms and how many are within predicted clusters. In step 2, clusters are selected based on specific homology count patterns, either “strict,” where only one gene can have a homolog outside the cluster, or “alternative,” where genes belonging to large protein families are allowed. Step 3 is filtering of the selected clusters based on other clusters having a homologous resistance gene; this step is optional. In step 4, filtering based on the majority of the organisms should provide a homolog of the resistance/target genes. Step 5 is filtering based on the number of organism that should have only a single homolog.
FIG 3
FIG 3
Phylogenetic tree and principal-component analysis of protein family 596635, including the fellutamide B resistance gene. (A) Phylogenetic tree of the protein family (596635) containing the fellutamide B resistance gene (protein identification 2700) from A. nidulans. The phylogenetic tree is a maximum likelihood tree and was created with 500 bootstraps. (B) Principal-component analysis of protein family 596635 containing the fellutamide B resistance gene (protein identification 2700) from A. nidulans. The panels to the left are colored based on the sections to which the proteins belong, while the panels to the right are colored based on if the protein is a putative resistance gene (StrictClust), in a predicted cluster (Clust), a target not in a cluster (0), and the homolog to the putative resistance gene (putative target) (outsideSC).

References

    1. Keller NP, Turner G, Bennett JW. 2005. Fungal secondary metabolism—from biochemistry to genomics. Nat Rev Microbiol 3:937–947. doi:10.1038/nrmicro1286. - DOI - PubMed
    1. Galagan JE, Calvo SE, Cuomo C, Ma L-J, Wortman JR, Batzoglou S, Lee S-I, Baştürkmen M, Spevak CC, Clutterbuck J, Kapitonov V, Jurka J, Scazzocchio C, Farman M, Butler J, Purcell S, Harris S, Braus GH, Draht O, Busch S, D’Enfert C, Bouchier C, Goldman GH, Bell-Pedersen D, Griffiths-Jones S, Doonan JH, Yu J, Vienken K, Pain A, Freitag M, Selker EU, Archer DB, Peñalva MA, Oakley BR, Momany M, Tanaka T, Kumagai T, Asai K, Machida M, Nierman WC, Denning DW, Caddick M, Hynes M, Paoletti M, Fischer R, Miller B, Dyer P, Sachs MS, Osmani SA, Birren BW. 2005. Sequencing of Aspergillus nidulans and comparative analysis with A. fumigatus and A. oryzae. Nature 438:1105–1115. doi:10.1038/nature04341. - DOI - PubMed
    1. Machida M, Asai K, Sano M, Tanaka T, Kumagai T, Terai G, Kusumoto KI, Arima T, Akita O, Kashiwagi Y, Abe K, Gomi K, Horiuchi H, Kitamoto K, Kobayashi T, Takeuchi M, Denning DW, Galagan JE, Nierman WC, Yu J, Archer DB, Bennett JW, Bhatnagar D, Cleveland TE, Fedorova ND, Gotoh O, Horikawa H, Hosoyama A, Ichinomiya M, Igarashi R, Iwashita K, Juvvadi PR, Kato M, Kato Y, Kin T, Kokubun A, Maeda H, Maeyama N, Maruyama JI, Nagasaki H, Nakajima T, Oda K, Okada K, Paulsen I, Sakamoto K, Sawano T, Takahashi M, Takase K, Terabayashi Y, Wortman JR, et al. . 2005. Genome sequencing and analysis of Aspergillus oryzae. Nature 438:1157–1161. doi:10.1038/nature04300. - DOI - PubMed
    1. Nierman WC, Pain A, Anderson MJ, Wortman JR, Kim HS, Arroyo J, Berriman M, Abe K, Archer DB, Bermejo C, Bennett J, Bowyer P, Chen D, Collins M, Coulsen R, Davies R, Dyer PS, Farman M, Fedorova N, Fedorova N, Feldblyum TV, Fischer R, Fosker N, Fraser A, García JL, García MJ, Goble A, Goldman GH, Gomi K, Griffith-Jones S, Gwilliam R, Haas B, Haas H, Harris D, Horiuchi H, Huang J, Humphray S, Jiménez J, Keller N, Khouri H, Kitamoto K, Kobayashi T, Konzack S, Kulkarni R, Kumagai T, Lafon A, Latgé J-P, Li W, Lord A, Lu C, et al. . 2005. Genomic sequence of the pathogenic and allergenic filamentous fungus Aspergillus fumigatus. Nature 438:1151–1156. doi:10.1038/nature04332. - DOI - PubMed
    1. Kjærbølling I, Vesth TC, Frisvad JC, Nybo JL, Theobald S, Kuo A, Bowyer P, Matsuda Y, Mondo S, Lyhne EK, Kogle ME, Clum A, Lipzen A, Salamov A, Ngan CY, Daum C, Chiniquy J, Barry K, LaButti K, Haridas S, Simmons BA, Magnuson JK, Mortensen UH, Larsen TO, Grigoriev IV, Baker SE, Andersen MR. 2018. Linking secondary metabolites to gene clusters through genome sequencing of six diverse Aspergillus species. Proc Natl Acad Sci U S A 115:E753–E761. doi:10.1073/pnas.1715954115. - DOI - PMC - PubMed

LinkOut - more resources