Improving taxonomic accuracy for fungi in public sequence databases: applying 'one name one species' in well-defined genera with Trichoderma/Hypocrea as a test case
- PMID: 29220466
- PMCID: PMC5641268
- DOI: 10.1093/database/bax072
Improving taxonomic accuracy for fungi in public sequence databases: applying 'one name one species' in well-defined genera with Trichoderma/Hypocrea as a test case
Abstract
The ITS (nuclear ribosomal internal transcribed spacer) RefSeq database at the National Center for Biotechnology Information (NCBI) is dedicated to the clear association between name, specimen and sequence data. This database is focused on sequences obtained from type material stored in public collections. While the initial ITS sequence curation effort together with numerous fungal taxonomy experts attempted to cover as many orders as possible, we extended our latest focus to the family and genus ranks. We focused on Trichoderma for several reasons, mainly because the asexual and sexual synonyms were well documented, and a list of proposed names and type material were recently proposed and published. In this case study the recent taxonomic information was applied to do a complete taxonomic audit for the genus Trichoderma in the NCBI Taxonomy database. A name status report is available here: https://www.ncbi.nlm.nih.gov/Taxonomy/TaxIdentifier/tax_identifier.cgi. As a result, the ITS RefSeq Targeted Loci database at NCBI has been augmented with more sequences from type and verified material from Trichoderma species. Additionally, to aid in the cross referencing of data from single loci and genomes we have collected a list of quality records of the RPB2 gene obtained from type material in GenBank that could help validate future submissions. During the process of curation misidentified genomes were discovered, and sequence records from type material were found hidden under previous classifications. Source metadata curation, although more cumbersome, proved to be useful as confirmation of the type material designation. Database URL:http://www.ncbi.nlm.nih.gov/bioproject/PRJNA177353
© Crown copyright 2017.
Figures






Similar articles
-
An oligonucleotide barcode for species identification in Trichoderma and Hypocrea.Fungal Genet Biol. 2005 Oct;42(10):813-28. doi: 10.1016/j.fgb.2005.06.007. Fungal Genet Biol. 2005. PMID: 16154784
-
Finding needles in haystacks: linking scientific names, reference specimens and molecular data for Fungi.Database (Oxford). 2014 Jun 30;2014:bau061. doi: 10.1093/database/bau061. Print 2014. Database (Oxford). 2014. PMID: 24980130 Free PMC article.
-
NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins.Nucleic Acids Res. 2005 Jan 1;33(Database issue):D501-4. doi: 10.1093/nar/gki025. Nucleic Acids Res. 2005. PMID: 15608248 Free PMC article.
-
NCBI Taxonomy: a comprehensive update on curation, resources and tools.Database (Oxford). 2020 Jan 1;2020:baaa062. doi: 10.1093/database/baaa062. Database (Oxford). 2020. PMID: 32761142 Free PMC article. Review.
-
Recent advances and future prospects in peptaibiotics, hydrophobin, and mycotoxin research, and their importance for chemotaxonomy of Trichoderma and Hypocrea.Chem Biodivers. 2008 May;5(5):671-80. doi: 10.1002/cbdv.200890064. Chem Biodivers. 2008. PMID: 18493954 Review.
Cited by
-
Unambiguous identification of fungi: where do we stand and how accurate and precise is fungal DNA barcoding?IMA Fungus. 2020 Jul 10;11:14. doi: 10.1186/s43008-020-00033-z. eCollection 2020. IMA Fungus. 2020. PMID: 32714773 Free PMC article.
-
How to publish a new fungal species, or name, version 3.0.IMA Fungus. 2021 May 3;12(1):11. doi: 10.1186/s43008-021-00063-1. IMA Fungus. 2021. PMID: 33934723 Free PMC article.
-
Customization of a DADA2-based pipeline for fungal internal transcribed spacer 1 (ITS1) amplicon data sets.JCI Insight. 2022 Jan 11;7(1):e151663. doi: 10.1172/jci.insight.151663. JCI Insight. 2022. PMID: 34813499 Free PMC article.
-
Improving the gold standard in NCBI GenBank and related databases: DNA sequences from type specimens and type strains.Syst Biol. 2024 Jul 27;73(2):486-494. doi: 10.1093/sysbio/syad068. Syst Biol. 2024. PMID: 37956405 Free PMC article.
-
Next-generation fungal identification using target enrichment and Nanopore sequencing.BMC Genomics. 2023 Oct 2;24(1):581. doi: 10.1186/s12864-023-09691-w. BMC Genomics. 2023. PMID: 37784013 Free PMC article.
References
-
- Nagy L.G., Petkovits T., Kovács G.M.. et al. (2011) Where is the unseen fungal diversity hidden? A study of Mortierella reveals a large contribution of reference collections to the identification of fungal environmental sequences. New Phytol., 191, 789–794. - PubMed
-
- O'Donnell K., Humber R.A., Geiser D.M.. et al. (2012) Phylogenetic diversity of insecticolous fusaria inferred from multilocus DNA sequence data and their molecular identification via FUSARIUM-ID and Fusarium MLST. Mycologia, 104, 427–445. - PubMed
-
- Irinyi L., Serena C., Garcia-Hermoso D.. et al. (2015) International Society of Human and Animal Mycology (ISHAM)-ITS reference DNA barcoding database–the quality controlled standard tool for routine identification of human and animal pathogenic fungi. Med. Mycol., 53, 313–337. - PubMed
-
- Kopchinskiy A., Komon M., Kubicek C.P.. et al. (2005) TrichoBLAST: a multilocus database for Trichoderma and Hypocrea identifications. Mycol. Res., 109, 658–660. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials