Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jan 1:2017:bax072.
doi: 10.1093/database/bax072.

Improving taxonomic accuracy for fungi in public sequence databases: applying 'one name one species' in well-defined genera with Trichoderma/Hypocrea as a test case

Affiliations

Improving taxonomic accuracy for fungi in public sequence databases: applying 'one name one species' in well-defined genera with Trichoderma/Hypocrea as a test case

Barbara Robbertse et al. Database (Oxford). .

Abstract

The ITS (nuclear ribosomal internal transcribed spacer) RefSeq database at the National Center for Biotechnology Information (NCBI) is dedicated to the clear association between name, specimen and sequence data. This database is focused on sequences obtained from type material stored in public collections. While the initial ITS sequence curation effort together with numerous fungal taxonomy experts attempted to cover as many orders as possible, we extended our latest focus to the family and genus ranks. We focused on Trichoderma for several reasons, mainly because the asexual and sexual synonyms were well documented, and a list of proposed names and type material were recently proposed and published. In this case study the recent taxonomic information was applied to do a complete taxonomic audit for the genus Trichoderma in the NCBI Taxonomy database. A name status report is available here: https://www.ncbi.nlm.nih.gov/Taxonomy/TaxIdentifier/tax_identifier.cgi. As a result, the ITS RefSeq Targeted Loci database at NCBI has been augmented with more sequences from type and verified material from Trichoderma species. Additionally, to aid in the cross referencing of data from single loci and genomes we have collected a list of quality records of the RPB2 gene obtained from type material in GenBank that could help validate future submissions. During the process of curation misidentified genomes were discovered, and sequence records from type material were found hidden under previous classifications. Source metadata curation, although more cumbersome, proved to be useful as confirmation of the type material designation. Database URL:http://www.ncbi.nlm.nih.gov/bioproject/PRJNA177353

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Morphology of specimens in the Trichoderma/Hypocrea clade: (A) asexual structures (conidiophore and conidia) of Trichoderma harzianum (FJ967806), (B) growth in culture of a specimen in the Trichoderma harzianum complex and (C) sexual reproduction structures of Hypocrea species.
Figure 2.
Figure 2.
Bar graph showing the number of formal Trichoderma names associated with different attributes in databases at NCBI.
Figure 3.
Figure 3.
Graphical display of ITS1 length compared with ITS2 length from Trichoderma ITS RefSeq records. Grey arrows indicate the minimum lengths of ITS1 and ITS2 observed using ITSx annotation.
Figure 4.
Figure 4.
A graphical summary of RefSeq ITS sequence BLASTn search results (% identity and alignment length) between (Figure 4A) and within clades (Figure 4B), where clades were defined by Jacklitsch and Voglmayr (23).
Figure 5.
Figure 5.
A phylogenetic tree generated by a FastTree analysis using a MAFFT alignment of RPB2 nucleotide sequences from type material (with asterisk) of Trichoderma and genomes labeled as Trichoderma.
Figure 6.
Figure 6.
Additions and updates to Trichoderma and Hypocrea binomial names in the NCBI Taxonomy database over the past 22 years.

Similar articles

  • An oligonucleotide barcode for species identification in Trichoderma and Hypocrea.
    Druzhinina IS, Kopchinskiy AG, Komoń M, Bissett J, Szakacs G, Kubicek CP. Druzhinina IS, et al. Fungal Genet Biol. 2005 Oct;42(10):813-28. doi: 10.1016/j.fgb.2005.06.007. Fungal Genet Biol. 2005. PMID: 16154784
  • Finding needles in haystacks: linking scientific names, reference specimens and molecular data for Fungi.
    Schoch CL, Robbertse B, Robert V, Vu D, Cardinali G, Irinyi L, Meyer W, Nilsson RH, Hughes K, Miller AN, Kirk PM, Abarenkov K, Aime MC, Ariyawansa HA, Bidartondo M, Boekhout T, Buyck B, Cai Q, Chen J, Crespo A, Crous PW, Damm U, De Beer ZW, Dentinger BT, Divakar PK, Dueñas M, Feau N, Fliegerova K, García MA, Ge ZW, Griffith GW, Groenewald JZ, Groenewald M, Grube M, Gryzenhout M, Gueidan C, Guo L, Hambleton S, Hamelin R, Hansen K, Hofstetter V, Hong SB, Houbraken J, Hyde KD, Inderbitzin P, Johnston PR, Karunarathna SC, Kõljalg U, Kovács GM, Kraichak E, Krizsan K, Kurtzman CP, Larsson KH, Leavitt S, Letcher PM, Liimatainen K, Liu JK, Lodge DJ, Luangsa-ard JJ, Lumbsch HT, Maharachchikumbura SS, Manamgoda D, Martín MP, Minnis AM, Moncalvo JM, Mulè G, Nakasone KK, Niskanen T, Olariaga I, Papp T, Petkovits T, Pino-Bodas R, Powell MJ, Raja HA, Redecker D, Sarmiento-Ramirez JM, Seifert KA, Shrestha B, Stenroos S, Stielow B, Suh SO, Tanaka K, Tedersoo L, Telleria MT, Udayanga D, Untereiner WA, Diéguez Uribeondo J, Subbarao KV, Vágvölgyi C, Visagie C, Voigt K, Walker DM, Weir BS, Weiß M, Wijayawardene NN, Wingfield MJ, Xu JP, Yang ZL, Zhang N, Zhuang WY, Federhen S. Schoch CL, et al. Database (Oxford). 2014 Jun 30;2014:bau061. doi: 10.1093/database/bau061. Print 2014. Database (Oxford). 2014. PMID: 24980130 Free PMC article.
  • NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins.
    Pruitt KD, Tatusova T, Maglott DR. Pruitt KD, et al. Nucleic Acids Res. 2005 Jan 1;33(Database issue):D501-4. doi: 10.1093/nar/gki025. Nucleic Acids Res. 2005. PMID: 15608248 Free PMC article.
  • NCBI Taxonomy: a comprehensive update on curation, resources and tools.
    Schoch CL, Ciufo S, Domrachev M, Hotton CL, Kannan S, Khovanskaya R, Leipe D, Mcveigh R, O'Neill K, Robbertse B, Sharma S, Soussov V, Sullivan JP, Sun L, Turner S, Karsch-Mizrachi I. Schoch CL, et al. Database (Oxford). 2020 Jan 1;2020:baaa062. doi: 10.1093/database/baaa062. Database (Oxford). 2020. PMID: 32761142 Free PMC article. Review.
  • Recent advances and future prospects in peptaibiotics, hydrophobin, and mycotoxin research, and their importance for chemotaxonomy of Trichoderma and Hypocrea.
    Degenkolb T, von Döhren H, Nielsen KF, Samuels GJ, Brückner H. Degenkolb T, et al. Chem Biodivers. 2008 May;5(5):671-80. doi: 10.1002/cbdv.200890064. Chem Biodivers. 2008. PMID: 18493954 Review.

Cited by

References

    1. Nilsson R.H., Ryberg M., Kristiansson E.. et al. (2006) Taxonomic reliability of DNA sequences in public sequence databases: a fungal perspective. Plos One, 1, e59.. - PMC - PubMed
    1. Nagy L.G., Petkovits T., Kovács G.M.. et al. (2011) Where is the unseen fungal diversity hidden? A study of Mortierella reveals a large contribution of reference collections to the identification of fungal environmental sequences. New Phytol., 191, 789–794. - PubMed
    1. O'Donnell K., Humber R.A., Geiser D.M.. et al. (2012) Phylogenetic diversity of insecticolous fusaria inferred from multilocus DNA sequence data and their molecular identification via FUSARIUM-ID and Fusarium MLST. Mycologia, 104, 427–445. - PubMed
    1. Irinyi L., Serena C., Garcia-Hermoso D.. et al. (2015) International Society of Human and Animal Mycology (ISHAM)-ITS reference DNA barcoding database–the quality controlled standard tool for routine identification of human and animal pathogenic fungi. Med. Mycol., 53, 313–337. - PubMed
    1. Kopchinskiy A., Komon M., Kubicek C.P.. et al. (2005) TrichoBLAST: a multilocus database for Trichoderma and Hypocrea identifications. Mycol. Res., 109, 658–660. - PubMed

Publication types

MeSH terms

Substances