Tackling Hypotheticals in Helminth Genomes

Affiliations

¹ Molecular Parasitology, Animal Science, AgResearch Ltd., Grasslands Research Centre, Palmerston North, New Zealand. Electronic address: nik.palevich@agresearch.co.nz.
² Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, UK.
³ Instituto de Microbiología y Parasitología Médica, Universidad de Buenos Aires Consejo Nacional de Investigaciones Científicas y Técnicas (IMPaM-UBA-CONICET), Buenos Aires, Argentina.
⁴ McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA; Division of Infectious Diseases, Department of Medicine, Washington University School of Medicine, St Louis, MO, USA.
⁵ Centro de Pesquisas René Rachou, FIOCRUZ, Belo Horizonte, Minas Gerais, Brazil.
⁶ NIAID, National Institutes of Health, Bethesda, MD, USA.
⁷ BFS, Institute of Parasitology, Justus Liebig University Giessen, Germany.
⁸ McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA.
⁹ Molecular Parasitology Division, New England Biolabs, Inc., Ipswich, MA, USA.

PMID: 29249363
PMCID: PMC11021132
DOI: 10.1016/j.pt.2017.11.007

Tackling Hypotheticals in Helminth Genomes

International Molecular Helminthology Annotation Network (IMHAN) et al. Trends Parasitol. 2018 Mar.

. 2018 Mar;34(3):179-183.

doi: 10.1016/j.pt.2017.11.007. Epub 2017 Dec 14.

Affiliations

¹ Molecular Parasitology, Animal Science, AgResearch Ltd., Grasslands Research Centre, Palmerston North, New Zealand. Electronic address: nik.palevich@agresearch.co.nz.
² Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, UK.
³ Instituto de Microbiología y Parasitología Médica, Universidad de Buenos Aires Consejo Nacional de Investigaciones Científicas y Técnicas (IMPaM-UBA-CONICET), Buenos Aires, Argentina.
⁴ McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA; Division of Infectious Diseases, Department of Medicine, Washington University School of Medicine, St Louis, MO, USA.
⁵ Centro de Pesquisas René Rachou, FIOCRUZ, Belo Horizonte, Minas Gerais, Brazil.
⁶ NIAID, National Institutes of Health, Bethesda, MD, USA.
⁷ BFS, Institute of Parasitology, Justus Liebig University Giessen, Germany.
⁸ McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA.
⁹ Molecular Parasitology Division, New England Biolabs, Inc., Ipswich, MA, USA.

PMID: 29249363
PMCID: PMC11021132
DOI: 10.1016/j.pt.2017.11.007

Abstract

Advancements in genome sequencing have led to the rapid accumulation of uncharacterized 'hypothetical proteins' in the public databases. Here we provide a community perspective and some best-practice approaches for the accurate functional annotation of uncharacterized genomic sequences.

Keywords: CRISPR; RNAi; annotation; genomes; helminth; hypothetical genes.

PubMed Disclaimer

Figures

**Figure 1.. Approaches for Functional Annotation of Uncharacterized Genes.**
The most efficient means of investigating genes encoded in helminth genomes with the ‘hypothetical’ function annotation is to initially search the currently available sequence databases (typically, NCBI nonredundant database [https://www.ncbi.nlm.nih.gov)] for sequence similarity, using BLAST. This should be followed up by searching structural and specialized databases, for example: protein databases (such as UniProt), enzyme databases (such as BRENDA), and metabolic databases [such as KEGG and Gene Ontology (GO)], for metabolic pathway reconstruction [2]. Several linux-based tools can be used to precisely predict enzyme function, such as DETECT, PRIAM, EFICAz², and InterProScan. Another *in silico* method used to improve functional annotation is phylogenomics [3], where hypothetical proteins from phylogenetically related species are compared. Once putative function is determined, cloning and sequencing of full-length cDNAs, proteomics (such as mass spectrometry), and RNA-Seq data can be used to experimentally validate annotations. Additional techniques, such as gene transformation and CRISPR/Cas-9 gene silencing, can also be applied 5, 6, 7, 8, 9, 10. The above mentioned tools and techniques should be used in concert with extensive literature mining to manually curate genomic content. The resulting genes/protein sequences should be deposited in public databases such as COMBREX and WormBase. As the research community accumulates information regarding experimentally verified and published genes/proteins along with species and strain identifications, a ‘Gold Standard’ database can emerge.

See this image and copyright information in PMC

References

1. Schnoes AM, et al. Annotation error in public databases: misannotation of molecular function in enzyme superfamilies. PLoS Comput. Biol, 5 (2009), Article e1000605 - PMC - PubMed
1. Leale G, et al. Inferring unknown biological functions by integration of GO annotations and gene expression data. IEEE/ACM Trans. Comput. Biol. Bioinform, 99 (2016), pp. 1–19 arXiv:1608.03672 - PubMed
1. Silva LL, et al. The Schistosoma mansoni phylome: using evolutionary genomics to gain insight into a parasite’s biology. BMC Genomics, 13 (2012), p. 617 - PMC - PubMed
1. Green ML, Karp PD. A Bayesian method for identifying missing enzymes in predicted metabolic pathway databases. BMC Bioinformatics, 5 (2004), p. 76 - PMC - PubMed
1. Štefanić S, et al. RNA interference in Schistosoma mansoni Schistosomula: selectivity, sensitivity and operation for larger-scale screening. PLoS Negl. Trop. Dis., 4 (2010), Article e850 - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

Grants and funding

Z99 AI999999/ImNIH/Intramural NIH HHS/United States

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Tackling Hypotheticals in Helminth Genomes

Affiliations

Tackling Hypotheticals in Helminth Genomes

Authors

Affiliations

Abstract

Figures

References

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources