The state of play in higher eukaryote gene annotation
- PMID: 27773922
- PMCID: PMC5876476
- DOI: 10.1038/nrg.2016.119
The state of play in higher eukaryote gene annotation
Abstract
A genome sequence is worthless if it cannot be deciphered; therefore, efforts to describe - or 'annotate' - genes began as soon as DNA sequences became available. Whereas early work focused on individual protein-coding genes, the modern genomic ocean is a complex maelstrom of alternative splicing, non-coding transcription and pseudogenes. Scientists - from clinicians to evolutionary biologists - need to navigate these waters, and this has led to the design of high-throughput, computationally driven annotation projects. The catalogues that are being produced are key resources for genome exploration, especially as they become integrated with expression, epigenomic and variation data sets. Their creation, however, remains challenging.
Figures




Similar articles
-
Roadmap for annotating transposable elements in eukaryote genomes.Methods Mol Biol. 2012;859:53-68. doi: 10.1007/978-1-61779-603-6_3. Methods Mol Biol. 2012. PMID: 22367865
-
GENCODE Pseudogenes.Methods Mol Biol. 2021;2324:67-82. doi: 10.1007/978-1-0716-1503-4_5. Methods Mol Biol. 2021. PMID: 34165709
-
Computational Methods for Pseudogene Annotation Based on Sequence Homology.Methods Mol Biol. 2021;2324:35-48. doi: 10.1007/978-1-0716-1503-4_3. Methods Mol Biol. 2021. PMID: 34165707 Review.
-
Segway 2.0: Gaussian mixture models and minibatch training.Bioinformatics. 2018 Feb 15;34(4):669-671. doi: 10.1093/bioinformatics/btx603. Bioinformatics. 2018. PMID: 29028889 Free PMC article.
-
A beginner's guide to eukaryotic genome annotation.Nat Rev Genet. 2012 Apr 18;13(5):329-42. doi: 10.1038/nrg3174. Nat Rev Genet. 2012. PMID: 22510764 Review.
Cited by
-
Enhanced protein isoform characterization through long-read proteogenomics.Genome Biol. 2022 Mar 3;23(1):69. doi: 10.1186/s13059-022-02624-y. Genome Biol. 2022. PMID: 35241129 Free PMC article.
-
From genome-wide associations to candidate causal variants by statistical fine-mapping.Nat Rev Genet. 2018 Aug;19(8):491-504. doi: 10.1038/s41576-018-0016-z. Nat Rev Genet. 2018. PMID: 29844615 Free PMC article. Review.
-
Tspan8-Tumor Extracellular Vesicle-Induced Endothelial Cell and Fibroblast Remodeling Relies on the Target Cell-Selective Response.Cells. 2020 Jan 29;9(2):319. doi: 10.3390/cells9020319. Cells. 2020. PMID: 32013145 Free PMC article.
-
A benchmark study of ab initio gene prediction methods in diverse eukaryotic organisms.BMC Genomics. 2020 Apr 9;21(1):293. doi: 10.1186/s12864-020-6707-9. BMC Genomics. 2020. PMID: 32272892 Free PMC article.
-
GenTree, an integrated resource for analyzing the evolution and function of primate-specific coding genes.Genome Res. 2019 Apr;29(4):682-696. doi: 10.1101/gr.238733.118. Epub 2019 Mar 12. Genome Res. 2019. PMID: 30862647 Free PMC article.
References
-
- Kim VN, Han J, Siomi MC. Biogenesis of small RNAs in animals. Nat Rev Mol Cell Biol. 2009;10:126–39. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources