Improved DNA-Versus-Protein Homology Search for Protein Fossils

Yin Yao, Martin C Frith

PMID: 35617174
DOI: 10.1109/TCBB.2022.3177855

Improved DNA-Versus-Protein Homology Search for Protein Fossils

Yin Yao et al. IEEE/ACM Trans Comput Biol Bioinform. 2023 May-Jun.

. 2023 May-Jun;20(3):1691-1699.

doi: 10.1109/TCBB.2022.3177855. Epub 2023 Jun 5.

Authors

Yin Yao, Martin C Frith

PMID: 35617174
DOI: 10.1109/TCBB.2022.3177855

Abstract

Protein fossils, i.e., noncoding DNA descended from coding DNA, arise frequently from transposable elements (TEs), decayed genes, and viral integrations. They can reveal, and mislead about, evolutionary history and relationships. They have been detected by comparing DNA to protein sequences, but current methods are not optimized for this task. We describe a powerful DNA-protein homology search method. We use a 64×21 substitution matrix, which is fitted to sequence data, automatically learning the genetic code. We detect subtly homologous regions by considering alternative possible alignments between them, and calculate significance (probability of occurring by chance between random sequences). Our method detects TE protein fossils much more sensitively than blastx, and faster. Of the ∼ 7 major categories of eukaryotic TE, three were long thought absent in mammals: we find two of them in the human genome, polinton and DIRS/Ngaro. This method increases our power to find ancient fossils, and perhaps to detect non-standard genetic codes. The alternative-alignments and significance paradigm is not specific to DNA-protein comparison, and could benefit homology search generally. This is an extended version of a conference paper (Yao & Frith, 2021).

PubMed Disclaimer

LinkOut - more resources

Full Text Sources
- IEEE Computer Society
- IEEE Engineering in Medicine and Biology Society

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Improved DNA-Versus-Protein Homology Search for Protein Fossils

Improved DNA-Versus-Protein Homology Search for Protein Fossils

Authors

Abstract

LinkOut - more resources

Full Text Sources