Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Sep 29:2:100.
doi: 10.12688/openreseurope.14936.3. eCollection 2022.

MTaxi: A comparative tool for taxon identification of ultra low coverage ancient genomes

Affiliations

MTaxi: A comparative tool for taxon identification of ultra low coverage ancient genomes

Gözde Atağ et al. Open Res Eur. .

Abstract

A major challenge in zooarchaeology is to morphologically distinguish closely related species' remains, especially using small bone fragments. Shotgun sequencing aDNA from archeological remains and comparative alignment to the candidate species' reference genomes will only apply when reference nuclear genomes of comparable quality are available, and may still fail when coverages are low. Here, we propose an alternative method, MTaxi, that uses highly accessible mitochondrial DNA (mtDNA) to distinguish between pairs of closely related species from ancient DNA sequences. MTaxi utilises mtDNA transversion-type substitutions between pairs of candidate species, assigns reads to either species, and performs a binomial test to determine the sample taxon. We tested MTaxi on sheep/goat and horse/donkey data, between which zooarchaeological classification can be challenging in ways that epitomise our case. The method performed efficiently on simulated ancient genomes down to 0.3x mitochondrial coverage for both sheep/goat and horse/donkey, with no false positives. Trials on n=18 ancient sheep/goat samples and n=10 horse/donkey samples of known species identity also yielded 100% accuracy. Overall, MTaxi provides a straightforward approach to classify closely related species that are difficult to distinguish through zooarchaeological methods using low coverage aDNA data, especially when similar quality reference genomes are unavailable. MTaxi is freely available at https://github.com/goztag/MTaxi.

Keywords: ancient DNA; bioinformatics; genomics; mtDNA; zooarcheology.

PubMed Disclaimer

Conflict of interest statement

No competing interests were disclosed.

Figures

Figure 1.
Figure 1.. Overview of the MTaxi pipeline.
Flowchart and representations of the steps to determine the sample taxon. Here sheep and goat stand for the candidate species pair, but MTaxi can be applied to any pair of species where mitochondrial DNA reference sequences are available. Target sites represent mismatches (candidate substitutions) between the reference genomes, restricted to transversions. Blue coloured alleles represent transitions, red coloured alleles represent transversions. Reads are assigned to either taxon based on target sites. Purple coloured alleles represent sheep alleles, orange coloured alleles represent goat alleles. Reads may be assigned to the wrong taxon due to homoplastic mutations, technical error, or incomplete lineage sorting.
Figure 2.
Figure 2.. Distribution of target sites along reference mitochondrial genomes.
The figure shows the position of target sites along ( a) sheep and ( b) goat ( c) horse ( d) donkey reference mitochondrial genomes. The sites represent transversion-type substitutions (n=197 for sheep and goat, and n=117 for horse and donkey). The figure was generated through CGview .
Figure 3.
Figure 3.. Results of the method applied to simulated ancient genomes at different coverages.
Binomial test p-values for comparing the proportions of reads assigned to ( a) sheep versus goat, and ( b) and horse versus donkey. For sheep and goat, results are based on transversion substitutions, obtained through the default approach. n refers to the number of simulated genomes in each case (100 for each species in a pair). The height of the blue bar represents the number of simulated goat/donkey and sheep/horse genomes identified correctly with p<0.05, and the height of the red bar represents the number of unidentified cases. No cases were misidentified. The height of the grey bar represents the trials that did not contain any reads aligned to target sites, and thus could not be evaluated.
Figure 4.
Figure 4.. Whole Genome Comparative Alignment Results.
Total number of bases aligned to sheep versus goat reference genomes (left panels) and the mismatch proportions (right panels) for whole genome ancient sheep (upper panels) and goat (lower panels) samples.

References

    1. Braje TJ, Rick TC, Szpak P, et al. : Historical ecology and the conservation of large, hermaphroditic fishes in Pacific coast kelp forest ecosystems. Sci Adv. 2017;3(2): e1601759. 10.1126/sciadv.1601759 - DOI - PMC - PubMed
    1. Costa T, Barri F: Lama guanicoe remains from the Chaco ecoregion (Córdoba, Argentina): An osteological approach to the characterization of a relict wild population. PLoS One. 2018;13(4): e0194727. 10.1371/journal.pone.0194727 - DOI - PMC - PubMed
    1. Murray MS: Zooarchaeology and Arctic Marine Mammal Biogeography, conservation, and Management. Ecol Appl. 2008;18(2 Suppl):S41–55. 10.1890/06-0818.1 - DOI - PubMed
    1. Gifford-Gonzalez D: Zooarchaeology and Ecology: Mortality Profiles, Species Abundance, Diversity.In: An Introduction to Zooarchaeology.Cham: Springer;2018;475–501. 10.1007/978-3-319-65682-3_22 - DOI
    1. Wolverton S, Lyman RL: Applied Zooarchaeology History, Value, and Use.In: Conservation Biology and Applied Zooarchaeology.Tucson: University of Arizona Press;2012. Reference Source

LinkOut - more resources