Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2026 Feb 9.
doi: 10.1038/s41592-025-02947-1. Online ahead of print.

Addressing pandemic-wide systematic errors in the SARS-CoV-2 phylogeny

Martin Hunt  1   2   3   4 Angie S Hinrichs  5 Daniel Anderson  1 Lily Karim  5   6 Bethany L Dearlove  7 Jeff Knaggs  1   2   3   4 Bede Constantinides  2   4 Philip W Fowler  2   3   4 Gillian Rodger  2   4 Teresa Street  2   3 Sheila Lumley  2   8 Hermione Webster  2   4 Theo Sanderson  9 Christopher Ruis  10   11 Benjamin Kotzen  12 Nicola de Maio  1 Lucas N Amenga-Etego  13 Dominic S Y Amuzu  13 Martin Avaro  14 Gordon A Awandare  13 Reuben Ayivor-Djanie  15   16 Timothy Barkham  17 Matthew Bashton  18 Elizabeth M Batty  19   20 Yaw Bediako  13 Denise De Belder  21 Estefania Benedetti  14 Andreas Bergthaler  7 Stefan A Boers  22 Josefina Campos  21 Rosina Afua Ampomah Carr  16   23 Yuan Yi Constance Chen  17 Facundo Cuba  21 Maria Elena Dattero  14 Wanwisa Dejnirattisai  24 Alexander Dilthey  25 Kwabena Obeng Duedu  16   26 Lukas Endler  7 Ilka Engelmann  27 Ngiambudulu M Francisco  28 Jonas Fuchs  29 Etienne Z Gnimpieba  30 Soraya Groc  31 Jones Gyamfi  16   32 Dennis Heemskerk  22 Torsten Houwaart  25 Nei-Yuan Hsiao  33 Matthew Huska  34 Martin Hölzer  34 Arash Iranzadeh  35 Hanna Jarva  36 Chandima Jeewandara  37 Bani Jolly  38   39 Rageema Joseph  33 Ravi Kant  40   41   42 Karrie Ko Kwan Ki  43 Satu Kurkela  36 Maija Lappalainen  36 Marie Lataretu  34 Jacob Lemieux  12 Chang Liu  44   45 Gathsaurie Neelika Malavige  37 Tapfumanei Mashe  46 Juthathip Mongkolsapaya  20   44   45 Brigitte Montes  31 Jose Arturo Molina Mora  47 Collins M Morang'a  13 Bernard Mvula  48 Niranjan Nagarajan  49   50 Andrew Nelson  51 Joyce M Ngoi  13 Joana Paula da Paixão  28 Marcus Panning  29 Tomas Poklepovich  21 Peter K Quashie  13 Diyanath Ranasinghe  37 Mara Russo  14 James Emmanuel San  52   53 Nicholas D Sanderson  2   3 Vinod Scaria  39   54 Gavin Screaton  2 October Michael Sessions  55 Tarja Sironen  40   41 Abay Sisay  56 Darren Smith  18 Teemu Smura  40   41 Piyada Supasa  44   45 Chayaporn Suphavilai  49 Jeremy Swann  2 Houriiyah Tegally  57 Bryan Tegomoh  58   59   60 Olli Vapalahti  40   41 Andreas Walker  61 Robert J Wilkinson  9   62   63 Carolyn Williamson  33 Xavier Zair  55 IMSSC Laboratory Network ConsortiumTulio de Oliveira  57   64 Timothy Ea Peto  2 Derrick Crook  2 Russell Corbett-Detig  5   6 Zamin Iqbal  65   66
Collaborators, Affiliations

Addressing pandemic-wide systematic errors in the SARS-CoV-2 phylogeny

Martin Hunt et al. Nat Methods. .

Abstract

The majority of SARS-CoV-2 genomes obtained during the pandemic were derived by amplifying overlapping windows of the genome ('tiled amplicons'), reconstructing their sequences and fitting them together. This leads to systematic errors in genomes unless the software is both aware of the amplicon scheme and of the error modes of amplicon sequencing. Additionally, over time, amplicon schemes need to be updated as new mutations in the virus interfere with the primer binding sites at the end of amplicons. Thus, waves of variants swept the world during the pandemic and were followed by waves of systematic errors in the genomes, which had significant impacts on the inferred phylogenetic tree.Here we reconstruct the genomes from all public data as of June 2024 using an assembly tool called Viridian ( https://github.com/iqbal-lab-org/viridian ), developed to rigorously process amplicon sequence data. With these high-quality consensus sequences we provide a global phylogenetic tree of 4,471,579 samples, viewable at https://viridian.taxonium.org . We provide simulation and empirical validation of the methodology, and quantify the improvement in the phylogeny.

PubMed Disclaimer

Conflict of interest statement

Competing interests: G. Screaton is on the GSK Vaccines Scientific Advisory Board, consults for AstraZeneca, and is a founding member of RQ Biotechnology. P. Fowler, D. Crook and Z. Iqbal have consulted for the Ellison Institute of Technology. B. Jolly is employed by Karkinos Healthcare Private Limited. The remaining authors declare no competing interests.

Update of

  • Addressing pandemic-wide systematic errors in the SARS-CoV-2 phylogeny.
    Hunt M, Hinrichs AS, Anderson D, Karim L, Dearlove BL, Knaggs J, Constantinides B, Fowler PW, Rodger G, Street T, Lumley S, Webster H, Sanderson T, Ruis C, Kotzen B, de Maio N, Amenga-Etego LN, Amuzu DSY, Avaro M, Awandare GA, Ayivor-Djanie R, Barkham T, Bashton M, Batty EM, Bediako Y, De Belder D, Benedetti E, Bergthaler A, Boers SA, Campos J, Carr RAA, Chen YYC, Cuba F, Dattero ME, Dejnirattisai W, Dilthey A, Duedu KO, Endler L, Engelmann I, Francisco NM, Fuchs J, Gnimpieba EZ, Groc S, Gyamfi J, Heemskerk D, Houwaart T, Hsiao NY, Huska M, Hölzer M, Iranzadeh A, Jarva H, Jeewandara C, Jolly B, Joseph R, Kant R, Ki KKK, Kurkela S, Lappalainen M, Lataretu M, Lemieux J, Liu C, Malavige GN, Mashe T, Mongkolsapaya J, Montes B, Mora JAM, Morang'a CM, Mvula B, Nagarajan N, Nelson A, Ngoi JM, da Paixão JP, Panning M, Poklepovich T, Quashie PK, Ranasinghe D, Russo M, San JE, Sanderson ND, Scaria V, Screaton G, Sessions OM, Sironen T, Sisay A, Smith D, Smura T, Supasa P, Suphavilai C, Swann J, Tegally H, Tegomoh B, Vapalahti O, Walker A, Wilkinson RJ, Williamson C, Zair X; IMSSC2 Laboratory Network Consortium; de Oliveira T, Peto TE, Crook D, Corbett-Detig R, Iqbal Z. Hunt M, et al. bioRxiv [Preprint]. 2024 Nov 5:2024.04.29.591666. doi: 10.1101/2024.04.29.591666. bioRxiv. 2024. Update in: Nat Methods. 2026 Feb 9. doi: 10.1038/s41592-025-02947-1. PMID: 38746185 Free PMC article. Updated. Preprint.

References

    1. Turakhia, Y. et al. Stability of SARS-CoV-2 phylogenies. PLoS Genet. 16, e1009175 (2020). - PubMed - PMC - DOI
    1. De Maio, N. et al. Issues with sars-cov-2 sequencing data. Virlogical.org https://virological.org/t/issues-with-sars-cov-2-sequencing-data/473 (2020).
    1. Henn, M. R. et al. Whole genome deep sequencing of HIV-1 reveals the impact of early minor variants upon immune recognition during acute infection. PLoS Pathog. 8, e1002529 (2012). - PubMed - PMC - DOI
    1. Holmes, E. Novel 2019 coronavirus genome. Virological.org https://virological.org/t/novel-2019-coronavirus-genome/319/1 (2020).
    1. Wu, F. et al. A new coronavirus associated with human respiratory disease in China. Nature 579, 265–269 (2020). - PubMed - PMC - DOI

LinkOut - more resources