Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Dec 30;16(1):67.
doi: 10.3390/v16010067.

Reclassification of ASFV into 7 Biotypes Using Unsupervised Machine Learning

Affiliations

Reclassification of ASFV into 7 Biotypes Using Unsupervised Machine Learning

Mark Dinhobl et al. Viruses. .

Abstract

In 2007, an outbreak of African swine fever (ASF), a deadly disease of domestic swine and wild boar caused by the African swine fever virus (ASFV), occurred in Georgia and has since spread globally. Historically, ASFV was classified into 25 different genotypes. However, a newly proposed system recategorized all ASFV isolates into 6 genotypes exclusively using the predicted protein sequences of p72. However, ASFV has a large genome that encodes between 150-200 genes, and classifications using a single gene are insufficient and misleading, as strains encoding an identical p72 often have significant mutations in other areas of the genome. We present here a new classification of ASFV based on comparisons performed considering the entire encoded proteome. A curated database consisting of the protein sequences predicted to be encoded by 220 reannotated ASFV genomes was analyzed for similarity between homologous protein sequences. Weights were applied to the protein identity matrices and averaged to generate a genome-genome identity matrix that was then analyzed by an unsupervised machine learning algorithm, DBSCAN, to separate the genomes into distinct clusters. We conclude that all available ASFV genomes can be classified into 7 distinct biotypes.

Keywords: ASFV; African swine fever; biotype; classification; genotype.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
Summary of the analysis pipeline described in Materials and Methods.
Figure 2
Figure 2
Process for going from pairwise percent identity gene-gene identity matrices to DBSCAN. (A) Gene-level identity matrices are multiplied by their weights and averaged resulting in the (B) genome-genome identity matrix. The average genome-genome identity matrix is then analyzed via DBSCAN to identify clusters based on similarity. The weight equation is described in Section 2.4.
Figure 3
Figure 3
Results of DBSCAN demonstrate the number of clusters at the indicated epsilon value. Brackets indicate the convergence of clusters. Dark blue boxes indicate a sequence from a single isolate, and medium blue boxes indicate a group of isolates with similar sequences.

References

    1. Penrith M.L., Kivaria F.M., Masembe C. One hundred years of African swine fever: A tribute to R. Eustace Montgomery. Transbound. Emerg. Dis. 2021;68:2640–2642. doi: 10.1111/tbed.14183. - DOI - PubMed
    1. Chapman D.A., Darby A.C., Da Silva M., Upton C., Radford A.D., Dixon L.K. Genomic analysis of highly virulent Georgia 2007/1 isolate of African swine fever virus. Emerg. Infect. Dis. 2011;17:599–605. doi: 10.3201/eid1704.101283. - DOI - PMC - PubMed
    1. Gonzales W., Moreno C., Duran U., Henao N., Bencosme M., Lora P., Reyes R., Nunez R., De Gracia A., Perez A.M. African swine fever in the Dominican Republic. Transbound. Emerg. Dis. 2021;68:3018–3019. doi: 10.1111/tbed.14341. - DOI - PubMed
    1. Flach B. United States Department of Agriculture (2023). First Case of African Swine Fever Found in Wild Boars in Sweden. Sweden. [(accessed on 1 December 2023)];2023 Available online: https://apps.fas.usda.gov/newgainapi/api/Report/DownloadReportByFileName....
    1. Zhao D., Sun E., Huang L., Ding L., Zhu Y., Zhang J., Shen D., Zhang X., Zhang Z., Ren T., et al. Highly lethal genotype I and II recombinant African swine fever viruses detected in pigs. Nat. Commun. 2023;14:3096. doi: 10.1038/s41467-023-38868-w. - DOI - PMC - PubMed

LinkOut - more resources