Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Apr 29;15(1):15041.
doi: 10.1038/s41598-025-98366-5.

Multifractal analysis and support vector machine for the classification of coronaviruses and SARS-CoV-2 variants

Affiliations

Multifractal analysis and support vector machine for the classification of coronaviruses and SARS-CoV-2 variants

J P Correia et al. Sci Rep. .

Abstract

This study presents a novel approach for the classification of coronavirus species and variants of SARS-CoV-2 using Chaos Game Representation (CGR) and 2D Multifractal Detrended Fluctuation Analysis (2D MF-DFA). By extracting fractal parameters from CGR images, we constructed a state space that effectively distinguishes different species and variants. Our method achieved [Formula: see text] accuracy in species classification, with a notable [Formula: see text] accuracy for SARS-CoV-2 variants despite their genetic similarities. Using a Support Vector Machine (SVM) as a classifier further enhanced the performance. This approach, which requires fewer steps than most existing methods, offers an efficient and effective tool for viral classification, with implications for bioinformatics, public health, and vaccine development.

Keywords: Coronaviridae; CGR; Fractal; RNA; SVM; Viral.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Graphical summary of workflows.
Fig. 2
Fig. 2
Chaos game representation for coronavirus species: HCoV-HKU1, HCoV-OC43, HCoV-NL63, HCoV-229E, HCoV-MERS, SARS-CoV-2. We used the samples identified as reference sequences in NCBI.
Fig. 3
Fig. 3
Power-law multifractal nature of coronavirus species CGR image. Some constants are subtracted to make the contrast between the different curves clearer in graphics of F(q) vs. q. The straight lines are the best-fit lines whose slopes are shown in the legend.
Fig. 4
Fig. 4
Multifractal spectrum of the coronavirus species (above) and the variants of SARS-CoV-2 (below).
Fig. 5
Fig. 5
Scatter plots of fractal parameters.
Fig. 6
Fig. 6
State space constructed using fractal parameters (h(2), formula image, formula image ) for SARS-CoV-2 variants. The apparent mixing of certain variants may reflect their evolutionary proximity or similarities in genomic features.
Fig. 7
Fig. 7
Left: The accuracy of the six coronavirus species for the selected combinations with increasing K. Right: The accuracy of the five SARS-CoV-2 species variants for the selected combinations with increasing K.
Fig. 8
Fig. 8
Confusion Matrix. Each row represents the actual class, and each column represents the predicted class. The diagonal elements indicate correctly classified samples. Classes: 0 (Alpha variant), 1 (Beta variant), 2 (Delta variant), 3 (Gamma variant), 4 (Omicron variant).

Similar articles

References

    1. Nicola, M. et al. The socio-economic implications of the coronavirus pandemic (covid-19): A review. Int. J. Surg.78, 185–193 (2020). - PMC - PubMed
    1. Hiscott, J. et al. The global impact of the coronavirus pandemic. Cytokine Growth Factor Rev.53, 1–9 (2020). - PMC - PubMed
    1. Drake, J. W. & Holland, J. J. Mutation rates among rna viruses. Proc. Natl. Acad. Sci.96(24), 13910–13913 (1999). - PMC - PubMed
    1. Zhao, Z. et al. Moderate mutation rate in the sars coronavirus genome and its implications. BMC Evol. Biol.4, 1–9 (2004). - PMC - PubMed
    1. Duffy, S. Why are rna virus mutation rates so damn high?. PLoS biology16(8), e3000003 (2018). - PMC - PubMed

Supplementary concepts

LinkOut - more resources