Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 8:15:1556521.
doi: 10.3389/fonc.2025.1556521. eCollection 2025.

Progress and current trends in prediction models for the occurrence and prognosis of cancer and cancer-related complications: a bibliometric and visualization analysis

Affiliations

Progress and current trends in prediction models for the occurrence and prognosis of cancer and cancer-related complications: a bibliometric and visualization analysis

Siyu Li et al. Front Oncol. .

Abstract

Objective: Prediction models, which estimate disease or outcome probabilities, are widely used in cancer research. This study aims to identify hotspots and future directions of cancer-related prediction models using bibliometrics.

Methods: A comprehensive literature search was conducted in the Science Citation Index Expanded (SCIE) from the Web of Science Core Collection (WoSCC) up to November 15, 2024, focusing on cancer-related prediction models research. Co-occurrence analyses of countries, institutions, authors, journals, and keywords were conducted using VOSviewer 1.6.20. Additionally, keyword clustering, timeline visualization, and burst term analysis were performed with CiteSpace 6.3.

Results: A total of 1,661 records were retrieved from the SCIE. After deduplication and eligibility screening, 1,556 publications were included in the analysis. The bibliometric analysis revealed a consistent annual increase in cancer-related prediction model research, with China and the United States emerging as the leading contributors. The United States, England, and the Netherlands had the strongest collaborative networks. The most frequent keywords, excluding "prediction model" and "predictive model", included nomogram (frequency=192), survival (191), risk (121), prognosis (112), breast cancer (103), carcinoma (93), validation (87), surgery (85), diagnosis (83), chemotherapy (80), and machine learning (77). Besides, the timeline view analysis indicated that the "#7 machine learning" cluster was experiencing vigorous growth.

Conclusion: Cancer-related prediction models are rapidly advancing, especially in prognostic models. Emerging modeling techniques, such as neural networks and deep learning algorithms, are likely to play a pivotal role in current and future cancer-related prediction model research. Systematic reviews of cancer-related predictive models, which could help clinicians select the optimal model for specific clinical conditions may emerge as potential research directions in this field.

Keywords: bibliometrics; cancer; hotspots and trends; machine learning; prediction models; visualization analysis.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
The process of bibliometric analysis.
Figure 2
Figure 2
Literature screening process.
Figure 3
Figure 3
Distribution of publication types and annual publication volume. (A) Annual publication volume and citations of publications; (B) Distribution of publication types.
Figure 4
Figure 4
Distribution of countries/regions and institutions. (A) A visual mapping of the collaborative networks among countries/regions in relevant publications. Each circle represents a country/region, with the size of the circle proportional to the number of publications; larger circles imply a greater number of publications. (B) A visual mapping of the collaborative networks among institutions. Each circle represents an institution, and the size of the circle proportional to the number of publications; larger circles imply a greater number of publications.
Figure 5
Figure 5
Distribution of authors and journals. (A) Visual mapping of the collaboration networks among authors. Each circle represents an author, and a larger circle indicates more publications. (B) Visual mapping of the journals. Each circle represents a journal, and a larger circle indicates more publications.
Figure 6
Figure 6
Co-occurrence and cluster of keywords. (A) VOSviewer keyword co-occurrence map: Each circle represents a keyword, and a larger circle indicates a higher number of publications associated with that keyword. To ensure readability, only keywords with a frequency of occurrence ≥20 are visually mapped in the VOSviewer keyword co-occurrence map. (B) CiteSpace keyword clustering map: Different colored areas represent different clusters of keywords.
Figure 7
Figure 7
Keyword burst and timeline. (A) CiteSpace burst term map: Burst terms typically represent emerging research directions or shifts in field hotspots. The red segment indicates the burst period of the keyword (i.e., the timeframe when its frequency surged abruptly), while the blue segment corresponds to conventional active periods before or after the burst. Strength refers to the Burst Strength — the higher the value, the more rapidly the attention to the keyword has grown. (B) CiteSpace timeline map: Temporal analysis of keyword clusters, highlighting longitudinal trends, and pivotal milestones. The horizontal axis represents years, while the vertical axis displays keyword clusters. Keywords within the same-color cluster are thematically related. Connecting lines indicate co-occurrence relationships between keywords, and thicker lines signify stronger associations.
Figure 8
Figure 8
Density visualization and timeline of machine learning. (A) VOSviewer keyword density visualization: Each circle represents a keyword, and a brighter circle indicates a higher number of publications associated with that keyword. To ensure readability, only keywords with a frequency of occurrence ≥5 are visually mapped in the density visualization. (B) CiteSpace timeline map: Temporal analysis of keyword clusters, highlighting longitudinal trends, and pivotal milestones. The horizontal axis represents years, while the vertical axis displays keyword clusters. Keywords within the same-color cluster are thematically related. Connecting lines indicate co-occurrence relationships between keywords, and thicker lines signify stronger associations.

Similar articles

References

    1. Global Burden of Disease 2019 Cancer Collaboration. Kocarnik JM, Compton K, Dean FE, Fu W, Gaw BL, Harvey JD, et al. Cancer incidence, mortality, years of life lost, years lived with disability, and disability-adjusted life years for 29 cancer groups from 2010 to 2019: A systematic analysis for the global burden of disease study 2019. JAMA Oncol. (2022) 8:420–44. doi: 10.1001/jamaoncol.2021.6987, PMID: - DOI - PMC - PubMed
    1. Bray F, Laversanne M, Sung H, Ferlay J, Siegel RL, Soerjomataram I, et al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: Cancer J Clin. (2024) 74:229–63. doi: 10.3322/caac.21834, PMID: - DOI - PubMed
    1. Collaborators GF. Burden of disease scenarios for 204 countries and territories, 2022-2050: a forecasting analysis for the Global Burden of Disease Study 2021. Lancet (London England). (2024) 403:2204–56. doi: 10.1016/S0140-6736(24)00685-8, PMID: - DOI - PMC - PubMed
    1. Cao W, Qin K, Li F, Chen W. Comparative study of cancer profiles between 2020 and 2022 using global cancer statistics (GLOBOCAN). J Natl Cancer Center. (2024) 4:128–34. doi: 10.1016/j.jncc.2024.05.001, PMID: - DOI - PMC - PubMed
    1. Weeden CE, Hill W, Lim EL, Grönroos E, Swanton C. Impact of risk factors on early cancer evolution. Cell. (2023) 186:1541–63. doi: 10.1016/j.cell.2023.03.013, PMID: - DOI - PubMed

Publication types

LinkOut - more resources