Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 18;80(6):1247-1261.
doi: 10.1093/cid/ciaf046.

Long COVID Incidence Proportion in Adults and Children Between 2020 and 2024: An Electronic Health Record-Based Study From the RECOVER Initiative

Affiliations

Long COVID Incidence Proportion in Adults and Children Between 2020 and 2024: An Electronic Health Record-Based Study From the RECOVER Initiative

Hannah Mandel et al. Clin Infect Dis. .

Abstract

Background: Incidence estimates of post-acute sequelae of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection, also known as long COVID, have varied across studies and changed over time. We estimated long COVID incidence among adult and pediatric populations in 3 nationwide research networks of electronic health records (EHRs) participating in the RECOVER (Researching COVID to Enhance Recovery) Initiative using different classification algorithms (computable phenotypes).

Methods: This EHR-based retrospective cohort study included adult and pediatric patients with documented acute SARS-CoV-2 infection and 2 control groups: contemporary coronavirus disease 2019 (COVID-19)-negative and historical patients (2019). We examined the proportion of individuals identified as having symptoms or conditions consistent with probable long COVID within 30-180 days after COVID-19 infection (incidence proportion). Each network (the National COVID Cohort Collaborative [N3C], National Patient-Centered Clinical Research Network [PCORnet], and PEDSnet) implemented its own long COVID definition. We introduced a harmonized definition for adults in a supplementary analysis.

Results: Overall, 4% of children and 10%-26% of adults developed long COVID, depending on computable phenotype used. Excess incidence among SARS-CoV-2 patients was 1.5% in children and ranged from 5% to 6% among adults, representing a lower-bound incidence estimation based on our control groups. Temporal patterns were consistent across networks, with peaks associated with introduction of new viral variants.

Conclusions: Our findings indicate that preventing and mitigating long COVID remains a public health priority. Examining temporal patterns and risk factors for long COVID incidence informs our understanding of etiology and can improve prevention and management.

Keywords: COVID; EHRs; electronic health records; long COVID; public health surveillance.

PubMed Disclaimer

Conflict of interest statement

Potential conflicts of interest . H. P., K. G. T., C. R. O., A. V., L. E. T., J. D., R. A. M., J. D. K., E. W. K., P. A. K., R. S. G., C. R. G., A. D., T. W. C., M. H., and K. R. T. all reports funding from the NIH. Additionally, J. A. reports grants or contracts from Pfizer and Amgen. C. G. C. reports a leadership role on the World Health Organization (WHO) Medical Scientific Advisory Committee. P. A. K. reports a leadership role with the Commonwealth of Virginia Board of Health. L. C. K. reports owning shares in Amgen, Regeneron, Sanofi, and GLAXF. K. E. R. reports funding from the NIH (National Institute on Minority Health and Health Disparities [NIMHD]) and the US Department of Agriculture (USDA). C. R. O. reports holding leadership positions with the American Academy of Pediatrics, Eastern Society of Pediatric Research, and Journal of Pediatric Infectious Diseases Society. S. E. P. received funding from the National Institute of Mental Health (NIMH) and travel support from Columbia University Mailman School of Public Health. J. L. S. reports funding from NYU Langone. K. R. T. reports funding from the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK), NIMHD, NIH Director's Office, Centers for Disease Control and Prevention (CDC), Travere, and Bayer; consulting fees and/or payments/honoraria from Eli Lilly, Boehringer Ingelheim, AstraZeneca, Bayer, Novo Nordisk, Pfizer, Travere, and ProKidney; participation in a data safety monitoring or advisory board for NIDDK/NIH and AstraZeneca; and leadership roles with the American Society of Nephrology and Clinical Journal of the American Society of Nephrology. L. E. T. reports a leadership position with the American Journal of Public Health. L. M. reports a leadership position with the International Society of Nurses in Genetics. C. K. reports receiving payments from the NIH. J. D. K. reports funding from the VA and CDC. K. C. H. reports owning stock in Pfizer. M. H. reports being a founder of Alamya Health. All other authors report no potential conflicts. All authors have submitted the ICMJE Form for Disclosure of Potential Conflicts of Interest. Conflicts that the editors consider relevant to the content of the manuscript have been disclosed.

Figures

Graphical Abstract
Graphical Abstract
This graphical abstract is also available at Tidbit: https://tidbitapp.io/institutional-portal/clinical-infectious-diseases/tidbits/long-covid-incidence-proportion-in-adults-and-children-between-2020-and-2023/update
Figure 1.
Figure 1.
Proportion of patients with long COVID and associated variables over time. Each patient was counted as having COVID-19 once, and COVID-19 index dates are graphed on the x-axis for panels BF. The dotted red line shows a 20% benchmark across differently sized axes. All percentages were calculated using the monthly COVID-19 case count as the denominator. “NASEM” refers to the harmonized long COVID definition used in both N3C and PCORnet. A, Count of long COVID cases, with the x-axis representing the date of long COVID onset. B, Count of COVID-19 index cases each month. C, Percentage of patients developing long COVID within 180 days of their COVID-19 index date. D, Percentage of COVID-19 cases hospitalized (with or without ICU-level care) for index infection. E, Percentage of COVID-19 cases treated with Paxlovid. This excludes N3C sites not providing any data around Paxlovid orders. F, Percentage of COVID-19 cases with vaccination documented prior to the index event. Abbreviations: COVID-19, coronavirus disease 2019; ICU, intensive care unit; N3C, National COVID Cohort Collaborative; NASEM, National Academies of Sciences, Engineering, and Medicine; PCORnet, National Patient-Centered Clinical Research Network. Purple represents N3C, yellow represents PCORnet, and green represents PEDSnet.
Figure 2.
Figure 2.
Long COVID incidence proportion by patient subpopulation and adjusted monthly relative hazards. A, Univariate analysis. Incidence of long COVID, calculated as the proportion of COVID-19 cases who developed long COVID within 180 days of the index infection. 95% CIs are provided. Patient characteristics align with categories described in Table 2. Vertical dotted lines represent overall long COVID incidence proportion for each network. B, Two-dimensional heatmap. Heatmaps represent the proportion of COVID-19–positive patients who developed long COVID. Percentages are stratified by COVID-19 severity and patient pre-existing conditions. “N” represents the number of patients within the group. “P” represents the number of long COVID patients within the group. Heatmap scales are based on the percentage of long COVID patients from each network, from 0% (blue) to 100% (red). The midpoint (white) of the scale represents the overall long COVID rate from each network. Values 3 or more times greater than the overall long COVID rate are colored red. C, Risk of long COVID over time compared to January 2021. Multivariable hazard ratios for incident long COVID per month are shown. Hazard ratios were generated by a multivariable Cox proportional hazards regression model, and are presented as unadjusted (black) and adjusted (orange) for age group, sex, race/ethnicity, pre-existing conditions, rurality, COVID-19 severity, vaccination status, and month of the index event. 95% CIs are provided. Abbreviations: COVID-19, coronavirus disease 2019; ED, emergency department; ICU, intensive care unit; N3C, National COVID Cohort Collaborative; PCORnet, National Patient-Centered Clinical Research Network. Purple represents N3C, yellow represents PCORnet, and green represents PEDSnet.
Figure 3.
Figure 3.
Control group analysis. Long COVID incidence proportion, by COVID-19 (COVID-19–positive patients) or index event (control patients) severity and pre-existing condition burden, among (A) COVID-19–positive patients identified between 1 January 2021 and 30 May 2021; (B) contemporary COVID-19–negative patients identified between 1 January 2021 and 30 May 2021; and (C) historical control patients identified between 1 January 2019 and 30 May 2019. Heatmaps for each group are centered (white) on the global long COVID incidence estimated in that group. Lower values are shown in shades of blue, and values up to 2.5 times above the global incidence are shown in shades of red. “N” represents the number of patients within the group. “P” represents the number of long COVID patients within the group. Abbreviations: COVID-19, coronavirus disease 2019; ED, emergency department; ICU, intensive care unit; N3C, National COVID Cohort Collaborative; PCORnet, National Patient-Centered Clinical Research Network. Purple represents N3C, yellow represents PCORnet, and green represents PEDSnet.
Figure 4.
Figure 4.
Harmonized long COVID incidence proportions by patient subpopulation. A, Description of logic for the NASEM-aligned definition applied to our adult populations. B, Univariate analysis applied to this additional long COVID definition. Incidence proportion was calculated as the percentage of COVID-19 cases who developed long COVID within 270 days of index infection. 95% CIs are provided. Vertical dotted lines represent overall long COVID incidence proportion for each network. C, Two-dimensional heatmap. Heatmaps represent the proportion of COVID-19–positive patients who developed long COVID. Percentages are stratified by COVID-19 severity and patient pre-existing conditions. “N” represents the number of patients within the group. “P” represents the number of long COVID patients within the group. Heatmap scales are based on the percentage of long COVID patients from each network, from 0% (blue) to 100% (red). The midpoint (white) of the scale represents the overall long COVID rate from each network. Values 3 or more times greater than the overall long COVID rate are colored red. Abbreviations: COVID-19, coronavirus disease 2019; ED, emergency department; ICU, intensive care unit; N3C, National COVID Cohort Collaborative; NASEM, National Academies of Sciences, Engineering, and Medicine; PCORnet, National Patient-Centered Clinical Research Network. Purple represents N3C, yellow represents PCORnet, and green represents PEDSnet.

Update of

References

    1. Davis HE, McCorkell L, Vogel JM, Topol EJ. Long COVID: major findings, mechanisms and recommendations. Nat Rev Microbiol 2023; 21:133–46. - PMC - PubMed
    1. Soriano JB, Murthy S, Marshall JC, Relan P, Diaz JV. WHO Clinical Case Definition Working Group on post-COVID-19 condition. A clinical case definition of post-COVID-19 condition by a Delphi consensus. Lancet Infect Dis 2022; 22:e102–7. - PMC - PubMed
    1. National Heart, Lung, and Blood Institute (NHLBI) . Long COVID Research and Resources. NHLBI.NIH.gov. Available at: https://www.nhlbi.nih.gov/covid/long-covid. Accessed 5 February 2025.
    1. Tartof SY, Malden DE, Liu ILA, et al. Health care utilization in the 6 months following SARS-CoV-2 infection. JAMA Netw Open 2022; 5:e2225657. - PMC - PubMed
    1. Hanson W, Abbafati S, Aerts C, et al. Estimated global proportions of individuals with persistent fatigue, cognitive, and respiratory symptom clusters following symptomatic COVID-19 in 2020 and 2021. JAMA 2022; 328:1604–15. - PMC - PubMed