Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Apr 12;3(1):51.
doi: 10.1038/s43856-023-00283-z.

Discovery and systematic assessment of early biomarkers that predict progression to severe COVID-19 disease

Affiliations

Discovery and systematic assessment of early biomarkers that predict progression to severe COVID-19 disease

Katrin Hufnagel et al. Commun Med (Lond). .

Abstract

Background: The clinical course of COVID-19 patients ranges from asymptomatic infection, via mild and moderate illness, to severe disease and even fatal outcome. Biomarkers which enable an early prediction of the severity of COVID-19 progression, would be enormously beneficial to guide patient care and early intervention prior to hospitalization.

Methods: Here we describe the identification of plasma protein biomarkers using an antibody microarray-based approach in order to predict a severe cause of a COVID-19 disease already in an early phase of SARS-CoV-2 infection. To this end, plasma samples from two independent cohorts were analyzed by antibody microarrays targeting up to 998 different proteins.

Results: In total, we identified 11 promising protein biomarker candidates to predict disease severity during an early phase of COVID-19 infection coherently in both analyzed cohorts. A set of four (S100A8/A9, TSP1, FINC, IFNL1), and two sets of three proteins (S100A8/A9, TSP1, ERBB2 and S100A8/A9, TSP1, IFNL1) were selected using machine learning as multimarker panels with sufficient accuracy for the implementation in a prognostic test.

Conclusions: Using these biomarkers, patients at high risk of developing a severe or critical disease may be selected for treatment with specialized therapeutic options such as neutralizing antibodies or antivirals. Early therapy through early stratification may not only have a positive impact on the outcome of individual COVID-19 patients but could additionally prevent hospitals from being overwhelmed in potential future pandemic situations.

Plain language summary

We aimed to identify components of the blood present during the early phase of SARS-CoV-2 infection that distinguish people who are likely to develop severe symptoms of COVID-19. Blood from people who later developed a mild or moderate course of disease were compared to blood from people who later had a severe or critical course of disease. Here, we identified a combination of three proteins that were present in the blood of patients with COVID-19 who later developed a severe or critical disease. Identifying the presence of these proteins in patients at an early stage of infection could enable physicians to treat these patients early on to avoid progression of the disease.

PubMed Disclaimer

Conflict of interest statement

The authors declare the following competing interests: C.S. and J.D.H are founders and shareholders of Sciomics GmbH. C.S., K.H., N.S., M.K., A.G. and F.S. are patent inventors who contributed to a patent claiming Covid-19 disease severity biomarkers (WO2022028917). All other authors have no competing interests to declare.

Figures

Fig. 1
Fig. 1. Patient and sample characteristics of the 1st cohort.
Characteristics (age, sex) of individuals included in this study and timepoints of sample collection after symptom onset. Letters a–p indicate the individual patients. Male patients are depicted in blue, female patients in red. Circles represent samples of mild or moderate (MM) patients, squares those of critical or severe (CS) patients. Acute phase: 14 MM and 3 CS samples; intermediate phase: 13 MM and 7 CS samples; late phase: 10 and 6 samples. Dotted vertical lines indicate 10 days and 21 days after onset of symptoms.
Fig. 2
Fig. 2. Venn diagram and volcano plots illustrating the number, degree and significance of differential protein expression in the 1st cohort.
The volcano plots visualize the p values (adjusted for multiple testing) and corresponding log-fold changes (logFC) of the identified protein biomarker candidates. A significance level of adj. p value = 0.05 is indicated as a horizontal red line. Absolute logFC cutoffs of |logFC | > 0.5 are indicated as vertical lines. 53 plasma samples from 16 COVID-19 patients were analyzed and divided into an acute (a) an intermediate (b) and a late stage (c), for which we included 18, 20 and 16 samples, respectively. Proteins with a positive logFC had a higher abundance in CS samples, proteins with a negative value in MM samples. d: Venn diagram listing the differential proteins and their numbers in the respective phases. Green numbers and protein IDs indicate proteins more abundant in CS patients and red numbers and IDs proteins with higher abundance in MM patients.
Fig. 3
Fig. 3. Stripcharts representing individual array values for all proteins selected as top candidates in the 1st cohort.
Each protein is measured by four replicate spots per array and is represented by their mean. The y-axis illustrates the log2 ratio of the individual samples and a reference sample while the x-axis is divided based on clinical course of disease of the patient (CS and MM) as well as phase of infection. 53 plasma samples from 16 COVID-19 patients were analyzed and divided into an acute (A) an intermediate (M) and a late stage (L), for which we included 18, 20 and 16 samples, respectively. Acute CS and MM samples are highlighted in red and blue respectively. Diamonds indicate arithmetic sample group means. Whiskers indicate one standard deviation, calculated based on arithmetic means. Empty circles indicate the group coefficients fitted by the linear model with additional factors sex, age and comorbidities, which were used for logFC and p value calculation.
Fig. 4
Fig. 4. Protein biomarker candidates from the 2nd cohort to predict a severe or critical disease in acute phase.
Plasma samples from 94 COVID-19 patients during the acute phase of disease were analyzed on antibody microarrays to identify differentially abundant proteins between patients with either a mild or moderate (MM, n = 47) or a critical or severe (CS, n = 47) disease course. The volcano plot visualizes the p values and corresponding log-fold changes (logFC). A significance level of adj. p value = 0.05 is indicated as a horizontal red line. Absolute logFC cutoffs of |logFC | > 0.5 are indicated as vertical lines. Proteins with a positive logFC had a higher abundance in CS samples, proteins with a negative value in MM samples. Two different antibodies against S10A8/A9 were included on the microarray, with both antibodies showing significant differences between CS and MM samples.
Fig. 5
Fig. 5. Biomarkers to predict a severe / critical disease in both study cohorts.
The y-axis illustrates log2(sample / reference) values after subtracting the group mean of the respective MM samples per cohort/protein, thus setting the mean value of MM samples as a baseline. Within the 1st cohort 18 samples (MM = 17; CS = 3) were analyzed, while 94 sample (MM = 47; CS = 47) were analyzed within the 2nd cohort. The x-axis is divided based on clinical course of disease of the patient (CS and MM). Only acute CS and MM samples are shown. Diamonds indicate arithmetic sample group means. Whiskers indicate one standard deviation, calculated based on the arithmetic means. Empty circles indicate the group coefficients fitted by the linear model with additional factors sex, age and comorbidities, which were used for logFC and p value calculation.
Fig. 6
Fig. 6. Validation of microarray data.
Plasma samples from 94 COVID-19 patients during the acute phase of disease (2nd cohort) were analyzed by ELISA, to validate differentially abundant proteins between patients with either a mild or moderate (MM, n = 47) or a critical or severe (CS, n = 47) disease course. Stripcharts representing individual S100A8/A9 (a) and CRP (b) ELISA measurements. The y-axis displays the log2 of the measured protein concentration while the x-axis is divided based on the later clinical course of disease of the patient (CS and MM). Triangles and whiskers indicate means and one standard deviation of the sample groups with critical/severe or mild/moderate course of the disease respectively. Possible cut-offs with a sensitivity of 89% are indicated by dotted grey lines. c, d Scatter plots demonstrate a high correlation between discovery antibody microarray data (y-axis) and ELISA validation (x-axis) with Pearson’s r > 0.9.
Fig. 7
Fig. 7. Estimation of diagnostic accuracy for disease severity prediction.
For the individual protein biomarkers S100A8/A9 and CRP, ROC curves of ELISA data as well as coherent multiplex antibody array data were aligned. Marker combinations of 2–4 proteins were selected from linSVM models, which outperform the individual biomarker candidates and exhibit a high accuracy with an AUC of up to 0.928. Area under the ROC curve (AUC) is presented for each biomarker or combination.

Similar articles

Cited by

References

    1. Coronavirus disease (COVID-19) – World Health Organization. https://www.who.int/emergencies/diseases/novel-coronavirus-2019.
    1. CDC COVID-19 Response Team. Severe outcomes among patients with coronavirus disease 2019 (COVID-19) - United States, February 12-March 16, 2020. MMWR Morb. Mortal. Wkly. Rep. 2020;69:343–346. - PMC - PubMed
    1. Schilling, J. et al. Krankheitsschwere der ersten COVID-19-Welle in Deutschland basierend auf den Meldungen gemäß Infektionsschutzgesetz. (2020) 10.25646/7169.
    1. Ständige Impfkommission: Empfehlungen der Ständigen Impfkommission (STIKO) beim Robert Koch-Institut. Epid Bull4, 3–67 10.25646/9285.3 (2022).
    1. Marin BG, et al. Predictors of COVID-19 severity: a literature review. Rev. Med. Virol. 2021;31:1–10. - PMC - PubMed