Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation

The blood proteome of imminent lung cancer diagnosis

Lung Cancer Cohort Consortium (LC3). Nat Commun. .

Abstract

Identification of risk biomarkers may enhance early detection of smoking-related lung cancer. We measured between 392 and 1,162 proteins in blood samples drawn at most three years before diagnosis in 731 smoking-matched case-control sets nested within six prospective cohorts from the US, Europe, Singapore, and Australia. We identify 36 proteins with independently reproducible associations with risk of imminent lung cancer diagnosis (all p < 4 × 10-5). These include a few markers (e.g. CA-125/MUC-16 and CEACAM5/CEA) that have previously been reported in studies using pre-diagnostic blood samples for lung cancer. The 36 proteins include several growth factors (e.g. HGF, IGFBP-1, IGFP-2), tumor necrosis factor-receptors (e.g. TNFRSF6B, TNFRSF13B), and chemokines and cytokines (e.g. CXL17, GDF-15, SCF). The odds ratio per standard deviation range from 1.31 for IGFBP-1 (95% CI: 1.17-1.47) to 2.43 for CEACAM5 (95% CI: 2.04-2.89). We map the 36 proteins to the hallmarks of cancer and find that activation of invasion and metastasis, proliferative signaling, tumor-promoting inflammation, and angiogenesis are most frequently implicated.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1. Identification of 36 protein biomarkers associated with risk of imminent lung cancer diagnosis among 731 cases and 731 matched controls in the Lung Cancer Cohort Consortium.
The volcano plot depicts the lung cancer odds ratio per standard deviation increment in relative protein concentrations (log-base-2 transformed) (x axis) and the −Log10 p value (y axis). The 36 identified markers of imminent lung cancer are labeled (see Methods; markers were identified through a resampling process that measured the association of each protein with lung cancer risk in a discovery set and a replication set. The risk markers were required to have a p < 0.05/effective-number-of-tests in the discovery set and p < 0.05 in the replication set in at least 50% of the resampling iterations). Source data are provided as a Source Data file.
Fig. 2
Fig. 2. Lung cancer odds ratios for the 36 proteins associated with imminent lung cancer diagnosis before and after detailed adjustment for smoking intensity, duration, and years since cessation.
Data for 95% confidence intervals are presented ase(β±1.96×sd). β is the estimate from each conditional logistic regression, and sd is their respective standard deviation. Number of samples used are presented in Supplementary Data 10. Source data are provided as a Source Data file.
Fig. 3
Fig. 3. Biological context of the 36 proteins associated with risk of imminent lung cancer diagnosis.
a Relationship between our 36 proteins and the 10 hallmarks of cancer described by Hanahan and Weinberg, based on their descriptions and functions available on GeneCards, the Human Protein Atlas, and Uniprot. Each hallmark is represented by a different color. b Network analysis among the 36 proteins, the figure depicts partial correlation networks (accounting for sex, age, cohort, and all other identified proteins) and stable protein associations. In lung cancer cases, no stable connections were found for ANGPT2, CDCP1, CEACAM5, CFHR5, CXCL13, IGFBP-1, IGFBP-2, IL6, MUC-16, SCF, SFTPA1, TFPI-2. In controls, no stable connections were found for ANGPT2, CEACAM5, CFHR5, CXCL13, CXCL9, IL6, MMP12, MUC-16, SCF, SFTPA1, and SYND1. Source data are provided as a Source Data file.
Fig. 4
Fig. 4. Gene expression of 36 protein biomarkers associated with risk of imminent lung cancer diagnosis in normal and tumor tissue.
Proteins are listed in order of their relative expression in non-cancerous lung cell. a mRNA expression in normal tissue (gtex). b mRNA expression in tumor tissue (TCGA). Source data are provided as a Source Data file.

References

    1. Sung H, et al. Global Cancer Statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA. Cancer J. Clin. 2021;71:209–249. doi: 10.3322/caac.21660. - DOI - PubMed
    1. Lung and Bronchus Cancer—Cancer Stat Facts. https://seer.cancer.gov/statfacts/html/lungb.html.
    1. Howlader N, et al. The effect of advances in lung-cancer treatment on population mortality. N. Engl. J. Med. 2020;383:640–649. doi: 10.1056/NEJMoa1916623. - DOI - PMC - PubMed
    1. Thai AA, Solomon BJ, Sequist LV, Gainor JF, Heist RS. Lung cancer. Lancet. 2021;398:535–554. doi: 10.1016/S0140-6736(21)00312-3. - DOI - PubMed
    1. The National Lung Screening Trial Research Team. Reduced lung-cancer mortality with low-dose computed tomographic screening. N. Engl. J. Med. 2011;365:395–409. doi: 10.1056/NEJMoa1102873. - DOI - PMC - PubMed

Publication types

Substances