Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2025 Aug 16:2025.08.13.670169.
doi: 10.1101/2025.08.13.670169.

A prognostic signature for lung adenocarcinoma in people who have never smoked

Wei Zhao  1 Tongwu Zhang  1 Xing Hua  1 Phuc H Hoang  1 Mona Miraftab  1 Monjoy Saha  1 John P McElderry  1 Jian Sang  1 Olivia W Lee  1 Caleb Hartman  1 Azhar Khandekar  1 Sunandini Sharma  1 Frank J Colón-Matos  1 Samuel Anyaso-Samuel  1 Difei Wang  1   2 Kristine Jones  1   2 Amy Hutchinson  1   2 Belynda Hicks  1   2 Jennifer Rosenbaum  3 Xiaoming Zhong  4 Yang Yang  4 Angela Pesatori  5   6 Dario Consonni  6 David C Christiani  7   8 Kin Chung Leung  9 Maria Pik Wong  10 Marta Manczuk  11 Jolanta Lissowska  11 Beata Świątkowska  12 Anush Mukeria  13 Oxana Shangina  13 David Zaridze  13 Ivana Holcatova  14   15 Dana Mates  16 Sasa Milosavljevic  17 Simona Ognjanovic  17 Milan Savic  18 Millica Kontic  19 Valerie Gaborieau  20 Paul Brennan  20 Oscar Arrieta  21 Yohan Bossé  22 Eric S Edell  23 Matthew B Schabath  24 Paul Hofman  25 Luis Mas  26 Sai S Yendamuri  27 Chih-Yi Chen  28   29 I-Shou Chang  30 Chao Agnes Hsiung  31 Geoffrey Liu  32 Jacobo Martínez Santamaría  33 Bonnie E Gould Rothberg  34 Karun Mutreja  35 Scott Lawrence  36   2 Nathaniel Rothman  1 Ludmil B Alexandrov  37 Charles Leduc  38 Marina K Baine  39 Philippe Joubert  22 Lynette M Sholl  40 William D Travis  39 Robert Homer  41 Qing Lan  1 Stephen J Chanock  1 Lixing Yang  4   42   43 Soo-Ryum Yang  39 Jianxin Shi  1 Maria Teresa Landi  1
Affiliations

A prognostic signature for lung adenocarcinoma in people who have never smoked

Wei Zhao et al. bioRxiv. .

Abstract

Knowledge of tumor cell dynamics can inform prognosis and treatment yet is largely lacking for lung adenocarcinoma in people who have never smoked (NS-LUAD). With RNA-seq data from 684 NS-LUAD and validation in an independent dataset, we identified three subtypes with distinct phenotypic traits and cell compositions. Additional genomic and histological data further characterized the subtypes. 'Steady', marked by low proliferation, high alveolar cell fraction, moderate-to-well differentiation, and fewer driver genes' alterations, is linked to prolonged survival and low immune evasion. 'Proliferative' shows high proliferation markers, TP53 mutations, and gene fusions. 'Chaotic', with high epithelial-to-mesenchymal transition markers, has the worst prognosis even within stage I tumors. Lacking known molecular or histological characteristics, this aggressive subtype is solely identified by transcriptomic data. A 60-gene signature recapitulates the overall classification and strongly predicts survival even within subgroups based on tumor stage or known genomic features, emphasizing its potential for improving NS-LUAD prognostication in clinical settings.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1. Transcriptomic clusters in the Sherlock cohort.
(A) Consensus matrices for NMF rank 2 to 10. Cophenetic coefficient from consensus matrix (left) and silhouettes from basis, coefficient and consensus matrices (right) were computed from 80 runs for each rank value. (B) Clustering of subtype-specific genes in the three expression subtypes are defined by NMF. The top panel shows cluster assignment, driver mutations and fusions, data set, genetic ancestry, sex and S-LUAD subtypes. The right panel indicates the subtype specific genes.
Fig. 2
Fig. 2. Comparison of clinical and histopathological features across NS-LUAD expression subtypes.
A-C, Comparison of (A) tumor stages, (B) histologic grades, and (C) signet ring cell features across the NS-LUAD expression subtypes. P-values for comparison of morphological features from chi-squared test are shown. D-E, Violin plots of (D) LUAD histology scores and (E) mix-histological lineage scores across the NS-LUAD expression subtype. Mean values are indicated by the yellow lines. P-values from two-sided Mann–Whitney U-test are shown.
Fig. 3
Fig. 3. Cell type deconvolution in the Sherlock cohort.
A, Summary of the average proportions of cell types in NS-LUAD expression subtypes. B-C, Violin plots of the proportions of (B) epithelial cells and (C) fibroblasts across NS-LUAD expression subtypes. D, Summary of the average proportions of epithelial cell types relative to all epithelial cells. E-F, Violin plots of the proportions of (E) AT2 and (F) basal cells relative to all epithelial cells across NS-LUAD expression subtypes. G-I, Comparison of the relative proportions of (G) COL10A1+, (H) COL4A1+, and (I) non-malignant lung fibroblasts across NS-LUAD expression subtypes. The fibroblast estimates are relative differences in abundances and cannot be compared across cell types. Mean values are indicated by the yellow lines in the violin plots. P-values from two-sided Mann-Whitney U-test are shown.
Fig. 4
Fig. 4. Gene fusion events in the Sherlock cohort.
(A) Comparison of the total numbers of gene fusions per sample across NS-LUAD expression subtypes. Mean values are indicated by the yellow lines. P-value from two-sided Mann-Whitney U-test is shown. (B) Summary of types of all detected gene fusions. (C) Summary of frequent gene fusions between pairs of protein-coding genes. The bar plot indicates the number of tumors containing gene fusion events. Colors indicate fusion types. Recurrent in-frame fusions of protein-coding genes are indicated by the star signs(*). (D) Summary of genes most frequently involved in fusions between protein-coding genes. The bar plot indicates the number of tumors in which gene fusions involving the listed genes are detected. Colors indicate fusion types. (E) Summary of genes most frequently involved in fusions between protein-coding genes within NS-LUAD expression subtypes. The bar plots indicate the percentage of tumors in which gene fusions involving the listed genes are detected. Colors indicate fusion types.
Fig. 5
Fig. 5. Association between NS-LUAD expression subtypes and overall survival in the Sherlock cohort.
A-D, Kaplan-Meier survival curves for overall survival stratified by NS-LUAD expression subtypes in (A) all patients, (B) stage I patients, (C) patients classified as TRU subtype26, and (D) TP53-wt patients. P-values and hazard ratios (HR) were calculated using Cox proportional hazards models adjusting for age, sex, and tumor stage (the analysis restricted to tumor stage I did not include tumor stage in the model).
Fig. 6
Fig. 6. Association between NS-LUAD expression subtypes and overall survival in the Genome Institute of Singapore (GIS) cohort22.
A-B, Kaplan-Meier survival curves for overall survival stratified by NS-LUAD expression subtypes in (A) all patients and (B) TP53-wt patients. P-values and hazard ratios (HR) are calculated using Cox proportional hazards models adjusting for age, sex, and tumor stage. (C) C-indexes of Cox models predicting the overall survival using different sets of predictors in the training (Sherlock) and testing (GIS) data sets. P-values testing for the effect of adding the expression subtypes to models of single modality were calculated using likelihood ratio test.

References

    1. Cufari ME, Proli C, De Sousa P, Raubenheimer H, Al Sahaf M, Chavan H, et al. Increasing frequency of non-smoking lung cancer: Presentation of patients with early disease to a tertiary institution in the UK. Eur J Cancer 2017;84:55–9 doi 10.1016/j.ejca.2017.06.031. - DOI - PubMed
    1. Pelosof L, Ahn C, Gao A, Horn L, Madrigales A, Cox J, et al. Proportion of Never-Smoker Non-Small Cell Lung Cancer Patients at Three Diverse Institutions. J Natl Cancer Inst 2017;109(7) doi 10.1093/jnci/djw295. - DOI
    1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, Bray F. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J Clin 2021;71(3):209–49 doi 10.3322/caac.21660. - DOI - PubMed
    1. Sun S, Schiller JH, Gazdar AF. Lung cancer in never smokers--a different disease. Nat Rev Cancer 2007;7(10):778–90 doi 10.1038/nrc2190. - DOI - PubMed
    1. Zhang T, Joubert P, Ansari-Pour N, Zhao W, Hoang PH, Lokanga R, et al. Genomic and evolutionary classification of lung cancer in never smokers. Nat Genet 2021;53(9):1348–59 doi 10.1038/s41588-021-00920-0. - DOI - PMC - PubMed

Publication types