Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Aug 29:8:297.
doi: 10.1186/1471-2164-8-297.

Effect of active smoking on the human bronchial epithelium transcriptome

Affiliations

Effect of active smoking on the human bronchial epithelium transcriptome

Raj Chari et al. BMC Genomics. .

Abstract

Background: Lung cancer is the most common cause of cancer-related deaths. Tobacco smoke exposure is the strongest aetiological factor associated with lung cancer. In this study, using serial analysis of gene expression (SAGE), we comprehensively examined the effect of active smoking by comparing the transcriptomes of clinical specimens obtained from current, former and never smokers, and identified genes showing both reversible and irreversible expression changes upon smoking cessation.

Results: Twenty-four SAGE profiles of the bronchial epithelium of eight current, twelve former and four never smokers were generated and analyzed. In total, 3,111,471 SAGE tags representing over 110 thousand potentially unique transcripts were generated, comprising the largest human SAGE study to date. We identified 1,733 constitutively expressed genes in current, former and never smoker transcriptomes. We have also identified both reversible and irreversible gene expression changes upon cessation of smoking; reversible changes were frequently associated with either xenobiotic metabolism, nucleotide metabolism or mucus secretion. Increased expression of TFF3, CABYR, and ENTPD8 were found to be reversible upon smoking cessation. Expression of GSK3B, which regulates COX2 expression, was irreversibly decreased. MUC5AC expression was only partially reversed. Validation of select genes was performed using quantitative RT-PCR on a secondary cohort of nine current smokers, seven former smokers and six never smokers.

Conclusion: Expression levels of some of the genes related to tobacco smoking return to levels similar to never smokers upon cessation of smoking, while expression of others appears to be permanently altered despite prolonged smoking cessation. These irreversible changes may account for the persistent lung cancer risk despite smoking cessation.

PubMed Disclaimer

Figures

Figure 1
Figure 1
(A) SAGE library statistics: Summary statistics of the 24 SAGE libraries analyzed in this study. Mapping information was based on the May 10th, 2006 version of SAGEGenie [45]. In total, over 3,000,000 SAGE tags were sequenced, with over 110,000 unique tags represented upon the exclusion of super singleton tags. (Super singleton tags are tags which have a count of 1 in a single library only). Approximately 75 % of these 110,000 unique tags, (potentially representing as many unique transcripts), mapped to an annotated UniGene cluster. As multiple SAGE tags frequently map to the same UniGene cluster, we have identified at a total of 25,653 distinct UniGene clusters within our dataset, approximately 68% of which represent previously characterized genes. Notably, 25% of the unique tags had no mapping, suggesting much information is currently unknown. (B) Transcriptome Venn diagram: Venn diagram of the transcriptomes of current, former and never smokers. Reported is the number of tags which are expressed in every library group at a raw tag count greater than or equal to 2, representing the tags which are constitutively expressed in each set. Nearly 2000 SAGE tags, mapping to over 1700 genes are common to all 24 SAGE libraries. A lower number of never smokers may have contributed to a higher number of preferentially expressed transcripts in this group.
Figure 2
Figure 2
(A) Cluster analysis of current, former and never smokers: Single link hierarchical clustering using the 609 SAGE tags comprised in Additional file 5 representing tags differentially expressed between current and never smokers. Distance measure used was a Euclidean distance. The visualization package Genesis [23] was used for clustering. Green rectangles represent samples with lower expression for the particular gene amongst the samples, and red rectangles represent samples where the gene is highly expressed relative to other samples. (B) Principal component analysis of current, former and never smokers. Expression values used were scaled to tags per million (TPM). Each tag was then normalized by dividing its value by the maximum value for that tag seen in all the libraries. Subsequently, this value was then multiplied by 6 and then subtracted by 3 to put the values ratios in the range of -3 to 3. A co-variance based approach was used and the statistics toolbox in MatLab (Mathworks) was used. Current smokers are represented in red, former smokers are represented in blue and never smokers are represented in green.
Figure 3
Figure 3
Principal component of current, former and never smokers using (A) the 161 tags deemed reversible upon smoking cessation (Additional file 6) and (B) the 152 tags deemed irreversible upon smoking cessation (Additional file 7). Expression values used were scaled to tags per million (TPM). Each tag was then normalized by dividing its value by the maximum value for that tag seen in all the libraries. Subsequently, this value was then multiplied by 6 and then subtracted by 3 to put the values in the range of -3 to 3. A co-variance based approach was used and the statistics toolbox in MatLab (Mathworks) was used. Current smokers are represented in red, former smokers are represented in blue and never smokers are represented in green.
Figure 4
Figure 4
SAGE and quantitative PCR (qRT-PCR) analysis of select genes: (A) Genes found to have reversible expression upon smoking cessation. Box plots of SAGE data and histograms for qRT-PCR for CABYR, ENTPD8 and TFF3. Distribution of ratios between both current vs. former and current vs. former and never (Additional file IV) were found to be statistically different. (B) Genes found to be either partially or fully irreversible. Box plots of SAGE data and histograms for qRT-PCR for MUC5AC and GSK3B. Distribution of ratios between current vs. former and former vs. never were statistically different for MUC5AC and in addition, GSK3B was statistically significant for the combination of current and former vs. never. Box plot analysis was done using the Statistics toolbox from the MathWorks MatLab program. Red lines in the boxes represent the median expression value in terms of tags per million (TPM), and red "plus" signs represent outliers (values which are greater than 1.5 times the maximum value). The bottom and top part of the boxes represent the 2nd and 3rd quartiles of the data respectively. The error bars represent the 5th and 95th percentiles of the data. Quantitative RT-PCR validation was performed on a second cohort of nine current smokers, seven former smokers and six never smokers. Plotted is the average expression ratio relative to the average expression in never smokers of current (red), former (blue) and never (green) smokers. Statistical significance was determined using a one-tailed p-value from the Mann Whitney U Test (Supplemental Table IX).
Figure 5
Figure 5
Expression trends of specific genes related to muco-ciliary function and airway restitution as compared with smoking status and lung cancer: TFF3, CABYR, and MUC5AC are over expressed in current smokers with lowered expression in both former and never smokers. Conversely, SCGB1A1 shows the opposite effect, with lowered expression in current smokers as compared to former and never smokers. MUC5AC and TFF3 are known to be components of mucus. EGFR levels are positively correlated with smoking status, with modestly higher levels in current smokers. MUC5AC and EGF have been shown to interact with EGFR in the process of airway restitution and SCGB1A1 has been shown to decrease levels of cyclooxygenase 2 (COX2) in cancer cells. Interestingly, within this process alone, we see reversible (TFF3, CABYR), partially reversible (MUC5AC) and completely irreversible (GSK3B) expression changes upon smoking cessation. Values refer to tag counts as tags-per-million (TPM).

Comment in

Similar articles

Cited by

References

    1. Jemal A, Siegel R, Ward E, Murray T, Xu J, Smigal C, Thun MJ. Cancer statistics, 2006. CA Cancer J Clin. 2006;56:106–130. - PubMed
    1. Halpern MT, Gillespie BW, Warner KE. Patterns of absolute risk of lung cancer mortality in former smokers. J Natl Cancer Inst. 1993;85:457–464. doi: 10.1093/jnci/85.6.457. - DOI - PubMed
    1. Peto R, Darby S, Deo H, Silcocks P, Whitley E, Doll R. Smoking, smoking cessation, and lung cancer in the UK since 1950: combination of national statistics with two case-control studies. Bmj. 2000;321:323–329. doi: 10.1136/bmj.321.7257.323. - DOI - PMC - PubMed
    1. Tong L, Spitz MR, Fueger JJ, Amos CA. Lung carcinoma in former smokers. Cancer. 1996;78:1004–1010. doi: 10.1002/(SICI)1097-0142(19960901)78:5<1004::AID-CNCR10>3.0.CO;2-6. - DOI - PubMed
    1. Spira A, Beane J, Shah V, Liu G, Schembri F, Yang X, Palma J, Brody JS. Effects of cigarette smoke on the human airway epithelial cell transcriptome. Proc Natl Acad Sci U S A. 2004;101:10143–10148. doi: 10.1073/pnas.0401422101. - DOI - PMC - PubMed

Publication types