Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Sep 20;23(5):bbac161.
doi: 10.1093/bib/bbac161.

Refinement of computational identification of somatic copy number alterations using DNA methylation microarrays illustrated in cancers of unknown primary

Affiliations

Refinement of computational identification of somatic copy number alterations using DNA methylation microarrays illustrated in cancers of unknown primary

Pedro Blecua et al. Brief Bioinform. .

Abstract

High-throughput genomic technologies are increasingly used in personalized cancer medicine. However, computational tools to maximize the use of scarce tissues combining distinct molecular layers are needed. Here we present a refined strategy, based on the R-package 'conumee', to better predict somatic copy number alterations (SCNA) from deoxyribonucleic acid (DNA) methylation arrays. Our approach, termed hereafter as 'conumee-KCN', improves SCNA prediction by incorporating tumor purity and dynamic thresholding. We trained our algorithm using paired DNA methylation and SNP Array 6.0 data from The Cancer Genome Atlas samples and confirmed its performance in cancer cell lines. Most importantly, the application of our approach in cancers of unknown primary identified amplified potentially actionable targets that were experimentally validated by Fluorescence in situ hybridization and immunostaining, reaching 100% specificity and 93.3% sensitivity.

Keywords: DNA methylation; actionable target identification; cancers of unknown primary; gene amplification; somatic copy number alterations.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Computational prediction of SCNA from DNA methylation arrays using conumee-KCN. (A) Workflow for the developed strategy to refine SCNA detection, ‘conumee-KCN’, trained over 442 primary tumor samples from TCGA, across 18 cancer types, with matched genotyping (SNP6 array) and DNA methylation array (450K) data. Our thresholding strategy refines conumee outputs and quantitatively calls SCNA from 450K arrays by considering tumor purity ρ (RF_Purity) and rigorous estimation of copy-number-state (CN)-dependent constants KCN. *A list of 94 genes frequently amplified or deleted in cancer was used as reference to define the copy number states. Thus, by using calibrated KCN’s and considering tumor purity, intra-sample variability and copy-number-state-dependent noise, thresholds for each CN can be estimated for each 450K profiled sample to accurately identify SCNA. (B) Benchmarking of our strategy (conumee-KCN) against conumee (fixed threshold of 0.3), cnAnalysis450k and ChAMP in an independent, validation set consisting of 151 TCGA samples, with matched genotyping (SNP6 array) and DNA methylation array (450K) data. True positive (TP) and false-positive (FP) rates of 450K-derived calls versus SNP6-derived calls (ASCAT) for amplifications are depicted, showing the improved performance of our approach. (C) TP and FP rates of conumee-KCN versus ASCAT in the TCGA validation set for the three amplification copy number states (Amp10, Amp and Gain). (D) Representative examples of gene amplifications in two samples from the TCGA validation cohort. Thresholds estimated by conumee-KCN for Amp and Amp10 are depicted (dotted grey lines). TP = #𝑇𝑟u𝑒 p𝑜𝑠𝑖𝑡𝑖v𝑒𝑠/#𝑇𝑟u𝑒 p𝑜𝑠𝑖𝑡𝑖v𝑒𝑠 + #𝐹𝑎l𝑠𝑒 𝑁𝑒g𝑎𝑡𝑖v𝑒𝑠; FP = #𝐹𝑎l𝑠𝑒 p𝑜𝑠𝑖𝑡𝑖v𝑒𝑠/#𝐹𝑎l𝑠𝑒 p𝑜𝑠𝑖𝑡𝑖v𝑒𝑠 + #𝑇𝑟u𝑒 𝑁𝑒g𝑎𝑡𝑖v𝑒𝑠.
Figure 2
Figure 2
Epigenomic-based computational prediction of SCNA in CUP using conumee-KCN. (A) Workflow for the detection of SCNA in CUPs using conumee-KCN and further identification of gene amplifications with clinical relevance. (B) Experimental validation of conumee-KCN predicted SCNA in CUP patient samples. Representative images of copy number state validation by FISH and protein expression by IHC are shown for three well-recognized oncogenes: MYC (c-MYC), CCND1 (Cyclin D1) and ERBB2 (HER2). *Orange/green break-apart FISH probes were used.

References

    1. Berdasco M, Esteller M. Clinical epigenetics: seizing opportunities for translation. Nat Rev Genet 2019;20(2):109–27. - PubMed
    1. Mancarella D, Plass C. Epigenetic signatures in cancer: proper controls, current challenges and the potential for clinical translation. Genome Med 2021;13(1):23. - PMC - PubMed
    1. Locke WJ, Guanzon D, Ma C, et al. . DNA methylation cancer biomarkers: translation to the clinic. Front Genet 2019;10:1150. - PMC - PubMed
    1. Hovestadt V, Zapatka M. Conumee: enhanced copy-number variation analysis using Illumina DNA methylation arrays. R package version 190. http://bioconductor.org/packages/conumee/.
    1. Feber A, Guilhamon P, Lechner M, et al. . Using high-density DNA methylation arrays to profile copy number alterations. Genome Biol 2014;15(2):R30. - PMC - PubMed

Publication types