Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan 19;24(1):bbac568.
doi: 10.1093/bib/bbac568.

A Bayesian model for identifying cancer subtypes from paired methylation profiles

Affiliations

A Bayesian model for identifying cancer subtypes from paired methylation profiles

Yetian Fan et al. Brief Bioinform. .

Abstract

Aberrant DNA methylation is the most common molecular lesion that is crucial for the occurrence and development of cancer, but has thus far been underappreciated as a clinical tool for cancer classification, diagnosis or as a guide for therapeutic decisions. Partly, this has been due to a lack of proven algorithms that can use methylation data to stratify patients into clinically relevant risk groups and subtypes that are of prognostic importance. Here, we proposed a novel Bayesian model to capture the methylation signatures of different subtypes from paired normal and tumor methylation array data. Application of our model to synthetic and empirical data showed high clustering accuracy, and was able to identify the possible epigenetic cause of a cancer subtype.

Keywords: Bayesian method; cancer subtyping; clustering; paired methylation profiles.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A flowchart to illustrate the BaySub algorithm. The methylation status of the reference normal cell is represented by a binary vector formula image, and the rate of methylation is set to be formula image. Based on this reference methylation status of normal tissue, a binary underlying methylation profile formula image could be generated with mutation probability formula image. Suppose there are formula image different paths formula image changing the methylation status from normal tissues to become tumor tissues, and the mutation rates is fixed at formula image. After obtaining the membership of the cancer subtypes formula image, the tumor methylation profile can be generated from normal methylation profile formula image, the membership formula image and the mutation paths formula image. Therefore, the M-values of normal tissues follow normal distribution according to its methylation profile formula image, and corresponding M-values of tumor tissues follow normal distribution according to its methylation profile formula image.
Figure 2
Figure 2
The traceplots of key variables for simulation datasets. The whole MCMC samples are shown in the left figures, and the burn-in periods are illustrated in the right figures.
Figure 3
Figure 3
Heatmap of methylation signatures captured by BaySub algorithm on simulation datasets.
Figure 4
Figure 4
The plots of AIC (left) and BIC (right) for different numbers of subtypes in Experiment 2 (dashed line) and Experiment 3 (solid line). The horizontal axis represents the assumed value of the number of subtypes formula image, and the vertical axis represents the corresponding values of AIC or BIC. The true value of formula image for red dashed line is 3, and that for green solid line is 4.
Figure 5
Figure 5
The traceplots of key variables for real datasets. The whole MCMC samples are shown in the left figures, and the burn-in periods are illustrated in the right figures.

References

    1. Bailey P, Chang DK, Nones J, et al. . Genomic analyses identify molecular subtypes of pancreatic cancer. Nature 2016;531(7592):47–52. - PubMed
    1. Felipe De Sousa EM, Wang X, Jansen M, et al. . Poor-prognosis colon cancer is defined by a molecularly distinct subtype and develops from serrated precursor lesions. Nat Med 2013;19(5):614–8. - PubMed
    1. Yu F, Quan F, Xu J, et al. . Breast cancer prognosis signature: linking risk stratification to disease subtypes. Brief Bioinform 2019;20(6):2130–40. - PubMed
    1. Dai X, Li T, Bai Z, et al. . Breast cancer intrinsic subtype classification, clinical use and future trends. Am J Cancer Res 2015;5(10):2929. - PMC - PubMed
    1. Arnold M, Soerjomataram I, Ferlay J, et al. . Global incidence of oesophageal cancer by histological subtype in 2012. Gut 2015;64(3):381–7. - PubMed

Publication types