Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Apr 1;18(4):e70091.
doi: 10.1111/eva.70091. eCollection 2025 Apr.

No Evidence for Distinct Transcriptomic Subgroups of Devil Facial Tumor Disease (DFTD)

Affiliations

No Evidence for Distinct Transcriptomic Subgroups of Devil Facial Tumor Disease (DFTD)

Cleopatra Petrohilos et al. Evol Appl. .

Abstract

Contagious cancers represent one of the least understood types of infections in wildlife. Devil Facial Tumor Disease (comprised of two different contagious cancers, DFT1 and DFT2) has led to an 80% decline in the Tasmanian devil (Sarcophilus harrisii ) population at the regional level since it was first observed in 1996. There are currently no treatment options for the disease, and research efforts are focused on vaccine development. Although DFT1 is clonal, phylogenomic studies have identified different genetic variants of the pathogen. We postulated that different genetic strains may have different gene expression profiles and would therefore require different vaccine components. Here, we aimed to test this hypothesis by applying two types of unsupervised clustering (hierarchical and k-means) to 35 DFT1 transcriptomes selected from the disease's four major phylogenetic clades. The two algorithms produced conflicting results, and there was low support for either method individually. Validation metrics, such as the Gap statistic method, the Elbow method, and the Silhouette method, were ambiguous, contradictory, or indicated that our dataset only consisted of a single cluster. Collectively, our results show that the different phylogenetic clades of DFT1 all have similar gene expression profiles. Previous studies have suggested that transcriptomic differences exist between tumours from different locations. However, our study differs in that it considers both tumor purity and genotypic clade when analysing differences between DFTD biopsies. These results have important implications for therapeutic development, as they indicate that a single vaccine or treatment approach has the potential to be effective for a large cross-section of DFT1 tumors. As one of the largest studies to use transcriptomics to investigate phenotypic variation within a single contagious cancer, it also provides novel insight into this unique group of diseases.

Keywords: contagious cancer; unsupervised clustering; wildlife disease.

PubMed Disclaimer

Conflict of interest statement

Benefit‐sharing statement: This study complies with benefit sharing under the Convention on Biological Diversity.The authors declare no conflicts of interest.

Figures

FIGURE 1
FIGURE 1
(A) MDS plot and (B) t‐SNE plot of distances between TMM normalized gene counts for each DFT1 sample. Neither method of dimension reduction indicated strong separation between the clusters. Clade A2 exhibited the tightest clustering of all groups, but this did not include all samples in A2. Samples are colored by genotypic clade.
FIGURE 2
FIGURE 2
Heatmaps displaying (A) 1000 most variable genes (B) 500 most variable genes (C) 250 most variable genes (D) 100 most variable genes. None showed a clear mosaic pattern that would be expected if distinct clusters were present in the dataset. Yellow represents a higher value (indicating genes are upregulated in that sample) and dark blue represents a lower value (indicating that genes are downregulated in that sample). The name of the clade is in square brackets after the sample name.
FIGURE 3
FIGURE 3
Clustering validation indices. The Gap statistic method indicated the optimum number of clusters was one for both (A) hierarchical clustering and (B) k‐means clustering; the Elbow method was ambiguous for both (C) hierarchical clustering and (D) k‐means clustering; and the Silhouette method indicated the optimum number of clusters was 10 (the maximum) for (E) hierarchical clustering and 4 for (F) k‐means clustering. The dotted line represents the highest value, that is, what the test deems to be the optimum number of clusters in the dataset.
FIGURE 4
FIGURE 4
K means clustering results with k = 4. Each cluster contained samples from at least two different genotypic clades.

Similar articles

References

    1. Adolfsson, A. , Ackerman M., and Brownstein N. C.. 2019. “To Cluster, or Not to Cluster: An Analysis of Clusterability Methods.” Pattern Recognition 88: 13–26.
    1. Andrews, S. 2010. “Babraham Bioinformatics.” Babraham Institute, Cambridge, United Kingdom.
    1. Benson, G. 1999. “Tandem Repeats Finder: A Program to Analyze DNA Sequences.” Nucleic Acids Research 27: 573–580. - PMC - PubMed
    1. Bolger, A. M. , Lohse M., and Usadel B.. 2014. “Trimmomatic: A Flexible Trimmer for Illumina Sequence Data.” Bioinformatics 30: 2114–2120. - PMC - PubMed
    1. Bruzos, A. L. , Santamarina M., García‐Souto D., et al. 2023. “Somatic Evolution of Marine Transmissible Leukemias in the Common Cockle, Cerastoderma edule .” Nature Cancer 4, no. 11: 1575–1591. 10.1038/s43018-023-00641-9. - DOI - PubMed

LinkOut - more resources