Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Nov 19:39:107587.
doi: 10.1016/j.dib.2021.107587. eCollection 2021 Dec.

Non-melanoma skin cancer segmentation for histopathology dataset

Affiliations

Non-melanoma skin cancer segmentation for histopathology dataset

Simon M Thomas et al. Data Brief. .

Abstract

Densely labelled segmentation data for digital pathology images is costly to produce but is invaluable to training effective machine learning models. We make available 290 hand-annotated histopathology tissue sections of the 3 most common skin cancers; basal cell carcinoma (BCC), squamous cell carcinoma (SCC) and intraepidermal carcinoma (IEC). These non-melanoma skin cancers constitute over 90% of all skin cancer diagnoses and hence this dataset gives an opportunity to the scientific community to benchmark analytic methodologies on a significant portion of the dermatopathology workflow. The data represents typical cases of the three cancer types (not requiring a differential diagnosis) across shave, punch and excision biopsy contexts. Each image is accompanied with a segmentation mask which characterizes the section into 12 tissue types, specifically: keratin, epidermis, papillary dermis, reticular dermis, hypodermis, inflammation, glands, hair follicles and background, as well as BCC, SCC and IEC. Included also are cancer margin measurements to work towards automated assessment of surgical margin clearance and tumour invasion. This leaves open many opportunities for researchers to utilize or extend the dataset, building upon recent work on image analysis problems in skin cancer (Thomas et al., 2021).

Keywords: Digital pathology; Image analysis; Machine learning; Medical imaging.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Fig. 1.
Fig. 1
The original histology image of an excisional biopsy (left), with a resolution of 11,412 × 15,940 pixels (7.6 x 10.6 mm) and the corresponding 12 class segmentation mask (right). The mask provides a full characterization of the tissue section by allocating pixels to 1 of 12 broad tissue classes, including background. An example machine learning problem would be to learn the mapping from the input domain to the segmentation mask.

References

    1. Thomas S.M., Lefevre J.G., Baxter G., Hamilton N.A. Interpretable deep learning systems for multi-class segmentation and classification of non-melanoma skin cancer. Med. Image Anal. 2021;68 - PubMed
    1. Thomas S.M., Lefevre J.G., Baxter G., Hamilton N.A. Characterization of tissue types in basal cell carcinoma images via generative modelling and concept vectors. Comput. Med. Imaging Graph. 2021;94 - PubMed
    1. Geller A.C., Annas G.D. Epidemiology of melanoma and nonmelanoma skin cancer. Semin. Oncol. Nurs. 2003;19:2–11. - PubMed
    1. Rogers H.W., Weinstock M.A., Feldman S.R., Coldiron B.M. Incidence estimate of nonmelanoma skin cancer (keratinocyte carcinomas) in the U.S. population, 2012. JAMA Dermatol. 2015;151:1081–1086. doi: 10.1001/jamadermatol.2015.1187. - DOI - PubMed
    1. Staples M.P., Elwood M., Burton R.C., Williams J.L., Marks R., Giles G.G. Non-melanoma skin cancer in Australia: the 2002 national survey and trends since 1985. Med. J. Aust. 2006;184:6–10. - PubMed