Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul 2;84(13):2060-2072.
doi: 10.1158/0008-5472.CAN-23-1349.

A Pan-Cancer Patient-Derived Xenograft Histology Image Repository with Genomic and Pathologic Annotations Enables Deep Learning Analysis

Affiliations

A Pan-Cancer Patient-Derived Xenograft Histology Image Repository with Genomic and Pathologic Annotations Enables Deep Learning Analysis

Brian S White et al. Cancer Res. .

Abstract

Patient-derived xenografts (PDX) model human intra- and intertumoral heterogeneity in the context of the intact tissue of immunocompromised mice. Histologic imaging via hematoxylin and eosin (H&E) staining is routinely performed on PDX samples, which could be harnessed for computational analysis. Prior studies of large clinical H&E image repositories have shown that deep learning analysis can identify intercellular and morphologic signals correlated with disease phenotype and therapeutic response. In this study, we developed an extensive, pan-cancer repository of >1,000 PDX and paired parental tumor H&E images. These images, curated from the PDX Development and Trial Centers Research Network Consortium, had a range of associated genomic and transcriptomic data, clinical metadata, pathologic assessments of cell composition, and, in several cases, detailed pathologic annotations of neoplastic, stromal, and necrotic regions. The amenability of these images to deep learning was highlighted through three applications: (i) development of a classifier for neoplastic, stromal, and necrotic regions; (ii) development of a predictor of xenograft-transplant lymphoproliferative disorder; and (iii) application of a published predictor of microsatellite instability. Together, this PDX Development and Trial Centers Research Network image repository provides a valuable resource for controlled digital pathology analysis, both for the evaluation of technical issues and for the development of computational image-based methods that make clinical predictions based on PDX treatment studies. Significance: A pan-cancer repository of >1,000 patient-derived xenograft hematoxylin and eosin-stained images will facilitate cancer biology investigations through histopathologic analysis and contributes important model system data that expand existing human histology repositories.

PubMed Disclaimer

Conflict of interest statement

B.S. White reports grants from the NIH/NCI during the conduct of the study. T. Sheridan reports personal fees from Google outside the submitted work. S.R. Davies reports grants from NIH/NCI U54CA224083 during the conduct of the study. K.W. Evans reports grants from the NCI during the conduct of the study. B.J. Sanderson reports grants from the NIH/NCI during the conduct of the study. M.W. Lloyd reports grants from NCI (HHS—NIH) during the conduct of the study. L.E. Dobrolecki reports grants from the NIH during the conduct of the study, as well as personal fees from StemMed, Ltd. outside the submitted work. B.N. Davis-Dusenbery reports grants and other support from the NCI during the conduct of the study, as well as being an employee and equity holder in Velsera. N. Mitsiades reports grants from the NCI during the conduct of the study. A.L. Welm reports that The University of Utah may license the models described herein to for-profit companies, which may result in tangible property royalties to the university and members of the Welm Labs who developed the models. B.E. Welm reports grants from the NIH/NCI during the conduct of the study; receiving royalties from licenses of previously developed PDX models issued by The University of Utah; and that The University of Utah may issue new licenses in the future at its discretion, which may result in additional royalties to the authors. S. Li reports personal fees from Inotivco outside the submitted work. M.A. Davies reports grants from the NCI during the conduct of the study, as well as personal fees from Roche/Genentech, Pfizer, Novartis, Bristol Myers Squibb, Iovance, and Eisai, grants and personal fees from ABM Therapeutics, and grants from Lead Pharma outside the submitted work. F. Meric-Bernstam reports personal fees from AbbVie, Aduro BioTech Inc., Alkermes, AstraZeneca, Daiichi Sankyo Co., Ltd., Calibr (a division of Scripps Research), Debiopharm, EcoR1 Capital, eFFECTOR Therapeutics, F. Hoffmann-La Roche Ltd., GT Apeiron, Genentech, Inc., Harbinger Health, IBM Watson, Incyte, Infinity Pharmaceuticals, The Jackson Laboratory, KOLON Life Science, LegoChem Bio, Lengo Therapeutics, Menarini Group, OrigiMed, PACT Pharma, Parexel International, Pfizer Inc., Protai Bio Ltd., Samsung Bioepis, Seattle Genetics, Inc., Tallac Therapeutics, Tyra Biosciences, Xencor, Zymeworks, Black Diamond, Biovica, Eisai, FogPharma, Immunomedics, Inflection Biosciences, Karyopharm Therapeutics, Loxo Oncology, Mersana Therapeutics, OnCusp Therapeutics, Puma Biotechnology Inc., Sanofi, Silverback Therapeutics, Spectrum Pharmaceuticals, Theratechnologies, Zentalis and Dava Oncology; grants from Aileron Therapeutics, Inc., AstraZeneca, Bayer Healthcare Pharmaceuticals, Calithera Biosciences Inc., Curis, Inc., CytomX Therapeutics Inc., Daiichi Sankyo Co., Ltd., Debiopharm International, eFFECTOR Therapeutics, Genentech, Inc., Guardant Health, Inc., KLUS Pharma, Takeda Pharmaceuticals, Novartis, Puma Biotechnology, Inc., and Taiho Pharmaceutical Co.; and other support from European Organisation for Research and Treatment of Cancer, European Society for Medical Oncology, Cholangiocarcinoma Foundation, and Dava Oncology outside the submitted work. Y. Xie reports grants from NIH and Cancer Prevention and Research Institute of Texas (CPRIT) during the conduct of the study; grants from NIH and CPRIT outside the submitted work; and being a cofounder of the Adjuvant Genomics, Inc. M.T. Lewis reports grants from the NCI during the conduct of the study, as well as being a founder and limited partner in StemMed Ltd., founder, manager, and general partner in StemMed Holdings LLC, and founder and equity holder in Tvardi Therapeutics Inc. No disclosures were reported by the other authors.

Figures

Figure 1.
Figure 1.
PDXNet image repository captures heterogeneity across tumor types, passages, and treatment status and pairs human and PDX histology images with pathologic assessment and genomic and transcriptomic data. A, Top, distribution of the number of PDX models with H&E images by cancer diagnosis. Fraction of PDX models of a given diagnosis paired with HRD and MSI status, slide-level pathologic assessment (Path), WGD predictions, and RNA-seq, Middle, TMB, tumor volume, and WES data. Bottom, fraction of PDX models with paired human data of the indicated modality. B, Distribution of PDX images by passage. C, Counts of images for those derived from human progenitor/PDX pairs, restricted to diagnoses with at least four such progenitors. B and C, Bar charts are colored according to the contributing site as indicated in A. D, Distribution of PDX images across diagnoses with at least two treated PDXs or at least two treated progenitors. BLCA, bladder cancer; HNSC, head and neck squamous carcinoma;NRSTS, nonrhabdomyosarcoma soft tissue sarcoma; Rx, treatment.
Figure 2.
Figure 2.
Cell segmentation and classification trained on human data correlate with pathologist estimates of cell-type proportions in PDX samples. A and B, Correlation of pathologist (x-axis) and HoVer-Net–based slide-level estimates (y-axis) for neoplastic (left), stromal (middle), and necrotic (right) cell proportions incolorectal cancer (A) andPDAC (B) from WUSTL. Each point corresponds to an individual image and PDX (i.e., no replication). C, Weighted Pearson correlations of pathologist and HoVer-Net–based estimates of neoplastic, stromal, and necrotic cell proportions in colon adenocarcinoma, colorectal cancer, and PDAC in the PDMR and WUSTL datasets. All PDX strains are NSG. Colorectal cancer not otherwise specified, for example, as colon adenocarcinoma or rectal adenocarcinoma. Correlations are computed over images, inversely weighted by the number of H&E images from each patient (see Materials and Methods).
Figure 3.
Figure 3.
Regional classifier predicts cancer epithelium (“tumor”), stroma, and necrotic areas based on nuclei classification. A and B, Cell phenotypes predicted by HoVer-Net (A) or HD-Staining (B) within a PDX lung tumor. C, Labeled tiles (orange, necrosis; blue, stroma; black, tumor) represented according to the cellular fraction that is predicted by HoVer-Net to be neoplastic, necrotic, immune, or stromal. D, Top, confusion matrix comparing pathologist-provided tile labels with corresponding prediction from the random forest (RF) HoVer-Net classifier. Results shown for held-out set following fivefold cross-validation, with folds defined according to the patient/progenitor (see Materials and Methods). Bottom, image and tile count used for cross-validation stratified by diagnosis and site. E, Pathologist regional annotations. F, Random forest HoVer-Net–based classifier predictions of regions (within 512 × 512 pixel tiles). necros, necrotic; neopla, neoplastic; nolabe, unlabeled; no-neo, nonneoplastic epithelial; RBC, red blood cell.
Figure 4.
Figure 4.
XTLD can be computationally predicted from H&E images using segmented and classified cells or DL features. A and B, H&E images of LUSC (A) and XTLD (B). Each image from a distinct human progenitor. Images at 20× resolution. Scale bars, 50 µm. C, Image and tile count used for cross-validation stratified by diagnosis and site. D and E, HoVer-Net–based classifier performance. D, Distribution of tile-level XTLD probability predicted by random forest trained using HoVer-Net–derived features, according to the PDX image (x-axis), from an LUSC (red) or XTLD (blue) sample in the MDACC (solid line) or PDMR (dashed line) dataset. Results shown for held-out set following fivefold cross-validation, with folds defined according to the patient/progenitor (see Materials and Methods). E, Confusion matrix showing tile-level concordance between HoVer-Net–based classification (rows) and annotation (columns). F and G, Inception-based classifier performance, as in D and E, respectively, but with predictions from a random forest trained using Inception features. H, Distribution of tile-level XTLD probability predicted by random forest trained using Inception features, according to the PDX image (x-axis), from LUSC (red) or XTLD (blue) samples in JAX external validation dataset. Each image is from a distinct human progenitor. Samples are ordered by median XTLD prediction probability over tiles in D, F, and H. All PDXs are NSG mice.
Figure 5.
Figure 5.
Inception encodes morphologic features useful in predicting XTLD. A, Tiles with large values of Inception feature 1895. The tile with the largest value is shown for each of four samples (noted above tile) with highest values, all of which are XTLD cases. B, Tiles with small values of Inception feature 1895. The tile with the smallest value is shown for each of four samples (noted above tile) with smallest values, all of which are LUSC cases. All PDXs are NSG mice. Images at 20× resolution. Scale bars, 50 µm.
Figure 6.
Figure 6.
MSI can be computationally predicted from PDX samples using a human-trained model. Distribution of tile-level MSI probability predicted using a published model (34) from human progenitor (A) or PDX (B) model H&E images derived from MSI (red) or MSS (blue) samples. All progenitor samples in A have corresponding derived PDX samples in B. Samples ordered by median (mean over models) MSI prediction over tiles.

References

    1. Tentler JJ, Tan AC, Weekes CD, Jimeno A, Leong S, Pitts TM, et al. . Patient-derived tumour xenografts as models for oncology drug development. Nat Rev Clin Oncol 2012;9:338–50. - PMC - PubMed
    1. Pompili L, Porru M, Caruso C, Biroccio A, Leonetti C. Patient-derived xenografts: a relevant preclinical model for drug development. J Exp Clin Cancer Res 2016;35:189. - PMC - PubMed
    1. DeRose YS, Wang G, Lin Y-C, Bernard PS, Buys SS, Ebbert MTW, et al. . Tumor grafts derived from women with breast cancer authentically reflect tumor pathology, growth, metastasis and disease outcomes. Nat Med 2011;17:1514–20. - PMC - PubMed
    1. Petrillo LA, Wolf DM, Kapoun AM, Wang NJ, Barczak A, Xiao Y, et al. . Xenografts faithfully recapitulate breast cancer-specific gene expression patterns of parent primary breast tumors. Breast Cancer Res Treat 2012;135:913–22. - PMC - PubMed
    1. Li S, Shen D, Shao J, Crowder R, Liu W, Prat A, et al. . Endocrine-therapy-resistant ESR1 variants revealed by genomic characterization of breast-cancer-derived xenografts. Cell Rep 2013;4:1116–30. - PMC - PubMed