Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jan 2;16(1):116.
doi: 10.1038/s41467-024-55456-8.

Maximizing the clinical utility and performance of cytology samples for comprehensive genetic profiling

Affiliations

Maximizing the clinical utility and performance of cytology samples for comprehensive genetic profiling

David Kim et al. Nat Commun. .

Abstract

Comprehensive molecular profiling by next-generation sequencing has revolutionized tumor classification and biomarker evaluation. However, routine implementation is challenged by the scant nature of diagnostic material obtained through minimally invasive procedures. Here, we describe our long-term experience in profiling cytology samples with an in-depth assessment of the performance, quality metrics, biomarker identification capabilities, and potential pitfalls. We highlight the impact of several optimization strategies to maximize performance with 4,871 prospectively sequenced clinical cytology samples tested by MSK-IMPACTTM. Special emphasis is given to the use of residual supernatant cell-free DNA (ScfDNA) as a valuable source of tumor DNA. Overall, cytology samples are similar in performance to surgical samples in identifying clinically relevant genomic alterations, achieving success rates up to 93% with full optimization. While cell block (CB) samples have excellent performance overall, low-level cross-contamination is identified in a small proportion of cases (4.7%), a common pitfall intrinsic to the processing of paraffin blocks, suggesting that more stringent precautions and processing modifications should be considered in quality control initiatives. By contrast ScfDNA samples have negligible contamination. Finally, ScfDNA testing exclusively used as a rescue strategy, delivered successful results in 71% of cases where tumor tissue from CB was depleted.

PubMed Disclaimer

Conflict of interest statement

Competing interests: C.V. reports intellectual property rights and equity interest in Paige.AI, Inc. A.R.B. has ownership/equity interests in Johnson and Johnson. M.B. has received advisory or consulting fees from AstraZeneca, Eli Lilly and Company, and PetDx, Inc. M.L. has received advisory or consulting fees from Takeda Oncology, Janssen Pharmaceuticals, AstraZeneca, ADC Therapeutics, Paige.AI, Merck, Bayer, and Lilly Oncology and has received research funding from Loxo Oncology, Helsinn Therapeutics, Merus NV, Elevation Oncology, and Rain Therapeutics. O.L. has received advisory or consulting fees from Hologic and Janssen Research & Development, LLC. M.A. has received advisory or consulting fees from Axis Medical Education, Clinical Education Alliance, LLC, Merck Sharp & Dohme, PeerView Institute for Medical Education (PVI), Physicians’ Education Resource, RMEI Medical Education, LLC, and Roche. The following Authors declare no competing interests: D.K., S.Y., S.N., K.N., R.F., N.R., I.R., J.C., and A.Y.

Figures

Fig. 1
Fig. 1. Overview of clinical cytology samples profiled by MSK-IMPACT.
a The composition of the cytology cohort by sample preparation type, either cellblock (CB) or supernatant cfDNA (ScfDNA), and their respective collection method of either fine needle aspiration (FNA) or cytology fluid (e.g. pleural fluids, bronchial washes, ascites fluid, etc.). b The distribution of cancer types profiled by MSK-IMPACT clinically from cytology samples. The most common cancer types are at the top and are ordered in descending order for specific cancer types. Represented are all cytology samples and by sample type. c The distribution of testing outcome and sample type of cytology samples. In cytology cases that failed testing for MSK-IMPACT the cause of failure is further broken down with the corresponding number of sample types for each reason.
Fig. 2
Fig. 2. Determinants of successful mutational profiling by MSK-IMPACT in cytology samples.
a Success rates of MSK-IMPACT testing on cytology samples by year and sample type. The colors denote successfully tested samples, samples failed due to scant/low tumor tissue, failed due to DNA content below sequencing thresholds, and all other failures (e.g., low DNA quality, contamination, etc.). Throughout the study period the panel genes increased in number and are denoted in the top bar by the number of panel genes included for that year. Various key optimization efforts for cytology samples were introduced in the clinical workflow at the corresponding timepoints including the use of a modified HistoGel cell-block preparation (1), improved bead-extraction procedures (2), dual-indexed libraries (3), and decreased lower DNA input for sequencing (4). b Success rates charted similar to that seen in (2a) but including only samples that were deemed to have adequate tumor for sequencing. The success rates therefore indicate the success rate of cytology sampled deemed to have adequate tumor on manual review as opposed to the overall testing success rate of a cytology sample received for testing. c Overall sequencing success rates on MSK-IMPACT for cytology samples charted by year and stratified by samples processed at an external laboratory (External), samples processed at the study institution (Inhouse), and all cytology cases (All Cases). Two-sided Chi-squared test revealed a significant difference in success rates between External and Inhouse samples in the years 2019 (p = 0.0087), 2021 (p = 0.0001), and 2022 (p = 0.0004). d The distribution of tumor purity for cytology samples by outcome of MSK-IMPACT testing for CB (n = 4457; p = 0.00039) and ScfDNA (n = 268; p = 0.63) samples. e Comparisons of total extracted DNA yields for extracted CB (n = 4457) and ScfDNA (n = 268) cytology samples (p = 2.2 × 10−16). Group comparisons for continuous data (d, e) were performed with a two-tailed Mann–Whitney test set at a p < 0.01. All boxplots show the median (center line with value) and 25th and 75th percentiles (bounding box) along with the 1.5 interquartile range (whiskers). Source data are provided as a Source Data file.
Fig. 3
Fig. 3. Sequencing quality metrics of profiled cytology samples.
a Sample coverage distribution by sequenced CB (n = 3707) and ScfDNA (n = 259) cytology samples (p = 2.2 × 1016). The dashed horizontal line indicates the 200× coverage mark in which samples are deemed to have low coverage with concerns for false negative assessment. The solid red line indicates the 50× mark for which samples below this mark are highly considered to be failed due to low coverage. b The distribution of total sample coverage before and after the lowered DNA threshold for sequencing by MSK-IMPACT compared in failed CB samples (n = 841, p = 0.79), successful CB samples (n = 3616, p = 2.2 × 1016), failed ScfDNA samples (n = 78, p = 0.31), and successful ScfDNA samples (n = 190, p = 0.69). c The distribution of non-patient DNA contamination rates and mean (red dot), determined by comparing homozygous SNP sites between the sequenced tumor and matched patient normal sample for CB (n = 4510) and ScfDNA (n = 259) samples (p = 0.00015). The dashed horizontal line indicates a contamination rate of 0.02 for which samples with a higher rate are considered to have a concern for non-patient DNA contamination. The number and percentage of samples that exceed this threshold are shown adjacent to the representative bracket for each sample type. d The distribution of contamination rates between sample preparation methods charted by sample coverage. Samples are colored based on sequencing results. The dashed vertical line indicates the threshold contamination rate of 0.02 and the dashed horizontal line denotes the threshold for adequate coverage (200×). Samples in the top-right quadrant indicate a high contamination rate in the face of adequate coverage, whereas samples in the bottom-right had low coverage that may falsely elevate the contamination rate. Group comparisons for continuous data (ac) were performed with a two-sided Mann-Whitney test set at a p < 0.01. All boxplots show the median (center line with value) and 25th and 75th percentiles (bounding box) along with the 1.5 interquartile range (whiskers). Source data are provided as a Source Data file.
Fig. 4
Fig. 4. Comparison of mutation calls between cytology cellblock and supernatant cfDNA samples.
a The proportion of significant genomic alterations identified by tumor type in ScfDNA and CB cytology samples compared to the pan-cancer GENIE cohort (non-cytology). The various colours indicate the highest OncoKB level and oncogenicity associated with the alteration identified. Comparative oncoprints of significant genomic alterations in three common tumor types profiled (bladder cancer, breast cancer, and non-small cell lung cancer) for samples with reported alterations identified from (b) CB and (c) ScfDNA samples. The significance of genomic alterations is coded by the corresponding OncoKB level. Sample level tumor mutational burden (TMB, mutations per megabase) is provided for each corresponding sample at the top with the horizontal dashed line indicating 10 mutations/megabase, for which samples with a higher number are TMB-High.
Fig. 5
Fig. 5. Comparison of mutation calls by MSK-IMPACT between matched cytology and surgical samples.
Cytology samples with a corresponding surgical sample (e.g., core biopsy, resection, etc.) of the same patient tumor profiled by MSK-IMPACT (n = 526) for comparison were identified. A total of 482 cytology CB samples profiled by MSK-IMPACT had a corresponding surgical sample profiled by MSK-IMPACT. a The number of cases tallied by the proportion of genomic alterations identified on the surgical sample that was also identified on the corresponding cytology CB. b The distribution of variant allele frequency (VAF) of shared mutations identified on both cytology CB samples and corresponding surgical samples (p = 4.9 × 1011) of the same patient tumor. c Venn diagram representing the total mutation calls in cytology CB samples only (left), their corresponding surgical sample only (right), and those found in both (middle). d The proportion of significant alterations identified in the cytology CB and corresponding pairs analyzed by if the mutation was identified exclusively in the cytology CB, corresponding surgical sample, or if it was seen in both (shared). The same analysis performed for ScfDNA samples (n = 44) with the proportion of genomic alterations identified (e), VAF distribution of shared alterations (f), venn diagram of mutation calls (g), and proportion of significant alterations identified by tissue sample and those seen in both (h). The p values were assessed as group comparisons for continuous data with a two-tailed Mann–Whitney test set at a p < 0.01. All boxplots show the median (center line) and 25th and 75th percentiles (bounding box) along with the 1.5 interquartile range (whiskers). Illustrations created with BioRender.com (BioRender.com/n48n733).

Update of

Similar articles

References

    1. Garraway, L. A. Genomics-driven oncology: framework for an emerging paradigm. J. Clin. Oncol.31, 1806–1814 (2013). - PubMed
    1. Wakai, T. et al. Next-generation sequencing-based clinical sequencing: toward precision medicine in solid tumors. Int. J. Clin. Oncol.24, 115–122 (2019). - PubMed
    1. Turner, S. R. et al. Feasibility of endobronchial ultrasound transbronchial needle aspiration for massively parallel next-generation sequencing in thoracic cancer patients. Lung Cancer119, 85–90 (2018). - PMC - PubMed
    1. Ramani, N. S. et al. Utilization of cytology smears improves success rates of RNA-based next-generation sequencing gene fusion assays for clinically relevant predictive biomarkers. Cancer Cytopathol.129, 374–382 (2021). - PMC - PubMed
    1. Baum, J. E. et al. Accuracy of next-generation sequencing for the identification of clinically relevant variants in cytology smears in lung adenocarcinoma. Cancer Cytopathol.125, 398–406 (2017). - PubMed

Publication types

MeSH terms

LinkOut - more resources