Comparative Study

. 2020 Nov 17;324(19):1957-1969.

doi: 10.1001/jama.2020.20457.

Detection of Pathogenic Variants With Germline Genetic Testing Using Deep Learning vs Standard Methods in Patients With Prostate Cancer and Melanoma

Saud H AlDubayan^{1

2

3

4}, Jake R Conway^{1

2

5}, Sabrina Y Camp^{1

2}, Leora Witkowski⁶, Eric Kofman^{1

2}, Brendan Reardon^{1

2}, Seunghun Han^{1

2

7}, Nicholas Moore^{1

2}, Haitham Elmarakeby^{1

2

8}, Keyan Salari⁹, Hani Choudhry¹⁰, Abdullah M Al-Rubaish¹¹, Abdulsalam A Al-Sulaiman¹¹, Amein K Al-Ali¹¹, Amaro Taylor-Weiner^{2

5}, Eliezer M Van Allen^{1

2}

Affiliations

¹ Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Harvard University, Boston, Massachusetts.
² Cancer Program, Broad Institute of Massachusetts Institute of Technology and Harvard University, Cambridge.
³ Division of Genetics, Brigham and Women's Hospital, Boston, Massachusetts.
⁴ College of Medicine, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia.
⁵ Division of Medical Sciences, Department of Biomedical Informatics, Harvard University, Boston, Massachusetts.
⁶ Genetics Training Program, Harvard Medical School, Harvard University, Boston, Massachusetts.
⁷ Program in Biological and Biomedical Sciences, Division of Medical Sciences, Harvard University, Boston, Massachusetts.
⁸ Department of System and Computer Engineering, Al-Azhar University, Cairo, Egypt.
⁹ Department of Urology, Massachusetts General Hospital, Boston.
¹⁰ Department of Biochemistry, Cancer Metabolism and Epigenetic Unit, Faculty of Science, Cancer and Mutagenesis Unit, King Fahd Medical Research Center, King Abdulaziz University, Jeddah, Saudi Arabia.
¹¹ College of Medicine, Imam Abdulrahman bin Faisal University, Dammam, Saudi Arabia.

PMID: 33201204
PMCID: PMC7672519
DOI: 10.1001/jama.2020.20457

Comparative Study

Detection of Pathogenic Variants With Germline Genetic Testing Using Deep Learning vs Standard Methods in Patients With Prostate Cancer and Melanoma

Saud H AlDubayan et al. JAMA. 2020.

. 2020 Nov 17;324(19):1957-1969.

doi: 10.1001/jama.2020.20457.

Authors

Affiliations

¹ Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Harvard University, Boston, Massachusetts.
² Cancer Program, Broad Institute of Massachusetts Institute of Technology and Harvard University, Cambridge.
³ Division of Genetics, Brigham and Women's Hospital, Boston, Massachusetts.
⁴ College of Medicine, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia.
⁵ Division of Medical Sciences, Department of Biomedical Informatics, Harvard University, Boston, Massachusetts.
⁶ Genetics Training Program, Harvard Medical School, Harvard University, Boston, Massachusetts.
⁷ Program in Biological and Biomedical Sciences, Division of Medical Sciences, Harvard University, Boston, Massachusetts.
⁸ Department of System and Computer Engineering, Al-Azhar University, Cairo, Egypt.
⁹ Department of Urology, Massachusetts General Hospital, Boston.
¹⁰ Department of Biochemistry, Cancer Metabolism and Epigenetic Unit, Faculty of Science, Cancer and Mutagenesis Unit, King Fahd Medical Research Center, King Abdulaziz University, Jeddah, Saudi Arabia.
¹¹ College of Medicine, Imam Abdulrahman bin Faisal University, Dammam, Saudi Arabia.

PMID: 33201204
PMCID: PMC7672519
DOI: 10.1001/jama.2020.20457

Abstract

Importance: Less than 10% of patients with cancer have detectable pathogenic germline alterations, which may be partially due to incomplete pathogenic variant detection.

Objective: To evaluate if deep learning approaches identify more germline pathogenic variants in patients with cancer.

Design, setting, and participants: A cross-sectional study of a standard germline detection method and a deep learning method in 2 convenience cohorts with prostate cancer and melanoma enrolled in the US and Europe between 2010 and 2017. The final date of clinical data collection was December 2017.

Exposures: Germline variant detection using standard or deep learning methods.

Main outcomes and measures: The primary outcomes included pathogenic variant detection performance in 118 cancer-predisposition genes estimated as sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). The secondary outcomes were pathogenic variant detection performance in 59 genes deemed actionable by the American College of Medical Genetics and Genomics (ACMG) and 5197 clinically relevant mendelian genes. True sensitivity and true specificity could not be calculated due to lack of a criterion reference standard, but were estimated as the proportion of true-positive variants and true-negative variants, respectively, identified by each method in a reference variant set that consisted of all variants judged to be valid from either approach.

Results: The prostate cancer cohort included 1072 men (mean [SD] age at diagnosis, 63.7 [7.9] years; 857 [79.9%] with European ancestry) and the melanoma cohort included 1295 patients (mean [SD] age at diagnosis, 59.8 [15.6] years; 488 [37.7%] women; 1060 [81.9%] with European ancestry). The deep learning method identified more patients with pathogenic variants in cancer-predisposition genes than the standard method (prostate cancer: 198 vs 182; melanoma: 93 vs 74); sensitivity (prostate cancer: 94.7% vs 87.1% [difference, 7.6%; 95% CI, 2.2% to 13.1%]; melanoma: 74.4% vs 59.2% [difference, 15.2%; 95% CI, 3.7% to 26.7%]), specificity (prostate cancer: 64.0% vs 36.0% [difference, 28.0%; 95% CI, 1.4% to 54.6%]; melanoma: 63.4% vs 36.6% [difference, 26.8%; 95% CI, 17.6% to 35.9%]), PPV (prostate cancer: 95.7% vs 91.9% [difference, 3.8%; 95% CI, -1.0% to 8.4%]; melanoma: 54.4% vs 35.4% [difference, 19.0%; 95% CI, 9.1% to 28.9%]), and NPV (prostate cancer: 59.3% vs 25.0% [difference, 34.3%; 95% CI, 10.9% to 57.6%]; melanoma: 80.8% vs 60.5% [difference, 20.3%; 95% CI, 10.0% to 30.7%]). For the ACMG genes, the sensitivity of the 2 methods was not significantly different in the prostate cancer cohort (94.9% vs 90.6% [difference, 4.3%; 95% CI, -2.3% to 10.9%]), but the deep learning method had a higher sensitivity in the melanoma cohort (71.6% vs 53.7% [difference, 17.9%; 95% CI, 1.82% to 34.0%]). The deep learning method had higher sensitivity in the mendelian genes (prostate cancer: 99.7% vs 95.1% [difference, 4.6%; 95% CI, 3.0% to 6.3%]; melanoma: 91.7% vs 86.2% [difference, 5.5%; 95% CI, 2.2% to 8.8%]).

Conclusions and relevance: Among a convenience sample of 2 independent cohorts of patients with prostate cancer and melanoma, germline genetic testing using deep learning, compared with the current standard genetic testing method, was associated with higher sensitivity and specificity for detection of pathogenic variants. Further research is needed to understand the relevance of these findings with regard to clinical outcomes.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest Disclosures: Dr Moore reported receiving personal fees from Immunity Health. Dr Van Allen reported serving on advisory boards or as a consultant to Tango Therapeutics, Genome Medical, Invitae, Illumina, Manifold Bio, Monte Rosa Therapeutics, and Enara Bio; receiving personal fees from Invitae, Tango Therapeutics, Genome Medical, Ervaxx, Roche/Genentech, and Janssen; receiving research support from Novartis and Bristol-Myers Squibb; having equity in Tango Therapeutics, Genome Medical, Syapse, Enara Bio, Manifold Bio, and Microsoft; receiving travel reimbursement from Roche and Genentech; and filing institutional patents (for ERCC2 variants and chemotherapy response, chromatin variants and immunotherapy response, and methods for clinical interpretation). No other disclosures were reported.

Figures

**Figure 1.. Overview of the Study Design and the Germline Variant Detection Methods Used**
Germline whole-exome sequencing data of 2 independent cohorts were analyzed using a deep learning variant detection approach and the current standard germline variant detection method to assess the sensitivity, specificity, positive predictive value, and negative predictive value of detecting pathogenic variants in 118 cancer-predisposition genes as well as in 59 American College of Medical Genetics and Genomics (ACMG) genes. In addition, an exome-wide analysis of putative loss-of-function (pLOF) variants in 5197 Online Mendelian Inheritance in Man (OMIM) genes and in 12 clinically oriented multigene panels was conducted in 286 patients with prostate cancer and in 1295 patients with melanoma whose tumors were available for independent variant validation. For each analysis, the performance of the deep learning method and the standard method was compared with the performance obtained when both methods are concurrently used (additional information appears in the Methods section). The performance of the standard and deep learning methods to detect pathogenic variants in the cancer-predisposition and ACMG gene sets was tested on all patients with prostate cancer and melanoma (n = 2367). However, the performance of the 2 methods in the OMIM gene set and the multigene panels was tested on 286 patients with prostate cancer and 1295 patients with melanoma (n = 1581) whose tumor sequencing data were available for computational validation of the identified pLOF variants. ^aRepresents the frequency of the allele (also called variant) in the general population. ^bGenetic variants that are likely to severely disrupt the protein function by truncating the gene transcript. Examples of pLOF variants include frameshift, stop-codon, and canonical splice-site variants.

**Figure 2.. Examples of Germline Pathogenic Variants That Were Exclusively Identified by the Deep Learning or Standard Variant Detection Methods**
Thirty-six germline pathogenic variants were identified by the deep learning method, 75% of which were judged to be valid or true-positive (A), whereas 27 variants were identified by the standard method, 40.7% of which were judged to be valid (B). A similar analysis of 1295 patients with melanoma showed a similarly higher detection rate with the deep learning method (eFigure 4). Examples show pathogenic variants in the cancer-predisposition gene set (C-E) and the clinically actionable American College of Medical Genetics and Genomics gene set (F-H) that were only identified by the deep learning method and judged to be valid on manual assessment.

**Figure 3.. Evaluation of Model Performance and Normalized Confusion Matrices of the Standard and Deep Learning Methods**
Manually assessed pathogenic variants in 151 cancer-predisposition and American College of Medical Genetics and Genomics genes were used to calculate the receiver operating characteristic curve using a set of thresholds for the quality scores. For each quality threshold, the standard and deep learning models predicted if the variant being assessed is real or artifactual. These predictions were then compared with the results of the manual validation of these pathogenic variants (judged to be valid or not) and true-positive, true-negative, false-positive, and false-negative rates were calculated. Analysis of prostate cancer cohort (n = 1072) (A). Normalized confusion matrices (B and C), representing the fractions of the manually validated true-positive and false-positive variants (ie, the reference variant set; y-axis) that were correctly identified as such by the standard method and the deep learning method (x-axis). For each matrix, the left-upper and right-lower squares represent the degree of agreement while the right-upper and left-lower squares represent the degree of mismatching between the results of manual validation and the method-based variant assessment. The intensity of the square shading represents the metric within the square, which is the ratio of the number of variants in each category to the total number of true-positive or true-negative in the reference variant set (ie, the manual validation set). Compared with the standard method, there was more agreement between the variant assessment results of the deep learning method and the manual validation results in the prostate cancer cohort (n = 1072). Analysis of the melanoma cohort (n = 1295) (D). Similarly, there was more agreement between the deep learning–based variant assessment and the manual evaluation of these variants in the raw genomic data of the melanoma cohort (E and F).

See this image and copyright information in PMC

Comment in

Bioinformatics, Sequencing Accuracy, and the Credibility of Clinical Genomics.
Feero WG. Feero WG. JAMA. 2020 Nov 17;324(19):1945-1947. doi: 10.1001/jama.2020.19939. JAMA. 2020. PMID: 33201189 No abstract available.
Urological Oncology: Prostate Cancer.
Taneja SS. Taneja SS. J Urol. 2021 May;205(5):1515-1517. doi: 10.1097/JU.0000000000001673. Epub 2021 Feb 24. J Urol. 2021. PMID: 33625902 No abstract available.

References

1. Rebbeck TR, Friebel T, Lynch HT, et al. Bilateral prophylactic mastectomy reduces breast cancer risk in BRCA1 and BRCA2 mutation carriers: the PROSE Study Group. J Clin Oncol. 2004;22(6):1055-1062. doi: 10.1200/JCO.2004.04.188 - DOI - PubMed
1. AlDubayan SH. Leveraging clinical tumor-profiling programs to achieve comprehensive germline-inclusive precision cancer medicine. JCO Precision Oncology. 2019;3:1-3. doi: 10.1200/PO.19.00108 - DOI - PubMed
1. Litton JK, Rugo HS, Ettl J, et al. Talazoparib in patients with advanced breast cancer and a germline BRCA mutation. N Engl J Med. 2018;379(8):753-763. doi: 10.1056/NEJMoa1802905 - DOI - PMC - PubMed
1. Le DT, Durham JN, Smith KN, et al. Mismatch repair deficiency predicts response of solid tumors to PD-1 blockade. Science. 2017;357(6349):409-413. doi: 10.1126/science.aan6733 - DOI - PMC - PubMed
1. AlDubayan SH, Pyle LC, Gamulin M, et al. ; Regeneron Genetics Center (RGC) Research Team . Association of inherited pathogenic variants in checkpoint kinase 2 (CHEK2) with susceptibility to testicular germ cell tumors. JAMA Oncol. 2019;5(4):514-522. doi: 10.1001/jamaoncol.2018.6477 - DOI - PMC - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Detection of Pathogenic Variants With Germline Genetic Testing Using Deep Learning vs Standard Methods in Patients With Prostate Cancer and Melanoma

Affiliations

Detection of Pathogenic Variants With Germline Genetic Testing Using Deep Learning vs Standard Methods in Patients With Prostate Cancer and Melanoma

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Comment in

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical