Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Apr:6:e2100326.
doi: 10.1200/PO.21.00326.

Acute Leukemia Classification Using Transcriptional Profiles From Low-Cost Nanopore mRNA Sequencing

Affiliations

Acute Leukemia Classification Using Transcriptional Profiles From Low-Cost Nanopore mRNA Sequencing

Jeremy Wang et al. JCO Precis Oncol. 2022 Apr.

Abstract

Purpose: Most cases of pediatric acute leukemia occur in low- and middle-income countries, where health centers lack the tools required for accurate diagnosis and disease classification. Recent research shows the robustness of using unbiased short-read RNA sequencing to classify genomic subtypes of acute leukemia. Compared with short-read sequencing, nanopore sequencing has low capital and consumable costs, making it suitable for use in locations with limited health infrastructure.

Materials and methods: We show the feasibility of nanopore mRNA sequencing on 134 cryopreserved acute leukemia specimens (26 acute myeloid leukemia [AML], 73 B-lineage acute lymphoblastic leukemia [B-ALL], 34 T-lineage acute lymphoblastic leukemia, and one acute undifferentiated leukemia). Using multiple library preparation approaches, we generated long-read transcripts for each sample. We developed a novel composite classification approach to predict acute leukemia lineage and major B-ALL and AML molecular subtypes directly from gene expression profiles.

Results: We demonstrate accurate classification of acute leukemia samples into AML, B-ALL, or T-lineage acute lymphoblastic leukemia (96.2% of cases are classifiable with a probability of > 0.8, with 100% accuracy) and further classification into clinically actionable genomic subtypes using shallow RNA nanopore sequencing, with 96.2% accuracy for major AML subtypes and 94.1% accuracy for major B-lineage acute lymphoblastic leukemia subtypes.

Conclusion: Transcriptional profiling of acute leukemia samples using nanopore technology for diagnostic classification is feasible and accurate, which has the potential to improve the accuracy of cancer diagnosis in low-resource settings.

PubMed Disclaimer

Conflict of interest statement

Mahler RevsineEmployment: Epic SystemsTravel, Accommodations, Expenses: Epic Systems Mark R. LitzowConsulting or Advisory Role: Omeros, Jazz PharmaceuticalsResearch Funding: Amgen, Astellas Pharma, Actinium Pharmaceuticals, Pluristem Therapeutics, AbbVie/Genentech, Tolero Pharmaceuticals, AbbVieOther Relationship: BioSight Yuri FedoriwHonoraria: Alexion Pharmaceuticals Kathryn G. RobertsStock and Other Ownership Interests: Amgen Charles G. MullighanStock and Other Ownership Interests: AmgenHonoraria: Amgen, IlluminaConsulting or Advisory Role: IlluminaSpeakers' Bureau: Amgen, PfizerResearch Funding: Loxo, Pfizer, AbbViePatents, Royalties, Other Intellectual Property: Inventor on a pending patent application related to gene expression signatures for detection of underlying Philadelphia chromosome–like events and therapeutic targeting in leukemia (PCT/US2012/069228), WO 2021/022076 A1. This Patent Highlight shows representative PROTAC compounds bound to JAK2, where ruxolitinib and baricitinib bind to the human JAK2 JH1. Furthermore, representative data illustrate protein degradation, cytotoxicity, and effect of the JAKSTAT signaling pathway of the PROTAC compounds in MHHCALL-4 cellsTravel, Accommodations, Expenses: Amgen, IlluminaNo other potential conflicts of interest were reported.

Figures

FIG 1.
FIG 1.
Process overview. (A) Sample preparation and sequencing: extract RNA from buffy coat, perform long-range RT-PCR to produce full-length cDNA, and sequence on MinION sequencer. (B) Informatics: raw sequence data are base-called on the local sequencing computer and then transferred to a remote cloud-computing platform where sequences are aligned to the reference transcriptome and a gene expression matrix is constructed and used as input to a machine learning lineage/genomic subtype classifier. (C) Classification: conceptual model of classification approach. We train a set of pairwise PLSDA classifiers, consisting of each type versus each other and each type versus everything else. The set of binary predictions produced by these models is then used to build a new feature matrix, which serves as the input to a SVM classifier for each of the lineage and AML genomic subtypes. The SVM classifiers produce prediction probabilities for each type. AML, acute myeloid leukemia; PCR, polymerase chain reaction; PLSDA, partial least-squares discriminant analysis; RT-PCR, reverse transcriptase polymerase chain reaction; SVM, support vector machine.
FIG 2.
FIG 2.
Association of classification accuracy with the prediction probability. Each point represents a separate case, sorted by known lineage and subtype. The color of each point represents the predicted classification on the basis of ONT RNA sequencing. (A) Classification accuracy of leukemia lineage. (B) Classification accuracy of the AML core genomic subtype. (C) Classification accuracy of the B-ALL genomic subtype. For the leukemia lineage and AML genomic subtype classification, all cases with a prediction probability of > 80% were correctly classified. AML, acute myeloid leukemia; B-ALL, B-lineage acute lymphoblastic leukemia.
FIG 3.
FIG 3.
Transcript versus read length as an indicator of RNA degradation and/or cDNA/PCR fidelity. There is significant variability in quality and proportion of full-length reads, to which classification is very robust. (A) 036, high quality with an average transcript completeness of 58.7%. (B) 043, typical with an average transcript completeness of 36.9%. (C) 048, degraded with an average transcript completeness of 25.5%. (D) 063, degraded with an average transcript completeness of 25.4%. PCR, polymerase chain reaction.

References

    1. Bray F, Soerjomataram I: The changing global burden of cancer: Transitions in human development and implications for cancer prevention and control, in Gelband H, Jha P, Sankaranarayanan R, et al. (eds): Cancer: Disease Control Priorities, Volume 3 (ed 2). Washington (DC), The International Bank for Reconstruction and Development/The World Bank, 2015 - PubMed
    1. Bhakta N, Force LM, Allemani C, et al. : Childhood cancer burden: A review of global estimates. Lancet Oncol 20:e42-e53, 2019 - PubMed
    1. Bray F, Ferlay J, Soerjomataram I, et al. : Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 68:394-424, 2018 - PubMed
    1. Ward ZJ, Yeh JM, Bhakta N, et al. : Estimating the total incidence of global childhood cancer: A simulation-based analysis. Lancet Oncol 20:483-493, 2019 - PubMed
    1. Yi M, Li A, Zhou L, et al. : The global burden and attributable risk factor analysis of acute myeloid leukemia in 195 countries and territories from 1990 to 2017: Estimates based on the global burden of disease study 2017. J Hematol Oncol 13:72, 2020 - PMC - PubMed

Publication types

MeSH terms