Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Aug 16;11(1):892.
doi: 10.1038/s41597-024-03741-y.

Targeted DNA-seq and RNA-seq of Reference Samples with Short-read and Long-read Sequencing

Affiliations

Targeted DNA-seq and RNA-seq of Reference Samples with Short-read and Long-read Sequencing

Binsheng Gong et al. Sci Data. .

Abstract

Next-generation sequencing (NGS) has revolutionized genomic research by enabling high-throughput, cost-effective genome and transcriptome sequencing accelerating personalized medicine for complex diseases, including cancer. Whole genome/transcriptome sequencing (WGS/WTS) provides comprehensive insights, while targeted sequencing is more cost-effective and sensitive. In comparison to short-read sequencing, which still dominates the field due to high speed and cost-effectiveness, long-read sequencing can overcome alignment limitations and better discriminate similar sequences from alternative transcripts or repetitive regions. Hybrid sequencing combines the best strengths of different technologies for a more comprehensive view of genomic/transcriptomic variations. Understanding each technology's strengths and limitations is critical for translating cutting-edge technologies into clinical applications. In this study, we sequenced DNA and RNA libraries of reference samples using various targeted DNA and RNA panels and the whole transcriptome on both short-read and long-read platforms. This study design enables a comprehensive analysis of sequencing technologies, targeting protocols, and library preparation methods. Our expanded profiling landscape establishes a reference point for assessing current sequencing technologies, facilitating informed decision-making in genomic research and precision medicine.

PubMed Disclaimer

Conflict of interest statement

Upon the completion of this study, N.N., A.B.L., S.H., and C.P-P. are affiliated with Agilent Technologies, Inc., J.S.L. is affiliated with Illumina Inc., T.A.R. is affiliated with Roche Sequencing Solutions Inc., E.T. is affiliated with PacBio. Other authors declare no competing interest.

Figures

Fig. 1
Fig. 1
Illustration of study design. (a) To create reference samples C, D, and E: samples A and B were mixed in the ratios of 1:1, 1:4, and 4:1 respectively. (b) Three types of libraries were prepared for the reference samples: targeted RNA, targeted DNA, and whole transcriptome RNA. The libraries were sequenced using short-read or long-read sequencing methods, or both. The panel codes are explained in Table 2. The pink “S” represents short-read sequencing, while green “L” represents long-read sequencing. The ILMR3 panel is a whole exome RNA panel, and it was placed under “targeted RNA” for visual simplicity. (c) Both targeted DNA and targeted RNA libraries were sequenced with short-read sequencing. For targeted DNA libraries, four library replicates (lib1-4) were prepared for Samples A, B, and C using AGLR1, AGLR2, and ROCR2. * Three library replicates (lib1-3) were prepared using ROCR1. Sample D was sequenced with ROCR1 instead of Sample C. For targeted RNA libraries, four library replicates (lib1-4) were prepared for Samples A, B, C, D, and E using 7 panels. (d) Targeted RNA libraries of Sample A, B, and C were made with three panels, each library was split into different fractions (F1, F1 + 2, or F3), and sequenced with both long-read sequencing platforms. (e) † ROCR2 was only used for Sample A and the libraries was sequenced only on Nanopore. ‡ Sample B was sequenced by Nanopore Direct RNA protocol only. (f) An illustration shows the flexible options for possible comparison analyses for an in-depth study of the impacts of targeting, size selection, and sequencing protocols.

References

    1. Sequencing Quality Control 2. Nature Biotechnology Web Collectionhttps://www.nature.com/collections/SEQC2 (2021).
    1. Wang, Z., Gerstein, M. & Snyder, M. RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet10, 57–63 (2009). 10.1038/nrg2484 - DOI - PMC - PubMed
    1. Byron, S. A., Van Keuren-Jensen, K. R., Engelthaler, D. M., Carpten, J. D. & Craig, D. W. Translating RNA sequencing into clinical diagnostics: opportunities and challenges. Nat Rev Genet17, 257–271 (2016). 10.1038/nrg.2016.10 - DOI - PMC - PubMed
    1. Ye, H., Meehan, J., Tong, W. & Hong, H. Alignment of Short Reads: A Crucial Step for Application of Next-Generation Sequencing Data in Precision Medicine. Pharmaceutics7, 523–541 (2015). 10.3390/pharmaceutics7040523 - DOI - PMC - PubMed
    1. Mittempergher, L. et al. MammaPrint and BluePrint Molecular Diagnostics Using Targeted RNA Next-Generation Sequencing Technology. J Mol Diagn21, 808–823 (2019). 10.1016/j.jmoldx.2019.04.007 - DOI - PubMed

LinkOut - more resources