Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011;6(9):e24668.
doi: 10.1371/journal.pone.0024668. Epub 2011 Sep 13.

Novel molecular and computational methods improve the accuracy of insertion site analysis in Sleeping Beauty-induced tumors

Affiliations

Novel molecular and computational methods improve the accuracy of insertion site analysis in Sleeping Beauty-induced tumors

Benjamin T Brett et al. PLoS One. 2011.

Abstract

The recent development of the Sleeping Beauty (SB) system has led to the development of novel mouse models of cancer. Unlike spontaneous models, SB causes cancer through the action of mutagenic transposons that are mobilized in the genomes of somatic cells to induce mutations in cancer genes. While previous methods have successfully identified many transposon-tagged mutations in SB-induced tumors, limitations in DNA sequencing technology have prevented a comprehensive analysis of large tumor cohorts. Here we describe a novel method for producing genetic profiles of SB-induced tumors using Illumina sequencing. This method has dramatically increased the number of transposon-induced mutations identified in each tumor sample to reveal a level of genetic complexity much greater than previously appreciated. In addition, Illumina sequencing has allowed us to more precisely determine the depth of sequencing required to obtain a reproducible signature of transposon-induced mutations within tumor samples. The use of Illumina sequencing to characterize SB-induced tumors should significantly reduce sampling error that undoubtedly occurs using previous sequencing methods. As a consequence, the improved accuracy and precision provided by this method will allow candidate cancer genes to be identified with greater confidence. Overall, this method will facilitate ongoing efforts to decipher the genetic complexity of the human cancer genome by providing more accurate comparative information from Sleeping Beauty models of cancer.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Dynamic filtering to remove background transposon insertion events.
(A) Read distributions from two independent tumor samples are shown along with the calculated cutoff points using three independent methods: negative binomial (NB), 1% of the top (i.e. most abundant) site and 0.1% of total reads (B) An experiment to simulate a variety of sequence read depths shows that the cutoff method used by the analysis pipeline is influenced by read depth.
Figure 2
Figure 2. Illumina-based LM-PCR analysis consistently identifies transposon insertions in SB-induced tumors.
A total of 62 T-cell lymphomas induced by SB mutagenesis using a ligation-mediated PCR approach were analyzed. Two technical replicates were performed to assess the consistency of the results (A). The results of the technical replicates were compared to assess the reproducibility of the approach. Both the rank (B) and abundance (C) of insertion sites showed a strong positive correlation between the replicate runs.
Figure 3
Figure 3. Determining optimal read depth in SB-induced tumors.
(A) An experiment was performed to simulate various read depths in 30 Vav-SB (left) and 32 CD4-SB (right) tumors. The average number of clonal transposon insertion sites was determined from 20 independent iterations of each read depth simulation. (B) The consistency of transposon insertion site identification varied with sequence depth. For example, only 10% of transposon insertion sites were identified in 100% simulations at a simulated read depth of 1,000.
Figure 4
Figure 4. Comparison of three independent methods to identify CISs within SB-induced tumors.
We compared the performance of Monte Carlo simulation (MC), Gaussian Kernel Convolution (GKC) and gene-centric common insertion site analysis (gCIS) in identifying candidate cancer genes in both Vav-SB (A) and CD4-SB (B) tumors. In addition, the number in parentheses indicates the number of total genes in each region that have evidence as human cancer genes in either the COSMIC or CGC databases. In addition, we compared the results of MC and gCIS analysis using data generated by 454 or Illumina sequencing of the same tumor samples in both Vav-SB (C) and CD4-SB (D) lymphoma models.
Figure 5
Figure 5. Determining the affect of sequence depth on the accuracy of CIS identification.
Read depth simulations were performed as previously described. However, gCIS analysis was performed on each of 20 iterations at all simulated read depths. In addition, the values generated from analysis of the actual 454 data obtained for the same tumor samples are shown. (A) The total number of gCIS genes identified in at least one of the 20 iterations is indicated along with the average number of gCIS genes at each simulated sequence depth. (B) The gCIS results obtained by analyzing all sequence data in both tumor models were used as reference data sets. The accuracy (% of genes found in reference set) and sensitivity (% of genes in reference set that were detected) were determined. For example, only 41% of the reference gCIS genes in the Vav-SB model were found at a depth of 1,000 reads per sample (i.e. 41% sensitivity). However, 90% of gCIS genes identified at this read depth were found in the reference data set (i.e. 90% accuracy).
Figure 6
Figure 6. The affect of varying read depth on the genomic distribution of CIS genes.
The genomic position of each CIS gene is indicated as a vertical bar. The color of the bar indicates the minimum number of reads per sample required to consistently identify the CIS gene at the indicated position. The height of the bar indicates the mutation frequency within the each tumor model as determined by analysis of all sequence data. A large number of false-positive CISs were identified in one or more iterations of the simulation to approximate a read depth of 5,000 (white bars).

Similar articles

Cited by

  • Analyzing tumor heterogeneity and driver genes in single myeloid leukemia cells with SBCapSeq.
    Mann KM, Newberg JY, Black MA, Jones DJ, Amaya-Manzanares F, Guzman-Rojas L, Kodama T, Ward JM, Rust AG, van der Weyden L, Yew CC, Waters JL, Leung ML, Rogers K, Rogers SM, McNoe LA, Selvanesan L, Navin N, Jenkins NA, Copeland NG, Mann MB. Mann KM, et al. Nat Biotechnol. 2016 Sep;34(9):962-72. doi: 10.1038/nbt.3637. Epub 2016 Aug 1. Nat Biotechnol. 2016. PMID: 27479497 Free PMC article.
  • Resistance mechanisms to TP53-MDM2 inhibition identified by in vivo piggyBac transposon mutagenesis screen in an Arf-/- mouse model.
    Chapeau EA, Gembarska A, Durand EY, Mandon E, Estadieu C, Romanet V, Wiesmann M, Tiedt R, Lehar J, de Weck A, Rad R, Barys L, Jeay S, Ferretti S, Kauffmann A, Sutter E, Grevot A, Moulin P, Murakami M, Sellers WR, Hofmann F, Jensen MR. Chapeau EA, et al. Proc Natl Acad Sci U S A. 2017 Mar 21;114(12):3151-3156. doi: 10.1073/pnas.1620262114. Epub 2017 Mar 6. Proc Natl Acad Sci U S A. 2017. PMID: 28265066 Free PMC article.
  • Divergent clonal selection dominates medulloblastoma at recurrence.
    Morrissy AS, Garzia L, Shih DJ, Zuyderduyn S, Huang X, Skowron P, Remke M, Cavalli FM, Ramaswamy V, Lindsay PE, Jelveh S, Donovan LK, Wang X, Luu B, Zayne K, Li Y, Mayoh C, Thiessen N, Mercier E, Mungall KL, Ma Y, Tse K, Zeng T, Shumansky K, Roth AJ, Shah S, Farooq H, Kijima N, Holgado BL, Lee JJ, Matan-Lithwick S, Liu J, Mack SC, Manno A, Michealraj KA, Nor C, Peacock J, Qin L, Reimand J, Rolider A, Thompson YY, Wu X, Pugh T, Ally A, Bilenky M, Butterfield YS, Carlsen R, Cheng Y, Chuah E, Corbett RD, Dhalla N, He A, Lee D, Li HI, Long W, Mayo M, Plettner P, Qian JQ, Schein JE, Tam A, Wong T, Birol I, Zhao Y, Faria CC, Pimentel J, Nunes S, Shalaby T, Grotzer M, Pollack IF, Hamilton RL, Li XN, Bendel AE, Fults DW, Walter AW, Kumabe T, Tominaga T, Collins VP, Cho YJ, Hoffman C, Lyden D, Wisoff JH, Garvin JH Jr, Stearns DS, Massimi L, Schüller U, Sterba J, Zitterbart K, Puget S, Ayrault O, Dunn SE, Tirapelli DP, Carlotti CG, Wheeler H, Hallahan AR, Ingram W, MacDonald TJ, Olson JJ, Van Meir EG, Lee JY, Wang KC, Kim SK, Cho BK, Pietsch T, Fleischhack G, Tippelt S, Ra YS, Bailey S, Lindsey JC, Clifford SC, Eberhart CG, Cooper MK, Packer RJ, Massimino M, Garre ML, Bartels U, Tabori U, H… See abstract for full author list ➔ Morrissy AS, et al. Nature. 2016 Jan 21;529(7586):351-7. doi: 10.1038/nature16478. Epub 2016 Jan 13. Nature. 2016. PMID: 26760213 Free PMC article.
  • A Sleeping Beauty forward genetic screen identifies new genes and pathways driving osteosarcoma development and metastasis.
    Moriarity BS, Otto GM, Rahrmann EP, Rathe SK, Wolf NK, Weg MT, Manlove LA, LaRue RS, Temiz NA, Molyneux SD, Choi K, Holly KJ, Sarver AL, Scott MC, Forster CL, Modiano JF, Khanna C, Hewitt SM, Khokha R, Yang Y, Gorlick R, Dyer MA, Largaespada DA. Moriarity BS, et al. Nat Genet. 2015 Jun;47(6):615-24. doi: 10.1038/ng.3293. Epub 2015 May 11. Nat Genet. 2015. PMID: 25961939 Free PMC article.
  • A Sleeping Beauty screen reveals NF-kB activation in CLL mouse model.
    Zanesi N, Balatti V, Riordan J, Burch A, Rizzotto L, Palamarchuk A, Cascione L, Lagana A, Dupuy AJ, Croce CM, Pekarsky Y. Zanesi N, et al. Blood. 2013 May 23;121(21):4355-8. doi: 10.1182/blood-2013-02-486035. Epub 2013 Apr 16. Blood. 2013. PMID: 23591791 Free PMC article.

References

    1. Mardis ER, Wilson RK. Cancer genome sequencing: a review. Hum Mol Genet. 2009;18:R163–168. - PMC - PubMed
    1. Kool J, Berns A. High-throughput insertional mutagenesis screens in mice to identify oncogenic networks. Nat Rev Cancer. 2009;9:389–399. - PubMed
    1. Dupuy AJ. Transposon-based screens for cancer gene discovery in mouse models. Semin Cancer Biol 2010 - PMC - PubMed
    1. Largaespada DA, Collier LS. Transposon-mediated mutagenesis in somatic cells: identification of transposon-genomic DNA junctions. Methods Mol Biol. 2008;435:95–108. - PMC - PubMed
    1. Uren AG, Kool J, Matentzoglu K, de Ridder J, Mattison J, et al. Large-scale mutagenesis in p19(ARF)- and p53-deficient mice identifies cancer genes and their collaborative networks. Cell. 2008;133:727–741. - PMC - PubMed

Publication types

Substances