Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Jul 18;2(7):e617.
doi: 10.1371/journal.pone.0000617.

Modeling insertional mutagenesis using gene length and expression in murine embryonic stem cells

Affiliations

Modeling insertional mutagenesis using gene length and expression in murine embryonic stem cells

Alex S Nord et al. PLoS One. .

Abstract

Background: High-throughput mutagenesis of the mammalian genome is a powerful means to facilitate analysis of gene function. Gene trapping in embryonic stem cells (ESCs) is the most widely used form of insertional mutagenesis in mammals. However, the rules governing its efficiency are not fully understood, and the effects of vector design on the likelihood of gene-trapping events have not been tested on a genome-wide scale.

Methodology/principal findings: In this study, we used public gene-trap data to model gene-trap likelihood. Using the association of gene length and gene expression with gene-trap likelihood, we constructed spline-based regression models that characterize which genes are susceptible and which genes are resistant to gene-trapping techniques. We report results for three classes of gene-trap vectors, showing that both length and expression are significant determinants of trap likelihood for all vectors. Using our models, we also quantitatively identified hotspots of gene-trap activity, which represent loci where the high likelihood of vector insertion is controlled by factors other than length and expression. These formalized statistical models describe a high proportion of the variance in the likelihood of a gene being trapped by expression-dependent vectors and a lower, but still significant, proportion of the variance for vectors that are predicted to be independent of endogenous gene expression.

Conclusions/significance: The findings of significant expression and length effects reported here further the understanding of the determinants of vector insertion. Results from this analysis can be applied to help identify other important determinants of this important biological phenomenon and could assist planning of large-scale mutagenesis efforts.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. Diagram of major mechanisms of gene trapping of an endogenous gene with two exons.
(A) In the SA-trap, the SA site allows trapping when inserted into any part of the gene via plasmid or viral integration. (B) The poly-A trap relies on the poly-A (pA) of the endogenous gene because the neomycin-resistance gene does not have a poly-A tail. Note that the poly-A trap has its own constitutive promoter (prom). Also indicated are the splice donor (SD), splice acceptor (SA), and neomycin resistance (NeoR). The major components of each trap were excluded from this diagram to emphasize on the essential elements needed to understand the trapping models. Detailed maps of each major vector type are referenced in the Methods section.
Figure 2
Figure 2. Trapped genes by length and expression.
For each vector type, genes were plotted according to their size and level of expression in ESCs. Genes that have been trapped are shown in red. The circle size is proportional to the number of times a gene has been trapped.
Figure 3
Figure 3. Models of trap likelihood for gene-trap vectors.
Models of the likelihood of trapping a gene with particular length (x-axis) and expression (y-axis) values for each gene-trap event were created through an iterative process, in which outliers (P<0.001) were removed before the final model was created. Probability (z-axis) is given as events per million traps.

Similar articles

Cited by

  • In vivo protein trapping produces a functional expression codex of the vertebrate proteome.
    Clark KJ, Balciunas D, Pogoda HM, Ding Y, Westcot SE, Bedell VM, Greenwood TM, Urban MD, Skuster KJ, Petzold AM, Ni J, Nielsen AL, Patowary A, Scaria V, Sivasubbu S, Xu X, Hammerschmidt M, Ekker SC. Clark KJ, et al. Nat Methods. 2011 Jun;8(6):506-15. doi: 10.1038/nmeth.1606. Epub 2011 May 8. Nat Methods. 2011. PMID: 21552255 Free PMC article.
  • Beyond knockouts: the International Knockout Mouse Consortium delivers modular and evolving tools for investigating mammalian genes.
    Rosen B, Schick J, Wurst W. Rosen B, et al. Mamm Genome. 2015 Oct;26(9-10):456-66. doi: 10.1007/s00335-015-9598-3. Epub 2015 Sep 4. Mamm Genome. 2015. PMID: 26340938
  • Efficient conditional and promoter-specific in vivo expression of cDNAs of choice by taking advantage of recombinase-mediated cassette exchange using FlEx gene traps.
    Schebelle L, Wolf C, Stribl C, Javaheri T, Schnütgen F, Ettinger A, Ivics Z, Hansen J, Ruiz P, von Melchner H, Wurst W, Floss T. Schebelle L, et al. Nucleic Acids Res. 2010 May;38(9):e106. doi: 10.1093/nar/gkq044. Epub 2010 Feb 5. Nucleic Acids Res. 2010. PMID: 20139417 Free PMC article.
  • Genome Wide Conditional Mouse Knockout Resources.
    Kaloff C, Anastassiadis K, Ayadi A, Baldock R, Beig J, Birling MC, Bradley A, Brown S, Bürger A, Bushell W, Chiani F, Collins FS, Doe B, Eppig JT, Finnel RH, Fletcher C, Flicek P, Fray M, Friedel RH, Gambadoro A, Gates H, Hansen J, Herault Y, Hicks GG, Hörlein A, Hrabé de Angelis M, Iyer V, de Jong PJ, Koscielny G, Kühn R, Liu P, Lloyd KC, Lopez RG, Marschall S, Martínez S, McKerlie C, Meehan T, von Melchner H, Moore M, Murray SA, Nagy A, Nutter L, Pavlovic G, Pombero A, Prosser H, Ramirez-Solis R, Ringwald M, Rosen B, Rosenthal N, Rossant J, Ruiz Noppinger P, Ryder E, Skarnes WC, Schick J, Schnütgen F, Schofield P, Seisenberger C, Selloum M, Smedley D, Simpson EM, Stewart AF, Teboul L, Tocchini Valentini GP, Valenzuela D, West A, Wurst W. Kaloff C, et al. Drug Discov Today Dis Models. 2016 Summer;20:3-12. doi: 10.1016/j.ddmod.2017.08.002. Epub 2017 Sep 12. Drug Discov Today Dis Models. 2016. PMID: 39132094 Free PMC article.
  • Enhanced gene trapping in mouse embryonic stem cells.
    Schnütgen F, Hansen J, De-Zolt S, Horn C, Lutz M, Floss T, Wurst W, Noppinger PR, von Melchner H. Schnütgen F, et al. Nucleic Acids Res. 2008 Nov;36(20):e133. doi: 10.1093/nar/gkn603. Epub 2008 Sep 23. Nucleic Acids Res. 2008. PMID: 18812397 Free PMC article.

References

    1. Bellen H, Levis R, Liao G, He Y, Carlson J, et al. The BDGP gene disruption project: single transposon insertions associated with 40% of Drosophila genes. Genetics, 2004;167(2):761–781. - PMC - PubMed
    1. Drysdale R, Crosby M. FlyBase: genes and gene models. Nucleic Acids Res. 2005;33(Database issue):D390–395. - PMC - PubMed
    1. Matthews K, Kaufman T, Gelbart W. Research resources for Drosophila: the expanding universe. Nat Rev Genet. 2005;6(3):179–193. - PubMed
    1. Baba T, Ara T, Hasegawa M, Takai Y, Okumura Y, et al. Construction of Escherichia coli K-12 in frame, single-gene knockout mutants: the Keio collection. Mol Syst Biol. 2006;2:2006–2008. - PMC - PubMed
    1. Datsenko K, Wanner B. One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products. Proc Natl Acad Sci U S A. 2000;97(12):6640–6645. - PMC - PubMed

Publication types