Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jun 5;9(6):1795-1805.
doi: 10.1534/g3.119.400071.

The Genome of C57BL/6J "Eve", the Mother of the Laboratory Mouse Genome Reference Strain

Affiliations

The Genome of C57BL/6J "Eve", the Mother of the Laboratory Mouse Genome Reference Strain

Vishal Kumar Sarsani et al. G3 (Bethesda). .

Abstract

Isogenic laboratory mouse strains enhance reproducibility because individual animals are genetically identical. For the most widely used isogenic strain, C57BL/6, there exists a wealth of genetic, phenotypic, and genomic data, including a high-quality reference genome (GRCm38.p6). Now 20 years after the first release of the mouse reference genome, C57BL/6J mice are at least 26 inbreeding generations removed from GRCm38 and the strain is now maintained with periodic reintroduction of cryorecovered mice derived from a single breeder pair, aptly named Adam and Eve. To provide an update to the mouse reference genome that more accurately represents the genome of today's C57BL/6J mice, we took advantage of long read, short read, and optical mapping technologies to generate a de novo assembly of the C57BL/6J Eve genome (B6Eve). Using these data, we have addressed recurring variants observed in previous mouse genomic studies. We have also identified structural variations, closed gaps in the mouse reference assembly, and revealed previously unannotated coding sequences. This B6Eve assembly explains discrepant observations that have been associated with GRCm38-based analyses, and will inform a reference genome that is more representative of the C57BL/6J mice that are in use today.

Keywords: C57BL/6J; Mus musculus domesticus; de novo genome assembly; laboratory mouse; long read sequencing; reference genomes; reproducibility.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Origin of the inbred strain C57BL/6J. Inbred laboratory mouse strains are maintained by brother x sister mating. Filial (F) generations from which mice contributing to the reference assembly clone libraries and from which the B6Eve mouse were derived are shown. Cryopreserved embryo stock is represented by blue snowflakes at F226, 3 generations from Adam and Eve at F223. Generations subsequent to the cryopreservation event are F226p###, e.g., F226p230, which means embryos cryopreserved at F226 were recovered and there were an additional 4 generations of subsequent inbreeding.
Figure 2
Figure 2
Schematic overview of the de novo assembly procedure for B6Eve. Details are described in Methods.
Figure 3
Figure 3
Ideogram of GRCm38 assembly annotated to highlight resolved gaps (vs. current reference), structural variants, and fixed variation using B6Eve data.
Figure 4
Figure 4
The Mia3 locus from the perspective of both the B6Eve assembly (top) and the GRCm38 mouse reference (bottom). CAT annotation of B6Eve identified three isoforms with an IsoSeq supported exon not found in the reference. The cactus alignments (blue bars) show that there are 43 bp of reference sequence that does not align to B6Eve, and that there are 638 bp of B6Eve not seen in the reference. These 638 bp contain the extra exon. This result is confirmed in the B6Eve IsoSeq GRCm38 alignment, which shows an insertion (white blocks between gray exon alignments).

References

    1. Beal M. A., Glenn T. C., Lance S. L., Somers C. M., 2012a Characterization of unstable microsatellites in mice: no evidence for germline mutation induction following gamma-radiation exposure. Environ. Mol. Mutagen. 53: 599–607. 10.1002/em.21726 - DOI - PubMed
    1. Beal M. A., Glenn T. C., Somers C. M., 2012b Whole genome sequencing for quantifying germline mutation frequency in humans and model species: cautious optimism. Mutat. Res. 750: 96–106. 10.1016/j.mrrev.2011.11.002 - DOI - PubMed
    1. Beck J. A., Lloyd S., Hafezparast M., Lennon-Pierce M., Eppig J. T., et al. , 2000. Genealogies of mouse inbred strains. Nat. Genet. 24: 23–25. 10.1038/71641 - DOI - PubMed
    1. Berlin K., Koren S., Chin C. S., Drake J. P., Landolin J. M., et al. , 2015. Assembling large genomes with single-molecule sequencing and locality-sensitive hashing. Nat. Biotechnol. 33: 623–630. Erratum: 33: 1109. 10.1038/nbt.3238 - DOI - PubMed
    1. Buac K., Watkins-Chow D. E., Loftus S. K., Larson D. M., Incao A., et al. , 2008. A Sox10 expression screen identifies an amino acid essential for Erbb3 function. PLoS Genet. 4: e1000177 10.1371/journal.pgen.1000177 - DOI - PMC - PubMed

Publication types

LinkOut - more resources