Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Oct;23(10):1749-62.
doi: 10.1101/gr.157651.113. Epub 2013 Jun 25.

The million mutation project: a new approach to genetics in Caenorhabditis elegans

Affiliations

The million mutation project: a new approach to genetics in Caenorhabditis elegans

Owen Thompson et al. Genome Res. 2013 Oct.

Abstract

We have created a library of 2007 mutagenized Caenorhabditis elegans strains, each sequenced to a target depth of 15-fold coverage, to provide the research community with mutant alleles for each of the worm's more than 20,000 genes. The library contains over 800,000 unique single nucleotide variants (SNVs) with an average of eight nonsynonymous changes per gene and more than 16,000 insertion/deletion (indel) and copy number changes, providing an unprecedented genetic resource for this multicellular organism. To supplement this collection, we also sequenced 40 wild isolates, identifying more than 630,000 unique SNVs and 220,000 indels. Comparison of the two sets demonstrates that the mutant collection has a much richer array of both nonsense and missense mutations than the wild isolate set. We also find a wide range of rDNA and telomere repeat copy number in both sets. Scanning the mutant collection for molecular phenotypes reveals a nonsense suppressor as well as strains with higher levels of indels that harbor mutations in DNA repair genes and strains with abundant males associated with him mutations. All the strains are available through the Caenorhabditis Genetics Center and all the sequence changes have been deposited in WormBase and are available through an interactive website.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Experimental design. Standard mutagenesis protocols were modified by (1) selecting for unc-22/+ animals (twitching in 1% nicotine) in the F1 generation to ensure that the gametes had been effectively exposed to the mutagen; (2) counter-selecting in the F2 generation to generate an unc-22(+) background for the mutant strains; and (3) clonally propagating independent lines for 10 generations, driving strains to homozygosity to simplify variant calling and to create uniform, stable strains for further manipulations.
Figure 2.
Figure 2.
Distribution of mutations along chromosomes. (A) SNV density is plotted across chromosomes I and X for the aggregate variants for mutant strains (Mu) and the wild isolates (WI). The SNV density is essentially uniform along the length of both the autosome and X chromosome in the mutant strains. In contrast, for the wild isolates SNV density on chromosome I is much higher on the arms, where recombination is high and gene density is in general lower than in the centers, where recombination is low and gene density is higher. SNV density along the wild isolates' X chromosome is more uniform. (B) Similar density plots for all short indels and (C) only those outside of homopolymer runs. There is enrichment for short indels on the autosomal arms of both the WI and mutant strains, although this is much more pronounced for the WI. However, the distribution in the mutant strains is much more uniform when only considering the small indel events that do not involve homopolymer runs. The bias toward higher density on the autosomal arms remains for the WI.
Figure 3.
Figure 3.
Large duplication and deletion events. (A) A large duplication of the right half of chromosome II and a deletion within that region (red circles) were detected by increased (duplication) or decreased (deletion) read depth (see Methods for details). Dashed vertical lines demark the change from normal copy number. The boundaries of the arms are indicated by the asterisks along the x-axis. (B) The fraction of reads containing a variant base is plotted across chromosome II for the same strain. Although on the left half of the chromosome, SNVs show essentially 100% of the reads with the variant base, in the region of the duplication the fraction drops <80% for most sites, reflecting the heterogeneity of the sites. The underlying event may represent a translocation event (the right end of chromosome I in this strain is also duplicated) maintained because it provides a function lost from a lethal mutation on the normal diploid chromosome. (C) Distribution of duplication (blue) and deletion (red) events on chromosome II in the 2007 mutant strains. Each segment represents a distinct event present in one or more of the strains. The duplication events predominate and both duplications and deletions are broadly distributed across the chromosomes (see Supplemental Fig. 8 for similar plots for all chromosomes).
Figure 4.
Figure 4.
Mutation effects (SNVs) in mutant strains and wild isolates. The inferred effects of the SNVs in the mutant strains and wild isolates are plotted, showing the disparity in the fraction of mutations affecting coding sequence. Mutations resulting in nonsynonymous, nonsense, and splice site changes are three to 10-fold less frequent in the wild isolates, whereas synonymous changes are similar in number between the two. In addition, the number of mutations in intronic regions is similar between mutant and WI collections, but both UTR and intergenic sites are less abundant in the WI strains.
Figure 5.
Figure 5.
Relative fractions of SNVs in different annotated features in the mutants and wild isolates. (A) The fraction of events observed in different annotated features compared with expected is shown for the mutant collection and the wild isolates. To facilitate comparison, the events have been normalized to synonymous mutations in both sets. In the mutant strains only the nonsense mutants are appreciably depressed as a fraction of expected. In contrast, in the wild isolates, nonsense SNVs are even more severely depressed and missense, splice junction, and to some extent even intergenic SNVs are depressed relative to the expected. (B) The fraction of events in different annotated features for a set of essential and nonessential genes is shown for the mutant set and wild isolates. Genes with deletion alleles produced by the The C. elegans Deletion Mutant Consortium (The C. elegans Deletion Mutant Consortium 2012) were divided into essential (lethal) and nonessential (viable) groups. The fractions of events in essential and nonessential genes within the mutant set are quite similar for all features but nonsense SNVs. The same is true for the wild isolates, but as expected from A the overall proportions are reduced.
Figure 6.
Figure 6.
rDNA copy number varies between strains. The number of copies of the 18S–28S repeat and the 5S repeat is illustrated as estimated by the fraction of reads mapping to the regions. Ninety-two percent of the strains have between 55 and 130 copies of the 18S–28S repeat and 93% have between 130 and 210 copies of the 5S repeat, with extreme outliers for both regions (33–245 and 39–438, respectively).
Figure 7.
Figure 7.
Deletions and duplications in the mitochondrial genome. The coverage of the mitochondrial genome in five mutant strains is shown in a UCSC Genome Browser display, along with the gene content below. Although there are hundreds of copies of the mitochondrial genome in the zygote, both deletions (red bars) and duplications (green bars) were detected in the mutant strains. None of these were fixed in the strains, with both normal and altered length molecules detected in all cases.

Similar articles

Cited by

References

    1. Ahmed S, Hodgkin J 2000. MRT-2 checkpoint protein is required for germline immortality and telomere replication in C. elegans. Nature 403: 159–164 - PubMed
    1. Alkan C, Coe BP, Eichler EE 2011. Genome structural variation discovery and genotyping. Nat Rev Genet 12: 363–376 - PMC - PubMed
    1. Andersen EC, Gerke JP, Shapiro JA, Crissman JR, Ghosh R, Bloom JS, Felix MA, Kruglyak L 2012. Chromosome-scale selective sweeps shape Caenorhabditis elegans genomic diversity. Nat Genet 44: 285–290 - PMC - PubMed
    1. Bailey JA, Gu Z, Clark RA, Reinert K, Samonte RV, Schwartz S, Adams MD, Myers EW, Li PW, Eichler EE 2002. Recent segmental duplications in the human genome. Science 297: 1003–1007 - PubMed
    1. Bellen HJ, Levis RW, He Y, Carlson JW, Evans-Holm M, Bae E, Kim J, Metaxakis A, Savakis C, Schulze KL, et al. 2011. The Drosophila gene disruption project: Progress using transposons with distinctive site specificities. Genetics 188: 731–743 - PMC - PubMed

Publication types

MeSH terms

Associated data