Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Dec 15;27(24):3430-1.
doi: 10.1093/bioinformatics/btr577. Epub 2011 Oct 17.

Enrich: software for analysis of protein function by enrichment and depletion of variants

Affiliations

Enrich: software for analysis of protein function by enrichment and depletion of variants

Douglas M Fowler et al. Bioinformatics. .

Abstract

Summary: Measuring the consequences of mutation in proteins is critical to understanding their function. These measurements are essential in such applications as protein engineering, drug development, protein design and genome sequence analysis. Recently, high-throughput sequencing has been coupled to assays of protein activity, enabling the analysis of large numbers of mutations in parallel. We present Enrich, a tool for analyzing such deep mutational scanning data. Enrich identifies all unique variants (mutants) of a protein in high-throughput sequencing datasets and can correct for sequencing errors using overlapping paired-end reads. Enrich uses the frequency of each variant before and after selection to calculate an enrichment ratio, which is used to estimate fitness. Enrich provides an interactive interface to guide users. It generates user-accessible output for downstream analyses as well as several visualizations of the effects of mutation on function, thereby allowing the user to rapidly quantify and comprehend sequence-function relationships.

Availability and implementation: Enrich is implemented in Python and is available under a FreeBSD license at http://depts.washington.edu/sfields/software/enrich/. Enrich includes detailed documentation as well as a small example dataset.

Contact: dfowler@uw.edu; fields@uw.edu

Supplementary information: Supplementary data is available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Enrich visualizations. Enrich produces three visualizations; examples from the dataset included with Enrich are shown here. (a) The diversity within a library is illustrated by a heatmap of the frequency of each position–mutation combination. (b) The position-averaged change in mutational frequency between two libraries is shown. (c) The log2-scaled enrichment ratio for each position–mutation combination is plotted, individually organized both by position and by amino acid (a single amino acid, serine, is shown). Blue dots indicate the enrichment or depletion of substitutions. Red squares correspond to wild-type residues. Grey squares correspond to unobserved mutations.

References

    1. Alper H., et al. Engineering yeast transcription machinery for improved ethanol tolerance and production. Science. 2006;314:1565–1568. - PubMed
    1. Araya C.L., Fowler D.M. Deep mutational scanning: assessing protein function on a massive scale. Trends Biotechnol. 2011;29:435–442. - PMC - PubMed
    1. Cock P.J., et al. The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants. Nucleic Acids Res. 2010;38:1767–1771. - PMC - PubMed
    1. Di Niro R., et al. Rapid interactome profiling by massive sequencing. Nucleic Acids Res. 2010;38:e110. - PMC - PubMed
    1. Dias-Neto E., et al. Next-generation phage display: integrating and comparing available molecular tools to enable cost-effective high-throughput analysis. PLoS One. 2009;4:e8338. - PMC - PubMed

Publication types