Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2007 Mar;175(3):1381-93.
doi: 10.1534/genetics.106.065557. Epub 2006 Dec 28.

Background selection in single genes may explain patterns of codon bias

Affiliations
Comparative Study

Background selection in single genes may explain patterns of codon bias

Laurence Loewe et al. Genetics. 2007 Mar.

Abstract

Background selection involves the reduction in effective population size caused by the removal of recurrent deleterious mutations from a population. Previous work has examined this process for large genomic regions. Here we focus on the level of a single gene or small group of genes and investigate how the effects of background selection caused by nonsynonymous mutations are influenced by the lengths of coding sequences, the number and length of introns, intergenic distances, neighboring genes, mutation rate, and recombination rate. We generate our predictions from estimates of the distribution of the fitness effects of nonsynonymous mutations, obtained from DNA sequence diversity data in Drosophila. Results for genes in regions with typical frequencies of crossing over in Drosophila melanogaster suggest that background selection may influence the effective population sizes of different regions of the same gene, consistent with observed differences in codon usage bias along genes. It may also help to cause the observed effects of gene length and introns on codon usage. Gene conversion plays a crucial role in determining the sizes of these effects. The model overpredicts the effects of background selection with large groups of nonrecombining genes, because it ignores Hill-Robertson interference among the mutations involved.

PubMed Disclaimer

Figures

F<sc>igure</sc> 1.—
Figure 1.—
Background selection in a single gene with the fixed selection coefficient approximation, showing the effects of gene conversion and mutation rates with a crossover rate of 10−8. The selection coefficient s was set equal to the harmonic mean heterozygous selection coefficient, estimated from a DDME with a width of σg = 50, which gives an Nes value of 12.7 and a frequency of effectively neutral mutations, cne, of 12.6%. All curves are paired, with the bottom curve representing the coding sequence of a gene without introns and the top curve that of a gene with four equidistant introns. (A) Different gene conversion rates for u = 4 × 10−9; pairs of solid lines from top to bottom are for rg = 5 × 10−6, 2.5 × 10−6, 0.5 × 10−6; the dashed line is for rg = 0. (B) Different mutation rates for rg = 2.5 × 10−6; pairs of lines from top to bottom are for u = 2 × 10−9, u = 4 × 10−9, and u = 8 × 10−9. The scale for the bottom two curves is on the right.
F<sc>igure</sc> 2.—
Figure 2.—
Background selection in a single gene: average of 100 samples from a DDME, showing the effects of gene conversion and mutation rate. Otherwise this is similar to Figure 1.
F<sc>igure</sc> 3.—
Figure 3.—
Longer exons experience more background selection. Mean values of B = Ne/N0 for a gene are shown, assuming no neighboring genes, no introns, and rg = 2.5 × 10−6. The thickness of the curves denotes the crossing-over rate (thick, 3 × 10−8; medium, 1 × 10−8; thin, 1 × 10−9), line type represents the mutation rate (dotted, 2 × 10−9; dashed, 4 × 10−9; solid, 8 × 10−9), and the highlighted curve indicates a typical gene (dashed line with medium thickness). (A) Fixed selection coefficient is as in Figure 1. (B) Average of 100 samples from the DDME. The bottom three curves belong to the scale on the right.
F<sc>igure</sc> 4.—
Figure 4.—
Longer introns reduce background selection. Mean values of B for a gene of the standard length are shown, assuming no neighboring genes, four equidistant introns, and rg = 2.5 × 10−6. Other features of the curves are as in Figure 3. (A) Fixed selection coefficients. The bottom three lines belong to the scale on the right. (B) Average of 100 samples from the DDME. The bottom three lines belong to the scale on the right.
F<sc>igure</sc> 5.—
Figure 5.—
The effect of neighboring genes on the mean B for a gene under background selection. Both plots show a focal gene and assume four equidistant introns of 100 bp length, 6000 bp intergenic distance, 2000 bp coding sequence for all genes, and rg = 2.5 × 10−6. Other features of the curves are as in Figure 3. (A) Fixed selection coefficient. (B) Average of 100 samples from the DDME.
F<sc>igure</sc> 6.—
Figure 6.—
Neighboring genes do not affect the patterns of background selection within a gene. This plot is like Figure 2A, except that five neighboring genes were located on both sides of the focal gene (with features as in Figure 5).
F<sc>igure</sc> 7.—
Figure 7.—
The effect of neighboring genes on background selection without crossing over. Both plots show the mean B for a focal gene, with the same features of neighboring genes as in Figure 5. However, crossing over is assumed to be absent and curve thickness denotes the rate of gene conversion rg (thick, 5 × 10−6; medium, 2.5 × 10−6; thin, 0.5 × 10−6). Line type represents the mutation rate (dotted, 2 × 10−9; dashed, 4 × 10−9; solid, 8 × 10−9) and the highlighted line indicates a typical gene (dashed line with medium thickness). (A) Fixed selection coefficient. (B) Average of 100 samples from the DDME.
F<sc>igure</sc> 8.—
Figure 8.—
Longer intergenic distances reduce background selection in a region. This plot shows the fixed selection coefficient prediction for the mean B of a focal gene with varying intergenic distances, when five neighboring genes are located on both sides (with features as in Figure 5) and rg = 2.5 × 10−6. Other features of the curves are as in Figure 3.
F<sc>igure</sc> 9.—
Figure 9.—
Predictions of background selection are insensitive to uncertainty in realistic DDME width estimates. For each assumed DDME shape (x-axis), a corresponding estimate of the location parameter was made from the diversity data. The standard gene structure without neighboring genes was assumed, with rg = 2.5 × 10−6. Other features of the curves are as in Figure 3. The bottom three lines belong to the axis on the right. Note that realistic shapes in this case are in the range 20–100, as narrower shapes predict too few lethal mutations and wider shapes predict too many lethal mutations (on the basis of estimates in Loewe and Charlesworth 2006). Details of the assumed DDMEs can be found in Table 1.

Similar articles

Cited by

References

    1. Adams, M. D., S. E. Celniker, R. A. Holt, C. A. Evans, J. D. Gocayne et al., 2000. The genome sequence of Drosophila melanogaster. Science 287: 2185–2195. - PubMed
    1. Aitchison, J., and J. A. C. Brown, 1957. The Lognormal Distribution, With Special Reference to Its Uses in Economics. Cambridge University Press, Cambridge, UK.
    1. Andolfatto, P., and M. Nordborg, 1998. The effect of gene conversion on intralocus associations. Genetics 148: 1397–1399. - PMC - PubMed
    1. Bartolomé, C., and B. Charlesworth, 2006. Evolution of amino-acid sequences and codon usage on the Drosophila miranda neo-sex chromosomes. Genetics 174: 2033–2044. - PMC - PubMed
    1. Bartolomé, C., X. Maside, S. Yi, A. L. Grant and B. Charlesworth, 2005. Patterns of selection on synonymous and nonsynonymous variants in Drosophila miranda. Genetics 169: 1495–1507. - PMC - PubMed

Publication types