Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2004 Mar;14(3):354-66.
doi: 10.1101/gr.2133704.

Gene conversion and the evolution of protocadherin gene cluster diversity

Affiliations
Comparative Study

Gene conversion and the evolution of protocadherin gene cluster diversity

James P Noonan et al. Genome Res. 2004 Mar.

Abstract

The synaptic cell adhesion molecules encoded by the protocadherin gene cluster are hypothesized to provide a molecular code involved in the generation of synaptic complexity in the developing brain. Variation in copy number and sequence content of protocadherin cluster genes among vertebrate species could reflect adaptive differences in protocadherin function. We have completed an analysis of zebrafish protocadherin cluster genes. Zebrafish have two unlinked protocadherin clusters, DrPcdh1 and DrPcdh2. Like mammalian protocadherin clusters, DrPcdh1 has both alpha and gamma variable and constant region exons. A consensus protocadherin promoter motif sequence identified in mammals is also conserved in zebrafish. Few orthologous relationships, however, are apparent between zebrafish and mammalian protocadherin proteins. Here we show that protocadherin cluster genes in human, mouse, rat, and zebrafish are subject to striking gene conversion events. These events are restricted to regions of the coding sequence, particularly the coding sequences of ectodomain 6 and the cytoplasmic domain. Diversity among paralogs is restricted to particular ectodomains that are excluded from conversion events. Conversion events are also strongly correlated with an increase in third-position GC content. We propose that the combination of lineage-specific duplication, restricted gene conversion, and adaptive variation in diversified ectodomains drives vertebrate protocadherin cluster evolution.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Organization of protocadherin gene clusters in zebrafish. Subgroups of paralogous variable exons are indicated by color. Constant region exons are in white. Pseudogenes (ψ) are in gray. (A) Zebrafish Pcdh1α and Pcdh1γ. (B) Partial zebrafish Pcdh2. Sequenced BAC clone names and their locations are shown below each gene cluster.
Figure 2
Figure 2
WebLogo plots of consensus mammalian Pcdh (A), DrPcdh1 (B), and DrPcdh2α (C) core promoter motifs identified using MEME. Mammalian Pcdh and zebrafish Pcdh1 promoter motifs are virtually identical, but the zebrafish Pcdh2α motif has a divergent CGCT box.
Figure 3
Figure 3
Maximum-likelihood phylogenies of zebrafish and human protocadherin cluster proteins. Members of paralog subgroups are indicated by color as in Figure 1. For clarity, subtrees of human protocadherin subgroups are shown as single branches in each tree, except for human C-type protocadherins, which are shown individually in purple. Trees are rooted by midpoint. (A) Protein tree of DrPcdh1α, DrPcdh2α, and human Pcdhα. (B) Protein tree of DrPcdh1γ, human Pcdhβ, and Pcdhγ.
Figure 3
Figure 3
Maximum-likelihood phylogenies of zebrafish and human protocadherin cluster proteins. Members of paralog subgroups are indicated by color as in Figure 1. For clarity, subtrees of human protocadherin subgroups are shown as single branches in each tree, except for human C-type protocadherins, which are shown individually in purple. Trees are rooted by midpoint. (A) Protein tree of DrPcdh1α, DrPcdh2α, and human Pcdhα. (B) Protein tree of DrPcdh1γ, human Pcdhβ, and Pcdhγ.
Figure 4
Figure 4
Ectodomain-specific sequence homogenization in protocadherin cluster genes. The human Pcdhβ ectodomain (EC) 3 gene tree (A) reflects the sequence diversity among Pcdhβ paralogs. Pcdhβ ectodomain 6, however, is almost completely homogenized (B), with otherwise diverse Pcdhβ genes (e.g., β3 and β9) having identical EC6 sequences. This phenomenon is pronounced in zebrafish, where DrPcdh1γ genes with very divergent EC3 domains (C) have nearly identical EC6 domains (D).
Figure 5
Figure 5
Sequence homogenization in protocadherin cluster genes is lineage-specific. The gene tree of human, mouse, and rat Pcdhα (A) and Pcdhβ (C) EC3 domains recapitulates the orthologous relationships among the full-length genes. These relationships break down in Pcdhα EC5 (B) and Pcdhβ EC6 (D) domains, where paralogs within each species are more similar to each other than they are to their orthologs in related species.
Figure 6
Figure 6
The correlation of paralogous sequence diversity at synonymous sites and third-position GC content implicates gene conversion in Pcdh homogenization. Neutral paralog sequence diversity vs. third-position GC content (GC3) is shown for ectodomains 1–6 and the cytoplasmic domain from various protocadherin paralog subgroups. (A) Human, mouse, and rat Pcdhα. (B) Human, mouse, and rat Pcdhβ. (C) Human, mouse, and rat PcdhγA. (D) Human, mouse, and rat PcdhγB. (E) Zebrafish Pcdh1γ. (F) Zebrafish Pcdh1α and Pcdh2α.

References

    1. Ahn, B.-Y., Dornfeld, K.J., Fagrelius, T.J., and Livingston, D.M. 1988. Effect of limited homology on gene conversion in a Saccharomyces cerevisiae plasmid recombination system. Mol. Cell. Biol. 8: 2442–2448. - PMC - PubMed
    1. Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J. 1997. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25: 3389–3402. - PMC - PubMed
    1. Amores, A., Force, A., Yan, Y.L., Joly, L., Amemiya, C., Fritz, A., Ho, R.K., Langeland, J., Prince, V., Wang, Y.L., et al. 1998. Zebrafish hox clusters and vertebrate genome evolution. Science 282: 1711–1714. - PubMed
    1. Bailey, T. and Elkan, C. 1994. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology, pp. 28–36, AAAI Press, Menlo Park, CA. - PubMed
    1. Birdsell, J.A. 2002. Integrating genomics, bioinformatics, and classical genetics to study the effects of recombination on genome evolution. Mol. Biol. Evol. 19: 1181–1197. - PubMed

Publication types

Associated data

LinkOut - more resources