Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2018 Dec 1;10(12):3176-3187.
doi: 10.1093/gbe/evy244.

Tandem Repeats Contribute to Coding Sequence Variation in Bumblebees (Hymenoptera: Apidae)

Affiliations
Comparative Study

Tandem Repeats Contribute to Coding Sequence Variation in Bumblebees (Hymenoptera: Apidae)

Xiaomeng Zhao et al. Genome Biol Evol. .

Abstract

Tandem repeats (TRs) are highly dynamic regions of the genome. Mutations at these loci represent a significant source of genetic variation and can facilitate rapid adaptation. Bumblebees are important pollinating insects occupying a wide range of habitats. However, to date, molecular mechanisms underlying the potential adaptation of bumblebees to diverse habitats are largely unknown. In the present study, we investigate how TRs contribute to genetic variation in bumblebees, thus potentially facilitating adaptation. We identified 26,595 TRs from the assembled 18 chromosome sequences of the buff-tailed bumblebee (Bombus terrestris), 66.7% of which reside in genic regions. We also compared TRs found in B. terrestris with those present in the assembled genome sequence of a congener, B. impatiens. We found that a total of 1,137 TRs were variable in length between the two sequenced bumblebee species, and further analysis reveals that 101 of them are located within coding regions. These 101 TRs are responsible for coding sequence variation and correspond to protein sequence length variation between the two bumblebee species. The variability of identified TRs in coding regions between bumblebees was confirmed by PCR amplification of a subset of loci. Functional classification of bumblebee genes where coding sequences include variable-length TRs suggests that a majority of genes (87%) that could be assigned to a protein class are related to transcriptional regulation. Our results show that TRs contribute to coding sequence variation in bumblebees, and thus may facilitate the adaptation of bumblebees through diversifying proteins involved in controlling gene expression.

PubMed Disclaimer

Figures

<sc>Fig</sc>. 1.
Fig. 1.
—Repeat unit features for identified bumblebee TRs. (A) Repeat unit length distribution of TRs. Only repeat unit lengths, at which there are >100 TR loci, are shown. (B) The top ten most abundant repeat unit sequences.
<sc>Fig</sc>. 2.
Fig. 2.
—Distribution features for bumblebee TR loci. (A) TR locus length distribution. (B) The distance between TRs and predicted genes. As shown in the figure, a majority of TRs identified from assembled portion of Bombus terrestris genome reside within genes.
<sc>Fig</sc>. 3.
Fig. 3.
—The relationship between repeat unit length and observed mutation probability for bumblebee TRs. The ratio between the number of TRs showing length variability and the number of TRs that do not exhibit variability in length was plotted against the repeat unit length of TRs.
<sc>Fig</sc>. 4.
Fig. 4.
—An example of TRs contributing to bumblebee coding sequence variation. (A) Pairwise alignments of TR arrays between Bombus terrestris and B. impatiens. Colored letters indicate TR array sequences, while black letters show their flanking sequences. The TR array has a repeat unit of CAG, and there are five more repeat units in B. terrestris than in B. impatiens. The coordinate for the variable TR is NC_015770.1: 2190704–2190753 in B. terrestris. (B) Pairwise alignments of protein sequences encoded by genes containing the variable TR. Colored letters indicate TR array sequences, while black letters show their flanking sequences. There are five more glutamine residues (Q) in B. terrestris than in B. impatiens. Genes containing this variable TR encode nuclear receptor corepressor (protein IDs are XP_012166765.1 and XP_012249688.1 in B. terrestris and B. impatiens, respectively).
<sc>Fig</sc>. 5.
Fig. 5.
—PCR amplification of variable-length TRs residing in coding sequences in specimens of Bombus terrestris and B. impatiens. (A) A schematic showing the principle of primer design and PCR amplification. (B) PCR amplification of the variable-length TRs residing in the gene that encodes FERM, RhoGEF, and pleckstrin domain-containing protein (protein ID: XP_012169724.1 for B. terrestris). This figure indicates that, for the given TR locus, there is fixed length variation between the two species. (C) PCR amplification of the variable-length TRs residing in the gene that encodes hexamerin (protein ID: XP_012169664.1 for B. terrestris). This figure indicates that, for the given TR locus, there is length variation within species.
<sc>Fig</sc>. 6.
Fig. 6.
—Comparison of variable-length TRs in coding sequences across species. (A) TR evolved by the loss of one repeat unit in Bombus impatiens. Colored letters indicate the repeat units of focal TR array, while black letters show its flanking sequences. The sequences involved in the multiple alignments are Apis mellifera (Group9: 2931618–2931857), B. terrestris (NC_015770.1: 10495478–10495717), and B. impatiens (NT_176565.1: 370528–370755). The gene containing this variable TR encodes a cyclin-dependent kinase inhibitor (protein ID is XP_012167390.1 in B. terrestris). (B) TR evolved by the gain of two repeat units in B. terrestris. Colored letters indicate the repeat units of focal TR array, while black letters show its flanking sequences. The sequences involved in the multiple alignments are A. mellifera (Group2: 3483900–3484134), B. terrestris (NC_015763.1: 9110312–9110594), and B. impatiens (NT_176945.1: 210140–210374). The gene containing this variable TR encodes an RNA polymerase-associated protein (protein ID is XP_012175579.1 in B. terrestris).
<sc>Fig</sc>. 7.
Fig. 7.
—Functional classification of genes that include variable-length TRs. (A) The number of genes classified in each molecular function category. (B) The number of genes classified in each protein class. The gene number shown in the nucleic acid binding category excludes transcription factors. (C) Biological processes that genes including variable-length TRs are involved in.

Similar articles

  • Genome-wide identification of accessible chromatin regions in bumblebee by ATAC-seq.
    Zhao X, Su L, Xu W, Schaack S, Sun C. Zhao X, et al. Sci Data. 2020 Oct 26;7(1):367. doi: 10.1038/s41597-020-00713-w. Sci Data. 2020. PMID: 33106500 Free PMC article.
  • The genomes of two key bumblebee species with primitive eusocial organization.
    Sadd BM, Barribeau SM, Bloch G, de Graaf DC, Dearden P, Elsik CG, Gadau J, Grimmelikhuijzen CJ, Hasselmann M, Lozier JD, Robertson HM, Smagghe G, Stolle E, Van Vaerenbergh M, Waterhouse RM, Bornberg-Bauer E, Klasberg S, Bennett AK, Câmara F, Guigó R, Hoff K, Mariotti M, Munoz-Torres M, Murphy T, Santesmasses D, Amdam GV, Beckers M, Beye M, Biewer M, Bitondi MM, Blaxter ML, Bourke AF, Brown MJ, Buechel SD, Cameron R, Cappelle K, Carolan JC, Christiaens O, Ciborowski KL, Clarke DF, Colgan TJ, Collins DH, Cridge AG, Dalmay T, Dreier S, du Plessis L, Duncan E, Erler S, Evans J, Falcon T, Flores K, Freitas FC, Fuchikawa T, Gempe T, Hartfelder K, Hauser F, Helbing S, Humann FC, Irvine F, Jermiin LS, Johnson CE, Johnson RM, Jones AK, Kadowaki T, Kidner JH, Koch V, Köhler A, Kraus FB, Lattorff HM, Leask M, Lockett GA, Mallon EB, Antonio DS, Marxer M, Meeus I, Moritz RF, Nair A, Näpflin K, Nissen I, Niu J, Nunes FM, Oakeshott JG, Osborne A, Otte M, Pinheiro DG, Rossié N, Rueppell O, Santos CG, Schmid-Hempel R, Schmitt BD, Schulte C, Simões ZL, Soares MP, Swevers L, Winnebeck EC, Wolschin F, Yu N, Zdobnov EM, Aqrawi PK, Blankenburg KP, Coyle M, Francisco L, Hernandez AG, Holder M, Hudson ME… See abstract for full author list ➔ Sadd BM, et al. Genome Biol. 2015 Apr 24;16(1):76. doi: 10.1186/s13059-015-0623-3. Genome Biol. 2015. PMID: 25908251 Free PMC article.
  • Analysis of a normalised expressed sequence tag (EST) library from a key pollinator, the bumblebee Bombus terrestris.
    Sadd BM, Kube M, Klages S, Reinhardt R, Schmid-Hempel P. Sadd BM, et al. BMC Genomics. 2010 Feb 15;11:110. doi: 10.1186/1471-2164-11-110. BMC Genomics. 2010. PMID: 20156341 Free PMC article.
  • Microsporidia: An Emerging Threat to Bumblebees?
    Brown MJF. Brown MJF. Trends Parasitol. 2017 Oct;33(10):754-762. doi: 10.1016/j.pt.2017.06.001. Epub 2017 Jun 26. Trends Parasitol. 2017. PMID: 28663099 Review.
  • Genome (in)stability at tandem repeats.
    Balzano E, Pelliccia F, Giunta S. Balzano E, et al. Semin Cell Dev Biol. 2021 May;113:97-112. doi: 10.1016/j.semcdb.2020.10.003. Epub 2020 Oct 24. Semin Cell Dev Biol. 2021. PMID: 33109442 Review.

Cited by

References

    1. Barrett RD, Schluter D.. 2008. Adaptation from standing genetic variation. Trends Ecol Evol. 23(1):38–44. - PubMed
    1. Benson G. 1999. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27(2):573–580. - PMC - PubMed
    1. Caburet S, Cocquet J, Vaiman D, Veitia RA.. 2005. Coding repeats and evolutionary “agility”. Bioessays 27(6):581–587. - PubMed
    1. Cameron SA, et al. 2011. Patterns of widespread decline in North American bumble bees. Proc Natl Acad Sci U S A. 108(2):662–667. - PMC - PubMed
    1. Cameron SA, Hines HM, Williams PH.. 2007. A comprehensive phylogeny of the bumble bees (Bombus). Biol J Linn Soc. 91(1):161–188.

Publication types