Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Oct 31;42(10):113178.
doi: 10.1016/j.celrep.2023.113178. Epub 2023 Sep 23.

Variation in the CENP-A sequence association landscape across diverse inbred mouse strains

Affiliations

Variation in the CENP-A sequence association landscape across diverse inbred mouse strains

Uma P Arora et al. Cell Rep. .

Abstract

Centromeres are crucial for chromosome segregation, but their underlying sequences evolve rapidly, imposing strong selection for compensatory changes in centromere-associated kinetochore proteins to assure the stability of genome transmission. While this co-evolution is well documented between species, it remains unknown whether population-level centromere diversity leads to functional differences in kinetochore protein association. Mice (Mus musculus) exhibit remarkable variation in centromere size and sequence, but the amino acid sequence of the kinetochore protein CENP-A is conserved. Here, we apply k-mer-based analyses to CENP-A chromatin profiling data from diverse inbred mouse strains to investigate the interplay between centromere variation and kinetochore protein sequence association. We show that centromere sequence diversity is associated with strain-level differences in both CENP-A positioning and sequence preference along the mouse core centromere satellite. Our findings reveal intraspecies sequence-dependent differences in CENP-A/centromere association and open additional perspectives for understanding centromere-mediated variation in genome stability.

Keywords: CENP-A association; CENP-A positioning; CP: Genomics; Mus musculus; centromere evolution; centromere transcription; centromere variation; genome instability; kinetochore; meiotic drive.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests The authors declare no competing interests.

Figures

Figure 1.
Figure 1.. Consensus-based alignment analysis of strain differences in CENP-A enrichment at centromeres
(A) Experimental design of ChIP-seq and the consensus-guided genomic analyses for CENP-A ChIP sequence enrichment. (B) Boxplots representing the enrichment of reads that mapped to either the major satellite (left) or minor satellite (right) consensus sequences in CENP-A ChIP relative to input samples. Color represents sample identity (AB, antibody). For each boxplot, the horizontal line represents the median, and the vertical line represents the range of values across replicates. (C) Heatmap showing the enrichment of the consensus nucleotide along the minor satellite consensus sequence in CENP-A ChIP compared with input samples from each strain.
Figure 2.
Figure 2.. Strain differences in CENP-A positioning along the minor satellite consensus
(A) Schematic depicting the difference in CENP-A and H3 nucleosome coverage between 76-bp single-end reads and trimmed reads. (B) Line plot representing the CENP-A ChIP/input enrichment of read coverage (y axis) at each position along the minor satellite consensus sequence (x axis). Solid (dashed) lines correspond to coverage values calculated using untrimmed (trimmed) reads. The 17-bp CENP-B box motif is marked in gray. (C) Boxplots representing the percent enrichment of CENP-B box motif frequency in ChIP/input samples. For each boxplot, the horizontal line represents the median, and the vertical line represents the range of values. (D) Heatmap presenting pairwise strain Pearson correlation coefficients for the average CENP-A ChIP/input enrichment pattern along the minor satellite consensus sequence. All Pearson correlation coefficients are significant (p < 0.05).
Figure 3.
Figure 3.. Shared and unique CENP-A-enriched k-mers across inbred mouse strains
(A) Upset plot representing the extent of k-mer sharing among the top 0.1% most enriched CENP-A-associated k-mers in each strain. Bar height corresponds to the total number of k-mers in each strain set. Each strain set of CENP-A-enriched k-mers is represented by the horizontal bars. The vertical bars represent the number of k-mers that belong to one or more strains, indicated by the dots below. p value was calculated by comparing the observed number of shared k-mers with the distribution of 100,000 bootstrap samples of the data. (B) Bar plots showing the number of CENP-A-enriched 31-mers from each strain with a given number of mismatches (NM) relative to the minor satellite consensus sequence. (C) Line plot depicting where CENP-A-enriched 31-mers map along the minor satellite consensus sequence. The y axis represents the number of 31-mers at a particular position along the minor satellite consensus sequence (x axis). Strains are indicated by line color.
Figure 4.
Figure 4.. CENP-A-associated sequences identified by a reference-agnostic, k-mer-based approach form distinct subgroups
Neighbor joining tree constructed from each strain’s top 1,000 CENP-A-associated sequences. Sequences cluster into seven groups. Strain-level contributions to each group are depicted by bar plots. Groups with skewed strain representation are indicated by red Chi-square p values (groups 1, 3, 4, 7). For clades with biased strain representation, Bonferroni post hoc tests were used to identify strains driving the significant signal.
Figure 5.
Figure 5.. Relationship between CENP-A-associated sequences across strains
Principal component analysis based on the stylistic similarity of CENP-A-associated sequences in each strain.
Figure 6.
Figure 6.. Transcription factor occupancy at centromere satellite DNA
(A) Association of transcription factors at centromere satellite DNA determined from publicly available ChIP-seq datasets. The y axis represents the enrichment of reads that map to the minor satellite consensus sequence in ChIP/input samples for each transcription factor (x axis). Each dot represents the average value of an experiment. The red horizontal line corresponds to ChIP/input value of 1. (B) The frequency of multiple TF motifs among CENP-A-enriched sequences across diverse mouse strains.

Similar articles

Cited by

References

    1. McKinley KL, and Cheeseman IM (2016). The molecular basis for centromere identity and function. Nat. Rev. Mol. Cell Biol. 17, 16–29. 10.1038/nrm.2015.5. - DOI - PMC - PubMed
    1. Bakhoum SF, Thompson SL, Manning AL, and Compton DA (2009). Genome stability is ensured by temporal control of kinetochore-microtubule dynamics. Nat. Cell Biol. 11, 27–35. 10.1038/ncb1809. - DOI - PMC - PubMed
    1. Schalch T, and Steiner FA (2017). Structure of centromere chromatin: from nucleosome to chromosomal architecture. Chromosoma 126, 443–455. 10.1007/s00412-016-0620-7. - DOI - PMC - PubMed
    1. Fukagawa T, and Earnshaw WC (2014). The centromere: chromatin foundation for the kinetochore machinery. Dev. Cell 30, 496–508. 10.1016/j.devcel.2014.08.016. - DOI - PMC - PubMed
    1. Giunta S, and Funabiki H (2017). Integrity of the human centromere DNA repeats is protected by CENP-A, CENP-C, and CENP-T. Proc. Natl. Acad. Sci. USA 114, 1928–1933. 10.1073/pnas.1615133114. - DOI - PMC - PubMed

Publication types