Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jan 6:14:giae116.
doi: 10.1093/gigascience/giae116.

A chromosome-level, haplotype-resolved genome assembly and annotation for the Eurasian minnow (Leuciscidae: Phoxinus phoxinus) provide evidence of haplotype diversity

Affiliations

A chromosome-level, haplotype-resolved genome assembly and annotation for the Eurasian minnow (Leuciscidae: Phoxinus phoxinus) provide evidence of haplotype diversity

Temitope Opeyemi Oriowo et al. Gigascience. .

Abstract

Background: In this study, we present an in-depth analysis of the Eurasian minnow (Phoxinus phoxinus) genome, highlighting its genetic diversity, structural variations, and evolutionary adaptations. We generated an annotated haplotype-phased, chromosome-level genome assembly (2n = 50) by integrating high-fidelity (HiFi) long reads and chromosome conformation capture data (Hi-C).

Results: We achieved a haploid size of 940 megabase pairs (Mbp) for haplome 1 and 929 Mbp for haplome 2 with high scaffold N50 values of 36.4 Mb and 36.6 Mb and BUSCO scores of 96.9% and 97.2%, respectively, indicating a highly complete genome assembly. We detected notable heterozygosity (1.43%) and a high repeat content (approximately 54%), primarily consisting of DNA transposons, which contribute to genome rearrangements and variations. We found substantial structural variations within the genome, including insertions, deletions, inversions, and translocations. These variations affect genes enriched in functions such as dephosphorylation, developmental pigmentation, phagocytosis, immunity, and stress response. In the annotation of protein-coding genes, 30,980 messenger RNAs and 23,497 protein-coding genes were identified with a high completeness score, which further underpins the high contiguity of our genome assemblies. We performed a gene family evolution analysis by comparing our proteome to 10 other teleost species, which identified immune system gene families that prioritize histone-based disease prevention over NB-LRR-related-based immune responses. Additionally, demographic analysis indicates historical fluctuations in the effective population size of P. phoxinus, likely correlating with past climatic changes.

Conclusions: This annotated, phased reference genome provides a crucial resource for resolving the taxonomic complexity within the genus Phoxinus and highlights the importance of haplotype-phased assemblies in understanding haplotype diversity in species characterized by high heterozygosity.

Keywords: Phoxinus; annotation; comparative genomics; haplotype-phased genome assembly; immune system; structural variants; transposable elements.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Graphical Abstract
Graphical Abstract
Figure 1:
Figure 1:
K-mer spectra profile of Phoxinus phoxinus generated from raw PacBio HiFi reads with GenomeScope2. The y-axis shows the k-mer counts and the x-axis shows sequencing depth. The clear bimodal pattern observed indicates a diploid genome with high heterozygosity of 1.43%. The haploid genome size was estimated to be around 805 Mb.
Figure 2:
Figure 2:
Repeat landscape of transposable elements (TEs) in the Phoxinus phoxinus genome across different Kimura substitution levels (in %). TEs on the left side of the histogram represent recently active TEs with low divergence from the consensus sequence of TEs, while TEs toward the right side of the histogram represent ancient TEs with higher degrees of divergence.
Figure 3:
Figure 3:
Distribution of heterozygosity along the 25 chromosomes in P. phoxinus shown for Hap1 (upper plot) and Hap2 (lower plot). Mean heterozygosity estimates were calculated in 1-Mb bins and converted to the number of heterozygous sites per kb across all 25 chromosomes with alternating colors to differentiate adjacent chromosomes.
Figure 4:
Figure 4:
Inferred demographic history of P. phoxinus from PSMC analysis based on Hap1. The time ranges cover the Miocene, Pliocene, Early to Late Pleistocene, and Holocene, each shown in different colors. The x-axis shows years before present on a logarithmic scale, and the y-axis shows the estimated effective population size. Bootstrap results are shown as transparent lines. Hapl2 showed a similar curve.
Figure 5:
Figure 5:
Synteny plot of all 25 chromosomes of the P. phoxinus Hap1 (top, blue line) and Hap2 (bottom, orange line) of assemblies. Aligned regions between both haplomes are shown in blocks of gray, and unaligned regions are shown in gaps of white. Inversions are represented in orange, translocations in green, and duplications in light blue.
Figure 6:
Figure 6:
GO network of 54 functionally annotated P. phoxinus–specific genes. Genes cluster in 8 major GO terms: “DNA replication,” “DNA repair,” “regulation of DNA-templated transcription,” “response to oxidative stress,” “defense response to virus,” “cellular response to stimulus,” “regulation of response to reactive oxygen species,” and “negative regulation of TORC1 signalling.” Bubble size indicates the frequency of the GO term in the Gene Ontology annotation database (i.e., larger bubbles correspond to more general terms). Edges on the graph connect highly similar or linked GO terms to each other.
Figure 7:
Figure 7:
Ultrametric phylogenetic tree of selected teleost species and P. phoxinus. Numbers on the branches represent counts for expanded (+) and contracted (–) gene families. Positive numbers indicate significant gene families that have been expanded since the split from the last common ancestor and negative numbers indicate significant gene families that have been contracted since the split from the last common ancestor.

Similar articles

Cited by

References

    1. Kottelat M, Freyhof J. Handbook of European freshwater fishes. Ichthyol Res. 2008;55:99. 10.1007/s10228-007-0012-3. - DOI
    1. Kottelat M. Three new species of Phoxinus from Greece and southern France (Teleostei: cyprinidae). Ichthyol Explor Freshw. 2007;18(2):145–62.
    1. Palandačić A, Bravničar J, Zupančič P, et al. Molecular data suggest a multispecies complex of phoxinus (Cyprinidae) in the Western Balkan Peninsula. Mol Phylogenet Evol. 2015;92:118–23. 10.1016/j.ympev.2015.05.024. - DOI - PubMed
    1. Palandačić A, Naseka A, Ramler D, et al. Contrasting morphology with molecular data: an approach to revision of species complexes based on the example of European Phoxinus (Cyprinidae). BMC Evol Biol. 2017;17:184. 10.1186/s12862-017-1032-x. - DOI - PMC - PubMed
    1. Palandačić A, Kruckenhauser L, Ahnelt H, et al. European minnows through time: museum collections aid genetic assessment of species introductions in freshwater fishes (Cyprinidae: phoxinus species complex). Heredity. 2020;124:410–22. 10.1038/s41437-019-0292-1. - DOI - PMC - PubMed

LinkOut - more resources