Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jan 6:14:giaf003.
doi: 10.1093/gigascience/giaf003.

Enhancing inbreeding estimation and global conservation insights through chromosome-level assemblies of the Chinese and Malayan pangolin

Affiliations

Enhancing inbreeding estimation and global conservation insights through chromosome-level assemblies of the Chinese and Malayan pangolin

Tianming Lan et al. Gigascience. .

Abstract

A high-quality reference genome coupled with resequencing data is a promising strategy to address issues in conservation genomics. This has greatly enhanced the development of conservation plans for endangered species. Pangolins are fascinating animals with a variety of unique features. Unfortunately, they are the most trafficked wild animal in the world. In this study, we assembled a chromosome-scale genome with HiFi long reads and Hi-C short reads for the Chinese and Malayan pangolin and provided two new representative reference genomes for the pangolin species. We found a great improvement in the evaluation of genetic diversity and inbreeding based on these high-quality genomes and obtained different results for the detection of genome-wide extinction risks compared with genomes assembled using short reads. Moderate inbreeding and genetic diversity were reverified in these two pangolin species, except for one Malayan pangolin population with high inbreeding and low genetic diversity. Moreover, we identified a much higher inbreeding level (FROH = 0.54) in the Chinese pangolin individual from Taiwan Province compared with that from Mainland China, but more than 99.6% runs of homozygosity (ROH) fragments were restricted to less than 1 Mb, indicating that the high FROH in Taiwan Chinese pangolins may have accumulated from historical inbreeding events. Furthermore, our study is the first to detect relatively mild genetic purging in pangolin populations. These two high-quality reference genomes will provide valuable genetic resources for future studies and contribute to the protection and conservation of pangolins.

Keywords: Chinese pangolin; Malayan pangolin; conservation genomics; genetic purging; inbreeding.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1:
Figure 1:
Introduction to the species distribution and chromosome synteny of the Chinese and Malayan pangolins. (A) The distribution area and sampling sites of the Chinese and Malayan pangolins in this study. The circles represent sampling sites of the Chinese pangolins reported by Wang et al. [27]. Samples without detailed locations are not shown on the map. (B) The chromosome-scale synteny analysis between the Malayan pangolin and Chinese pangolin genomes.
Figure 2:
Figure 2:
Comparison of the genome-wide genetic diversity and inbreeding estimated based on the LG (long-read HiFi assembled genome) and SG (short-read assembled genome). (A) Comparison of genome-wide π calculated based on the LG and SG in the Chinese and Malayan pangolin genomes. (B) Comparison of FROH calculated based on LG and SG in the Malayan pangolin genomes. (C) Comparison of FROH calculated based on LG and SG in the Chinese pangolin genomes. NS: P ≥ 0.5, *P < 0.05, **P < 0.01, ***P < 0.001.
Figure 3:
Figure 3:
Genome-wide inbreeding estimated by ROH in the Chinese and Malayan pangolin populations. (A) The length distribution of ROH across the genome in the Chinese pangolin population. (B) The length distribution of ROH across the genome in the Malayan pangolin population. (C) Genome-wide heterozygosity and inbreeding estimates (FROH) for all five pangolin populations. (D) Comparison of the averaged FROH in the MjavA and MjavB populations of the Malayan pangolins. (E) Comparison of the averaged FROH in the MpenA, MpenB, and MpenC populations of Chinese pangolins.
Figure 4:
Figure 4:
Mutational load in the Chinese and Malayan pangolin populations. (A) Total number of individual-level LoF mutations across the Chinese pangolin populations. (B) Total number of individual-level LoF mutations across the Malayan pangolin populations. (C) The ratio of homozygous LoF mutations in the Chinese pangolin populations was calculated by the following formula: 2 × homozygous sites/(2 × homozygous sites + heterozygous site). (D) The ratio of homozygous LoF mutations in the Malayan pangolin populations was calculated using the same formula as that for the Chinese pangolin. (E) Relative mutational load in the Chinese pangolin populations (top 0.1% of GERP scores). (F) Relative mutational load in the Malayan pangolin populations (top 0.1% of GERP scores). The LoF here means loss-of-function mutations.
Figure 5:
Figure 5:
The SFS and genetic signals of genetic purging in pangolin populations. (A) SFS for putatively damaging (LoF and missense mutations) and neutral mutations (intergenic variants) in the MpenA, MpenB, and MpenC populations. (B) SFS for putatively damaging and neutral mutations in the MjavA and MjavB populations. (C) Dot plot showing the occurrence of LoF mutations in the two Malayan pangolin populations calculated as the ratio of the number of the mutational load to synonymous mutations in the ROH regions (ROHf) or non-ROH (non-ROHf) regions across the genome. Each point signifies the LoF frequency of an individual. The large dots represent the average LoF frequency for the population, while the lines indicate the standard deviation range around the mean. (D) The ratio of ROHf to non-ROHf for the LoF in the two Malayan pangolin populations. The box represents the interquartile range (IQR), stretching from the first quartile (Q1) to the third quartile (Q3). The line that bisects the box indicates the median value. The whiskers extended from the box to show the variability of the data, typically reaching to the minimum and maximum values that fall within 1.5 times of the IQR from Q1 and Q3, respectively. (E) Dot plot showing the occurrence of LoF mutations in the three Chinese pangolin populations calculated as that in (C). (F) The ratio of ROHf to non-ROHf for the LoF in the three Chinese pangolin populations.

References

    1. Formenti G, Theissinger K, Fernandes C, et al. The era of reference genomes in conservation genomics. Trends Ecol Evol. 2022;37(3):197–202.. 10.1016/j.tree.2021.11.008. - DOI - PubMed
    1. Kitts PA, Church DM, Thibaud-Nissen F, et al. Assembly: a resource for assembled genomes at NCBI. Nucleic Acids Res. 2016;44(D1):D73–80.. 10.1093/nar/gkv1226. - DOI - PMC - PubMed
    1. Ouborg NJ, Pertoldi C, Loeschcke V, et al. Conservation genetics in transition to conservation genomics. Trends Genet. 2010;26(4):177–87.. 10.1016/j.tig.2010.01.001. - DOI - PubMed
    1. Brandies P, Peel E, Hogg CJ, et al. The value of reference genomes in the conservation of threatened species. Genes. 2019;10(11):846. 10.3390/genes10110846. - DOI - PMC - PubMed
    1. Guang X, Lan T, Wan Q-H, et al. Chromosome-scale genomes provide new insights into subspecies divergence and evolutionary characteristics of the giant panda. Sci Bull. 2021;66(19):2002–13.. 10.1016/j.scib.2021.02.002. - DOI - PubMed

LinkOut - more resources