Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 1;12(1):1086.
doi: 10.1038/s41597-025-05448-0.

Whole genome sequences of 297 Duolang sheep for litter size

Affiliations

Whole genome sequences of 297 Duolang sheep for litter size

Chao Fang et al. Sci Data. .

Abstract

Litter size is a critical economic trait in the sheep industry. Like many other breeds, Duolang sheep typically produce one lamb per ewe per lambing. To date, genetic studies of this trait in that breed have largely relied on candidate gene approaches. To expand the genomic resources for this breed, we sequenced 297 genomes, generating approximately 10.52 trillion bases with an average coverage of 13.35X. High-quality alignments with a mapping rate exceeding 99% enabled the identification of 43,968,128 SNPs and 6,504,047 InDels. This dataset provides a valuable resource for identifying genetic variants associated with litter size through genome-wide association studies (GWAS) and lays the foundation for future genetic improvement efforts through genomic selection in Duolang sheep. Beyond trait mapping, the dataset also supports broader applications, including analyses of genetic diversity, phylogenetic relationships, population history, adaptive introgression, and breed-specific characteristics. Additionally, the moderate-coverage WGS data are suitable for structural variant (SV) detection and downstream analyses such as association mapping and the identification of SVs underlying phenotypic traits.

PubMed Disclaimer

Conflict of interest statement

Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Overview of the experimental and analytical workflow. This figure illustrates the full pipeline used in the study, including sample collection, whole-genome sequencing, read trimming, variant calling, and filtering steps. Sequencing reads were processed following standard GATK best practices for variant discovery, with the exception of applying more stringent filtering parameters in the VariantFiltration module to reduce false positives. This workflow ensures high-quality variant calls and provides a reproducible framework for downstream genomic analyses.
Fig. 2
Fig. 2
Quality assessment of sequencing reads across 302 Duolang sheep. This figure presents density plots summarizing key quality metrics of the sequencing data. (a) Total number of bases generated per individual; (b) Total number of reads per individual; (c) GC content distribution across individuals; (d) Proportion of reads with a mean base quality score >20; and (e) Proportion of reads with a mean base quality score >30. These results demonstrate overall high sequencing quality and consistency across samples, supporting the reliability of downstream variant detection and analysis.
Fig. 3
Fig. 3
Alignment quality metrics across 297 Duolang ewes. This figure shows density plots summarizing key alignment statistics following read mapping to the reference genome. (a) Mapping ratio, indicating the proportion of reads successfully aligned to the genome; (b) Mean sequencing depth per individual, reflecting the average number of reads covering each genomic position; and (c) Average mapping quality score, representing the confidence in alignment accuracy. The consistently high mapping metrics across samples demonstrate reliable data quality suitable for downstream variant analysis.
Fig. 4
Fig. 4
Functional annotation and chromosomal distribution of detected variants. (a) Functional classification of SNPs and InDels based on gene structure, showing the proportion of variants located in intergenic, intronic, exonic, and other genomic regions. (b) Density plot illustrating the genomic distribution of variants across autosomes and the X chromosome. The variants are generally evenly distributed, with minor deviations observed on specific chromosomes, consistent with known genomic features in sheep.

Similar articles

Cited by

References

    1. Wolfová, M., Wolf, J., Krupová, Z. & Margetín, M. Estimation of economic values for traits of dairy sheep: II. Model application to a production system with one lambing per year. Journal of dairy science92, 2195–2203 (2009). - PubMed
    1. Wolfová, M., Wolf, J. & Milerski, M. Calculating economic values for growth and functional traits in non‐dairy sheep. Journal of Animal Breeding and Genetetics126, 480–491 (2009). - PubMed
    1. Teletchea, F. in Animal Domestication Ch. Chapter 1 A Brief Overview, 137–159 (Intechopen, 2019).
    1. Souza, C., MacDougall, C., Campbell, B., McNeilly, A. & Baird, D. The Booroola (FecB) phenotype is associated with a mutation in the bone morphogenetic receptor type 1 B (BMPR1B) gene. Journal of Endocrinology169, R1 (2001). - PubMed
    1. Hanrahan, J. P. et al. Mutations in the genes for oocyte-derived growth factors GDF9 and BMP15 are associated with both increased ovulation rate and sterility in Cambridge and Belclare sheep (Ovis aries). Biology of reproduction70, 900–909 (2004). - PubMed