Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 11;57(1):35.
doi: 10.1186/s12711-025-00985-z.

Accurate determination of breed origin of alleles in a simulated smallholder crossbred dairy cattle population

Affiliations

Accurate determination of breed origin of alleles in a simulated smallholder crossbred dairy cattle population

Berihu Welderufael et al. Genet Sel Evol. .

Abstract

Background: Accurate assignment of breed origin of alleles (BOA) at a heterozygote locus may help to introduce a resilient or adaptive haplotype in crossbreeding. In this study, we developed and tested a method to assign breed of origin for individual alleles in crossbred dairy cattle. After generations of mating within and between local breeds as well as the importation of exotic bulls, five rounds of selected crossbred cows were simulated to mimic a dairy breeding program in the low- and middle-income countries (LMICs). In each round of selection, the alleles of those crossbred animals were phased and assigned to their breed of origin (being either local or exotic).

Results: Across all core lengths and modes of phasing (with offset-move 50% of the core length forward or no-offset), the average percentage of alleles correctly assigned a breed origin was 95.76%, with only 1.39% incorrectly assigned and 2.85% missing or unassigned. On consensus, the average percentage of alleles correctly assigned a breed origin was 93.21%, with only 0.46% incorrectly assigned and 6.33% missing or unassigned. This high proportion of alleles correctly assigned a breed origin resulted in a high core-based mean accuracy of 0.99 and a very high consensus-based (most frequently observed assignment across all the scenarios) mean accuracy of 1.00. The algorithm's assignment yield and accuracy were affected by the choice of threshold levels for the best match of assignments. The threshold level had the opposite effect on assignment yield and assignment accuracy. A less stringent threshold generated higher assignment yields and lower assignment accuracy.

Conclusions: We developed an algorithm that accurately assigns a breed origin to alleles of crossbred animals designed to represent breeding programs in the LMICs. The developed algorithm is straightforward in its application and does not require prior knowledge of pedigree, which makes it more relevant and applicable in LMICs breeding programs.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Competing interests: RCG and JMH are now employed by Bayer Crop Science.

Figures

Fig. 1
Fig. 1
Schematic representation of the simulated breeding program. A founder population on the top of the figure is split into exotic and local breeds
Fig. 2
Fig. 2
Illustration of a core and offset. Phasing was performed in two modes: either using the whole length of a core or by moving it forward 50% of the core length (offset) to define the beginning of a given core
Fig. 3
Fig. 3
Phasing and assignment algorithm overview. a Phasing was performed using AlphaPhase, b Hypothetical haplotype libraries based on a core length of ten SNPs. To assign origin to the haplotype of an individual (bottom genotype sequence), the algorithm searches for the best match in each position in the exotic (top left genotype sequence) and local (top right genotype sequence) haplotypes. In this case, the individual’s haplotype should be assigned as a local haplotype because the local haplotype library contains the haplotype with the fewest number of errors, i.e., mismatches (red)
Fig. 4
Fig. 4
Plot of principal component analysis of SNP genotypes (PC1 vs. PC2 and PC1 vs. PC3). a Genetic data structure of the founders and the first crossbred cows, b of all animals across generations
Fig. 5
Fig. 5
Effect of core length on assignment yield. a Phasing yield was very high for all core lengths but more concise for longer core lengths (core length of 220–280 SNPs), b Assignment yield was optimal for a core length of 200 SNPs
Fig. 6
Fig. 6
Percentage of allele assignment yield and accuracy of assignment. a Yield using the consensus-based allele assignment algorithm as a function of threshold level, b accuracy using the consensus-based allele assignment algorithm as a function of threshold level
Fig. 7
Fig. 7
Percentage of allele assignment yield and accuracy of assignment. a Yield using the consensus-based allele assignment algorithm as a function of crossbreeding (admixture) level, b accuracy using the consensus-based allele assignment algorithm as a function of crossbreeding (admixture) level

References

    1. Leroy G, Baumung R, Boettcher P, Scherf B, Hoffmann I. Review: sustainability of crossbreeding in developing countries; definitely not like crossing a meadow. Animal. 2016;10:262–73. - PubMed
    1. Marshall K, Gibson JP, Mwai O, Mwacharo JM, Haile A, Getachew T, et al. Livestock genomics for developing countries—African examples in practice. Front Genet. 2019;10:297. - PMC - PubMed
    1. Powell O, Mrode R, Gaynor RC, Johnsson M, Gorjanc G, Hickey JM. Genomic evaluations using data recorded on smallholder dairy farms in low- to middle-income countries. JDSC. 2021;2:366–70. - PMC - PubMed
    1. VanRaden PM, Cooper TA. Genomic evaluations and breed composition for crossbred US dairy cattle. Interbull Bull. 2015;49:19–23.
    1. Kuehn LA, Keele JW, Bennett GL, McDaneld TG, Smith TPL, Snelling WM, et al. Predicting breed composition using breed frequencies of 50,000 markers from the US meat animal research center 2,000 bull project. J Anim Sci. 2011;89:1742–50. - PubMed

LinkOut - more resources