Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Feb 7;16(1):21.
doi: 10.1186/s40104-024-01133-1.

Advancing the Indian cattle pangenome: characterizing non-reference sequences in Bos indicus

Affiliations

Advancing the Indian cattle pangenome: characterizing non-reference sequences in Bos indicus

Sarwar Azam et al. J Anim Sci Biotechnol. .

Abstract

Background: India harbors the world's largest cattle population, encompassing over 50 distinct Bos indicus breeds. This rich genetic diversity underscores the inadequacy of a single reference genome to fully capture the genomic landscape of Indian cattle. To comprehensively characterize the genomic variation within Bos indicus and, specifically, dairy breeds, we aim to identify non-reference sequences and construct a comprehensive pangenome.

Results: Five representative genomes of prominent dairy breeds, including Gir, Kankrej, Tharparkar, Sahiwal, and Red Sindhi, were sequenced using 10X Genomics 'linked-read' technology. Assemblies generated from these linked-reads ranged from 2.70 Gb to 2.77 Gb, comparable to the Bos indicus Brahman reference genome. A pangenome of Bos indicus cattle was constructed by comparing the newly assembled genomes with the reference using alignment and graph-based methods, revealing 8 Mb and 17.7 Mb of novel sequence respectively. A confident set of 6,844 Non-reference Unique Insertions (NUIs) spanning 7.57 Mb was identified through both methods, representing the pangenome of Indian Bos indicus breeds. Comparative analysis with previously published pangenomes unveiled 2.8 Mb (37%) commonality with the Chinese indicine pangenome and only 1% commonality with the Bos taurus pangenome. Among these, 2,312 NUIs encompassing ~ 2 Mb, were commonly found in 98 samples of the 5 breeds and designated as Bos indicus Common Insertions (BICIs) in the population. Furthermore, 926 BICIs were identified within 682 protein-coding genes, 54 long non-coding RNAs (lncRNA), and 18 pseudogenes. These protein-coding genes were enriched for functions such as chemical synaptic transmission, cell junction organization, cell-cell adhesion, and cell morphogenesis. The protein-coding genes were found in various prominent quantitative trait locus (QTL) regions, suggesting potential roles of BICIs in traits related to milk production, reproduction, exterior, health, meat, and carcass. Notably, 63.21% of the bases within the BICIs call set contained interspersed repeats, predominantly Long Interspersed Nuclear Elements (LINEs). Additionally, 70.28% of BICIs are shared with other domesticated and wild species, highlighting their evolutionary significance.

Conclusions: This is the first report unveiling a robust set of NUIs defining the pangenome of Bos indicus breeds of India. The analyses contribute valuable insights into the genomic landscape of desi cattle breeds.

Keywords: Bos indicus; BICIs; Cattle; Genome assembly; Linked-reads; NUIs; Pangenome.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: Cattle samples were obtained in accordance with the guidelines set forth by the Committee for the Purpose of Control and Supervision on Experiments on Animals (CPCSEA), India with the Institutional Animal Ethics Committee (IAEC) approval. Consent for publication: Not applicable. Competing interests: The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Identification of Non-reference Unique Insertions (NUIs) in Bos indicus. The flowchart illustrates the systematic process for identifying the final set of NUIs. The diagram outlines the sequential steps involved in the selection and refinement of NUIs
Fig. 2
Fig. 2
Overview of Non-reference Unique Insertion (NUI) final set and their distribution. A Circos plot of Bos indicus pangenome. From the outer to inner track: Chromosome, Gene track, Red Sindhi, Sahiwal, Tharparkar, Kankrej, Gir. B Size distribution of NUIs with bin size of 500 bp and 50 bp in zoomed area
Fig. 3
Fig. 3
Overview of Bos indicus Common Insertions (BICIs) and their distribution patterns. A This ideogram depicts BICIs occurrences across Brahman genome chromosomes. Pink histograms above each chromosome illustrate BICI density using a 100 kb window size. The black line within chromosomes represents gene distribution. B The plot illustrates the size distribution of BICIs with a bin size of 500 bp, providing an overview and 50 bp in the zoomed area. C The first two principal components are based on the BICI occurrence matrix
Fig. 4
Fig. 4
Distribution of transposable elements on Bos indicus Common Insertions (BICIs). A Stacked bars represent the total number of BICIs split by three different size ranges. The major transposable elements (TEs) are categorized as SINE (Short Interspersed Nuclear Element), LTR (Long Terminal Repeat), LINE (Long Interspersed Nuclear Element), DNA (DNA Transposon), and NONE (No Interspersed Repeat Detected). “Other TEs” encompasses various minor classes. B Bar plot showing the number of TEs flanking and crossing BICIs
Fig. 5
Fig. 5
Characterization of Bos indicus Common Insertions (BICIs) in the transcriptome. A Stacked bars depict the distribution of BICIs across genic and non-genic regions of the genome. B Gene Ontology (GO) annotation analysis for the biological processes associated with BICIs
Fig. 6
Fig. 6
Trait enrichment analysis of BICIs in known cattle QTL regions. A Percentage of QTL type (pie chart) associated with BICIs. B Top 10 enriched QTL traits (bar plots) associated with BICIs
Fig. 7
Fig. 7
Evolutionary analysis of Bos indicus Common Insertions (BICIs). A Venn diagram illustrating the number of BICIs shared within the Bos genus. The overlapping regions reveal the extent of shared BICIs among species within the Bos genus. B Venn diagram showcasing the number of BICIs shared within the Bovidae family. The intersecting areas depict the shared BICIs among species within the broader Bovidae family

References

    1. Taye M, Lee W, Jeon S, Yoon J, Dessie T, Hanotte O, et al. Exploring evidence of positive selection signatures in cattle breeds selected for different traits. Mamm Genome. 2017;28:528–41. - PubMed
    1. Gilbert M, Nicolas G, Cinardi G, Van Boeckel TP, Vanwambeke SO, Wint GRW, et al. Global distribution data for cattle, buffaloes, horses, sheep, goats, pigs, chickens and ducks in 2010. Sci Data. 2018;5:180227. - PMC - PubMed
    1. Pitt D, Sevane N, Nicolazzi EL, MacHugh DE, Park SDE, Colli L, et al. Domestication of cattle: two or three events? Evol Appl. 2019;12:123–36. - PMC - PubMed
    1. Ajmone-Marsan P, Garcia JF, Lenstra JA. On the origin of cattle: how aurochs became cattle and colonized the world. Evol Anthropol. 2010;19:148–57.
    1. Loftus RT, MacHugh DE, Bradley DG, Sharp PM, Cunningham P. Evidence for two independent domestications of cattle. Proc Natl Acad Sci U S A. 1994;91:2757–61. - PMC - PubMed

LinkOut - more resources