Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul 25:13:867152.
doi: 10.3389/fgene.2022.867152. eCollection 2022.

Estimating Heritabilities and Breeding Values From Censored Phenotypes Using a Data Augmentation Approach

Affiliations

Estimating Heritabilities and Breeding Values From Censored Phenotypes Using a Data Augmentation Approach

Melissa A Stephen et al. Front Genet. .

Abstract

Time-dependent traits are often subject to censorship, where instead of precise phenotypes, only a lower and/or upper bound can be established for some of the individuals. Censorship reduces the precision of phenotypes but can represent compromise between measurement cost and animal ethics considerations. This compromise is particularly relevant for genetic evaluation because phenotyping initiatives often involve thousands of individuals. This research aimed to: 1) demonstrate a data augmentation approach for analysing censored phenotypes, and 2) quantify the implications of phenotype censorship on estimation of heritabilities and predictions of breeding values. First, we simulated uncensored phenotypes, representing fine-scale "age at puberty" for each individual in a population of some 5,000 animals across 50 herds. Analysis of these uncensored phenotypes provided a gold-standard control. We then produced seven "test" phenotypes by superimposing varying degrees of left, interval, and/or right censorship, as if herds were measured on only one, two or three occasions, with a binary measure categorized for animals at each visit (either pre or post pubertal). We demonstrated that our estimates of heritabilities and predictions of breeding values obtained using a data augmentation approach were remarkably robust to phenotype censorship. Our results have important practical implications for measuring time-dependent traits for genetic evaluation. More specifically, we suggest that data collection can be designed with relatively infrequent repeated measures, thereby reducing costs and increasing feasibility across large numbers of animals.

Keywords: MCMC; baysian; breeding; censored; data augmentation; gibbs sampling.

PubMed Disclaimer

Conflict of interest statement

Authors MS, CB, NS, and CP were employed by DairyNZ Limited. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
An example of interval censoring for “age at puberty”. If this animal was observed daily, it would be recorded as attaining puberty at 345 days old. However, if the herd was observed only three times (early, mid and late), when this animal was 300, 330 and 360 days old, respectively, its phenotype would fall within the bounds of 330–360 days.
FIGURE 2
FIGURE 2
An example of the Markov-chain Monte Carlo (MCMC) sampled phenotypes for a single animal with a left- (A) or interval- (B) censored phenotype. Left censoring occurs where an animal has attained puberty on or before the first herd visit (i.e., sampled phenotypes are truncated by the age of the animal on the first visit). Interval censoring occurs when an animal reaches puberty between two visits (i.e., sampled phenotypes are truncated by the age of the animal on the flanking visits.

Similar articles

Cited by

References

    1. Bezanson J., Edelman A., Karpinski S., Shah V. B. (2017). Julia: A Fresh Approach to Numerical Computing. SIAM Rev. 59, 65–98. 10.1137/141000671 - DOI
    1. Cheng H., Fernando R., Garrick D. (2018). JWAS: Julia Implementation of Whole-Genome Analysis Software. in Proceedings of the world congress on genetics applied to livestock production, Auckland, New Zealand, 859.
    1. Cheng H., Garrick D., Fernando R. (2015). Xsim: Simulation of Descendants from Ancestors with Sequence Data. G3 Genes, Genomes, Genet. 5, 1415–1417. 10.1534/g3.115.016683 - DOI - PMC - PubMed
    1. Dennis N. A., Stachowicz K., Visser B., Hely F. S., Berg D. K., Friggens N. C., et al. (2018). Combining Genetic and Physiological Data to Identify Predictors of Lifetime Reproductive Success and the Effect of Selection on These Predictors on Underlying Fertility Traits. J. Dairy Sci. 101, 3176–3192. 10.3168/jds.2017-13355 - DOI - PubMed
    1. Donoghue K. A., Rekaya R., Bertrand J. K. (2004). Comparison of Methods for Handling Censored Records in Beef Fertility Data: Simulation Study1. J. Anim. Sci. 82, 351–356. 10.2527/2004.822351x - DOI - PubMed

LinkOut - more resources