Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Dec 24:12:707754.
doi: 10.3389/fgene.2021.707754. eCollection 2021.

Harnessing Genetic Diversity in the USDA Pea Germplasm Collection Through Genomic Prediction

Affiliations

Harnessing Genetic Diversity in the USDA Pea Germplasm Collection Through Genomic Prediction

Md Abdullah Al Bari et al. Front Genet. .

Abstract

Phenotypic evaluation and efficient utilization of germplasm collections can be time-intensive, laborious, and expensive. However, with the plummeting costs of next-generation sequencing and the addition of genomic selection to the plant breeder's toolbox, we now can more efficiently tap the genetic diversity within large germplasm collections. In this study, we applied and evaluated genomic prediction's potential to a set of 482 pea (Pisum sativum L.) accessions-genotyped with 30,600 single nucleotide polymorphic (SNP) markers and phenotyped for seed yield and yield-related components-for enhancing selection of accessions from the USDA Pea Germplasm Collection. Genomic prediction models and several factors affecting predictive ability were evaluated in a series of cross-validation schemes across complex traits. Different genomic prediction models gave similar results, with predictive ability across traits ranging from 0.23 to 0.60, with no model working best across all traits. Increasing the training population size improved the predictive ability of most traits, including seed yield. Predictive abilities increased and reached a plateau with increasing number of markers presumably due to extensive linkage disequilibrium in the pea genome. Accounting for population structure effects did not significantly boost predictive ability, but we observed a slight improvement in seed yield. By applying the best genomic prediction model (e.g., RR-BLUP), we then examined the distribution of genotyped but nonphenotyped accessions and the reliability of genomic estimated breeding values (GEBV). The distribution of GEBV suggested that none of the nonphenotyped accessions were expected to perform outside the range of the phenotyped accessions. Desirable breeding values with higher reliability can be used to identify and screen favorable germplasm accessions. Expanding the training set and incorporating additional orthogonal information (e.g., transcriptomics, metabolomics, physiological traits, etc.) into the genomic prediction framework can enhance prediction accuracy.

Keywords: genomic prediction; genomic selection; germplasm accessions; next-generation sequencing; pea (Pisum sativum L); reliability criteria.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
Predictive ability with increasing training population size using the RR-BLUP model, DFF is days to first flowering, DM, is days to physiological maturity, NoSeedsPod is number of seeds per pod, PH is plant height in cm, PodsPlant is the number of pods per plant, SeedYield is seed yield in kg ha−1.
FIGURE 2
FIGURE 2
Distribution of phenotyped and predicted nonphenotyped accessions within the USDA pea germplasm collection for seed yield and plant height.
FIGURE 3
FIGURE 3
Reliability criteria for nonphenotyped lines: the top 50 of genomic estimated breeding values are blue, and bottom 50 are in red, intermediates are in green. (A) reliability estimates for seed yield (kg ha−1), (B) days to first flowering, (C) plant height, (D) number of seeds per plant.

References

    1. Alexander D. H., Novembre J., Lange K. (2009). Fast Model-Based Estimation of Ancestry in Unrelated Individuals. Genome Res. 19 (9), 1655–1664. 10.1101/gr.094052.109 - DOI - PMC - PubMed
    1. Annicchiarico P., Nazzicari N., Laouar M., Thami-Alami I., Romani M., Pecetti L. (2020). Development and Proof-Of-Concept Application of Genome-Enabled Selection for Pea Grain Yield under Severe Terminal Drought. Ijms 21 (7), 2414–2420. 10.3390/ijms21072414 - DOI - PMC - PubMed
    1. Annicchiarico P., Nazzicari N., Pecetti L., Romani M., Russi L. (2019). Pea Genomic Selection for Italian Environments. BMC Genomics 20 (1), 1–18. 10.1186/s12864-019-5920-x - DOI - PMC - PubMed
    1. Azodi C. B., Bolger E., McCarren A., Roantree M., de los Campos G., Shiu S.-H. (2019). Benchmarking Parametric and Machine Learning Models for Genomic Prediction of Complex Traits. G3: Genes, Genomes, Genet. 9 (11), 3691–3702. 10.1534/g3.119.400498 - DOI - PMC - PubMed
    1. Bates D., Mächler M., Bolker B., Walker S. (2015). Fitting Linear Mixed-Effects Models Usinglme4. J. Stat. Soft. 67 (1), 1–48. 10.18637/jss.v067.i01 - DOI