Identification of Candidate Genes and Genomic Selection for Seed Protein in Soybean Breeding Pipeline
- PMID: 35783963
- PMCID: PMC9244705
- DOI: 10.3389/fpls.2022.882732
Identification of Candidate Genes and Genomic Selection for Seed Protein in Soybean Breeding Pipeline
Abstract
Soybean is a primary meal protein for human consumption, poultry, and livestock feed. In this study, quantitative trait locus (QTL) controlling protein content was explored via genome-wide association studies (GWAS) and linkage mapping approaches based on 284 soybean accessions and 180 recombinant inbred lines (RILs), respectively, which were evaluated for protein content for 4 years. A total of 22 single nucleotide polymorphisms (SNPs) associated with protein content were detected using mixed linear model (MLM) and general linear model (GLM) methods in Tassel and 5 QTLs using Bayesian interval mapping (IM), single-trait multiple interval mapping (SMIM), single-trait composite interval mapping maximum likelihood estimation (SMLE), and single marker regression (SMR) models in Q-Gene and IciMapping. Major QTLs were detected on chromosomes 6 and 20 in both populations. The new QTL genomic region on chromosome 6 (Chr6_18844283-19315351) included 7 candidate genes and the Hap.X AA at the Chr6_19172961 position was associated with high protein content. Genomic selection (GS) of protein content was performed using Bayesian Lasso (BL) and ridge regression best linear unbiased prediction (rrBULP) based on all the SNPs and the SNPs significantly associated with protein content resulted from GWAS. The results showed that BL and rrBLUP performed similarly; GS accuracy was dependent on the SNP set and training population size. GS efficiency was higher for the SNPs derived from GWAS than random SNPs and reached a plateau when the number of markers was >2,000. The SNP markers identified in this study and other information were essential in establishing an efficient marker-assisted selection (MAS) and GS pipelines for improving soybean protein content.
Keywords: Glycine max; genome-wide association study; genomic selection; genotyping by sequencing; protein content; single nucleotide polymorphism.
Copyright © 2022 Qin, Wang, Zhao, Shi, Zhao, Song, Ravelombola, An, Yan, Yang and Zhang.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures





Similar articles
-
Identification of candidate genes and genomic prediction of soybean fatty acid components in two soybean populations.Theor Appl Genet. 2024 Aug 29;137(9):211. doi: 10.1007/s00122-024-04716-8. Theor Appl Genet. 2024. PMID: 39210238
-
A Genome-Wide Association Study Reveals Region Associated with Seed Protein Content in Cowpea.Plants (Basel). 2023 Jul 20;12(14):2705. doi: 10.3390/plants12142705. Plants (Basel). 2023. PMID: 37514320 Free PMC article.
-
Genome-wide association study, genomic prediction and marker-assisted selection for seed weight in soybean (Glycine max).Theor Appl Genet. 2016 Jan;129(1):117-30. doi: 10.1007/s00122-015-2614-x. Epub 2015 Oct 30. Theor Appl Genet. 2016. PMID: 26518570 Free PMC article.
-
Genome-wide association study as a powerful tool for dissecting competitive traits in legumes.Front Plant Sci. 2023 Aug 14;14:1123631. doi: 10.3389/fpls.2023.1123631. eCollection 2023. Front Plant Sci. 2023. PMID: 37645459 Free PMC article. Review.
-
Genome-Wide SNP Markers Accelerate Perennial Forest Tree Breeding Rate for Disease Resistance through Marker-Assisted and Genome-Wide Selection.Int J Mol Sci. 2022 Oct 14;23(20):12315. doi: 10.3390/ijms232012315. Int J Mol Sci. 2022. PMID: 36293169 Free PMC article. Review.
Cited by
-
Genetic mapping and functional genomics of soybean seed protein.Mol Breed. 2023 Apr 12;43(4):29. doi: 10.1007/s11032-023-01373-5. eCollection 2023 Apr. Mol Breed. 2023. PMID: 37313523 Free PMC article.
-
Discovery of genomic regions associated with grain yield and agronomic traits in Bi-parental populations of maize (Zea mays. L) Under optimum and low nitrogen conditions.Front Genet. 2023 Oct 26;14:1266402. doi: 10.3389/fgene.2023.1266402. eCollection 2023. Front Genet. 2023. PMID: 37964777 Free PMC article.
-
Soybean genetic resources contributing to sustainable protein production.Theor Appl Genet. 2022 Nov;135(11):4095-4121. doi: 10.1007/s00122-022-04222-9. Epub 2022 Oct 14. Theor Appl Genet. 2022. PMID: 36239765 Free PMC article. Review.
-
Multiple-statistical genome-wide association analysis and genomic prediction of fruit aroma and agronomic traits in peaches.Hortic Res. 2023 May 31;10(7):uhad117. doi: 10.1093/hr/uhad117. eCollection 2023 Jul. Hortic Res. 2023. PMID: 37577398 Free PMC article.
-
Genomic and phenomic prediction for soybean seed yield, protein, and oil.Plant Genome. 2025 Mar;18(1):e70002. doi: 10.1002/tpg2.70002. Plant Genome. 2025. PMID: 39972529 Free PMC article.
References
-
- Bao Y., Vuong T., Meinhardt C., Tiffin P., Denny R., Chen S., et al. (2014). Potential of association mapping and genomic selection to explore PI 88788 derived soybean cyst nematode resistance. Plant Genome 7 2840–2854.
-
- Brummer E., Graef G., Orf J., Wilcox J., Shoemaker R. (1997). Mapping QTL for seed protein and oil content in eight soybean populations. Crop Sci. 37 370–378.
-
- Chapman A., Pantalone V., Ustun A., Allen F., Landau-Ellis D., Trigiano R., et al. (2003). Quantitative trait loci for agronomic and seed quality traits in an F 2 and F 4: 6 soybean population. Euphytica 129 387–393.
LinkOut - more resources
Full Text Sources