Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul 29:13:938100.
doi: 10.3389/fpls.2022.938100. eCollection 2022.

Linkage analysis and residual heterozygotes derived near isogenic lines reveals a novel protein quantitative trait loci from a Glycine soja accession

Affiliations

Linkage analysis and residual heterozygotes derived near isogenic lines reveals a novel protein quantitative trait loci from a Glycine soja accession

Yia Yang et al. Front Plant Sci. .

Abstract

Modern soybean [Glycine max (L.) Merr] cultivars have low overall genetic variation due to repeated bottleneck events that arose during domestication and from selection strategies typical of many soybean breeding programs. In both public and private soybean breeding programs, the introgression of wild soybean (Glycine soja Siebold and Zucc.) alleles is a viable option to increase genetic diversity and identify new sources for traits of value. The objectives of our study were to examine the genetic architecture responsible for seed protein and oil using a recombinant inbred line (RIL) population derived from hybridizing a G. max line ('Osage') with a G. soja accession (PI 593983). Linkage mapping identified a total of seven significant quantitative trait loci on chromosomes 14 and 20 for seed protein and on chromosome 8 for seed oil with LOD scores ranging from 5.3 to 31.7 for seed protein content and from 9.8 to 25.9 for seed oil content. We analyzed 3,015 single F4:9 soybean plants to develop two residual heterozygotes derived near isogenic lines (RHD-NIL) populations by targeting nine SNP markers from genotype-by-sequencing, which corresponded to two novel quantitative trait loci (QTL) derived from G. soja: one for a novel seed oil QTL on chromosome 8 and another for a novel protein QTL on chromosome 14. Single marker analysis and linkage analysis using 50 RHD-NILs validated the chromosome 14 protein QTL, and whole genome sequencing of RHD-NILs allowed us to reduce the QTL interval from ∼16.5 to ∼4.6 Mbp. We identified two genomic regions based on recombination events which had significant increases of 0.65 and 0.72% in seed protein content without a significant decrease in seed oil content. A new Kompetitive allele-specific polymerase chain reaction (KASP) assay, which will be useful for introgression of this trait into modern elite G. max cultivars, was developed in one region. Within the significantly associated genomic regions, a total of eight genes are considered as candidate genes, based on the presence of gene annotations associated with the protein or amino acid metabolism/movement. Our results provide better insights into utilizing wild soybean as a source of genetic diversity for soybean cultivar improvement utilizing native traits.

Keywords: Glycine soja; QTL; seed oil; seed protein; wild soybean (Glycine soja Sieb. and Zucc.).

PubMed Disclaimer

Conflict of interest statement

YY was employed by the company Benson Hill. Mention of any trademark, vendor, or proprietary product does not constitute a guarantee or warranty of the product by the USDA and does not imply its approval to the exclusion of other products or vendor that may also be suitable. USDA is an equal opportunity provider and employer. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
Quantitative trait loci (QTL) LOD score traces for seed oil (green) and protein (blue) content in a population of Osage × PI593983 across four environments during 2016 and 2017 in Missouri. The vertical axis indicates the genetic map position along the chromosome. The horizontal axis represents the logarithm of the odds (LOD) score. The black dotted line indicates the threshold of significance (LOD = 6.0). (A) Chr8. (B) Chr14. (C) Chr20.
FIGURE 2
FIGURE 2
Genetic similarity test between individual RHD-NIL and parental lines is shown as a heatmap. Red indicates 1.0 genetically similar, light red indicates 0.90 genetically similar, and light pink indicates less than 0.50 genetically similar. Osage represents parent one and PI 593983 represents parent two.
FIGURE 3
FIGURE 3
Distribution of markers across the Chr.14 protein QTL on the physical map. (A) Five genotyping-by-sequencing (GBS) markers in the initial RIL population. (B) Fifty-one SoySNP6K markers in the RHD-NIL population. (C) Eight WGR markers in the RHD-NIL population. The eight recombination regions are indicated on the physical map.
FIGURE 4
FIGURE 4
Individual RHD-NILs and their eight recombination regions with the physical start and stop positions, protein and oil content, and the mean with the standard deviation (SD) based on their t-test grouping. F-value for the protein and oil content and the number of Wm82.a2.v1 annotated genes are displayed at the bottom. G. max are white, G. soja are dark gray, and the region containing recombination is light gray. Regions 5 and 6 are outlined in red to showcase the most significant regions for seed protein content.
FIGURE 5
FIGURE 5
The differences in phenotypic values of protein content (%) and oil content (%) from CLM&NOV carrying different homozygous alleles for the markers Gm14_8059955 and Gm14_9508613. Allele (CC) is the allele from G. max (Osage) and (TT) is the allele from G. soja (PI 593983) in Gm14_8059955. The alleles in Gm14_9508613 are (TT) for G. max (Osage) and (GG) for G. soja (PI 593983). (A) Protein content for Gm14_8059955. (B) Protein content for Gm14_9508613. (C) Oil content for Gm14_8059955. (D) Oil content for Gm14_9508613. The whiskers represent the maximum and minimum values, the box displays the 25th and 75th percentile, and the line in the box is the median value. The dots represent the density of the phenotypic values.
FIGURE 6
FIGURE 6
Performance of developed KASP assay ED-5 for detection of protein QTL on chr.14. Endpoint fluorescence scattering plots of (A) RHD-NIL population, and (B) BC1F1 breeding population. Allele-specific HEX primer (mutant; MUT) was displayed in green, allele-specific FAM primer (wild type; WT) was displayed in blue, and heterozygous (HET) lines were marked in red. The X-axis displays fluorescence of FAM at 523–568 nm, and the Y-axis displays fluorescence of HEX at 483–533 nm. Molecular marker assay for Gm14_8059955 is displayed above containing the forward FAM allele X wild type (WT), forward HEX allele Y mutant (MUT), and the reverse primer.

Similar articles

Cited by

References

    1. Andrews S. (2010). FastQC: A Quality Control Tool for High Throughput Sequence Data. Cambridge: Babraham Bioinformatics, Babraham Institute.
    1. Bernardo R. (1994). Prediction of maize single-cross performance using RFLPs and information from related hybrids. Crop Sci. 34 20–25. 10.2135/cropsci1994.0011183X003400010003x - DOI
    1. Bernardo R. (2002). Breeding for Quantitative Traits in PLANTS. Woodbury: Stemma press.
    1. Boehm J., Jr., Abdel-Haleem H., Schapaugh W., Rainey K., Pantalone V., Shannon G., et al. (2019). Genetic Improvement of U.S. Soybean in Maturity Groups V, VI, and VII. Crop Sci. 59 1838–1852. 10.2135/cropsci2018.10.0627 - DOI
    1. Bradbury P. J., Zhang Z., Kroon D. E., Casstevens T. M., Ramdoss Y., Buckler E. S. (2007). TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23 2633–2635. 10.1093/bioinformatics/btm308 - DOI - PubMed