Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Dec 17;19(1):942.
doi: 10.1186/s12864-018-5247-z.

A catalog of annotated high-confidence SNPs from exome capture and sequencing reveals highly polymorphic genes in Norway spruce (Picea abies)

Affiliations

A catalog of annotated high-confidence SNPs from exome capture and sequencing reveals highly polymorphic genes in Norway spruce (Picea abies)

Aïda Azaiez et al. BMC Genomics. .

Abstract

Background: Norway spruce [Picea abies (L.) Karst.] is ecologically and economically one of the most important conifer worldwide. Our main goal was to develop a large catalog of annotated high confidence gene SNPs that should sustain the development of genomic tools for the conservation of natural and domesticated genetic diversity resources, and hasten tree breeding efforts in this species.

Results: Targeted sequencing was achieved by capturing P. abies exome with probes previously designed from the sequenced transcriptome of white spruce (Picea glauca (Moench) Voss). Capture efficiency was high (74.5%) given a high level of exome conservation between the two species. Using stringent criteria, we delimited a set of 61,771 high-confidence SNPs across 13,543 genes. To validate SNPs, a high-throughput genotyping array was developed for a subset of 5571 predicted SNPs representing as many different gene loci, and was used to genotype over 1000 trees. The estimated true positive rate of the resource was 84.2%, which was comparable with the genotyping success rate obtained for P. abies control SNPs recycled from previous genotyping efforts. We also analyzed SNP abundance across various gene functional categories. Several GO terms and gene families involved in stress response were found over-represented in highly polymorphic genes.

Conclusion: The annotated high-confidence SNP catalog developed herein represents a valuable genomic resource, being representative of over 13 K genes distributed across the P. abies genome. This resource should serve a variety of population genomics and breeding applications in Norway spruce.

Keywords: Annotated gene SNPs; Conifer; Exome sequencing; Illumina Infinium iSelect array; Illumina MiSeq; Picea abies; SNP abundance; Single nucleotide polymorphism.

PubMed Disclaimer

Conflict of interest statement

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Figures

Fig. 1
Fig. 1
Pipeline for exome capture and sequencing, sequence assembly and SNP discovery. 1 [30]. 2 [93]. 3 [94]. 4454 Life Science, Branford, CT.See Methods for details
Fig. 2
Fig. 2
Distribution of SNP abundance across genes. β is the computed SNP abundance parameter correcting for variable sequencing depth among genes, which was used to compare SNP abundance among them
Fig. 3
Fig. 3
Differential representation of gene families among the 10% most SNPed genes versus the other genes. The 16 families found differentially represented after Fisher’s exact test (P < 0.05) are represented; the stars (*) indicate the statistically most significant differences (P < 0.01). Gene family identifiers were retrieved from the ConGenIE database (http://www.congenie.org)

References

    1. Birol I, Raymond A, Jackman SD, Pleasance S, Coope R, Taylor GA, et al. Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data. Bioinformatics. 2013;29:1492–1497. doi: 10.1093/bioinformatics/btt178. - DOI - PMC - PubMed
    1. Nystedt B, Street NR, Wetterborn A, Zuccolo A, Lin Y-C, Scofield D-G, et al. The Norway spruce genome sequence and conifer genome evolution. Nature. 2013;497:579–584. doi: 10.1038/nature12211. - DOI - PubMed
    1. Warren RL, Keeling CI, Yuen MM, Raymond A, Taylor GA, Vandervalke BP, et al. Improved white spruce (Picea glauca) genome assemblies and annotation of large gene families of conifer terpenoid and phenolic defense metabolism. Plant J. 2015;83:189–212. doi: 10.1111/tpj.12886. - DOI - PubMed
    1. De La Torre AR, Birol I, Bousquet J, Ingvarsson PK, Jansson S, Jones SJM, et al. Insights into conifer giga-genomes. Plant Physiol. 2014;166:1724–1732. doi: 10.1104/pp.114.248708. - DOI - PMC - PubMed
    1. Pelgas B, Bousquet J, Meirmans PG, Ritland K, Isabel N. QTL mapping in white spruce: gene maps and genomic regions underlying adaptive traits across pedigrees, years and environments. BMC Genomics. 2011;12:145. doi: 10.1186/1471-2164-12-145. - DOI - PMC - PubMed

LinkOut - more resources