Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 May 22:5:180069.
doi: 10.1038/sdata.2018.69.

The draft genome sequence of cork oak

Affiliations

The draft genome sequence of cork oak

António Marcos Ramos et al. Sci Data. .

Abstract

Cork oak (Quercus suber) is native to southwest Europe and northwest Africa where it plays a crucial environmental and economical role. To tackle the cork oak production and industrial challenges, advanced research is imperative but dependent on the availability of a sequenced genome. To address this, we produced the first draft version of the cork oak genome. We followed a de novo assembly strategy based on high-throughput sequence data, which generated a draft genome comprising 23,347 scaffolds and 953.3 Mb in size. A total of 79,752 genes and 83,814 transcripts were predicted, including 33,658 high-confidence genes. An InterPro signature assignment was detected for 69,218 transcripts, which represented 82.6% of the total. Validation studies demonstrated the genome assembly and annotation completeness and highlighted the usefulness of the draft genome for read mapping of high-throughput sequence data generated using different protocols. All data generated is available through the public databases where it was deposited, being therefore ready to use by the academic and industry communities working on cork oak and/or related species.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1. Illumina DNA sequence data pre-processing workflow.
The pipeline included removal of low quality reads, as well as reads containing adapter sequences and undetermined nucleotides. The reads that remained were subsequently mapped to a set of chloroplast and mitochondrion genomes to remove the reads derived from these plastid genomes.
Figure 2
Figure 2. K-mer distribution used for the estimation of genome size.
The distribution was determined with Jellyfish using a k-mer size of 23.

References

Data Citations

    1. 2018. GenBank. PKMF00000000
    1. 2017. NCBI Sequence Read Archive. SRP111728

References

    1. Pereira-Leal J. B. et al. A comprehensive assessment of the transcriptome of cork oak (Quercus suber) through EST sequencing. BMC Genomics 15, 371 (2014). - PMC - PubMed
    1. Sebastiana M. et al. Oak root response to ectomycorrhizal symbiosis establishment: RNA-seq derived transcript identification and expression profiling. PLoS ONE 9, e98376 (2014). - PMC - PubMed
    1. Magalhães A. P. et al. RNA-seq and gene network analysis uncover activation of an ABA-dependent signalosome during the cork oak root response to drought. Front. Plant Sci. 6, 1195 (2016). - PMC - PubMed
    1. Rocheta M. et al. Comparative transcriptomic analysis of male and female flowers of monoecious Quercus suber. Front. Plant Sci. 5, 599 (2014). - PMC - PubMed
    1. Miguel A. et al. Characterization of the cork oak transcriptome dynamics during acorn development. BMC Plant Biol. 15, 158 (2015). - PMC - PubMed

Publication types

LinkOut - more resources