Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Oct;208(1):96-101.
doi: 10.1111/nph.13545. Epub 2015 Jun 25.

AtRTD - a comprehensive reference transcript dataset resource for accurate quantification of transcript-specific expression in Arabidopsis thaliana

Affiliations

AtRTD - a comprehensive reference transcript dataset resource for accurate quantification of transcript-specific expression in Arabidopsis thaliana

Runxuan Zhang et al. New Phytol. 2015 Oct.

Abstract

RNA-sequencing (RNA-seq) allows global gene expression analysis at the individual transcript level. Accurate quantification of transcript variants generated by alternative splicing (AS) remains a challenge. We have developed a comprehensive, nonredundant Arabidopsis reference transcript dataset (AtRTD) containing over 74 000 transcripts for use with algorithms to quantify AS transcript isoforms in RNA-seq. The AtRTD was formed by merging transcripts from TAIR10 and novel transcripts identified in an AS discovery project. We have estimated transcript abundance in RNA-seq data using the transcriptome-based alignment-free programmes Sailfish and Salmon and have validated quantification of splicing ratios from RNA-seq by high resolution reverse transcription polymerase chain reaction (HR RT-PCR). Good correlations between splicing ratios from RNA-seq and HR RT-PCR were obtained demonstrating the accuracy of abundances calculated for individual transcripts in RNA-seq. The AtRTD is a resource that will have immediate utility in analysing Arabidopsis RNA-seq data to quantify differential transcript abundance and expression.

Keywords: Arabidopsis thaliana; RNA-sequencing (RNA-seq); Sailfish; Salmon; alternative splicing; high resolution reverse transcription (HR RT)-PCR; transcripts per million.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Distribution of the number of transcripts per gene in The Arabidopsis Information Resource version 10 (TAIR10) and the Arabidopsis reference transcript dataset (AtRTD). The number of Arabidopsis thaliana genes (y‐axis) containing two or more transcripts (x‐axis) are shown.
Figure 2
Figure 2
Arabidopsis thaliana gene and transcript structures and histograms of transcript ratios from transcripts per million (TPM) generated by Sailfish and Salmon with Arabidopsis reference transcript dataset (AtRTD) and from relative fluorescence units (RFUs) that measures peak areas from high resolution reverse transcription polymerase chain reaction (HR RTPCR. (a) At5g16820, HEAT SHOCK FACTOR 3 (HSF3); (b) At2g39730, RUBISCO ACTIVASE (RCA). Transcripts are shown below the gene structure. Open boxes, exons; black rectangles, untranslated regions; thin lines, introns; diagonal lines, splicing events; arrowheads, approximate positions of primers used in HR RTPCR. FS, fully spliced; E, exon; I2R/I6R, intron retention of introns 2 and 6 in (a) and (b), respectively; Alt 3′/5′ss, alternative (a) 3′ and (b) 5′ splice sites, respectively; T1 and T2, different time‐points. Transcript variants from The Arabidopsis Information Resource (TAIR) and the alternative splicing (AS) discovery dataset (Marquez et al., 2012) are indicated by .1, .2, or _ID3, respectively. Error bars represent the ± standard deviation (SD) of three independent biological replicates.
Figure 3
Figure 3
Correlation of the splicing ratios calculated from the RNA‐seq data and the high resolution reverse transcription polymerase chain reaction (HR RTPCR. (a) Sailfish, (b) Salmon. Splicing ratios for 50 alternative splicing events from 29 Arabidopsis thaliana genes (three biological replicates of the time points T1 and T2) generated 300 data points in total. AS, alternatively spliced; FS, fully spliced; TPM, transcripts per million; RFU, relative fluorescence unit. Pearson's correlation coefficient: Sailfish = 0.7044, Salmon = 0.9051. Spearman's rank correlation coefficient: Sailfish = 0.712, Salmon = 0.907.

Similar articles

Cited by

References

    1. Airoldi CA, Davies B. 2012. Gene duplication and the evolution of plant MADS‐box transcription factors. Journal of Genetics and Genomics 39: 157–165. - PubMed
    1. Alamancos GP, Pagès A, Trincado JL, Bellora N, Eyras E. 2015. Leveraging transcript quantification for fast computation of alternative splicing profiles. bioRxiv. doi: 10.1101/008763. - DOI - PMC - PubMed
    1. Bardou F, Ariel F, Simpson CG, Romero‐Barrios N, Laporte P, Balzerque S, Brown JW, Crespi M. 2014. Long noncoding RNA modulates alternative splicing regulators in Arabidopsis . Developmental Cell 30: 166–176. - PubMed
    1. Black DL. 2003. Mechanisms of alternative pre‐messenger RNA splicing. Annual Review of Biochemistry 72: 291–336. - PubMed
    1. Carvalho RF, Feijão CV, Duque P. 2013. On the physiological significance of alternative splicing events in higher plants. Protoplasma 250: 639–650. - PubMed

Publication types

LinkOut - more resources