Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Oct;26(10):3829-37.
doi: 10.1105/tpc.114.130641. Epub 2014 Oct 31.

Reliable gene expression analysis by reverse transcription-quantitative PCR: reporting and minimizing the uncertainty in data accuracy

Affiliations

Reliable gene expression analysis by reverse transcription-quantitative PCR: reporting and minimizing the uncertainty in data accuracy

Tony Remans et al. Plant Cell. 2014 Oct.

Abstract

Reverse transcription-quantitative PCR (RT-qPCR) has been widely adopted to measure differences in mRNA levels; however, biological and technical variation strongly affects the accuracy of the reported differences. RT-qPCR specialists have warned that, unless researchers minimize this variability, they may report inaccurate differences and draw incorrect biological conclusions. The Minimum Information for Publication of Quantitative Real-Time PCR Experiments (MIQE) guidelines describe procedures for conducting and reporting RT-qPCR experiments. The MIQE guidelines enable others to judge the reliability of reported results; however, a recent literature survey found low adherence to these guidelines. Additionally, even experiments that use appropriate procedures remain subject to individual variation that statistical methods cannot correct. For example, since ideal reference genes do not exist, the widely used method of normalizing RT-qPCR data to reference genes generates background noise that affects the accuracy of measured changes in mRNA levels. However, current RT-qPCR data reporting styles ignore this source of variation. In this commentary, we direct researchers to appropriate procedures, outline a method to present the remaining uncertainty in data accuracy, and propose an intuitive way to select reference genes to minimize uncertainty. Reporting the uncertainty in data accuracy also serves for quality assessment, enabling researchers and peer reviewers to confidently evaluate the reliability of gene expression data.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Flow Chart for the Identification of Suitable Reference Genes in New Experimental Conditions and Subsequent Related Experiments. A starting pool of minimum 10 reference genes, which may or not include a minimum number of traditional housekeeping genes, can be selected from sources such as reference gene papers or from transcriptome data. This pool should be evaluated by a chosen algorithm and evaluation criteria to identify appropriate reference genes. For subsequent related experiments (closely related experimental conditions or repeated experiments), it may be sufficient to start with the evaluation of the small number of genes that were used for normalization in previous experiments, but when these genes fail the criteria, additional ones need to be incorporated. Strongly deviating experimental conditions may require reevaluation of the original starting pool to identify the best genes for normalization, or may even need the incorporation of additional candidate reference genes.
Figure 2.
Figure 2.
Representation of RT-qPCR Data in Column Graphs Including the Uncertainty on the Accuracy. Relative expression data (RT-qPCR; average ± se, n = 4 biological replicates from one experiment) of CAT2 in the leaves ([A] to [C]) and RBOHF in the roots ([D] to [F]) of Arabidopsis exposed to excess Zn (100, 250, and 500 μM) and relative to the control (0 μM = 1.0). Fold up- or downregulations are represented on a log2 scale y axis. (The control condition equals 1.0; hence, bars are not visible but standard errors are.) The normalized data are represented in white and the non-normalized data are in black. The difference between these is indicated by a gray bar and visualizes the uncertainty on the accuracy of the estimated GOI up- or downregulation, which varies according to the reference genes used for normalization. Normalization of data was performed with ACT2 only ([A] and [D]), four previously validated reference genes ([B] and [D]), or the gene(s) proposed by the GrayNorm algorithm yielding the lowest level of uncertainty ([C] and [F]). Statistics (one-way ANOVA and Dunnett comparison after testing normality with Shapiro-Wilk test and homoscedasticity with Bartlett test; *P < 0.05) were performed on both normalized and non-normalized data, and both data sets should yield significance to conclude a treatment effect. Uncertainty remains no matter the set of reference genes chosen, but the uncertainty is smaller in (C) and (F), allowing more accurate estimation of true GOI levels. This revealed, for example, that RBOHF is not upregulated in the roots at 500 μM Zn.
Figure 3.
Figure 3.
Representation of RT-qPCR Data in Line Graphs Including the Uncertainty on the Accuracy. Representation of log2 relative expression levels (RT-qPCR; average ± se, n = 3 to 8 biological replicates from one experiment) of NTRA in leaves of Arabidopsis exposed to Cd (5 and 10 μM) over a time period of 0, 2, 24, 48, and 72 h and relative to the control (0 μM, 0 h = 1.0). The normalized data are represented by full lines and the non-normalized data by dotted lines, which visualizes the uncertainty on the accuracy of estimated GOI up- or downregulation. The uncertainty level varies with the (set of) reference genes used for normalization. Normalization of data was performed with ACT2 (A), three previously validated reference genes (B), or the gene proposed by the GrayNorm algorithm yielding the lowest level of uncertainty (C). If statistical differences (one-way ANOVA and Tukey-Kramer adjustment; P < 0.05) were observed between treatments within a time point, they are indicated with different lowercase letters for normalized data and bold italic for non-normalized data (only indicated in [A], as they are the same for [B] and [C]). Both data sets should yield significance to conclude a concentration dependent effect. Uncertainty remains no matter which normalization chosen, but the uncertainty is smaller in (C), allowing more accurate estimation of true GOI expression levels.

Similar articles

Cited by

References

    1. Andersen C.L., Jensen J.L., Ørntoft T.F. (2004). Normalization of real-time quantitative reverse transcription-PCR data: a model-based variance estimation approach to identify genes suited for normalization, applied to bladder and colon cancer data sets. Cancer Res. 64: 5245–5250. - PubMed
    1. Baker M. (2011). qPCR: quicker and easier but don’t be sloppy. Nat. Methods 8: 207–212.
    1. Bustin S.A., Beaulieu J.F., Huggett J., Jaggi R., Kibenge F.S.B., Olsvik P.A., Penning L.C., Toegel S. (2010). MIQE précis: Practical implementation of minimum standard guidelines for fluorescence-based quantitative real-time PCR experiments. BMC Mol. Biol. 11: 74. - PMC - PubMed
    1. Bustin S.A., et al. . . (2013). The need for transparency and good practices in the qPCR literature. Nat. Methods 10: 1063–1067. - PubMed
    1. Bustin S.A., Benes V., Garson J.A., Hellemans J., Huggett J., Kubista M., Mueller R., Nolan T., Pfaffl M.W., Shipley G.L., Vandesompele J., Wittwer C.T. (2009). The MIQE guidelines: minimum information for publication of quantitative real-time PCR experiments. Clin. Chem. 55: 611–622. - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources