Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Sep 14;19(1):405.
doi: 10.1186/s12870-019-1988-3.

Genome wide search to identify reference genes candidates for gene expression analysis in Gossypium hirsutum

Affiliations

Genome wide search to identify reference genes candidates for gene expression analysis in Gossypium hirsutum

P K Smitha et al. BMC Plant Biol. .

Abstract

Background: Cotton is one of the most important commercial crops as the source of natural fiber, oil and fodder. To protect it from harmful pest populations number of newer transgenic lines have been developed. For quick expression checks in successful agriculture qPCR (quantitative polymerase chain reaction) have become extremely popular. The selection of appropriate reference genes plays a critical role in the outcome of such experiments as the method quantifies expression of the target gene in comparison with the reference. Traditionally most commonly used reference genes are the "house-keeping genes", involved in basic cellular processes. However, expression levels of such genes often vary in response to experimental conditions, forcing the researchers to validate the reference genes for every experimental platform. This study presents a data science driven unbiased genome-wide search for the selection of reference genes by assessing variation of > 50,000 genes in a publicly available RNA-seq dataset of cotton species Gossypium hirsutum.

Result: Five genes (TMN5, TBL6, UTR5B, AT1g65240 and CYP76B6) identified by data-science driven analysis, along with two commonly used reference genes found in literature (PP2A1 and UBQ14) were taken through qPCR in a set of 33 experimental samples consisting of different tissues (leaves, square, stem and root), different stages of leaf (young and mature) and square development (small, medium and large) in both transgenic and non-transgenic plants. Expression stability of the genes was evaluated using four algorithms - geNorm, BestKeeper, NormFinder and RefFinder.

Conclusion: Based on the results we recommend the usage of TMN5 and TBL6 as the optimal candidate reference genes in qPCR experiments with normal and transgenic cotton plant tissues. AT1g65240 and PP2A1 can also be used if expression study includes squares. This study, for the first time successfully displays a data science driven genome-wide search method followed by experimental validation as a method of choice for selection of stable reference genes over the selection based on function alone.

Keywords: Cotton; Data science; Gossypium hirsutum; Reference gene; Transgenic; qPCR.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Cluster of genes in the three-dimensional space of CV, MAD and 1-p obtained using the PAM method. Genes marked in red represent cluster #1
Fig. 2
Fig. 2
Work Flow to identify candidate reference genes with least variations and validation of the genes in experiment
Fig. 3
Fig. 3
Observed expression values of candidate reference genes across normal and transgenic categories, with median expression value of each gene represented by middle horizontal lines in the box plot
Fig. 4
Fig. 4
Observed expression values of candidate reference genes across (a) different ages of the plant and (b) various plant parts, with median expression value of each gene represented by middle horizontal lines in the box plot
Fig. 5
Fig. 5
Observed expression values of candidate reference genes across (a) two maturity levels of the leaves and (b) different sizes of the squares, with median expression value of each gene represented by middle horizontal lines in the box plot
Fig. 6
Fig. 6
Stability Ranks of the chosen reference genes candidate using four different algorithms - geNorm, BestKeeper, NormFinder and RefFinder

References

    1. Kozera B, Rapacz M. Reference genes in real-time PCR. J Appl Genet. 2013;54(4):391–406. doi: 10.1007/s13353-013-0173-x. - DOI - PMC - PubMed
    1. Sun R, He Q, Zhang B, Wang Q. Selection and validation of reliable reference genes in Gossypium raimondii. Biotechnol Lett. 2015;37(7):1483–1493. doi: 10.1007/s10529-015-1810-8. - DOI - PubMed
    1. Bustin SA. Real-time, fluorescence-based quantitative PCR: a snapshot of current procedures and preferences. Expert Rev Mol Diagn. 2005;5(4):493–498. doi: 10.1586/14737159.5.4.493. - DOI - PubMed
    1. Lü J, Yang C, Zhang Y, Pan H. Selection of reference genes for the normalization of RT-qPCR data in gene expression studies in insects: a systematic review. Front Physiol. 2018;9:1560. doi: 10.3389/fphys.2018.01560. - DOI - PMC - PubMed
    1. Zhu X, Li X, Chen W, Chen J, Lu W, Chen L, et al. Evaluation of new reference genes in papaya for accurate transcript normalization under different experimental conditions. PLoS One. 2012;7(8):e44405. doi: 10.1371/journal.pone.0044405. - DOI - PMC - PubMed

MeSH terms

LinkOut - more resources