Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jun 14;24(1):554.
doi: 10.1186/s12870-024-05262-7.

Genome-wide analyses of member identification, expression pattern, and protein-protein interaction of EPF/EPFL gene family in Gossypium

Affiliations

Genome-wide analyses of member identification, expression pattern, and protein-protein interaction of EPF/EPFL gene family in Gossypium

Pengtao Li et al. BMC Plant Biol. .

Abstract

Background: Epidermal patterning factor / -like (EPF/EPFL) gene family encodes a class of cysteine-rich secretory peptides, which are widelyfound in terrestrial plants.Multiple studies has indicated that EPF/EPFLs might play significant roles in coordinating plant development and growth, especially as the morphogenesis processes of stoma, awn, stamen, and fruit skin. However, few research on EPF/EPFL gene family was reported in Gossypium.

Results: We separately identified 20 G. raimondii, 24 G. arboreum, 44 G. hirsutum, and 44 G. barbadense EPF/EPFL genes in the 4 representative cotton species, which were divided into four clades together with 11 Arabidopsis thaliana, 13 Oryza sativa, and 17 Selaginella moellendorffii ones based on their evolutionary relationships. The similar gene structure and common motifs indicated the high conservation among the EPF/EPFL members, while the uneven distribution in chromosomes implied the variability during the long-term evolutionary process. Hundreds of collinearity relationships were identified from the pairwise comparisons of intraspecifc and interspecific genomes, which illustrated gene duplication might contribute to the expansion of cotton EPF/EPFL gene family. A total of 15 kinds of cis-regulatory elements were predicted in the promoter regions, and divided into three major categories relevant to the biological processes of development and growth, plant hormone response, and abiotic stress response. Having performing the expression pattern analyses with the basic of the published RNA-seq data, we found most of GhEPF/EPFL and GbEPF/EPFL genes presented the relatively low expression levels among the 9 tissues or organs, while showed more dramatically different responses to high/low temperature and salt or drought stresses. Combined with transcriptome data of developing ovules and fibers and quantitative Real-time PCR results (qRT-PCR) of 15 highly expressed GhEPF/EPFL genes, it could be deduced that the cotton EPF/EPFL genes were closely related with fiber development. Additionally, the networks of protein-protein interacting among EPF/EPFLs concentrated on the cores of GhEPF1 and GhEPF7, and thosefunctional enrichment analyses indicated that most of EPF/EPFLs participate in the GO (Gene Ontology) terms of stomatal development and plant epidermis development, and the KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways of DNA or base excision repair.

Conclusion: Totally, 132 EPF/EPFL genes were identified for the first time in cotton, whose bioinformatic analyses of cis-regulatory elements and expression patterns combined with qRT-PCR experiments to prove the potential functions in the biological processes of plant growth and responding to abiotic stresses, specifically in the fiber development. These results not only provide comprehensive and valuable information for cotton EPF/EPFL gene family, but also lay solid foundation for screening candidate EPF/EPFL genes in further cotton breeding.

Keywords: EPF/EPFL gene family; Cotton; Expression pattern; Protein–protein interaction; qRT-PCR verification.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Phylogenetic analysis of EPF/EPFL proteins from Gossypium, Arabidopsis thaliana, Oryza sativa, and Selaginella moellendorffii. The blue checkmark presents the AtEPF/EPFL proteins, and the purple square presents the OsEPF/EPFL proteins. The red check mark presents the SmEPF/EPFL proteins, and the green triangle presents the GaEPF/EPFL proteins. The red square presents the GrEPF/EPFL proteins, and the yellow star and blue circle present the GhEPF/EPFL and GbEPF/EPFL proteins, respectively
Fig. 2
Fig. 2
Chromosomal location of cotton EPF/EPFL genes
Fig. 3
Fig. 3
Gene structure and conserved motif identification of cotton EPF/EPFLs. a represents the evolutionary relationships of cotton EPF/EPFL genes, and b and c separately represent the conserved motifs and gene structures of cotton EPF/EPFL genes
Fig. 4
Fig. 4
Collinearity events of duplication gene pairs of EPF/EPFL genes in four cotton species. The different color rectangles represented the chromosomes derived from the different cotton species, and the different color lines represented the collinearity relationships between and among the different cotton species
Fig. 5
Fig. 5
Cis-regulatory elements in the promoter regions of cotton EPF/EPFL genes
Fig. 6
Fig. 6
The tissue-specific expression and expressed patterns responding to abiotic stresses. A presented the analyses of specific expression of GhEPF/EPFL and GbEPF/EPFL genes in 9 tissues, and B presented the expression patterns of GhEPF/EPFL genes in response to high-temperature (37℃ treatment), low-temperature (4℃ treatment), salt stress (NaCl treatment), and drought stress (PEG treatment) at 0 h, 1 h, 3 h, 6 h, 12 h, and 24 h, respectively
Fig. 7
Fig. 7
The expression patterns and qRT-PCR verification of cotton EPF/EPFL genes during fiber development. A presented the analyses of expression patterns of GhEPF/EPFL and GbEPF/EPFL genes on the developing ovule (0, 1, 3, 5, 10, and 20 DPA) and fiber (10, 20, and 25 DPA), and B presented the qRT-PCR verification of 15 highly expressed GhEPF/EPFL genes on the high-yield and wide-adaptability CCRI36 and superior fiber-quality and high VW-resistance Hai1 at 10, 20, and 25 DPA, respectively
Fig. 8
Fig. 8
The protein–protein interaction and functional enrichment of GhEPF/EPFL genes. A presented the analysis of protein–protein interaction of GhEPF/EPFL genes based on the String database, and B and C separately presented the enriched GO categories and KEGG pathways of GhEPF/EPFL genes

Similar articles

Cited by

References

    1. Drakakaki G, Dandekar A. Protein secretion: how many secretory routes does a plant cell have? Plant Sci. 2013;203–204:74–78. doi: 10.1016/j.plantsci.2012.12.017. - DOI - PubMed
    1. Grudkowska M, Zagdańska B. Multifunctional role of plant cysteine proteinases. Acta Biochim Pol. 2004;51(3):609–624. doi: 10.18388/abp.2004_3547. - DOI - PubMed
    1. Rawlings ND, Barrett AJ, Thomas PD, Huang X, Bateman A, Finn RD. The MEROPS database of proteolytic enzymes, their substrates and inhibitors in 2017 and a comparison with peptidases in the PANTHER database. Nucleic Acids Res. 2018;46(D1):D624–D632. doi: 10.1093/nar/gkx1134. - DOI - PMC - PubMed
    1. Szewińska J, Simińska J, Bielawski W. The roles of cysteine proteases and phytocystatins in development and germination of cereal seeds. J Plant Physiol. 2016;207:10–21. doi: 10.1016/j.jplph.2016.09.008. - DOI - PubMed
    1. Höwing T, Dann M, Müller B, Helm M, Scholz S, Schneitz K, Hammes UZ, Gietl C. The role of KDEL-tailed cysteine endopeptidases of Arabidopsis (AtCEP2 and AtCEP1) in root development. PLoS ONE. 2018;13(12):e0209407. doi: 10.1371/journal.pone.0209407. - DOI - PMC - PubMed

LinkOut - more resources