Molecular Traits of Long Non-protein Coding RNAs from Diverse Plant Species Show Little Evidence of Phylogenetic Relationships
- PMID: 31235560
- PMCID: PMC6686929
- DOI: 10.1534/g3.119.400201
Molecular Traits of Long Non-protein Coding RNAs from Diverse Plant Species Show Little Evidence of Phylogenetic Relationships
Abstract
Long non-coding RNAs (lncRNAs) represent a diverse class of regulatory loci with roles in development and stress responses throughout all kingdoms of life. LncRNAs, however, remain under-studied in plants compared to animal systems. To address this deficiency, we applied a machine learning prediction tool, Classifying RNA by Ensemble Machine learning Algorithm (CREMA), to analyze RNAseq data from 11 plant species chosen to represent a wide range of evolutionary histories. Transcript sequences of all expressed and/or annotated loci from plants grown in unstressed (control) conditions were assembled and input into CREMA for comparative analyses. On average, 6.4% of the plant transcripts were identified by CREMA as encoding lncRNAs. Gene annotation associated with the transcripts showed that up to 99% of all predicted lncRNAs for Solanum tuberosum and Amborella trichopoda were missing from their reference annotations whereas the reference annotation for the genetic model plant Arabidopsis thaliana contains 96% of all predicted lncRNAs for this species. Thus a reliance on reference annotations for use in lncRNA research in less well-studied plants can be impeded by the near absence of annotations associated with these regulatory transcripts. Moreover, our work using phylogenetic signal analyses suggests that molecular traits of plant lncRNAs display different evolutionary patterns than all other transcripts in plants and have molecular traits that do not follow a classic evolutionary pattern. Specifically, GC content was the only tested trait of lncRNAs with consistently significant and high phylogenetic signal, contrary to high signal in all tested molecular traits for the other transcripts in our tested plant species.
Keywords: CREMA; RNASeq; evolution; lncRNA; phylogenetic signal.
Copyright © 2019 Simopoulos et al.
Figures



Similar articles
-
Machine Learning-Based Annotation of Long Noncoding RNAs Using PLncPRO.Methods Mol Biol. 2020;2107:253-260. doi: 10.1007/978-1-0716-0235-5_12. Methods Mol Biol. 2020. PMID: 31893451
-
PLncDB V2.0: a comprehensive encyclopedia of plant long noncoding RNAs.Nucleic Acids Res. 2021 Jan 8;49(D1):D1489-D1495. doi: 10.1093/nar/gkaa910. Nucleic Acids Res. 2021. PMID: 33079992 Free PMC article.
-
Prediction of plant lncRNA by ensemble machine learning classifiers.BMC Genomics. 2018 May 2;19(1):316. doi: 10.1186/s12864-018-4665-2. BMC Genomics. 2018. PMID: 29720103 Free PMC article.
-
Long non-coding RNAs and their biological roles in plants.Genomics Proteomics Bioinformatics. 2015 Jun;13(3):137-47. doi: 10.1016/j.gpb.2015.02.003. Epub 2015 Apr 30. Genomics Proteomics Bioinformatics. 2015. PMID: 25936895 Free PMC article. Review.
-
Pattern recognition analysis on long noncoding RNAs: a tool for prediction in plants.Brief Bioinform. 2019 Mar 25;20(2):682-689. doi: 10.1093/bib/bby034. Brief Bioinform. 2019. PMID: 29697740 Review.
Cited by
-
Long Noncoding RNAs in Response to Hyperosmolarity Stress, but Not Salt Stress, Were Mainly Enriched in the Rice Roots.Int J Mol Sci. 2024 Jun 5;25(11):6226. doi: 10.3390/ijms25116226. Int J Mol Sci. 2024. PMID: 38892412 Free PMC article.
-
Linking discoveries, mechanisms, and technologies to develop a clearer perspective on plant long noncoding RNAs.Plant Cell. 2023 May 29;35(6):1762-1786. doi: 10.1093/plcell/koad027. Plant Cell. 2023. PMID: 36738093 Free PMC article. Review.
-
Roles of Non-Coding RNAs in Response to Nitrogen Availability in Plants.Int J Mol Sci. 2020 Nov 12;21(22):8508. doi: 10.3390/ijms21228508. Int J Mol Sci. 2020. PMID: 33198163 Free PMC article. Review.
-
The evolutionary landscape and expression pattern of plant lincRNAs.RNA Biol. 2022 Jan;19(1):1190-1207. doi: 10.1080/15476286.2022.2144609. RNA Biol. 2022. PMID: 36382947 Free PMC article.
-
A vast pool of lineage-specific microproteins encoded by long non-coding RNAs in plants.Nucleic Acids Res. 2021 Oct 11;49(18):10328-10346. doi: 10.1093/nar/gkab816. Nucleic Acids Res. 2021. PMID: 34570232 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Miscellaneous