POLYAR, a new computer program for prediction of poly(A) sites in human sequences
- PMID: 21092114
- PMCID: PMC3053588
- DOI: 10.1186/1471-2164-11-646
POLYAR, a new computer program for prediction of poly(A) sites in human sequences
Abstract
Background: mRNA polyadenylation is an essential step of pre-mRNA processing in eukaryotes. Accurate prediction of the pre-mRNA 3'-end cleavage/polyadenylation sites is important for defining the gene boundaries and understanding gene expression mechanisms.
Results: 28761 human mapped poly(A) sites have been classified into three classes containing different known forms of polyadenylation signal (PAS) or none of them (PAS-strong, PAS-weak and PAS-less, respectively) and a new computer program POLYAR for the prediction of poly(A) sites of each class was developed. In comparison with polya_svm (till date the most accurate computer program for prediction of poly(A) sites) while searching for PAS-strong poly(A) sites in human sequences, POLYAR had a significantly higher prediction sensitivity (80.8% versus 65.7%) and specificity (66.4% versus 51.7%) However, when a similar sort of search was conducted for PAS-weak and PAS-less poly(A) sites, both programs had a very low prediction accuracy, which indicates that our knowledge about factors involved in the determination of the poly(A) sites is not sufficient to identify such polyadenylation regions.
Conclusions: We present a new classification of polyadenylation sites into three classes and a novel computer program POLYAR for prediction of poly(A) sites/regions of each of the class. In tests, POLYAR shows high accuracy of prediction of the PAS-strong poly(A) sites, though this program's efficiency in searching for PAS-weak and PAS-less poly(A) sites is not very high but is comparable to other available programs. These findings suggest that additional characteristics of such poly(A) sites remain to be elucidated. POLYAR program with a stand-alone version for downloading is available at http://cub.comsats.edu.pk/polyapredict.htm.
Figures


Similar articles
-
Sequence determinants in human polyadenylation site selection.BMC Genomics. 2003 Feb 25;4(1):7. doi: 10.1186/1471-2164-4-7. Epub 2003 Feb 25. BMC Genomics. 2003. PMID: 12600277 Free PMC article.
-
Omni-PolyA: a method and tool for accurate recognition of Poly(A) signals in human genomic DNA.BMC Genomics. 2017 Aug 15;18(1):620. doi: 10.1186/s12864-017-4033-7. BMC Genomics. 2017. PMID: 28810905 Free PMC article.
-
Hybrid model for efficient prediction of poly(A) signals in human genomic DNA.Methods. 2019 Aug 15;166:31-39. doi: 10.1016/j.ymeth.2019.04.001. Epub 2019 Apr 13. Methods. 2019. PMID: 30991099
-
Implications of polyadenylation in health and disease.Nucleus. 2014;5(6):508-19. doi: 10.4161/nucl.36360. Epub 2014 Oct 31. Nucleus. 2014. PMID: 25484187 Free PMC article. Review.
-
Genome-wide identification and predictive modeling of polyadenylation sites in eukaryotes.Brief Bioinform. 2015 Mar;16(2):304-13. doi: 10.1093/bib/bbu011. Epub 2014 Apr 1. Brief Bioinform. 2015. PMID: 24695098 Review.
Cited by
-
RNA polyadenylation sites on the genomes of microorganisms, animals, and plants.PLoS One. 2013 Nov 18;8(11):e79511. doi: 10.1371/journal.pone.0079511. eCollection 2013. PLoS One. 2013. PMID: 24260238 Free PMC article.
-
Aptardi predicts polyadenylation sites in sample-specific transcriptomes using high-throughput RNA sequencing and DNA sequence.Nat Commun. 2021 Mar 12;12(1):1652. doi: 10.1038/s41467-021-21894-x. Nat Commun. 2021. PMID: 33712618 Free PMC article.
-
CRISPRs in the human genome are differentially expressed between malignant and normal adjacent to tumor tissue.Commun Biol. 2022 Apr 8;5(1):338. doi: 10.1038/s42003-022-03249-4. Commun Biol. 2022. PMID: 35396392 Free PMC article.
-
Alterations in polyadenylation and its implications for endocrine disease.Front Endocrinol (Lausanne). 2013 May 8;4:53. doi: 10.3389/fendo.2013.00053. eCollection 2013. Front Endocrinol (Lausanne). 2013. PMID: 23658553 Free PMC article.
-
Motif types, motif locations and base composition patterns around the RNA polyadenylation site in microorganisms, plants and animals.BMC Evol Biol. 2014 Jul 23;14:162. doi: 10.1186/s12862-014-0162-7. BMC Evol Biol. 2014. PMID: 25052519 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources