A deep learning based approach for extracting Arabic handwriting: applied calligraphy and old cursive
- PMID: 38192476
- PMCID: PMC10773564
- DOI: 10.7717/peerj-cs.1465
A deep learning based approach for extracting Arabic handwriting: applied calligraphy and old cursive
Abstract
Based on the results of this research, a new method for separating Arabic offline text is presented. This method finds the core splitter between the "Middle" and "Lower" zones by looking for sharp character degeneration in those zones. With the exception of script localization and the essential feature of determining which direction a starting point is pointing, the baseline also functions as a delimiter for horizontal projections. Despite the fact that the bottom half of the characteristics is utilized to differentiate the modifiers in zones, the top half of the characteristics is not. This method works best when the baseline is able to divide features into the bottom zone and the middle zone in a complex pattern where it is hard to find the alphabet, like in ancient scripts. Furthermore, this technique performed well when it came to distinguishing Arabic text, including calligraphy. With the zoning system, the aim is to decrease the number of different element classes that are associated with the total number of alphabets used in Arabic cursive writing. The components are identified using the pixel value origin and center reign (CR) technique, which is combined with letter morphology to achieve complete word-level identification. Using the upper baseline and lower baseline together, this proposed technique produces a consistent Arabic pattern, which is intended to improve identification rates by increasing the number of matches. For Mediterranean keywords (cities in Algeria and Tunisia), the suggested approach makes use of indicators that the correctness of the Othmani and Arabic scripts is greater than 98.14 percent and 90.16 percent, respectively, based on 84 and 117 verses. As a consequence of the auditing method and the assessment section's structure and software, the major problems were identified, with a few of them being specifically highlighted.
Keywords: Pattern Recognition; Recognition.
©2023 Zerdoumi et al.
Conflict of interest statement
Noor Zaman Jhanjhi is an Academic Editor for PeerJ. The authors declare there are no competing interests.
Figures
References
-
- Abdelaziz I, Abdou S, Al-Barhamtoshy H. A large vocabulary system for Arabic online handwriting recognition. Pattern Analysis and Applications. 2016;19:1129–1141. doi: 10.1007/s10044-015-0526-7. - DOI
-
- Al-Dmour A, Fraij F. Segmenting Arabic handwritten documents into text lines and words. International Journal of Advancements in Computing Technology. 2014;6(3):109–119.
-
- Al-Ma’adeed S, Elliman D, Higgins CA. A data base for Arabic handwritten text recognition research. Piscataway. Frontiers in handwriting recognition, 2002. Proceedings. Eighth international workshop on 2002.2002. pp. 485–489.
-
- Ali A, Ahmad M, Rafiq N, Akber J, Ahmad U, Akmal S. Language independent optical character recognition for hand written text. Piscataway. Multitopic conference, 2004. Proceedings of INMIC 2004. 8th international.2004. pp. 79–84.
-
- Amin A, Al-Sadoun H, Fischer S. Hand-printed Arabic character recognition system using an artificial network. Pattern Recognition. 1996;29(4):663–675. doi: 10.1016/0031-3203(95)00110-7. - DOI
LinkOut - more resources
Full Text Sources
Research Materials