Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jul 22;14(1):16800.
doi: 10.1038/s41598-024-67738-8.

Enhancing handwritten text recognition accuracy with gated mechanisms

Affiliations

Enhancing handwritten text recognition accuracy with gated mechanisms

Ravikumar Chinthaginjala et al. Sci Rep. .

Abstract

Handwritten Text Recognition (HTR) is a challenging task due to the complex structures and variations present in handwritten text. In recent years, the application of gated mechanisms, such as Long Short-Term Memory (LSTM) networks, has brought significant advancements to HTR systems. This paper presents an overview of HTR using a gated mechanism and highlights its novelty and advantages. The gated mechanism enables the model to capture long-term dependencies, retain relevant context, handle variable length sequences, mitigate error propagation, and adapt to contextual variations. The pipeline involves preprocessing the handwritten text images, extracting features, modeling the sequential dependencies using the gated mechanism, and decoding the output into readable text. The training process utilizes annotated datasets and optimization techniques to minimize transcription discrepancies. HTR using a gated mechanism has found applications in digitizing historical documents, automatic form processing, and real-time transcription. The results show improved accuracy and robustness compared to traditional HTR approaches. The advancements in HTR using a gated mechanism open up new possibilities for effectively recognizing and transcribing handwritten text in various domains. This research does a better job than the most recent iteration of the HTR system when compared to five different handwritten datasets (Washington, Saint Gall, RIMES, Bentham and IAM). Smartphones and robots are examples of low-cost computing devices that can benefit from this research.

Keywords: Convolutional recurrent neural networks; Deep learning; Gated convolutional neural networks; Handwritten transcript recognition; Natural language processing.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Puigcerver architecture.
Figure 2
Figure 2
Bluche architecture.
Figure 3
Figure 3
Flor architecture.
Figure 4
Figure 4
Proposed architecture.
Figure 5
Figure 5
Bentham database sample.
Figure 6
Figure 6
IAM database sample.
Figure 7
Figure 7
RIMES database sample.
Figure 8
Figure 8
Saint Gall database sample.
Figure 9
Figure 9
Washington database sample.
Figure 10
Figure 10
WER and CER evaluation for all Test partition.

References

    1. Bezerra, B. L. D., Zanchettin, C. & Toselli, A. H. Handwriting: Recognition, Development, and Analysis (Nova Science Publication Inc, 2017).
    1. Darmatasia and Fanany, M. I., Handwriting recognition on form document using convolutional neural network and support vector machines (CNN-SVM), Proc. 2017 5th International Conference on Information and Communication Technology (ICoIC7), Melaka, Malaysia, 2017, pp. 1–6, 10.1109/ICoICT.2017.8074699.
    1. Toselli, A. H. Vidal E. Handwritten Text Recognition Results on the Bentham Collection with Improved Classical N-Gram-HMM methods. Proc. of the 3rd International Workshop on Historical Document Imaging and Processing (HIP '15). Association for Computing Machinery, New York, NY, USA, 15–22. 10.1145/2809544.2809551. (2015).
    1. Sánchez, J. A., Romero, V., Toselli, A. H., Vidal, E. ICFHR2016 Competition on Handwritten Text Recognition on the READ Dataset, Proc. 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), Shenzhen, China, 630–635, 10.1109/ICFHR.2016.0120. (2016).
    1. Kamalanaban, E., Gopinath, M. & Premkumar, S. Medicine box: Doctor’s prescription recognition using deep machine learning. Int. J. Eng. Technol.7(334), 114–117 (2018).10.14419/ijet.v7i3.34.18785 - DOI

LinkOut - more resources