Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Dec 2;7(12):260.
doi: 10.3390/jimaging7120260.

HTR for Greek Historical Handwritten Documents

Affiliations

HTR for Greek Historical Handwritten Documents

Lazaros Tsochatzidis et al. J Imaging. .

Abstract

Offline handwritten text recognition (HTR) for historical documents aims for effective transcription by addressing challenges that originate from the low quality of manuscripts under study as well as from several particularities which are related to the historical period of writing. In this paper, the challenge in HTR is related to a focused goal of the transcription of Greek historical manuscripts that contain several particularities. To this end, in this paper, a convolutional recurrent neural network architecture is proposed that comprises octave convolution and recurrent units which use effective gated mechanisms. The proposed architecture has been evaluated on three newly created collections from Greek historical handwritten documents that will be made publicly available for research purposes as well as on standard datasets like IAM and RIMES. For evaluation we perform a concise study which shows that compared to state of the art architectures, the proposed one deals effectively with the challenging Greek historical manuscripts.

Keywords: convolutional neural networks; document image dataset; gated recurrent unit; handwritten text recognition; recurrent neural networks.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Schematic diagram of the proposed architecture, consisting of (a) the CNN stage and (b) the recurrent stage.
Figure 2
Figure 2
An example text line image (a) before and (b) after the preprocessing stage.
Figure 3
Figure 3
Feature maps produced by each layer of the Octave-CNN model.
Figure 4
Figure 4
An illustration of the gated recurrent unit (GRU) [8].
Figure 5
Figure 5
Example document image from the collection χϕ53.
Figure 6
Figure 6
Example document image from the collection χϕ79.
Figure 7
Figure 7
Example document image from the collection χϕ114.
Figure 8
Figure 8
Floating characters appearing at word endings. The floating portion of the word is represented by a rectangle, while the rest of the word is underlined.
Figure 9
Figure 9
‘Minuscule’ writing example. Key locations in the text line that correspond to this particularity are underlined.
Figure 10
Figure 10
Polytonic orthography example.
Figure 11
Figure 11
An example of a correctly predicted text line image along with the corresponding (a) groud-truth and (b) predicted texts.
Figure 12
Figure 12
An example of a problematic text line image along with the corresponding (a) groud-truth and (b) predicted texts. The errors concern diacritics (circle), spacing (red line) and abbreviations (square).

References

    1. De Sousa Neto A.F., Bezerra B.L.D., Toselli A.H., Lima E.B. HTR-Flor++: A Handwritten Text Recognition System Based on a Pipeline of Optical and Language Models; Proceedings of the DocEng ’20: ACM Symposium on Document Engineering 2020; Virtual Event. 29 September–1 October 2020; pp. 17:1–17:4. - DOI
    1. Puigcerver J. Are Multidimensional Recurrent Layers Really Necessary for Handwritten Text Recognition?; Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition; Kyoto, Japan. 9–15 November 2017; pp. 67–72. - DOI
    1. Chen Y., Fan H., Xu B., Yan Z., Kalantidis Y., Rohrbach M., Yan S., Feng J. Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution; Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision; Seoul, Korea. 27 October–2 November 2019; pp. 3434–3443. - DOI
    1. Markou K., Tsochatzidis L.T., Zagoris K., Papazoglou A., Karagiannis X., Symeonidis S., Pratikakis I. A Convolutional Recurrent Neural Network for the Handwritten Text Recognition of Historical Greek Manuscripts; Proceedings of the Pattern Recognition, ICPR International Workshops and Challenges; Virtual Event. 10–15 January 2021; pp. 249–262. - DOI
    1. Marti U., Bunke H. The IAM-database: An English sentence database for offline handwriting recognition. Int. J. Doc. Anal. Recognit. 2002;5:39–46. doi: 10.1007/s100320200071. - DOI

LinkOut - more resources