Joint recognition of text and layout in historical Russian documents
Journal: Scientific and Technical Journal of Information Technologies, Mechanics and Optics (Vol.23, No. 3)Publication Date: 2023-06-21
Authors : Mohammed S. Teslya N.;
Page : 585-594
Keywords : document understanding; handwritten text recognition; layout analysis; fully connected networks; transformers;
Abstract
In this paper, we evaluated the Document Attention Network (DAN), the first end-to-end segmentation-free architecture on Historical Russian Documents. The DAN model jointly recognizes both text and layout from whole documents, it takes whole documents from any size as an input and output the text as well as logical layout tokens. For comparison purposes, we conduct our experiments on Digital Peter dataset as it has been recognized at line-level. Dataset consists of documents of Peter the Great manuscripts; ground truths are represented according to a sophisticated XML schema which enables an accurate detailed definition of layout and text regions. We achieved good results at page-level: 18.71 % for Character Error Rate (CER), 39.7 % for Word Error Rate (WER), 14.11 % For Layout Ordering Error Rate (LOER), and 66.67 % for mean Average Precision (mAP).
Other Latest Articles
- COMPARATIVE STUDY OF MACHINE TRANSLATION TECHNIQUES
- Perceived Barriers and COVID-19 Vaccine Acceptance Among Health Professions Students in Vietnam
- Blindness detection in diabetic retinopathy using Bayesian variant-based connected component algorithm in Keras and TensorFlow
- Muscle Endurance, Muscle Strength, Coordination, and Balance as Predictors of Archery Accuracy
- Exploring the possibility of predicting users’ career guidance preferences based on analysis of community topics and the gender in the online social network users’ profiles
Last modified: 2023-07-10 19:53:57