Importance of Textlines in Historical Document Classification

01/24/2022
by   Martin Kišš, et al.
0

This paper describes a system prepared at Brno University of Technology for ICDAR 2021 Competition on Historical Document Classification, experiments leading to its design, and the main findings. The solved tasks include script and font classification, document origin localization, and dating. We combined patch-level and line-level approaches, where the line-level system utilizes an existing, publicly available page layout analysis engine. In both systems, neural networks provide local predictions which are combined into page-level decisions, and the results of both systems are fused using linear or log-linear interpolation. We propose loss functions suitable for weakly supervised classification problem where multiple possible labels are provided, and we propose loss functions suitable for interval regression in the dating task. The line-level system significantly improves results in script and font classification and in the dating task. The full system achieved 98.48 respectively. In the dating task, our system achieved a mean absolute error of 21.91 years.

READ FULL TEXT
research
12/09/2019

Modular Multimodal Architecture for Document Classification

Page classification is a crucial component to any document analysis syst...
research
07/29/2022

PageNet: Towards End-to-End Weakly Supervised Page-Level Handwritten Chinese Text Recognition

Handwritten Chinese text recognition (HCTR) has been an active research ...
research
10/15/2021

Accurate Fine-grained Layout Analysis for the Historical Tibetan Document Based on the Instance Segmentation

Accurate layout analysis without subsequent text-line segmentation remai...
research
03/24/2023

HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures

The problem of document structure reconstruction refers to converting di...
research
12/28/2020

Multiple Document Datasets Pre-training Improves Text Line Detection With Deep Neural Networks

In this paper, we introduce a fully convolutional network for the docume...
research
07/19/2022

You Actually Look Twice At it (YALTAi): using an object detection approach instead of region segmentation within the Kraken engine

Layout Analysis (the identification of zones and their classification) i...
research
04/27/2018

dhSegment: A generic deep-learning approach for document segmentation

In recent years there have been multiple successful attempts tackling do...

Please sign up or login with your details

Forgot password? Click here to reset