Joint Recognition of Handwritten Text and Named Entities with a Neural End-to-end Model

03/16/2018
by   Manuel Carbonell, et al.
0

When extracting information from handwritten documents, text transcription and named entity recognition are usually faced as separate subsequent tasks. This has the disadvantage that errors in the first module affect heavily the performance of the second module. In this work we propose to do both tasks jointly, using a single neural network with a common architecture used for plain text recognition. Experimentally, the work has been tested on a collection of historical marriage records. Results of experiments are presented to show the effect on the performance for different configurations: different ways of encoding the information, doing or not transfer learning and processing at text line or multi-line region level. The results are comparable to state of the art reported in the ICDAR 2017 Information Extraction competition, even though the proposed technique does not use any dictionaries, language modeling or post processing.

READ FULL TEXT

page 2

page 5

research
12/08/2021

Transformer-Based Approach for Joint Handwriting and Named Entity Recognition in Historical documents

The extraction of relevant information carried out by named entities in ...
research
12/20/2019

TreyNet: A Neural Model for Text Localization, Transcription and Named Entity Recognition in Full Pages

In the last years, the consolidation of deep neural network architecture...
research
05/23/2022

LexiconNet: An End-to-End Handwritten Paragraph Text Recognition System

Historical documents present in the form of libraries needs to be digiti...
research
04/27/2023

Large Scale Genealogical Information Extraction From Handwritten Quebec Parish Records

This paper presents a complete workflow designed for extracting informat...
research
08/16/2022

The LAM Dataset: A Novel Benchmark for Line-Level Handwritten Text Recognition

Handwritten Text Recognition (HTR) is an open problem at the intersectio...
research
07/17/2018

Bench-Marking Information Extraction in Semi-Structured Historical Handwritten Records

In this report, we present our findings from benchmarking experiments fo...

Please sign up or login with your details

Forgot password? Click here to reset