A Saliency-based Convolutional Neural Network for Table and Chart Detection in Digitized Documents

by   I. Kavasidis, et al.

Deep Convolutional Neural Networks (DCNNs) have recently been applied successfully to a variety of vision and multimedia tasks, thus driving development of novel solutions in several application domains. Document analysis is a particularly promising area for DCNNs: indeed, the number of available digital documents has reached unprecedented levels, and humans are no longer able to discover and retrieve all the information contained in these documents without the help of automation. Under this scenario, DCNNs offers a viable solution to automate the information extraction process from digital documents. Within the realm of information extraction from documents, detection of tables and charts is particularly needed as they contain a visual summary of the most valuable information contained in a document. For a complete automation of visual information extraction process from tables and charts, it is necessary to develop techniques that localize them and identify precisely their boundaries. In this paper we aim at solving the table/chart detection task through an approach that combines deep convolutional neural networks, graphical models and saliency concepts. In particular, we propose a saliency-based fully-convolutional neural network performing multi-scale reasoning on visual cues followed by a fully-connected conditional random field (CRF) for localizing tables and charts in digital/digitized documents. Performance analysis carried out on an extended version of ICDAR 2013 (with annotated charts as well as tables) shows that our approach yields promising results, outperforming existing models.


page 4

page 5

page 7

page 10

page 12


A two-stage approach for table extraction in invoices

The automated analysis of administrative documents is an important field...

MRZ code extraction from visa and passport documents using convolutional neural networks

Detecting and extracting information from Machine-Readable Zone (MRZ) on...

Graphical Object Detection in Document Images

Graphical elements: particularly tables and figures contain a visual sum...

ScanBank: A Benchmark Dataset for Figure Extraction from Scanned Electronic Theses and Dissertations

We focus on electronic theses and dissertations (ETDs), aiming to improv...

Accessible tables in digital documents

Accessibility of tables on websites for Visually Impaired Persons (VIP) ...

Locating Tables in Scanned Documents for Reconstructing and Republishing (ICIAfS14)

Pool of knowledge available to the mankind depends on the source of lear...

Visual link retrieval and knowledge discovery in painting datasets

Visual arts have invaluable importance for the cultural, historic and ec...

Please sign up or login with your details

Forgot password? Click here to reset