Siwen Luo

research

∙ 07/31/2023

Workshop on Document Intelligence Understanding

Document understanding and information extraction include different task...

0 Soyeon Caren Han, et al. ∙

research

∙ 04/13/2023

PDFVQA: A New Dataset for Real-World VQA on PDF Documents

Document-based Visual Question Answering examines the document understan...

0 Yihao Ding, et al. ∙

research

∙ 12/16/2022

SceneGATE: Scene-Graph based co-Attention networks for TExt visual question answering

Most TextVQA approaches focus on the integration of objects, scene texts...

0 Siwen Luo, et al. ∙

research

∙ 11/29/2022

PiggyBack: Pretrained Visual Question Answering Environment for Backing up Non-deep Learning Professionals

We propose a PiggyBack, a Visual Question Answering platform that allows...

0 Zhihao Zhang, et al. ∙

research

∙ 08/22/2022

Doc-GCN: Heterogeneous Graph Convolutional Networks for Document Layout Analysis

Recognizing the layout of unstructured digital documents is crucial when...

0 Siwen Luo, et al. ∙

research

∙ 03/20/2021

Local Interpretations for Explainable Natural Language Processing: A Survey

As the use of deep learning techniques has grown across various fields o...

0 Siwen Luo, et al. ∙

research

∙ 02/20/2021

Deep Structured Feature Networks for Table Detection and Tabular Data Extraction from Scanned Financial Document Images

Automatic table detection in PDF documents has achieved a great success ...

0 Siwen Luo, et al. ∙

research

∙ 10/07/2020

VICTR: Visual Information Captured Text Representation for Text-to-Image Multimodal Tasks

Text-to-image multimodal tasks, generating/retrieving an image from a gi...

0 Soyeon Caren Han, et al. ∙

research

∙ 07/27/2020

REXUP: I REason, I EXtract, I UPdate with Structured Compositional Reasoning for Visual Question Answering

Visual question answering (VQA) is a challenging multi-modal task that r...

0 Siwen Luo, et al. ∙

Siwen Luo

Featured Co-authors

Sign in with Google

Consider DeepAI Pro