A framework for information extraction from tables in biomedical literature

02/26/2019
by   Nikola Milosevic, et al.
0

The scientific literature is growing exponentially, and professionals are no more able to cope with the current amount of publications. Text mining provided in the past methods to retrieve and extract information from text; however, most of these approaches ignored tables and figures. The research done in mining table data still does not have an integrated approach for mining that would consider all complexities and challenges of a table. Our research is examining the methods for extracting numerical (number of patients, age, gender distribution) and textual (adverse reactions) information from tables in the clinical literature. We present a requirement analysis template and an integral methodology for information extraction from tables in clinical domain that contains 7 steps: (1) table detection, (2) functional processing, (3) structural processing, (4) semantic tagging, (5) pragmatic processing, (6) cell selection and (7) syntactic processing and extraction. Our approach performed with the F-measure ranged between 82 and 92 and its complexity.

READ FULL TEXT

page 5

page 19

research
04/22/2021

Tablext: A Combined Neural Network And Heuristic Based Table Extractor

A significant portion of the data available today is found within tables...
research
07/03/2023

Data-Driven Information Extraction and Enrichment of Molecular Profiling Data for Cancer Cell Lines

With the proliferation of research means and computational methodologies...
research
05/12/2021

TabLeX: A Benchmark Dataset for Structure and Content Information Extraction from Scientific Tables

Information Extraction (IE) from the tables present in scientific articl...
research
07/03/2022

DiSCoMaT: Distantly Supervised Composition Extraction from Tables in Materials Science Articles

A crucial component in the curation of KB for a scientific domain is inf...
research
10/16/2020

A Conglomerate of Multiple OCR Table Detection and Extraction

Information representation as tables are compact and concise method that...
research
03/19/2015

Syntagma Lexical Database

This paper discusses the structure of Syntagma's Lexical Database (focus...
research
10/23/2020

Extracting Body Text from Academic PDF Documents for Text Mining

Accurate extraction of body text from PDF-formatted academic documents i...

Please sign up or login with your details

Forgot password? Click here to reset