Unsupervised Identification of Study Descriptors in Toxicology Research: An Experimental Study

11/03/2018
by   Drahomira Herrmannova, et al.
0

Identifying and extracting data elements such as study descriptors in publication full texts is a critical yet manual and labor-intensive step required in a number of tasks. In this paper we address the question of identifying data elements in an unsupervised manner. Specifically, provided a set of criteria describing specific study parameters, such as species, route of administration, and dosing regimen, we develop an unsupervised approach to identify text segments (sentences) relevant to the criteria. A binary classifier trained to identify publications that met the criteria performs better when trained on the candidate sentences than when trained on sentences randomly picked from the text, supporting the intuition that our method is able to accurately identify study descriptors.

READ FULL TEXT

page 7

page 12

research
10/21/2020

ReSCo-CC: Unsupervised Identification of Key Disinformation Sentences

Disinformation is often presented in long textual articles, especially w...
research
06/21/2016

A Novel Framework to Expedite Systematic Reviews by Automatically Building Information Extraction Training Corpora

A systematic review identifies and collates various clinical studies and...
research
05/06/2016

Detecting Context Dependence in Exercise Item Candidates Selected from Corpora

We explore the factors influencing the dependence of single sentences on...
research
11/21/2014

A Joint Probabilistic Classification Model of Relevant and Irrelevant Sentences in Mathematical Word Problems

Estimating the difficulty level of math word problems is an important ta...
research
10/06/2022

Detecting Narrative Elements in Informational Text

Automatic extraction of narrative elements from text, combining narrativ...
research
05/25/2020

Pointwise Paraphrase Appraisal is Potentially Problematic

The prevailing approach for training and evaluating paraphrase identific...
research
09/13/2021

Show Me How To Revise: Improving Lexically Constrained Sentence Generation with XLNet

Lexically constrained sentence generation allows the incorporation of pr...

Please sign up or login with your details

Forgot password? Click here to reset