PHEE: A Dataset for Pharmacovigilance Event Extraction from Text

by   Zhaoyue Sun, et al.

The primary goal of drug safety researchers and regulators is to promptly identify adverse drug reactions. Doing so may in turn prevent or reduce the harm to patients and ultimately improve public health. Evaluating and monitoring drug safety (i.e., pharmacovigilance) involves analyzing an ever growing collection of spontaneous reports from health professionals, physicians, and pharmacists, and information voluntarily submitted by patients. In this scenario, facilitating analysis of such reports via automation has the potential to rapidly identify safety signals. Unfortunately, public resources for developing natural language models for this task are scant. We present PHEE, a novel dataset for pharmacovigilance comprising over 5000 annotated events from medical case reports and biomedical literature, making it the largest such public dataset to date. We describe the hierarchical event schema designed to provide coarse and fine-grained information about patients' demographics, treatments and (side) effects. Along with the discussion of the dataset, we present a thorough experimental evaluation of current state-of-the-art approaches for biomedical event extraction, point out their limitations, and highlight open challenges to foster future research in this area.


page 1

page 2

page 3

page 4


BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance

Timely and accurate extraction of Adverse Drug Events (ADE) from biomedi...

Distilling Large Language Models for Biomedical Knowledge Extraction: A Case Study on Adverse Drug Events

Large language models (LLMs), such as GPT-4, have demonstrated remarkabl...

Goal Driven Discovery of Distributional Differences via Language Descriptions

Mining large corpora can generate useful discoveries but is time-consumi...

From Unstructured to Structured: Transforming Chatbot Dialogues into Data Mart Schema for Visualization

Schools are among the primary avenues for public healthcare intervention...

Automagically encoding Adverse Drug Reactions in MedDRA

Pharmacovigilance is the field of science devoted to the collection, ana...

From narrative descriptions to MedDRA: automagically encoding adverse drug reactions

The collection of narrative spontaneous reports is an irreplaceable sour...

Please sign up or login with your details

Forgot password? Click here to reset