Large Language Models are Zero-Shot Clinical Information Extractors

05/25/2022
by   Monica Agrawal, et al.
0

We show that large language models, such as GPT-3, perform well at zero-shot information extraction from clinical text despite not being trained specifically for the clinical domain. We present several examples showing how to use these models as tools for the diverse tasks of (i) concept disambiguation, (ii) evidence extraction, (iii) coreference resolution, and (iv) concept extraction, all on clinical text. The key to good performance is the use of simple task-specific programs that map from the language model outputs to the label space of the task. We refer to these programs as resolvers, a generalization of the verbalizer, which defines a mapping between output tokens and a discrete label space. We show in our examples that good resolvers share common components (e.g., "safety checks" that ensure the language model outputs faithfully match the input data), and that the common patterns across tasks make resolvers lightweight and easy to create. To better evaluate these systems, we also introduce two new datasets for benchmarking zero-shot clinical information extraction based on manual relabeling of the CASI dataset (Moon et al., 2014) with labels for new tasks. On the clinical extraction tasks we studied, the GPT-3 + resolver systems significantly outperform existing zero- and few-shot baselines.

READ FULL TEXT

page 21

page 22

page 23

page 24

page 25

research
09/23/2021

Zero-Shot Information Extraction as a Unified Text-to-Triple Translation

We cast a suite of information extraction tasks into a text-to-triple tr...
research
02/23/2023

CHiLL: Zero-shot Custom Interpretable Feature Extraction from Clinical Notes with Large Language Models

Large Language Models (LLMs) have yielded fast and dramatic progress in ...
research
02/20/2023

Zero-Shot Information Extraction via Chatting with ChatGPT

Zero-shot information extraction (IE) aims to build IE systems from the ...
research
09/08/2023

Retrieving Evidence from EHRs with LLMs: Possibilities and Challenges

Unstructured Electronic Health Record (EHR) data often contains critical...
research
02/02/2022

Pop Quiz! Can a Large Language Model Help With Reverse Engineering?

Large language models (such as OpenAI's Codex) have demonstrated impress...
research
09/04/2023

Zero-shot information extraction from radiological reports using ChatGPT

Electronic health records contain an enormous amount of valuable informa...
research
05/24/2023

A Monte Carlo Language Model Pipeline for Zero-Shot Sociopolitical Event Extraction

We consider dyadic zero-shot event extraction (EE) to identify actions b...

Please sign up or login with your details

Forgot password? Click here to reset