PULSAR at MEDIQA-Sum 2023: Large Language Models Augmented by Synthetic Dialogue Convert Patient Dialogues to Medical Records

07/05/2023
by   Viktor Schlegel, et al.
0

This paper describes PULSAR, our system submission at the ImageClef 2023 MediQA-Sum task on summarising patient-doctor dialogues into clinical records. The proposed framework relies on domain-specific pre-training, to produce a specialised language model which is trained on task-specific natural data augmented by synthetic data generated by a black-box LLM. We find limited evidence towards the efficacy of domain-specific pre-training and data augmentation, while scaling up the language model yields the best performance gains. Our approach was ranked second and third among 13 submissions on task B of the challenge. Our code is available at https://github.com/yuping-wu/PULSAR.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/22/2022

KALA: Knowledge-Augmented Language Model Adaptation

Pre-trained language models (PLMs) have achieved remarkable success on v...
research
04/27/2023

ZeroShotDataAug: Generating and Augmenting Training Data with ChatGPT

In this paper, we investigate the use of data obtained from prompting a ...
research
06/05/2023

PULSAR: Pre-training with Extracted Healthcare Terms for Summarising Patients' Problems and Data Augmentation with Black-box Large Language Models

Medical progress notes play a crucial role in documenting a patient's ho...
research
02/24/2023

HULAT at SemEval-2023 Task 9: Data augmentation for pre-trained transformers applied to Multilingual Tweet Intimacy Analysis

This paper describes our participation in SemEval-2023 Task 9, Intimacy ...
research
11/27/2020

TaylorGAN: Neighbor-Augmented Policy Update for Sample-Efficient Natural Language Generation

Score function-based natural language generation (NLG) approaches such a...
research
07/18/2022

Label2Label: A Language Modeling Framework for Multi-Attribute Learning

Objects are usually associated with multiple attributes, and these attri...
research
01/31/2023

FLAME: A small language model for spreadsheet formulas

The widespread use of spreadsheet environments by billions of users pres...

Please sign up or login with your details

Forgot password? Click here to reset