Enhancing Few-shot NER with Prompt Ordering based Data Augmentation

05/19/2023
by   Huiming Wang, et al.
0

Recently, data augmentation (DA) methods have been proven to be effective for pre-trained language models (PLMs) in low-resource settings, including few-shot named entity recognition (NER). However, conventional NER DA methods are mostly aimed at sequence labeling models, i.e., token-level classification, and few are compatible with unified autoregressive generation frameworks, which can handle a wider range of NER tasks, such as nested NER. Furthermore, these generation frameworks have a strong assumption that the entities will appear in the target sequence with the same left-to-right order as the source sequence. In this paper, we claim that there is no need to keep this strict order, and more diversified but reasonable target entity sequences can be provided during the training stage as a novel DA method. Nevertheless, a naive mixture of augmented data can confuse the model since one source sequence will then be paired with different target sequences. Therefore, we propose a simple but effective Prompt Ordering based Data Augmentation (PODA) method to improve the training of unified autoregressive generation frameworks under few-shot NER scenarios. Experimental results on three public NER datasets and further analyses demonstrate the effectiveness of our approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2022

EnTDA: Entity-to-Text based Data Augmentation Approach for Named Entity Recognition Tasks

Data augmentation techniques have been used to improve the generalizatio...
research
07/11/2023

RoPDA: Robust Prompt-based Data Augmentation for Low-Resource Named Entity Recognition

Data augmentation has been widely used in low-resource NER tasks to tack...
research
05/18/2023

BioAug: Conditional Generation based Data Augmentation for Low-Resource Biomedical NER

Biomedical Named Entity Recognition (BioNER) is the fundamental task of ...
research
10/05/2020

SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup

Active learning is an important technique for low-resource sequence labe...
research
04/25/2022

Robust Self-Augmentation for Named Entity Recognition with Meta Reweighting

Self-augmentation has received increasing research interest recently to ...
research
06/01/2023

ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER

Complex Named Entity Recognition (NER) is the task of detecting linguist...
research
10/04/2020

Local Additivity Based Data Augmentation for Semi-supervised NER

Named Entity Recognition (NER) is one of the first stages in deep langua...

Please sign up or login with your details

Forgot password? Click here to reset