Addressing Resource and Privacy Constraints in Semantic Parsing Through Data Augmentation

05/18/2022
by   Kevin Yang, et al.
0

We introduce a novel setup for low-resource task-oriented semantic parsing which incorporates several constraints that may arise in real-world scenarios: (1) lack of similar datasets/models from a related domain, (2) inability to sample useful logical forms directly from a grammar, and (3) privacy requirements for unlabeled natural utterances. Our goal is to improve a low-resource semantic parser using utterances collected through user interactions. In this highly challenging but realistic setting, we investigate data augmentation approaches involving generating a set of structured canonical utterances corresponding to logical forms, before simulating corresponding natural language and filtering the resulting pairs. We find that such approaches are effective despite our restrictive setup: in a low-resource setting on the complex SMCalFlow calendaring dataset (Andreas et al., 2020), we observe 33 match.

READ FULL TEXT
research
10/07/2020

Low-Resource Domain Adaptation for Compositional Task-Oriented Semantic Parsing

Task-oriented semantic parsing is a critical component of virtual assist...
research
08/26/2019

Don't paraphrase, detect! Rapid and Effective Data Collection for Semantic Parsing

A major hurdle on the road to conversational interfaces is the difficult...
research
02/02/2021

On Robustness of Neural Semantic Parsers

Semantic parsing maps natural language (NL) utterances into logical form...
research
06/09/2021

Text-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data

Most available semantic parsing datasets, comprising of pairs of natural...
research
11/14/2022

CST5: Data Augmentation for Code-Switched Semantic Parsing

Extending semantic parsers to code-switched input has been a challenging...
research
04/15/2021

Low-Resource Task-Oriented Semantic Parsing via Intrinsic Modeling

Task-oriented semantic parsing models typically have high resource requi...
research
03/15/2023

PRESTO: A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs

Research interest in task-oriented dialogs has increased as systems such...

Please sign up or login with your details

Forgot password? Click here to reset