From Language to Programs: Bridging Reinforcement Learning and Maximum Marginal Likelihood

04/25/2017
by   Kelvin Guu, et al.
0

Our goal is to learn a semantic parser that maps natural language utterances into executable programs when only indirect supervision is available: examples are labeled with the correct execution result, but not the program itself. Consequently, we must search the space of programs for those that output the correct result, while not being misled by spurious programs: incorrect programs that coincidentally output the correct result. We connect two common learning paradigms, reinforcement learning (RL) and maximum marginal likelihood (MML), and then present a new learning algorithm that combines the strengths of both. The new algorithm guards against spurious programs by combining the systematic search traditionally employed in MML with the randomized exploration of RL, and by updating parameters such that probability is spread more evenly across consistent programs. We apply our learning algorithm to a new neural semantic parser and show significant gains over existing state-of-the-art results on a recent context-dependent semantic parsing task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/14/2017

Weakly-supervised Semantic Parsing with Abstract Examples

Semantic parsers translate language utterances to programs, but are ofte...
research
04/12/2021

Learning from Executions for Semantic Parsing

Semantic parsing aims at translating natural language (NL) utterances on...
research
09/09/2019

Learning Semantic Parsers from Denotations with Latent Structured Alignments and Abstract Programs

Semantic parsing aims to map natural language utterances onto machine in...
research
02/22/2019

Learning to Learn Semantic Parsers from Natural Language Supervision

As humans, we often rely on language to learn language. For example, whe...
research
05/11/2018

Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis

Program synthesis is the task of automatically generating a program cons...
research
01/30/2023

Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs

Aiming to produce reinforcement learning (RL) policies that are human-in...
research
05/25/2022

Active Programming by Example with a Natural Language Prior

We introduce APEL, a new framework that enables non-programmers to indir...

Please sign up or login with your details

Forgot password? Click here to reset