Contrastive Reinforcement Learning of Symbolic Reasoning Domains

06/16/2021
by   Gabriel Poesia, et al.
0

Abstract symbolic reasoning, as required in domains such as mathematics and logic, is a key component of human intelligence. Solvers for these domains have important applications, especially to computer-assisted education. But learning to solve symbolic problems is challenging for machine learning algorithms. Existing models either learn from human solutions or use hand-engineered features, making them expensive to apply in new domains. In this paper, we instead consider symbolic domains as simple environments where states and actions are given as unstructured text, and binary rewards indicate whether a problem is solved. This flexible setup makes it easy to specify new domains, but search and planning become challenging. We introduce four environments inspired by the Mathematics Common Core Curriculum, and observe that existing Reinforcement Learning baselines perform poorly. We then present a novel learning algorithm, Contrastive Policy Learning (ConPoLe) that explicitly optimizes the InfoNCE loss, which lower bounds the mutual information between the current state and next states that continue on a path to the solution. ConPoLe successfully solves all four domains. Moreover, problem representations learned by ConPoLe enable accurate prediction of the categories of problems in a real mathematics curriculum. Our results suggest new directions for reinforcement learning in symbolic domains, as well as applications to mathematics education.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/20/2018

PEORL: Integrating Symbolic Planning and Hierarchical Reinforcement Learning for Robust Decision-Making

Reinforcement learning and symbolic planning have both been used to buil...
research
11/29/2022

Peano: Learning Formal Mathematical Reasoning

General mathematical reasoning is computationally undecidable, but human...
research
03/09/2022

SAGE: Generating Symbolic Goals for Myopic Models in Deep Reinforcement Learning

Model-based reinforcement learning algorithms are typically more sample ...
research
12/12/2019

The Use of Deep Learning for Symbolic Integration: A Review of (Lample and Charton, 2019)

Lample and Charton (2019) describe a system that uses deep learning tech...
research
10/19/2021

Generating Symbolic Reasoning Problems with Transformer GANs

Constructing training data for symbolic reasoning domains is challenging...
research
08/26/2022

Symbolic Explanation of Affinity-Based Reinforcement Learning Agents with Markov Models

The proliferation of artificial intelligence is increasingly dependent o...
research
09/12/2019

Learning First-Order Symbolic Planning Representations from Plain Graphs

One of the main obstacles for developing flexible AI system is the split...

Please sign up or login with your details

Forgot password? Click here to reset