Self-directed Learning of Action Models using Exploratory Planning

by   Dustin Dannenhauer, et al.

Complex, real-world domains may not be fully modeled for an agent, especially if the agent has never operated in the domain before. The agent's ability to effectively plan and act in such a domain is influenced by its knowledge of when it can perform specific actions and the effects of those actions. We describe a novel exploratory planning agent that is capable of learning action preconditions and effects without expert traces or a given goal. The agent's architecture allows it to perform both exploratory actions as well as goal-directed actions, which opens up important considerations for how exploratory planning and goal planning should be controlled, as well as how the agent's behavior should be explained to any teammates it may have. The contributions of this work include a new representation for contexts called Lifted Linked Clauses, a novel exploration action selection approach using these clauses, an exploration planner that uses lifted linked clauses as goals in order to reach new states, and an empirical evaluation in a scenario from an exploration-focused video game demonstrating that lifted linked clauses improve exploration and action model learning against non-planning baseline agents.


page 1

page 2

page 3

page 4


A Probabilistic Model of Action for Least-Commitment Planning with Information Gather

AI planning algorithms have addressed the problem of generating sequence...

Minimizing the Negative Side Effects of Planning with Reduced Models

Reduced models of large Markov decision processes accelerate planning by...

Thinker: Learning to Plan and Act

We propose the Thinker algorithm, a novel approach that enables reinforc...

Task Scoping: Building Goal-Specific Abstractions for Planning in Complex Domains

A generally intelligent agent requires an open-scope world model: one ri...

On Reasonable and Forced Goal Orderings and their Use in an Agenda-Driven Planning Algorithm

The paper addresses the problem of computing goal orderings, which is on...

Design and Implementation of Linked Planning Domain Definition Language

Planning is a critical component of any artificial intelligence system t...

Universal Memory Architectures for Autonomous Machines

We propose a self-organizing memory architecture for perceptual experien...

Please sign up or login with your details

Forgot password? Click here to reset