Learning Macromanagement in StarCraft from Replays using Deep Learning

07/12/2017
by   Niels Justesen, et al.
0

The real-time strategy game StarCraft has proven to be a challenging environment for artificial intelligence techniques, and as a result, current state-of-the-art solutions consist of numerous hand-crafted modules. In this paper, we show how macromanagement decisions in StarCraft can be learned directly from game replays using deep learning. Neural networks are trained on 789,571 state-action pairs extracted from 2,005 replays of highly skilled players, achieving top-1 and top-3 error rates of 54.6 the next build action. By integrating the trained network into UAlbertaBot, an open source StarCraft bot, the system can significantly outperform the game's built-in Terran bot, and play competitively against UAlbertaBot with a fixed rush strategy. To our knowledge, this is the first time macromanagement tasks are learned directly from replays in StarCraft. While the best hand-crafted strategies are still the state-of-the-art, the deep network approach is able to express a wide range of different strategies and thus improving the network's performance further with deep reinforcement learning is an immediately promising avenue for future research. Ultimately this approach could lead to strong StarCraft bots that are less reliant on hard-coded strategies.

READ FULL TEXT

page 2

page 7

research
09/04/2015

Giraffe: Using Deep Reinforcement Learning to Play Chess

This report presents Giraffe, a chess engine that uses self-play to disc...
research
12/17/2017

Towards a Deep Reinforcement Learning Approach for Tower Line Wars

There have been numerous breakthroughs with reinforcement learning in th...
research
11/29/2017

Happiness Pursuit: Personality Learning in a Society of Agents

Modeling personality is a challenging problem with applications spanning...
research
01/17/2017

Fusing Deep Learned and Hand-Crafted Features of Appearance, Shape, and Dynamics for Automatic Pain Estimation

Automatic continuous time, continuous value assessment of a patient's pa...
research
03/05/2018

A Genetic Programming Framework for 2D Platform AI

There currently exists a wide range of techniques to model and evolve ar...
research
05/17/2016

Dynamic Frame skip Deep Q Network

Deep Reinforcement Learning methods have achieved state of the art perfo...
research
07/23/2018

Learning to Play Pong using Policy Gradient Learning

Activities in reinforcement learning (RL) revolve around learning the Ma...

Please sign up or login with your details

Forgot password? Click here to reset