NetHack is Hard to Hack

05/30/2023
by   Ulyana Piterbarg, et al.
0

Neural policy learning methods have achieved remarkable results in various control problems, ranging from Atari games to simulated locomotion. However, these methods struggle in long-horizon tasks, especially in open-ended environments with multi-modal observations, such as the popular dungeon-crawler game, NetHack. Intriguingly, the NeurIPS 2021 NetHack Challenge revealed that symbolic agents outperformed neural approaches by over four times in median game score. In this paper, we delve into the reasons behind this performance gap and present an extensive study on neural policy learning for NetHack. To conduct this study, we analyze the winning symbolic agent, extending its codebase to track internal strategy selection in order to generate one of the largest available demonstration datasets. Utilizing this dataset, we examine (i) the advantages of an action hierarchy; (ii) enhancements in neural architecture; and (iii) the integration of reinforcement learning with imitation learning. Our investigations produce a state-of-the-art neural agent that surpasses previous fully neural policies by 127 25 mere scaling is insufficient to bridge the performance gap with the best symbolic models or even the top human players.

READ FULL TEXT
research
12/24/2020

SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II

AlphaStar, the AI that reaches GrandMaster level in StarCraft II, is a r...
research
08/20/2023

Mimicking To Dominate: Imitation Learning Strategies for Success in Multiagent Competitive Games

Training agents in multi-agent competitive games presents significant ch...
research
10/21/2021

LOA: Logical Optimal Actions for Text-based Interaction Games

We present Logical Optimal Actions (LOA), an action decision architectur...
research
11/28/2020

Human-Agent Cooperation in Bridge Bidding

We introduce a human-compatible reinforcement-learning approach to a coo...
research
11/27/2020

TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game

StarCraft, one of the most difficult esport games with long-standing his...
research
07/02/2023

Neuro-Symbolic Sudoku Solver

Deep Neural Networks have achieved great success in some of the complex ...

Please sign up or login with your details

Forgot password? Click here to reset