Path Planning of Cleaning Robot with Reinforcement Learning

by   Woohyeon Moon, et al.

Recently, as the demand for cleaning robots has steadily increased, therefore household electricity consumption is also increasing. To solve this electricity consumption issue, the problem of efficient path planning for cleaning robot has become important and many studies have been conducted. However, most of them are about moving along a simple path segment, not about the whole path to clean all places. As the emerging deep learning technique, reinforcement learning (RL) has been adopted for cleaning robot. However, the models for RL operate only in a specific cleaning environment, not the various cleaning environment. The problem is that the models have to retrain whenever the cleaning environment changes. To solve this problem, the proximal policy optimization (PPO) algorithm is combined with an efficient path planning that operates in various cleaning environments, using transfer learning (TL), detection nearest cleaned tile, reward shaping, and making elite set methods. The proposed method is validated with an ablation study and comparison with conventional methods such as random and zigzag. The experimental results demonstrate that the proposed method achieves improved training performance and increased convergence speed over the original PPO. And it also demonstrates that this proposed method is better performance than conventional methods (random, zigzag).


page 3

page 4

page 5

page 6


Capability Iteration Network for Robot Path Planning

Path planning is an important topic in robotics. Recently, value iterati...

Simulating Coverage Path Planning with Roomba

Coverage Path Planning involves visiting every unoccupied state in an en...

An Improved Algorithm of Robot Path Planning in Complex Environment Based on Double DQN

Deep Q Network (DQN) has several limitations when applied in planning a ...

Combining Subgoal Graphs with Reinforcement Learning to Build a Rational Pathfinder

In this paper, we present a hierarchical path planning framework called ...

Informative Path Planning for Mobile Sensing with Reinforcement Learning

Large-scale spatial data such as air quality, thermal conditions and loc...

Learning to View: Decision Transformers for Active Object Detection

Active perception describes a broad class of techniques that couple plan...

Surveillance Evasion Through Bayesian Reinforcement Learning

We consider a 2D continuous path planning problem with a completely unkn...

Please sign up or login with your details

Forgot password? Click here to reset