Experience enrichment based task independent reward model

05/21/2017
by   Min Xu, et al.
0

For most reinforcement learning approaches, the learning is performed by maximizing an accumulative reward that is expectedly and manually defined for specific tasks. However, in real world, rewards are emergent phenomena from the complex interactions between agents and environments. In this paper, we propose an implicit generic reward model for reinforcement learning. Unlike those rewards that are manually defined for specific tasks, such implicit reward is task independent. It only comes from the deviation from the agents' previous experiences.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset