首页 | 本学科首页   官方微博 | 高级检索  
     


Parallel reinforcement learning for weighted multi-criteria model with adaptive margin
Authors:Kazuyuki Hiraoka  Manabu Yoshida  Taketoshi Mishima
Affiliation:(1) Graduate School of Science and Engineering, Saitama University, 255 Shimo-Okubo, Sakura-ku, Saitama-shi, Japan
Abstract:Reinforcement learning (RL) for a linear family of tasks is described in this paper. The key of our discussion is nonlinearity of the optimal solution even if the task family is linear; we cannot obtain the optimal policy using a naive approach. Although an algorithm exists for calculating the equivalent result to Q-learning for each task simultaneously, it presents the problem of explosion of set sizes. We therefore introduce adaptive margins to overcome this difficulty.
Keywords:Reinforcement learning  Multi-criteria  Convex hull  Minkowski sum
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号