Parallel reinforcement learning for weighted multi-criteria model with adaptive margin期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Parallel reinforcement learning for weighted multi-criteria model with adaptive margin

Authors:	Kazuyuki Hiraoka Manabu Yoshida Taketoshi Mishima

Affiliation:	(1) Graduate School of Science and Engineering, Saitama University, 255 Shimo-Okubo, Sakura-ku, Saitama-shi, Japan

Abstract:	Reinforcement learning (RL) for a linear family of tasks is described in this paper. The key of our discussion is nonlinearity of the optimal solution even if the task family is linear; we cannot obtain the optimal policy using a naive approach. Although an algorithm exists for calculating the equivalent result to Q-learning for each task simultaneously, it presents the problem of explosion of set sizes. We therefore introduce adaptive margins to overcome this difficulty.

Keywords:	Reinforcement learning Multi-criteria Convex hull Minkowski sum
本文献已被 SpringerLink 等数据库收录！