Reinforcement learning with internal expectation in the random neural networks for cascaded decisions期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

Reinforcement learning with internal expectation in the random neural networks for cascaded decisions

Authors:	Halici U

Institution:	Computer Vision and Artificial Neural Networks Research Laboratory, Department of Electrical and Electronics Engineering, 06531 METU, Ankara, Turkey. halici@metu.edu.tr

Abstract:	The reinforcement learning scheme proposed in Halici (J. Biosystems 40 (1997) 83) for the random neural network (RNN) (Neural Computation 1 (1989) 502) is based on reward and performs well for stationary environments. However, when the environment is not stationary it suffers from getting stuck to the previously learned action and extinction is not possible. To overcome the problem, the reinforcement scheme is extended in Halici (Eur. J. Oper. Res., 126(2000) 288) by introducing a new weight update rule (E-rule) which takes into consideration the internal expectation of reinforcement. Although the E-rule is proposed for the RNN, it can be used for training learning automata or other intelligent systems based on reinforcement learning. This paper looks into the behavior of the learning scheme with internal expectation for the environments where the reinforcement is obtained after a sequence of cascaded decisions. The simulation results have shown that the RNN learns well and extinction is possible even for the cases with several decision steps and with hundreds of possible decision paths.

Keywords:
本文献已被 PubMed 等数据库收录！