模拟昆虫视觉-行为抉择的强化学习模型 A Reinforcement Learning Model to Simulate Insect’s Choice Behavior Facing Visual Cues期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

模拟昆虫视觉-行为抉择的强化学习模型

引用本文：	马奇,张立明.模拟昆虫视觉-行为抉择的强化学习模型[J].生物物理学报,2008,24(3):211-220.

作者姓名：	马奇张立明

作者单位：	复旦大学信息学院电子工程系,上海,200433

基金项目：	国家自然科学基金 , 上海市重点学科建设项目

摘要：	视觉信息用于行为抉择的过程是一个极其复杂的脑信息处理过程,昆虫或动物对外界环境的学习是以价值来控制的,并可影响其行为抉择,研究这一过程对揭示人类自身脑运行机制有重要意义.文章在郭爱克研究小组果蝇实验提供的生物依据基础上,提出了一种模拟果蝇视觉-行为抉择的神经网络模型.该模型引入了价值和基于价值的强化学习算法,应用于输入视觉图像的强化学习,以此建立果蝇脑内多巴胺和蘑菇体对于抉择判断的价值体系.模拟的结果表明,该模型可以模拟果蝇视觉信息的学习和行为抉择过程,其结果与生物实验相符,同时也为机器人视觉信息控制行为抉择的应用提供了基础.
关键词：	强化学习价值系统神经网络行为抉择模拟昆虫机器人视觉行为抉择强化学习算法学习模型 INSECT SIMULATE LEARNING MODEL REINFORCEMENT VISUAL BEHAVIOR 信息控制生物实验结果价值体系判断蘑菇体脑内多巴胺视觉图像
收稿时间：	2008-06-10
A Reinforcement Learning Model to Simulate Insect’s Choice Behavior Facing Visual Cues

MA Qi,ZHANG Li-ming.A Reinforcement Learning Model to Simulate Insect’s Choice Behavior Facing Visual Cues[J].Acta Biophysica Sinica,2008,24(3):211-220.

Authors:	MA Qi ZHANG Li-ming

Institution:	Department of Electronics Engineering, Fudan University, Shanghai 200433, China

Abstract:	Choice behavior based on visual cues is a very complicated information process of brain. Biological experiments showed that learning of insects or animals in the environment is controlled by value system which affects their choice behavior. This study is significant to reveal the mechanism of human brain. A neural network model was proposed to simulate the choice behavior of insects facing visual cues based on biological facts of Guo's group. The proposed model introduces value system and a reinforcement learning algorithm based on the value system. It can learn input visual images under the reward or punishment and establish a value system, which simulates choice behavior based on dopamine and mushroom body circuit in the brain of insects. The simulation results show that the proposed model can simulate the development of value system and choice behavior of drosophila. The simulating curves accord with biological experiments. This model provides a basis for the further application on robot auto-control system facing visual cues.

Keywords:	Reinforcement learning Value system Neural network Choice behavior
本文献已被维普万方数据等数据库收录！
	点击此处可从《生物物理学报》浏览原始摘要信息
	点击此处可从《生物物理学报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏