首页 | 本学科首页   官方微博 | 高级检索  
   检索      

模拟昆虫视觉-行为抉择的强化学习模型
引用本文:马奇,张立明.模拟昆虫视觉-行为抉择的强化学习模型[J].生物物理学报,2008,24(3):211-220.
作者姓名:马奇  张立明
作者单位:复旦大学信息学院电子工程系,上海,200433
基金项目:国家自然科学基金 , 上海市重点学科建设项目
摘    要:视觉信息用于行为抉择的过程是一个极其复杂的脑信息处理过程,昆虫或动物对外界环境的学习是以价值来控制的,并可影响其行为抉择,研究这一过程对揭示人类自身脑运行机制有重要意义.文章在郭爱克研究小组果蝇实验提供的生物依据基础上,提出了一种模拟果蝇视觉-行为抉择的神经网络模型.该模型引入了价值和基于价值的强化学习算法,应用于输入视觉图像的强化学习,以此建立果蝇脑内多巴胺和蘑菇体对于抉择判断的价值体系.模拟的结果表明,该模型可以模拟果蝇视觉信息的学习和行为抉择过程,其结果与生物实验相符,同时也为机器人视觉信息控制行为抉择的应用提供了基础.

关 键 词:强化学习  价值系统  神经网络  行为抉择  模拟  昆虫  机器人视觉  行为抉择  强化学习算法  学习模型  INSECT  SIMULATE  LEARNING  MODEL  REINFORCEMENT  VISUAL  BEHAVIOR  信息控制  生物实验  结果  价值体系  判断  蘑菇体  脑内多巴胺  视觉图像
收稿时间:2008-06-10

A Reinforcement Learning Model to Simulate Insect’s Choice Behavior Facing Visual Cues
MA Qi,ZHANG Li-ming.A Reinforcement Learning Model to Simulate Insect’s Choice Behavior Facing Visual Cues[J].Acta Biophysica Sinica,2008,24(3):211-220.
Authors:MA Qi  ZHANG Li-ming
Institution:Department of Electronics Engineering, Fudan University, Shanghai 200433, China
Abstract:Choice behavior based on visual cues is a very complicated information process of brain. Biological experiments showed that learning of insects or animals in the environment is controlled by value system which affects their choice behavior. This study is significant to reveal the mechanism of human brain. A neural network model was proposed to simulate the choice behavior of insects facing visual cues based on biological facts of Guo's group. The proposed model introduces value system and a reinforcement learning algorithm based on the value system. It can learn input visual images under the reward or punishment and establish a value system, which simulates choice behavior based on dopamine and mushroom body circuit in the brain of insects. The simulation results show that the proposed model can simulate the development of value system and choice behavior of drosophila. The simulating curves accord with biological experiments. This model provides a basis for the further application on robot auto-control system facing visual cues.
Keywords:Reinforcement learning  Value system  Neural network  Choice behavior
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《生物物理学报》浏览原始摘要信息
点击此处可从《生物物理学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号