首页 | 本学科首页   官方微博 | 高级检索  
   检索      


Using Reinforcement Learning to Provide Stable Brain-Machine Interface Control Despite Neural Input Reorganization
Authors:Eric A Pohlmeyer  Babak Mahmoudi  Shijia Geng  Noeline W Prins  Justin C Sanchez
Institution:1. Department of Biomedical Engineering, University of Miami, Coral Gables, Florida, United States of America.; 2. Department of Neurosurgery, Emory University, Atlanta, Georgia, United States of America.; 3. Department of Neuroscience, University of Miami, Miami, Florida, United States of America.; 4. Miami Project to Cure Paralysis, University of Miami, Miami, Florida, United States of America.; Georgia State University, United States of America,
Abstract:Brain-machine interface (BMI) systems give users direct neural control of robotic, communication, or functional electrical stimulation systems. As BMI systems begin transitioning from laboratory settings into activities of daily living, an important goal is to develop neural decoding algorithms that can be calibrated with a minimal burden on the user, provide stable control for long periods of time, and can be responsive to fluctuations in the decoder’s neural input space (e.g. neurons appearing or being lost amongst electrode recordings). These are significant challenges for static neural decoding algorithms that assume stationary input/output relationships. Here we use an actor-critic reinforcement learning architecture to provide an adaptive BMI controller that can successfully adapt to dramatic neural reorganizations, can maintain its performance over long time periods, and which does not require the user to produce specific kinetic or kinematic activities to calibrate the BMI. Two marmoset monkeys used the Reinforcement Learning BMI (RLBMI) to successfully control a robotic arm during a two-target reaching task. The RLBMI was initialized using random initial conditions, and it quickly learned to control the robot from brain states using only a binary evaluative feedback regarding whether previously chosen robot actions were good or bad. The RLBMI was able to maintain control over the system throughout sessions spanning multiple weeks. Furthermore, the RLBMI was able to quickly adapt and maintain control of the robot despite dramatic perturbations to the neural inputs, including a series of tests in which the neuron input space was deliberately halved or doubled.
Keywords:
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号