Title
Pneumatic artificial muscle-driven robot control using local update reinforcement learning.
Abstract
In this study, a new value function based Reinforcement learning (RL) algorithm, Local Update Dynamic Policy Programming (LUDPP), is proposed. It exploits the nature of smooth policy update using Kullback-Leibler divergence to update its value function locally and considerably reduces the computational complexity. We firstly investigated the learning performance of LUDPP and other algorithms without smooth policy update for tasks of pendulum swing up and n DOFs manipulator reaching in simulation. Only LUDPP could efficiently and stably learn good control policies in high dimensional systems with limited number of training samples. In real word application, we applied LUDPP to control Pneumatic Artificial Muscles (PAMs) driven robots without the knowledge of model which is challenging for traditional methods due to the high nonlinearities of PAM's air pressure dynamics and mechanical structure. LUDPP successfully achieved one finger control of Shadow Dexterous Hand, a PAM-driven humanoid robot hand, with far lower computational resource compared with other conventional value function based RL algorithms.
Year
DOI
Venue
2017
10.1080/01691864.2016.1274680
ADVANCED ROBOTICS
Keywords
Field
DocType
Smooth policy update,dynamic policy programming,robot motor learning
Robot learning,Robot control,Control engineering,Engineering,Artificial muscle,Reinforcement learning
Journal
Volume
Issue
ISSN
31.0
8
0169-1864
Citations 
PageRank 
References 
4
0.43
18
Authors
3
Name
Order
Citations
PageRank
Yunduan Cui1247.07
Takamitsu Matsubara235139.84
Kenji Sugimoto33010.35