Title
Deep reinforcement learning for tensegrity robot locomotion
Abstract
Tensegrity robots, composed of rigid rods connected by elastic cables, have a number of unique properties that make them appealing for use as planetary exploration rovers. However, control of tensegrity robots remains a difficult problem due to their unusual structures and complex dynamics. In this work, we show how locomotion gaits can be learned automatically using a novel extension of mirror descent guided policy search (MDGPS) applied to periodic locomotion movements, and we demonstrate the effectiveness of our approach on tensegrity robot locomotion. We evaluate our method with real-world and simulated experiments on the SUPERball tensegrity robot, showing that the learned policies generalize to changes in system parameters, unreliable sensor measurements, and variation in environmental conditions, including varied terrains and a range of different gravities. Our experiments demonstrate that our method not only learns fast, power-efficient feedback policies for rolling gaits, but that these policies can succeed with only the limited onboard sensing provided by SUPERball's accelerometers. We compare the learned feedback policies to learned open-loop policies and hand-engineered controllers, and demonstrate that the learned policy enables the first continuous, reliable locomotion gait for the real SUPERball robot. Our code and supplementary material is available from http://rll.berkeley.edu/drl_tensegrity.
Year
DOI
Venue
2017
10.1109/ICRA.2017.7989079
2017 IEEE International Conference on Robotics and Automation (ICRA)
Keywords
Field
DocType
deep reinforcement learning,tensegrity robot locomotion,rigid rods,elastic cables,planetary exploration rovers,complex dynamics,locomotion gaits,mirror descent guided policy search,MDGPS,periodic locomotion movements,SUPERball tensegrity robot,unreliable sensor measurements,power-efficient feedback policies,rolling gaits,onboard sensing,SUPERball accelerometers,open-loop policies,hand-engineered controllers,reliable locomotion gait,SUPERball robot
Complex dynamics,Gait,Accelerometer,Simulation,Control engineering,Tensegrity,Robot locomotion,Planetary exploration,Engineering,Robot,Reinforcement learning
Conference
Volume
Issue
ISBN
2017
1
978-1-5090-4634-8
Citations 
PageRank 
References 
6
0.52
10
Authors
8
Name
Order
Citations
PageRank
Marvin Zhang1192.47
Xinyang Geng2714.43
Jonathan Bruce3112.68
Ken Caluwaerts4184.29
Massimo Vespignani513911.02
Vytas SunSpiral6367.03
Pieter Abbeel76363376.48
Sergey Levine83377182.21