Deep reinforcement learning for tensegrity robot locomotion - Citegraph

Paper Info

Title
Deep reinforcement learning for tensegrity robot locomotion

Abstract
Tensegrity robots, composed of rigid rods connected by elastic cables, have a number of unique properties that make them appealing for use as planetary exploration rovers. However, control of tensegrity robots remains a difficult problem due to their unusual structures and complex dynamics. In this work, we show how locomotion gaits can be learned automatically using a novel extension of mirror descent guided policy search (MDGPS) applied to periodic locomotion movements, and we demonstrate the effectiveness of our approach on tensegrity robot locomotion. We evaluate our method with real-world and simulated experiments on the SUPERball tensegrity robot, showing that the learned policies generalize to changes in system parameters, unreliable sensor measurements, and variation in environmental conditions, including varied terrains and a range of different gravities. Our experiments demonstrate that our method not only learns fast, power-efficient feedback policies for rolling gaits, but that these policies can succeed with only the limited onboard sensing provided by SUPERball's accelerometers. We compare the learned feedback policies to learned open-loop policies and hand-engineered controllers, and demonstrate that the learned policy enables the first continuous, reliable locomotion gait for the real SUPERball robot. Our code and supplementary material is available from http://rll.berkeley.edu/drl_tensegrity.

Year	DOI	Venue
2017	10.1109/ICRA.2017.7989079	2017 IEEE International Conference on Robotics and Automation (ICRA)
Keywords	Field	DocType
deep reinforcement learning,tensegrity robot locomotion,rigid rods,elastic cables,planetary exploration rovers,complex dynamics,locomotion gaits,mirror descent guided policy search,MDGPS,periodic locomotion movements,SUPERball tensegrity robot,unreliable sensor measurements,power-efficient feedback policies,rolling gaits,onboard sensing,SUPERball accelerometers,open-loop policies,hand-engineered controllers,reliable locomotion gait,SUPERball robot	Complex dynamics,Gait,Accelerometer,Simulation,Control engineering,Tensegrity,Robot locomotion,Planetary exploration,Engineering,Robot,Reinforcement learning	Conference
Volume	Issue	ISBN
2017	1	978-1-5090-4634-8
Citations	PageRank	References
6	0.52	10
Authors
8

Authors (8 rows)

Cited by (6 rows)

References (10 rows)

Name	Order	Citations	PageRank
Marvin Zhang	1	19	2.47
Xinyang Geng	2	71	4.43
Jonathan Bruce	3	11	2.68
Ken Caluwaerts	4	18	4.29
Massimo Vespignani	5	139	11.02
Vytas SunSpiral	6	36	7.03
Pieter Abbeel	7	6363	376.48
Sergey Levine	8	3377	182.21

1