Learning Stable Manoeuvres in Quadruped Robots from Expert Demonstrations

Tirumala, Sashank ; Gubbi, Sagar ; Paigwar, Kartik ; Sagi, Aditya ; Joglekar, Ashish ; Bhatnagar, Shalabh ; Ghosal, Ashitava ; Amrutur, Bharadwaj ; Kolathaya, Shishir (2020) Learning Stable Manoeuvres in Quadruped Robots from Expert Demonstrations In: 29th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), 31 Aug.-4 Sept. 2020, Naples, Italy.

Full text not available from this repository.

Official URL: http://doi.org/10.1109/RO-MAN47096.2020.9223511

Related URL: http://dx.doi.org/10.1109/RO-MAN47096.2020.9223511

Abstract

With the research into development of quadruped robots picking up pace, learning based techniques are being explored for developing locomotion controllers for such robots. A key problem is to generate leg trajectories for continuously varying target linear and angular velocities, in a stable manner. In this paper, we propose a two pronged approach to address this problem. First, multiple simpler policies are trained to generate trajectories for a discrete set of target velocities and turning radius. These policies are then augmented using a higher level neural network for handling the transition between the learned trajectories. Specifically, we develop a neural network based filter that takes in target velocity, radius and transforms them into new commands that enable smooth transitions to the new trajectory. This transformation is achieved by learning from expert demonstrations. An application of this is the transformation of a novice user's input into an expert user's input, thereby ensuring stable manoeuvres regardless of the user's experience. Training our proposed architecture requires much less expert demonstrations compared to standard neural network architectures. Finally, we demonstrate experimentally these results in the in-house quadruped Stoch 2.

Item Type:Conference or Workshop Item (Paper)
Source:Copyright of this article belongs to Institute of Electrical and Electronics Engineers.
Keywords:Quadrupedal Walking; Reinforcement Learning; Random Search; Gait Transitions.
ID Code:116610
Deposited On:12 Apr 2021 07:09
Last Modified:12 Apr 2021 07:09

Repository Staff Only: item control page