UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Learning Needle Pick-And-Place without expert demonstrations

Bendikas, Rokas; Modugno, Valerio; Kanoulas, Dimitrios; Vasconcelos, Francisco; Stoyanov, Danail; (2023) Learning Needle Pick-And-Place without expert demonstrations. IEEE Robotics and Automation Letters 10.1109/lra.2023.3266720. (In press). Green open access

[thumbnail of Learning_Needle_Pick-And-Place_without_expert_demonstrations.pdf]
Preview
Text
Learning_Needle_Pick-And-Place_without_expert_demonstrations.pdf - Accepted Version

Download (3MB) | Preview

Abstract

We introduce a novel approach for learning a complex multi-stage needle pick-and-place manipulation task for surgical applications using Reinforcement Learning without expert demonstrations or explicit curriculum. The proposed method is based on a recursive decomposition of the original task into a sequence of sub-tasks with increasing complexity and utilizes an actor-critic algorithm with deterministic policy output. In this work, exploratory bottlenecks have been used by a human expert as convenient boundary points for partitioning complex tasks into simpler subunits. Our method has successfully learnt a policy for the needle pick-and-place task, whereas the state-of-the-art TD3+HER method is unable to achieve success without the help of expert demonstrations. Comparison results show that our method achieves the highest performance with a 91% average success rate.

Type: Article
Title: Learning Needle Pick-And-Place without expert demonstrations
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/lra.2023.3266720
Publisher version: https://doi.org/10.1109/lra.2023.3266720
Language: English
Additional information: This version is the author accepted manuscript. For the purpose of open access, the author has applied a CC BY public copyright licence to any author accepted manuscript version arising from this submission.
Keywords: Task analysis, Needles, Grasping, Robots, Medical robotics, Trajectory, Transfer learning
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10168660
Downloads since deposit
2,928Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item