Skip to main navigation Skip to search Skip to main content

Path Planning for Multi-Arm Manipulators Using Soft Actor-Critic Algorithm with Position Prediction of Moving Obstacles via LSTM

  • Kwan Woo Park
  • , Myeong Seop Kim
  • , Jung Su Kim
  • , Jae Han Park
  • Seoul National University of Science and Technology (SNUST)
  • Korea Institute of Industrial Technology

Research output: Contribution to journalArticlepeer-review

21 Scopus citations

Abstract

This paper presents a deep reinforcement learning-based path planning algorithm for the multi-arm robot manipulator when there are both fixed and moving obstacles in the workspace. Considering the problem properties such as high dimensionality and continuous action, the proposed algorithm employs the SAC (soft actor-critic). Moreover, in order to predict explicitly the future position of the moving obstacle, LSTM (long short-term memory) is used. The SAC-based path planning algorithm is developed using the LSTM. In order to show the performance of the proposed algorithm, simulation results using GAZEBO and experimental results using real manipulators are presented. The simulation and experiment results show that the success ratio of path generation for arbitrary starting and goal points converges to 100%. It is also confirmed that the LSTM successfully predicts the future position of the obstacle.

Original languageEnglish
Article number9837
JournalApplied Sciences (Switzerland)
Volume12
Issue number19
DOIs
StatePublished - Oct 2022

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • collision avoidance
  • hindsight experience replay (HER)
  • long short-term memory (LSTM)
  • moving obstacles
  • multi-arm manipulators
  • path planning
  • reinforcement learning
  • soft actor-critic (SAC)

Fingerprint

Dive into the research topics of 'Path Planning for Multi-Arm Manipulators Using Soft Actor-Critic Algorithm with Position Prediction of Moving Obstacles via LSTM'. Together they form a unique fingerprint.

Cite this